月別アーカイブ: 2025年4月

NormalCrafter: Learning Temporally Consistent Normals from Video Diffusion Priors

投稿日: 2025年4月16日作成者: jarxiv

要約表面の通常の推定は、コンピュータービジョンアプリケーションのスペクトルの基 … 続きを読む →

カテゴリー: cs.CV | コメントを受け付けていません

Enhancing Out-of-Distribution Detection with Extended Logit Normalization

投稿日: 2025年4月16日作成者: jarxiv

要約分散除外（OOD）検出は、機械学習モデルの安全な展開に不可欠です。最近の … 続きを読む →

カテゴリー: cs.CV | コメントを受け付けていません

Mamba-Based Ensemble learning for White Blood Cell Classification

投稿日: 2025年4月16日作成者: jarxiv

要約白血球（WBC）の分類は、免疫の健康の評価とさまざまな疾患の診断に役立ちま … 続きを読む →

カテゴリー: cs.CV, cs.LG | コメントを受け付けていません

TADACap: Time-series Adaptive Domain-Aware Captioning

投稿日: 2025年4月16日作成者: jarxiv

要約画像キャプションは大きな注目を集めていますが、金融やヘルスケアなどの分野で … 続きを読む →

カテゴリー: cs.CL, cs.CV | コメントを受け付けていません

Reference-Based 3D-Aware Image Editing with Triplanes

投稿日: 2025年4月16日作成者: jarxiv

要約生成的敵対ネットワーク（GAN）は、潜在スペースを操作することにより、高品 … 続きを読む →

カテゴリー: cs.CV | コメントを受け付けていません

Diffusion Distillation With Direct Preference Optimization For Efficient 3D LiDAR Scene Completion

投稿日: 2025年4月16日作成者: jarxiv

要約 3D LIDARシーンの完了における拡散モデルの適用は、拡散のサンプリング … 続きを読む →

カテゴリー: cs.CV | コメントを受け付けていません

PARTFIELD: Learning 3D Feature Fields for Part Segmentation and Beyond

投稿日: 2025年4月16日作成者: jarxiv

要約 Partfieldを提案します。これは、定義済みのテンプレートやテキストベ … 続きを読む →

カテゴリー: cs.CV | コメントを受け付けていません

SimpleAR: Pushing the Frontier of Autoregressive Visual Generation through Pretraining, SFT, and RL

投稿日: 2025年4月16日作成者: jarxiv

要約この作業は、複雑なアーキテクチャの変更なしに、バニラの自己回帰視覚生成フレ … 続きを読む →

カテゴリー: cs.CV | コメントを受け付けていません

Aligning Generative Denoising with Discriminative Objectives Unleashes Diffusion for Visual Perception

投稿日: 2025年4月16日作成者: jarxiv

要約画像生成の成功に伴い、ピクセル生成が統一された知覚インターフェイスを提供す … 続きを読む →

カテゴリー: cs.CV | コメントを受け付けていません

MultiLoKo: a multilingual local knowledge benchmark for LLMs spanning 31 languages

投稿日: 2025年4月16日作成者: jarxiv

要約 31の言語をカバーするLLMSの多言語性を評価するための新しいベンチマーク … 続きを読む →

カテゴリー: cs.CL | コメントを受け付けていません

月別アーカイブ: 2025年4月

NormalCrafter: Learning Temporally Consistent Normals from Video Diffusion Priors

Enhancing Out-of-Distribution Detection with Extended Logit Normalization

Mamba-Based Ensemble learning for White Blood Cell Classification

TADACap: Time-series Adaptive Domain-Aware Captioning

Reference-Based 3D-Aware Image Editing with Triplanes

Diffusion Distillation With Direct Preference Optimization For Efficient 3D LiDAR Scene Completion

PARTFIELD: Learning 3D Feature Fields for Part Segmentation and Beyond

SimpleAR: Pushing the Frontier of Autoregressive Visual Generation through Pretraining, SFT, and RL

Aligning Generative Denoising with Discriminative Objectives Unleashes Diffusion for Visual Perception

MultiLoKo: a multilingual local knowledge benchmark for LLMs spanning 31 languages

最近の投稿

最近のコメント

アーカイブ

カテゴリー