「cs.CV」カテゴリーアーカイブ

Improving Dynamic Object Interactions in Text-to-Video Generation with AI Feedback

投稿日: 2024年12月4日作成者: jarxiv

要約大規模なテキストからビデオへのモデルは、幅広い下流アプリケーションに計り知 … 続きを読む →

カテゴリー: cs.AI, cs.CV, cs.LG | コメントを受け付けていません

Continual Learning of Personalized Generative Face Models with Experience Replay

投稿日: 2024年12月4日作成者: jarxiv

要約つまり、異なる外見、スタイル、ポーズ、照明の新しい写真が定期的に撮影される … 続きを読む →

カテゴリー: cs.CV | コメントを受け付けていません

Sharp-It: A Multi-view to Multi-view Diffusion Model for 3D Synthesis and Manipulation

投稿日: 2024年12月4日作成者: jarxiv

要約テキストから画像への拡散モデルの進歩により、3Dコンテンツの高速作成が大き … 続きを読む →

カテゴリー: cs.CV, cs.LG | コメントを受け付けていません

Scaling Image Tokenizers with Grouped Spherical Quantization

投稿日: 2024年12月4日作成者: jarxiv

要約ビジョントークナイザーは、そのスケーラビリティとコンパクト性から多くの注目 … 続きを読む →

カテゴリー: cs.AI, cs.CV | コメントを受け付けていません

MetaShadow: Object-Centered Shadow Detection, Removal, and Synthesis

投稿日: 2024年12月4日作成者: jarxiv

要約画像編集アプリケーションにおいて、影はしばしば十分に考慮されないか、無視さ … 続きを読む →

カテゴリー: cs.CV | コメントを受け付けていません

Robust soybean seed yield estimation using high-throughput ground robot videos

投稿日: 2024年12月4日作成者: jarxiv

要約我々は、コンピュータビジョンとディープラーニング技術による高スループットな … 続きを読む →

カテゴリー: cs.CV | コメントを受け付けていません

A Bidirectional Long Short Term Memory Approach for Infrastructure Health Monitoring Using On-board Vibration Response

投稿日: 2024年12月4日作成者: jarxiv

要約利用可能なインフラ監視データの量が増加しているため、直接計測を使用してイン … 続きを読む →

カテゴリー: cs.CV | コメントを受け付けていません

Decoupling Dark Knowledge via Block-wise Logit Distillation for Feature-level Alignment

投稿日: 2024年12月4日作成者: jarxiv

要約知識蒸留(Knowledge Distillation: KD)は、より大 … 続きを読む →

カテゴリー: cs.CV, cs.LG | コメントを受け付けていません

Go beyond End-to-End Training: Boosting Greedy Local Learning with Context Supply

投稿日: 2024年12月4日作成者: jarxiv

要約ディープネットワークの従来のE2E（end-to-end）学習では、バック … 続きを読む →

カテゴリー: cs.CV, cs.LG, stat.ML | コメントを受け付けていません

STRIDE: Single-video based Temporally Continuous Occlusion Robust 3D Pose Estimation

投稿日: 2024年12月4日作成者: jarxiv

要約人間の3Dポーズを正確に推定する能力は、行動認識、歩行認識、仮想現実／拡張 … 続きを読む →

カテゴリー: cs.CV | コメントを受け付けていません

「cs.CV」カテゴリーアーカイブ

Improving Dynamic Object Interactions in Text-to-Video Generation with AI Feedback

Continual Learning of Personalized Generative Face Models with Experience Replay

Sharp-It: A Multi-view to Multi-view Diffusion Model for 3D Synthesis and Manipulation

Scaling Image Tokenizers with Grouped Spherical Quantization

MetaShadow: Object-Centered Shadow Detection, Removal, and Synthesis

Robust soybean seed yield estimation using high-throughput ground robot videos

A Bidirectional Long Short Term Memory Approach for Infrastructure Health Monitoring Using On-board Vibration Response

Decoupling Dark Knowledge via Block-wise Logit Distillation for Feature-level Alignment

Go beyond End-to-End Training: Boosting Greedy Local Learning with Context Supply

STRIDE: Single-video based Temporally Continuous Occlusion Robust 3D Pose Estimation

最近の投稿

最近のコメント

アーカイブ

カテゴリー