「cs.CV」カテゴリーアーカイブ

Adaptive Caching for Faster Video Generation with Diffusion Transformers

投稿日: 2024年11月5日作成者: jarxiv

要約時間的に一貫性のある忠実度の高い映像を生成することは、特に長い時間スパンで … 続きを読む →

カテゴリー: cs.CV | コメントを受け付けていません

SlowFast-VGen: Slow-Fast Learning for Action-Driven Long Video Generation

投稿日: 2024年11月4日作成者: jarxiv

要約人間には相補的な学習システムが備わっており、一般的な世界ダイナミクスのゆっ … 続きを読む →

カテゴリー: cs.AI, cs.CL, cs.CV, cs.LG, cs.RO | コメントを受け付けていません

LucidGrasp: Robotic Framework for Autonomous Manipulation of Laboratory Equipment with Different Degrees of Transparency via 6D Pose Estimation

投稿日: 2024年11月4日作成者: jarxiv

要約最新のロボットシステムの多くは自律的に動作するが、環境を正確に分析し、変化 … 続きを読む →

カテゴリー: cs.CV, cs.RO, cs.SE, cs.SY, eess.SY | コメントを受け付けていません

Elliptical Attention

投稿日: 2024年11月4日作成者: jarxiv

要約一対のドット積自己アテンションは、言語と視覚の様々な応用において最先端の性 … 続きを読む →

カテゴリー: cs.AI, cs.CL, cs.CV, cs.LG, stat.ML | コメントを受け付けていません

Exploring Behavior-Relevant and Disentangled Neural Dynamics with Generative Diffusion Models

投稿日: 2024年11月4日作成者: jarxiv

要約行動の神経基盤を理解することは、神経科学における基本的な目標である。大規模 … 続きを読む →

カテゴリー: cs.CV, cs.LG, q-bio.NC | コメントを受け付けていません

VILA$^2$: VILA Augmented VILA

投稿日: 2024年11月4日作成者: jarxiv

要約視覚言語モデルのアーキテクチャや学習インフラが急速に進歩する一方で、データ … 続きを読む →

カテゴリー: cs.CV | コメントを受け付けていません

RACCooN: A Versatile Instructional Video Editing Framework with Auto-Generated Narratives

投稿日: 2024年11月4日作成者: jarxiv

要約最近の動画生成モデルは、インペインティングやスタイル編集のような特定のタス … 続きを読む →

カテゴリー: cs.AI, cs.CL, cs.CV | コメントを受け付けていません

DiffusionPDE: Generative PDE-Solving Under Partial Observation

投稿日: 2024年11月4日作成者: jarxiv

要約生成的拡散モデルを用いて偏微分方程式（PDE）を解くための一般的な枠組みを … 続きを読む →

カテゴリー: cs.AI, cs.CV, cs.LG, cs.NA, math.NA | コメントを受け付けていません

LRM-Zero: Training Large Reconstruction Models with Synthesized Data

投稿日: 2024年11月4日作成者: jarxiv

要約 LRM-ZEROは、合成された3Dデータのみで学習され、高品質なスパースビ … 続きを読む →

カテゴリー: cs.CV, cs.LG | コメントを受け付けていません

From Question to Exploration: Test-Time Adaptation in Semantic Segmentation?

投稿日: 2024年11月4日作成者: jarxiv

要約テスト時間適応（TTA）は、最初に訓練データで訓練されたモデルを、潜在的な … 続きを読む →

カテゴリー: cs.AI, cs.CV | コメントを受け付けていません

「cs.CV」カテゴリーアーカイブ

Adaptive Caching for Faster Video Generation with Diffusion Transformers

SlowFast-VGen: Slow-Fast Learning for Action-Driven Long Video Generation

LucidGrasp: Robotic Framework for Autonomous Manipulation of Laboratory Equipment with Different Degrees of Transparency via 6D Pose Estimation

Elliptical Attention

Exploring Behavior-Relevant and Disentangled Neural Dynamics with Generative Diffusion Models

VILA$^2$: VILA Augmented VILA

RACCooN: A Versatile Instructional Video Editing Framework with Auto-Generated Narratives

DiffusionPDE: Generative PDE-Solving Under Partial Observation

LRM-Zero: Training Large Reconstruction Models with Synthesized Data

From Question to Exploration: Test-Time Adaptation in Semantic Segmentation?

最近の投稿

最近のコメント

アーカイブ

カテゴリー