「cs.CV」カテゴリーアーカイブ

Instruction-based Image Manipulation by Watching How Things Move

投稿日: 2024年12月17日作成者: jarxiv

要約この論文では、ビデオからフレームのペアをサンプリングし、マルチモーダル大規 … 続きを読む →

カテゴリー: cs.CV | コメントを受け付けていません

Stabilizing Reinforcement Learning in Differentiable Multiphysics Simulation

投稿日: 2024年12月17日作成者: jarxiv

要約 GPU ベースの並列シミュレーションの最近の進歩により、実践者は大量のデー … 続きを読む →

カテゴリー: cs.AI, cs.CV, cs.LG, cs.RO | コメントを受け付けていません

Wonderland: Navigating 3D Scenes from a Single Image

投稿日: 2024年12月17日作成者: jarxiv

要約このペーパーでは、単一の任意の画像から高品質で広範囲の 3D シーンを効率 … 続きを読む →

カテゴリー: cs.CV | コメントを受け付けていません

CAP4D: Creating Animatable 4D Portrait Avatars with Morphable Multi-View Diffusion Models

投稿日: 2024年12月17日作成者: jarxiv

要約画像からフォトリアリスティックでダイナミックなポートレートアバターを再構 … 続きを読む →

カテゴリー: cs.CV | コメントを受け付けていません

Causal Diffusion Transformers for Generative Modeling

投稿日: 2024年12月17日作成者: jarxiv

要約拡散モデルの自己回帰 (AR) 対応物として因果拡散を導入します。これは … 続きを読む →

カテゴリー: cs.CV | コメントを受け付けていません

PanSplat: 4K Panorama Synthesis with Feed-Forward Gaussian Splatting

投稿日: 2024年12月17日作成者: jarxiv

要約ポータブル 360{\deg} カメラの出現により、パノラマは仮想現実 ( … 続きを読む →

カテゴリー: cs.CV | コメントを受け付けていません

BrushEdit: All-In-One Image Inpainting and Editing

投稿日: 2024年12月17日作成者: jarxiv

要約画像編集は、反転ベースの方法と命令ベースの方法の両方を使用した拡散モデルの … 続きを読む →

カテゴリー: cs.AI, cs.CV | コメントを受け付けていません

MVQ:Towards Efficient DNN Compression and Acceleration with Masked Vector Quantization

投稿日: 2024年12月17日作成者: jarxiv

要約ベクトル量子化 (VQ) は、ストレージコストとハードウェアアクセラレ … 続きを読む →

カテゴリー: cs.AR, cs.CV | コメントを受け付けていません

TIV-Diffusion: Towards Object-Centric Movement for Text-driven Image to Video Generation

投稿日: 2024年12月17日作成者: jarxiv

要約テキスト駆動型の画像からビデオの生成 (TI2V) は、最初のフレームと対 … 続きを読む →

カテゴリー: cs.CV | コメントを受け付けていません

LLMPhy: Complex Physical Reasoning Using Large Language Models and World Models

投稿日: 2024年12月16日作成者: jarxiv

要約物理的推論は、現実世界で動作するロボットエージェントに必要な重要なスキル … 続きを読む →

カテゴリー: cs.AI, cs.CV, cs.LG, cs.RO | コメントを受け付けていません

「cs.CV」カテゴリーアーカイブ

Instruction-based Image Manipulation by Watching How Things Move

Stabilizing Reinforcement Learning in Differentiable Multiphysics Simulation

Wonderland: Navigating 3D Scenes from a Single Image

CAP4D: Creating Animatable 4D Portrait Avatars with Morphable Multi-View Diffusion Models

Causal Diffusion Transformers for Generative Modeling

PanSplat: 4K Panorama Synthesis with Feed-Forward Gaussian Splatting

BrushEdit: All-In-One Image Inpainting and Editing

MVQ:Towards Efficient DNN Compression and Acceleration with Masked Vector Quantization

TIV-Diffusion: Towards Object-Centric Movement for Text-driven Image to Video Generation

LLMPhy: Complex Physical Reasoning Using Large Language Models and World Models

最近の投稿

最近のコメント

アーカイブ

カテゴリー