月別アーカイブ: 2025年2月

Learning Real-World Action-Video Dynamics with Heterogeneous Masked Autoregression

投稿日: 2025年2月7日作成者: jarxiv

要約アクションとビデオのダイナミクスをモデリングするための不均一なマスク自己網 … 続きを読む →

カテゴリー: cs.CV, cs.LG, cs.RO | コメントを受け付けていません

MotionCanvas: Cinematic Shot Design with Controllable Image-to-Video Generation

投稿日: 2025年2月7日作成者: jarxiv

要約このペーパーでは、ユーザーが画像間生成のコンテキストで映画のビデオショット … 続きを読む →

カテゴリー: cs.CV | コメントを受け付けていません

SWAG: Long-term Surgical Workflow Prediction with Generative-based Anticipation

投稿日: 2025年2月7日作成者: jarxiv

要約既存のアプローチは現在の外科段階を認識することに優れていますが、将来の手続 … 続きを読む →

カテゴリー: cs.CV, cs.LG | コメントを受け付けていません

SoNIC: Safe Social Navigation with Adaptive Conformal Inference and Constrained Reinforcement Learning

投稿日: 2025年2月7日作成者: jarxiv

要約強化学習（RL）により、ソーシャルロボットは、人間が設計したルールや介入に … 続きを読む →

カテゴリー: cs.AI, cs.CV, cs.LG, cs.RO, cs.SY, eess.SY | コメントを受け付けていません

Factorized Implicit Global Convolution for Automotive Computational Fluid Dynamics Prediction

投稿日: 2025年2月7日作成者: jarxiv

要約計算流体ダイナミクス（CFD）は自動車設計に不可欠であり、大きな3Dポイン … 続きを読む →

カテゴリー: cs.CV | コメントを受け付けていません

sshELF: Single-Shot Hierarchical Extrapolation of Latent Features for 3D Reconstruction from Sparse-Views

投稿日: 2025年2月7日作成者: jarxiv

要約まばらな外向きの景色から無制限の屋外シーンを再構築することは、最小限の視野 … 続きを読む →

カテゴリー: cs.CV | コメントを受け付けていません

ConceptAttention: Diffusion Transformers Learn Highly Interpretable Features

投稿日: 2025年2月7日作成者: jarxiv

要約マルチモーダル拡散トランス（DITS）の豊富な表現は、解釈可能性を高めるユ … 続きを読む →

カテゴリー: cs.CV, cs.LG | コメントを受け付けていません

WorldSense: Evaluating Real-world Omnimodal Understanding for Multimodal LLMs

投稿日: 2025年2月7日作成者: jarxiv

要約このペーパーでは、視覚、オーディオ、テキスト入力を同時に網羅するマルチモー … 続きを読む →

カテゴリー: cs.AI, cs.CV | コメントを受け付けていません

Ola: Pushing the Frontiers of Omni-Modal Language Model with Progressive Modality Alignment

投稿日: 2025年2月7日作成者: jarxiv

要約特にGPT-4Oに続く大規模な言語モデルの最近の進歩により、より多くのモダ … 続きを読む →

カテゴリー: cs.CL, cs.CV, cs.MM, cs.SD, eess.AS, eess.IV | コメントを受け付けていません

SMART: Advancing Scalable Map Priors for Driving Topology Reasoning

投稿日: 2025年2月7日作成者: jarxiv

要約トポロジーの推論は、車線と交通要素の間の接続性と関係を包括的に理解すること … 続きを読む →

カテゴリー: cs.CV, cs.RO | コメントを受け付けていません

月別アーカイブ: 2025年2月

Learning Real-World Action-Video Dynamics with Heterogeneous Masked Autoregression

MotionCanvas: Cinematic Shot Design with Controllable Image-to-Video Generation

SWAG: Long-term Surgical Workflow Prediction with Generative-based Anticipation

SoNIC: Safe Social Navigation with Adaptive Conformal Inference and Constrained Reinforcement Learning

Factorized Implicit Global Convolution for Automotive Computational Fluid Dynamics Prediction

sshELF: Single-Shot Hierarchical Extrapolation of Latent Features for 3D Reconstruction from Sparse-Views

ConceptAttention: Diffusion Transformers Learn Highly Interpretable Features

WorldSense: Evaluating Real-world Omnimodal Understanding for Multimodal LLMs

Ola: Pushing the Frontiers of Omni-Modal Language Model with Progressive Modality Alignment

SMART: Advancing Scalable Map Priors for Driving Topology Reasoning

最近の投稿

最近のコメント

アーカイブ

カテゴリー