「cs.CV」カテゴリーアーカイブ

Point2RBox-v2: Rethinking Point-supervised Oriented Object Detection with Spatial Layout Among Instances

投稿日: 2025年2月7日作成者: jarxiv

要約指向性オブジェクト検出（OOD）に対する需要が急速に増加しているため、ポイ … 続きを読む →

カテゴリー: cs.AI, cs.CV | コメントを受け付けていません

Articulate-Anything: Automatic Modeling of Articulated Objects via a Vision-Language Foundation Model

投稿日: 2025年2月7日作成者: jarxiv

要約 Interactive 3Dシミュレーションオブジェクトは、AR/VR、ア … 続きを読む →

カテゴリー: cs.CV | コメントを受け付けていません

GCE-Pose: Global Context Enhancement for Category-level Object Pose Estimation

投稿日: 2025年2月7日作成者: jarxiv

要約モデルフリーのカテゴリレベルのポーズ推定における重要な課題は、特定のカテゴ … 続きを読む →

カテゴリー: cs.CV | コメントを受け付けていません

Learning Real-World Action-Video Dynamics with Heterogeneous Masked Autoregression

投稿日: 2025年2月7日作成者: jarxiv

要約アクションとビデオのダイナミクスをモデリングするための不均一なマスク自己網 … 続きを読む →

カテゴリー: cs.CV, cs.LG, cs.RO | コメントを受け付けていません

MotionCanvas: Cinematic Shot Design with Controllable Image-to-Video Generation

投稿日: 2025年2月7日作成者: jarxiv

要約このペーパーでは、ユーザーが画像間生成のコンテキストで映画のビデオショット … 続きを読む →

カテゴリー: cs.CV | コメントを受け付けていません

SWAG: Long-term Surgical Workflow Prediction with Generative-based Anticipation

投稿日: 2025年2月7日作成者: jarxiv

要約既存のアプローチは現在の外科段階を認識することに優れていますが、将来の手続 … 続きを読む →

カテゴリー: cs.CV, cs.LG | コメントを受け付けていません

SoNIC: Safe Social Navigation with Adaptive Conformal Inference and Constrained Reinforcement Learning

投稿日: 2025年2月7日作成者: jarxiv

要約強化学習（RL）により、ソーシャルロボットは、人間が設計したルールや介入に … 続きを読む →

カテゴリー: cs.AI, cs.CV, cs.LG, cs.RO, cs.SY, eess.SY | コメントを受け付けていません

Factorized Implicit Global Convolution for Automotive Computational Fluid Dynamics Prediction

投稿日: 2025年2月7日作成者: jarxiv

要約計算流体ダイナミクス（CFD）は自動車設計に不可欠であり、大きな3Dポイン … 続きを読む →

カテゴリー: cs.CV | コメントを受け付けていません

sshELF: Single-Shot Hierarchical Extrapolation of Latent Features for 3D Reconstruction from Sparse-Views

投稿日: 2025年2月7日作成者: jarxiv

要約まばらな外向きの景色から無制限の屋外シーンを再構築することは、最小限の視野 … 続きを読む →

カテゴリー: cs.CV | コメントを受け付けていません

ConceptAttention: Diffusion Transformers Learn Highly Interpretable Features

投稿日: 2025年2月7日作成者: jarxiv

要約マルチモーダル拡散トランス（DITS）の豊富な表現は、解釈可能性を高めるユ … 続きを読む →

カテゴリー: cs.CV, cs.LG | コメントを受け付けていません

「cs.CV」カテゴリーアーカイブ

Point2RBox-v2: Rethinking Point-supervised Oriented Object Detection with Spatial Layout Among Instances

Articulate-Anything: Automatic Modeling of Articulated Objects via a Vision-Language Foundation Model

GCE-Pose: Global Context Enhancement for Category-level Object Pose Estimation

Learning Real-World Action-Video Dynamics with Heterogeneous Masked Autoregression

MotionCanvas: Cinematic Shot Design with Controllable Image-to-Video Generation

SWAG: Long-term Surgical Workflow Prediction with Generative-based Anticipation

SoNIC: Safe Social Navigation with Adaptive Conformal Inference and Constrained Reinforcement Learning

Factorized Implicit Global Convolution for Automotive Computational Fluid Dynamics Prediction

sshELF: Single-Shot Hierarchical Extrapolation of Latent Features for 3D Reconstruction from Sparse-Views

ConceptAttention: Diffusion Transformers Learn Highly Interpretable Features

最近の投稿

最近のコメント

アーカイブ

カテゴリー