「cs.CV」カテゴリーアーカイブ

Latent Radiance Fields with 3D-aware 2D Representations

投稿日: 2025年2月14日作成者: jarxiv

要約潜在的な3D再構成は、2D機能を3Dスペースに蒸留することにより、3Dセマ … 続きを読む →

カテゴリー: cs.CV | コメントを受け付けていません

Opening Articulated Objects in the Real World

投稿日: 2025年2月14日作成者: jarxiv

要約以前に見えなかった環境で、以前に見えなかったオブジェクトで有能に動作できる … 続きを読む →

カテゴリー: cs.AI, cs.CV, cs.LG, cs.RO | コメントを受け付けていません

RigAnything: Template-Free Autoregressive Rigging for Diverse 3D Assets

投稿日: 2025年2月14日作成者: jarxiv

要約新しい自己回帰変圧器ベースのモデルであるRiganythingを提示します … 続きを読む →

カテゴリー: cs.CV | コメントを受け付けていません

DexTrack: Towards Generalizable Neural Tracking Control for Dexterous Manipulation from Human References

投稿日: 2025年2月14日作成者: jarxiv

要約人間の参照からの器用な操作のための一般化可能なニューラル追跡コントローラー … 続きを読む →

カテゴリー: cs.AI, cs.CV, cs.LG, cs.RO | コメントを受け付けていません

Variational Rectified Flow Matching

投稿日: 2025年2月14日作成者: jarxiv

要約マルチモーダル速度ベクトルフィールドをモデル化することにより、古典的な修正 … 続きを読む →

カテゴリー: cs.CV, cs.LG | コメントを受け付けていません

LIFe-GoM: Generalizable Human Rendering with Learned Iterative Feedback Over Multi-Resolution Gaussians-on-Mesh

投稿日: 2025年2月14日作成者: jarxiv

要約スパース入力からのアニメーション可能なヒトアバターの一般化可能なレンダリン … 続きを読む →

カテゴリー: cs.CV | コメントを受け付けていません

Can this Model Also Recognize Dogs? Zero-Shot Model Search from Weights

投稿日: 2025年2月14日作成者: jarxiv

要約公開されているモデルの数が増えているため、ユーザーが必要とするほとんどのタ … 続きを読む →

カテゴリー: cs.CV, cs.LG | コメントを受け付けていません

Exploring the Potential of Encoder-free Architectures in 3D LMMs

投稿日: 2025年2月14日作成者: jarxiv

要約エンコーダーフリーのアーキテクチャは、2Dビジュアルドメインで事前に検討さ … 続きを読む →

カテゴリー: cs.AI, cs.CL, cs.CV | コメントを受け付けていません

MME-CoT: Benchmarking Chain-of-Thought in Large Multimodal Models for Reasoning Quality, Robustness, and Efficiency

投稿日: 2025年2月14日作成者: jarxiv

要約チェーンオブシュート（COT）で質問に答えることで、大規模な言語モデル（L … 続きを読む →

カテゴリー: cs.AI, cs.CL, cs.CV | コメントを受け付けていません

Embed Any NeRF: Graph Meta-Networks for Neural Tasks on Arbitrary NeRF Architectures

投稿日: 2025年2月14日作成者: jarxiv

要約ニューラル放射輝度フィールド（NERF）は、神経ネットワークの重みに形状と … 続きを読む →

カテゴリー: cs.CV | コメントを受け付けていません

「cs.CV」カテゴリーアーカイブ

Latent Radiance Fields with 3D-aware 2D Representations

Opening Articulated Objects in the Real World

RigAnything: Template-Free Autoregressive Rigging for Diverse 3D Assets

DexTrack: Towards Generalizable Neural Tracking Control for Dexterous Manipulation from Human References

Variational Rectified Flow Matching

LIFe-GoM: Generalizable Human Rendering with Learned Iterative Feedback Over Multi-Resolution Gaussians-on-Mesh

Can this Model Also Recognize Dogs? Zero-Shot Model Search from Weights

Exploring the Potential of Encoder-free Architectures in 3D LMMs

MME-CoT: Benchmarking Chain-of-Thought in Large Multimodal Models for Reasoning Quality, Robustness, and Efficiency

Embed Any NeRF: Graph Meta-Networks for Neural Tasks on Arbitrary NeRF Architectures

最近の投稿

最近のコメント

アーカイブ

カテゴリー