「cs.CV」カテゴリーアーカイブ

SDFit: 3D Object Pose and Shape by Fitting a Morphable SDF to a Single Image

投稿日: 2025年3月11日作成者: jarxiv

要約単一の画像から3Dオブジェクトのポーズと形状を回復することは、挑戦的で非常 … 続きを読む →

カテゴリー: cs.CV | コメントを受け付けていません

MIBench: A Comprehensive Framework for Benchmarking Model Inversion Attack and Defense

投稿日: 2025年3月11日作成者: jarxiv

要約モデル反転（MI）攻撃は、ターゲットモデルの出力情報を活用してプライバシー … 続きを読む →

カテゴリー: cs.CR, cs.CV | コメントを受け付けていません

TriDi: Trilateral Diffusion of 3D Humans, Objects, and Interactions

投稿日: 2025年3月11日作成者: jarxiv

要約 3D Human-Object Interaction（HOI）のモデリン … 続きを読む →

カテゴリー: cs.CV | コメントを受け付けていません

Personalized Generative Low-light Image Denoising and Enhancement

投稿日: 2025年3月11日作成者: jarxiv

要約今日のスマートフォンカメラは驚くほど良い写真を生成することができますが、光 … 続きを読む →

カテゴリー: cs.CV | コメントを受け付けていません

Look Inside for More: Internal Spatial Modality Perception for 3D Anomaly Detection

投稿日: 2025年3月11日作成者: jarxiv

要約 3D異常検出は最近、コンピュータービジョンに大きな焦点となっています。い … 続きを読む →

カテゴリー: cs.AI, cs.CV, eess.IV | コメントを受け付けていません

MGNiceNet: Unified Monocular Geometric Scene Understanding

投稿日: 2025年3月11日作成者: jarxiv

要約単眼の幾何学的シーンの理解は、パノプティックセグメンテーションと自己監視の … 続きを読む →

カテゴリー: cs.CV | コメントを受け付けていません

Sparrow: Data-Efficient Video-LLM with Text-to-Image Augmentation

投稿日: 2025年3月11日作成者: jarxiv

要約近年、ビジョン理解ドメインにおけるマルチモーダル大手言語モデル（MLLM） … 続きを読む →

カテゴリー: cs.CL, cs.CV, cs.LG | コメントを受け付けていません

NeAS: 3D Reconstruction from X-ray Images using Neural Attenuation Surface

投稿日: 2025年3月11日作成者: jarxiv

要約 2次元（2D）X線画像からの3次元（3D）構造の再構築は、コンピューター断 … 続きを読む →

カテゴリー: cs.CV, eess.IV | コメントを受け付けていません

V2Flow: Unifying Visual Tokenization and Large Language Model Vocabularies for Autoregressive Image Generation

投稿日: 2025年3月11日作成者: jarxiv

要約 V2Flowを提案します。これは、高忠実度の再構成が可能な離散視覚トークン … 続きを読む →

カテゴリー: cs.AI, cs.CV | コメントを受け付けていません

AthletePose3D: A Benchmark Dataset for 3D Human Pose Estimation and Kinematic Validation in Athletic Movements

投稿日: 2025年3月11日作成者: jarxiv

要約人間のポーズ推定は、スポーツ科学、リハビリテーション、および生体力学的研究 … 続きを読む →

カテゴリー: cs.CV | コメントを受け付けていません

「cs.CV」カテゴリーアーカイブ

SDFit: 3D Object Pose and Shape by Fitting a Morphable SDF to a Single Image

MIBench: A Comprehensive Framework for Benchmarking Model Inversion Attack and Defense

TriDi: Trilateral Diffusion of 3D Humans, Objects, and Interactions

Personalized Generative Low-light Image Denoising and Enhancement

Look Inside for More: Internal Spatial Modality Perception for 3D Anomaly Detection

MGNiceNet: Unified Monocular Geometric Scene Understanding

Sparrow: Data-Efficient Video-LLM with Text-to-Image Augmentation

NeAS: 3D Reconstruction from X-ray Images using Neural Attenuation Surface

V2Flow: Unifying Visual Tokenization and Large Language Model Vocabularies for Autoregressive Image Generation

AthletePose3D: A Benchmark Dataset for 3D Human Pose Estimation and Kinematic Validation in Athletic Movements

最近の投稿

最近のコメント

アーカイブ

カテゴリー