「cs.CV」カテゴリーアーカイブ

3D Reconstruction of non-visible surfaces of objects from a Single Depth View — Comparative Study

投稿日: 2025年1月28日作成者: jarxiv

要約シーンとオブジェクトの再構築は、特に衝突のない軌跡を計画したり、オブジェク … 続きを読む →

カテゴリー: cs.CV, cs.RO | コメントを受け付けていません

3DGS$^2$: Near Second-order Converging 3D Gaussian Splatting

投稿日: 2025年1月28日作成者: jarxiv

要約 3Dガウススプラッティング（3DG）は、新しいビュー合成と3D再建のための … 続きを読む →

カテゴリー: cs.CV, cs.GR | コメントを受け付けていません

Toward Efficient Generalization in 3D Human Pose Estimation via a Canonical Domain Approach

投稿日: 2025年1月28日作成者: jarxiv

要約深い学習方法の最近の進歩により、3Dヒトポーズ推定（HPE）のパフォーマン … 続きを読む →

カテゴリー: cs.AI, cs.CV | コメントを受け付けていません

Efficient Portrait Matte Creation With Layer Diffusion and Connectivity Priors

投稿日: 2025年1月28日作成者: jarxiv

要約効果的な深いポートレートマットモデルを学習するには、高品質と大量の両方のト … 続きを読む →

カテゴリー: cs.CV | コメントを受け付けていません

Learning Point Spread Function Invertibility Assessment for Image Deconvolution

投稿日: 2025年1月28日作成者: jarxiv

要約 Deep-Learning（DL）ベースの画像デコンボリューション（ID） … 続きを読む →

カテゴリー: 68T10, 94A08, cs.CV, eess.IV, I.4.5 | コメントを受け付けていません

VCRScore: Image captioning metric based on V\&L Transformers, CLIP, and precision-recall

投稿日: 2025年1月28日作成者: jarxiv

要約画像キャプションは、本質的なビジョンと言語研究のタスクになっています。特 … 続きを読む →

カテゴリー: 68Txx, cs.CL, cs.CV, I.4 | コメントを受け付けていません

BAG: Body-Aligned 3D Wearable Asset Generation

投稿日: 2025年1月28日作成者: jarxiv

要約最近の進歩により、一般的な3D形状生成モデルで顕著な進歩が示されていますが … 続きを読む →

カテゴリー: cs.AI, cs.CV, cs.GR | コメントを受け付けていません

Text-driven Adaptation of Foundation Models for Few-shot Surgical Workflow Analysis

投稿日: 2025年1月28日作成者: jarxiv

要約目的：外科的ワークフロー分析は、外科的効率と安全性を改善するために重要です … 続きを読む →

カテゴリー: cs.AI, cs.CV | コメントを受け付けていません

The Linear Attention Resurrection in Vision Transformer

投稿日: 2025年1月28日作成者: jarxiv

要約 Vision Transformers（VITS）は最近、コンピュータービ … 続きを読む →

カテゴリー: cs.AI, cs.CV | コメントを受け付けていません

MoColl: Agent-Based Specific and General Model Collaboration for Image Captioning

投稿日: 2025年1月28日作成者: jarxiv

要約画像キャプションは、コンピュータービジョンと自然言語処理の交差点における重 … 続きを読む →

カテゴリー: cs.AI, cs.CV | コメントを受け付けていません

「cs.CV」カテゴリーアーカイブ

3D Reconstruction of non-visible surfaces of objects from a Single Depth View — Comparative Study

3DGS$^2$: Near Second-order Converging 3D Gaussian Splatting

Toward Efficient Generalization in 3D Human Pose Estimation via a Canonical Domain Approach

Efficient Portrait Matte Creation With Layer Diffusion and Connectivity Priors

Learning Point Spread Function Invertibility Assessment for Image Deconvolution

VCRScore: Image captioning metric based on V\&L Transformers, CLIP, and precision-recall

BAG: Body-Aligned 3D Wearable Asset Generation

Text-driven Adaptation of Foundation Models for Few-shot Surgical Workflow Analysis

The Linear Attention Resurrection in Vision Transformer

MoColl: Agent-Based Specific and General Model Collaboration for Image Captioning

最近の投稿

最近のコメント

アーカイブ

カテゴリー