「cs.CV」カテゴリーアーカイブ

Enhancing Autonomous Navigation by Imaging Hidden Objects using Single-Photon LiDAR

投稿日: 2024年10月7日作成者: jarxiv

要約ロボット工学において、視界の限られた環境下でのロバストな自律ナビゲーション … 続きを読む →

カテゴリー: cs.CV, cs.RO | コメントを受け付けていません

Not All Diffusion Model Activations Have Been Evaluated as Discriminative Features

投稿日: 2024年10月7日作成者: jarxiv

要約拡散モデルは当初、画像生成のために設計された。最近の研究により、そのバック … 続きを読む →

カテゴリー: cs.AI, cs.CV | コメントを受け付けていません

VideoCLIP-XL: Advancing Long Description Understanding for Video CLIP Models

投稿日: 2024年10月7日作成者: jarxiv

要約対照的言語イメージ事前学習（CLIP）は広く研究され、多くのアプリケーショ … 続きを読む →

カテゴリー: cs.CL, cs.CV, cs.MM | コメントを受け付けていません

Look Twice Before You Answer: Memory-Space Visual Retracing for Hallucination Mitigation in Multimodal Large Language Models

投稿日: 2024年10月7日作成者: jarxiv

要約その素晴らしい能力にもかかわらず、マルチモーダル大規模言語モデル（MLLM … 続きを読む →

カテゴリー: cs.CV | コメントを受け付けていません

Lessons Learned from a Unifying Empirical Study of Parameter-Efficient Transfer Learning (PETL) in Visual Recognition

投稿日: 2024年10月7日作成者: jarxiv

要約パラメータ効率的伝達学習(PETL)は、事前学習されたモデルのサイズが大き … 続きを読む →

カテゴリー: cs.AI, cs.CV, cs.LG | コメントを受け付けていません

Data Diet: Can Trimming PET/CT Datasets Enhance Lesion Segmentation?

投稿日: 2024年10月7日作成者: jarxiv

要約この研究では、autoPET3データセントリックトラックに出場するための我 … 続きを読む →

カテゴリー: cs.CV, eess.IV | コメントを受け付けていません

Resfusion: Denoising Diffusion Probabilistic Models for Image Restoration Based on Prior Residual Noise

投稿日: 2024年10月7日作成者: jarxiv

要約近年、ノイズ除去拡散モデルの研究は、画像復元の分野にも応用を広げている。従 … 続きを読む →

カテゴリー: cs.AI, cs.CV | コメントを受け付けていません

Variational Bayes Gaussian Splatting

投稿日: 2024年10月7日作成者: jarxiv

要約近年、3Dガウススプラッティングは、ガウスの混合を使用して3Dシーンをモデ … 続きを読む →

カテゴリー: cs.AI, cs.CV | コメントを受け付けていません

Unraveling the Truth: Do VLMs really Understand Charts? A Deep Dive into Consistency and Robustness

投稿日: 2024年10月7日作成者: jarxiv

要約図表質問応答（CQA）は、視覚言語理解の重要な分野である。しかし、この分野 … 続きを読む →

カテゴリー: cs.AI, cs.CL, cs.CV, cs.HC, cs.LG | コメントを受け付けていません

AID: Attention Interpolation of Text-to-Image Diffusion

投稿日: 2024年10月7日作成者: jarxiv

要約条件拡散モデルは、様々な環境において未見の画像を作成し、画像補間を支援する … 続きを読む →

カテゴリー: cs.AI, cs.CV | コメントを受け付けていません

「cs.CV」カテゴリーアーカイブ

Enhancing Autonomous Navigation by Imaging Hidden Objects using Single-Photon LiDAR

Not All Diffusion Model Activations Have Been Evaluated as Discriminative Features

VideoCLIP-XL: Advancing Long Description Understanding for Video CLIP Models

Look Twice Before You Answer: Memory-Space Visual Retracing for Hallucination Mitigation in Multimodal Large Language Models

Lessons Learned from a Unifying Empirical Study of Parameter-Efficient Transfer Learning (PETL) in Visual Recognition

Data Diet: Can Trimming PET/CT Datasets Enhance Lesion Segmentation?

Resfusion: Denoising Diffusion Probabilistic Models for Image Restoration Based on Prior Residual Noise

Variational Bayes Gaussian Splatting

Unraveling the Truth: Do VLMs really Understand Charts? A Deep Dive into Consistency and Robustness

AID: Attention Interpolation of Text-to-Image Diffusion

最近の投稿

最近のコメント

アーカイブ

カテゴリー