月別アーカイブ: 2024年5月

Test-Time Adaptation for Depth Completion

投稿日: 2024年5月9日作成者: jarxiv

要約一部の (ソース) データセットでトレーニングされたモデルをターゲットのテ … 続きを読む →

カテゴリー: cs.CV, cs.LG | コメントを受け付けていません

EVA-X: A Foundation Model for General Chest X-ray Analysis with Self-supervised Learning

投稿日: 2024年5月9日作成者: jarxiv

要約胸部疾患の診断と治療は、人間の健康を維持する上で重要な役割を果たします。 … 続きを読む →

カテゴリー: cs.CV | コメントを受け付けていません

BenthicNet: A global compilation of seafloor images for deep learning applications

投稿日: 2024年5月9日作成者: jarxiv

要約水中イメージングの進歩により、重要な底生生態系の監視に必要な広範な海底画像 … 続きを読む →

カテゴリー: cs.CV, cs.LG | コメントを受け付けていません

D4C Glove-train: Solving the RPM and Bongard-logo Problem by Circumscribing and Building Distribution for Concepts

投稿日: 2024年5月9日作成者: jarxiv

要約この論文は、抽象推論の領域、特に Raven の Progressive … 続きを読む →

カテゴリー: cs.CV | コメントを受け付けていません

Attention-Driven Training-Free Efficiency Enhancement of Diffusion Models

投稿日: 2024年5月9日作成者: jarxiv

要約拡散モデル (DM) は、高品質で多様な画像を生成する際に優れたパフォーマ … 続きを読む →

カテゴリー: cs.AI, cs.CV, cs.LG, eess.IV, eess.SP | コメントを受け付けていません

THRONE: An Object-based Hallucination Benchmark for the Free-form Generations of Large Vision-Language Models

投稿日: 2024年5月9日作成者: jarxiv

要約大規模視覚言語モデル (LVLM) における幻覚の軽減は依然として未解決の … 続きを読む →

カテゴリー: cs.AI, cs.CV, cs.LG | コメントを受け付けていません

Multi-Modal Data-Efficient 3D Scene Understanding for Autonomous Driving

投稿日: 2024年5月9日作成者: jarxiv

要約自動運転における 3D シーンの理解を進めるには、データの効率的な利用が不 … 続きを読む →

カテゴリー: cs.CV, cs.LG, cs.RO | コメントを受け付けていません

OpenESS: Event-based Semantic Scene Understanding with Open Vocabularies

投稿日: 2024年5月9日作成者: jarxiv

要約イベントベースのセマンティックセグメンテーション (ESS) は、イベン … 続きを読む →

カテゴリー: cs.CV, cs.RO | コメントを受け付けていません

DistGrid: Scalable Scene Reconstruction with Distributed Multi-resolution Hash Grid

投稿日: 2024年5月9日作成者: jarxiv

要約 Neural Radiance Field~(NeRF) は、オブジェクト … 続きを読む →

カテゴリー: cs.CV | コメントを受け付けていません

Inf-DiT: Upsampling Any-Resolution Image with Memory-Efficient Diffusion Transformer

投稿日: 2024年5月9日作成者: jarxiv

要約近年、拡散モデルは画像生成において目覚ましい性能を発揮しています。ただし … 続きを読む →

カテゴリー: cs.CV | コメントを受け付けていません

月別アーカイブ: 2024年5月

Test-Time Adaptation for Depth Completion

EVA-X: A Foundation Model for General Chest X-ray Analysis with Self-supervised Learning

BenthicNet: A global compilation of seafloor images for deep learning applications

D4C Glove-train: Solving the RPM and Bongard-logo Problem by Circumscribing and Building Distribution for Concepts

Attention-Driven Training-Free Efficiency Enhancement of Diffusion Models

THRONE: An Object-based Hallucination Benchmark for the Free-form Generations of Large Vision-Language Models

Multi-Modal Data-Efficient 3D Scene Understanding for Autonomous Driving

OpenESS: Event-based Semantic Scene Understanding with Open Vocabularies

DistGrid: Scalable Scene Reconstruction with Distributed Multi-resolution Hash Grid

Inf-DiT: Upsampling Any-Resolution Image with Memory-Efficient Diffusion Transformer

最近の投稿

最近のコメント

アーカイブ

カテゴリー