月別アーカイブ: 2024年3月

MedPromptX: Grounded Multimodal Prompting for Chest X-ray Diagnosis

投稿日: 2024年3月27日作成者: jarxiv

要約胸部 X 線画像は、急性および慢性の心肺疾患の予測によく使用されますが、胸 … 続きを読む →

カテゴリー: cs.AI, cs.CV | コメントを受け付けていません

MUTE-SLAM: Real-Time Neural SLAM with Multiple Tri-Plane Hash Representations

投稿日: 2024年3月27日作成者: jarxiv

要約効率的なシーン表現のために複数のトライプレーンハッシュエンコーディング … 続きを読む →

カテゴリー: cs.CV | コメントを受け付けていません

ReMoS: 3D Motion-Conditioned Reaction Synthesis for Two-Person Interactions

投稿日: 2024年3月27日作成者: jarxiv

要約 3D ヒューマンモーション合成の現在のアプローチは、さまざまなアクション … 続きを読む →

カテゴリー: cs.CV | コメントを受け付けていません

CT Synthesis with Conditional Diffusion Models for Abdominal Lymph Node Segmentation

投稿日: 2024年3月27日作成者: jarxiv

要約医用画像セグメンテーションにおいてディープラーニング手法が大きな成功を収め … 続きを読む →

カテゴリー: cs.CV, eess.IV | コメントを受け付けていません

Pushing Auto-regressive Models for 3D Shape Generation at Capacity and Scalability

投稿日: 2024年3月27日作成者: jarxiv

要約自己回帰モデルは、グリッド空間内の結合分布をモデル化することにより、2D … 続きを読む →

カテゴリー: cs.CV | コメントを受け付けていません

Exploiting Semantic Reconstruction to Mitigate Hallucinations in Vision-Language Models

投稿日: 2024年3月27日作成者: jarxiv

要約視覚言語モデルにおける幻覚は、特に長いキャプションの生成において、その信頼 … 続きを読む →

カテゴリー: cs.CL, cs.CV | コメントを受け付けていません

GenesisTex: Adapting Image Denoising Diffusion to Texture Space

投稿日: 2024年3月27日作成者: jarxiv

要約テキスト記述から 3D ジオメトリのテクスチャを合成する新しい方法である … 続きを読む →

カテゴリー: cs.CV, cs.GR | コメントを受け付けていません

Evaluating the Efficacy of Prompt-Engineered Large Multimodal Models Versus Fine-Tuned Vision Transformers in Image-Based Security Applications

投稿日: 2024年3月27日作成者: jarxiv

要約大規模言語モデル (LLM) の成功により、Gemini-pro などの大 … 続きを読む →

カテゴリー: cs.AI, cs.CR, cs.CV | コメントを受け付けていません

Towards 3D Vision with Low-Cost Single-Photon Cameras

投稿日: 2024年3月27日作成者: jarxiv

要約小型でエネルギー効率が高く、低コストの単一光子カメラによる測定に基づいて、 … 続きを読む →

カテゴリー: cs.CV, eess.IV | コメントを受け付けていません

HIMap: HybrId Representation Learning for End-to-end Vectorized HD Map Construction

投稿日: 2024年3月27日作成者: jarxiv

要約ベクトル化された高精細 (HD) 地図の構築には、地図要素 (道路境界線、 … 続きを読む →

カテゴリー: cs.CV | コメントを受け付けていません

月別アーカイブ: 2024年3月

MedPromptX: Grounded Multimodal Prompting for Chest X-ray Diagnosis

MUTE-SLAM: Real-Time Neural SLAM with Multiple Tri-Plane Hash Representations

ReMoS: 3D Motion-Conditioned Reaction Synthesis for Two-Person Interactions

CT Synthesis with Conditional Diffusion Models for Abdominal Lymph Node Segmentation

Pushing Auto-regressive Models for 3D Shape Generation at Capacity and Scalability

Exploiting Semantic Reconstruction to Mitigate Hallucinations in Vision-Language Models

GenesisTex: Adapting Image Denoising Diffusion to Texture Space

Evaluating the Efficacy of Prompt-Engineered Large Multimodal Models Versus Fine-Tuned Vision Transformers in Image-Based Security Applications

Towards 3D Vision with Low-Cost Single-Photon Cameras

HIMap: HybrId Representation Learning for End-to-end Vectorized HD Map Construction

最近の投稿

最近のコメント

アーカイブ

カテゴリー