月別アーカイブ: 2025年4月

MBE-ARI: A Multimodal Dataset Mapping Bi-directional Engagement in Animal-Robot Interaction

投稿日: 2025年4月14日作成者: jarxiv

要約ロボットは、ボディーランゲージ、動き、発声などの動物の複雑でマルチモーダル … 続きを読む →

カテゴリー: cs.CV, cs.RO | コメントを受け付けていません

A Multi-Modal AI System for Screening Mammography: Integrating 2D and 3D Imaging to Improve Breast Cancer Detection in a Prospective Clinical Study

投稿日: 2025年4月14日作成者: jarxiv

要約デジタル乳房トモシンセシス（DBT）は、フルフィールドデジタルマンモグラフ … 続きを読む →

カテゴリー: cs.CV, cs.LG, eess.IV | コメントを受け付けていません

The Invisible EgoHand: 3D Hand Forecasting through EgoBody Pose Estimation

投稿日: 2025年4月14日作成者: jarxiv

要約エゴセントリックな視点からの手の動きとポーズを予測することは、人間の意図を … 続きを読む →

カテゴリー: cs.CV | コメントを受け付けていません

Fine-Grained Retrieval-Augmented Generation for Visual Question Answering

投稿日: 2025年4月14日作成者: jarxiv

要約視覚的な質問回答（VQA）は、画像からの情報を利用することにより、自然言語 … 続きを読む →

カテゴリー: cs.AI, cs.CV | コメントを受け付けていません

X2BR: High-Fidelity 3D Bone Reconstruction from a Planar X-Ray Image with Hybrid Neural Implicit Methods

投稿日: 2025年4月14日作成者: jarxiv

要約単一の平面X線からの正確な3D骨再建は、解剖学的複雑さと限られた入力データ … 続きを読む →

カテゴリー: cs.CV | コメントを受け付けていません

HRDecoder: High-Resolution Decoder Network for Fundus Image Lesion Segmentation

投稿日: 2025年4月14日作成者: jarxiv

要約 Fundus画像の正確なセグメンテーションには高解像度が重要ですが、高解像 … 続きを読む →

カテゴリー: cs.CV | コメントを受け付けていません

UNEM: UNrolled Generalized EM for Transductive Few-Shot Learning

投稿日: 2025年4月14日作成者: jarxiv

要約トランスダクトの少数のショット学習は、最近、コンピュータービジョンにおいて … 続きを読む →

カテゴリー: cs.CV | コメントを受け付けていません

Seaweed-7B: Cost-Effective Training of Video Generation Foundation Model

投稿日: 2025年4月14日作成者: jarxiv

要約このテクニカルレポートは、ビデオジェネレーションファンデーションモデルをト … 続きを読む →

カテゴリー: cs.AI, cs.CV | コメントを受け付けていません

Hypergraph Vision Transformers: Images are More than Nodes, More than Edges

投稿日: 2025年4月14日作成者: jarxiv

要約コンピュータービジョンの最近の進歩により、さまざまなタスクにわたる視覚変圧 … 続きを読む →

カテゴリー: cs.CV | コメントを受け付けていません

Generating Fine Details of Entity Interactions

投稿日: 2025年4月14日作成者: jarxiv

要約画像は、オブジェクトを描写するだけでなく、それらの間の豊富な相互作用もカプ … 続きを読む →

カテゴリー: cs.CL, cs.CV, cs.LG | コメントを受け付けていません

月別アーカイブ: 2025年4月

MBE-ARI: A Multimodal Dataset Mapping Bi-directional Engagement in Animal-Robot Interaction

A Multi-Modal AI System for Screening Mammography: Integrating 2D and 3D Imaging to Improve Breast Cancer Detection in a Prospective Clinical Study

The Invisible EgoHand: 3D Hand Forecasting through EgoBody Pose Estimation

Fine-Grained Retrieval-Augmented Generation for Visual Question Answering

X2BR: High-Fidelity 3D Bone Reconstruction from a Planar X-Ray Image with Hybrid Neural Implicit Methods

HRDecoder: High-Resolution Decoder Network for Fundus Image Lesion Segmentation

UNEM: UNrolled Generalized EM for Transductive Few-Shot Learning

Seaweed-7B: Cost-Effective Training of Video Generation Foundation Model

Hypergraph Vision Transformers: Images are More than Nodes, More than Edges

Generating Fine Details of Entity Interactions

最近の投稿

最近のコメント

アーカイブ

カテゴリー