月別アーカイブ: 2025年4月

Efficient Contrastive Decoding with Probabilistic Hallucination Detection – Mitigating Hallucinations in Large Vision Language Models –

投稿日: 2025年4月17日作成者: jarxiv

要約大規模なビジョン言語モデル（LVLMS）の最近の進歩にもかかわらず、これら … 続きを読む →

カテゴリー: cs.AI, cs.CL, cs.CV, cs.LG | コメントを受け付けていません

OmniDrive: A Holistic Vision-Language Dataset for Autonomous Driving with Counterfactual Reasoning

投稿日: 2025年4月17日作成者: jarxiv

要約ビジョン言語モデル（VLM）の進歩により、強力な推論能力を活用するための自 … 続きを読む →

カテゴリー: cs.CV | コメントを受け付けていません

Document Parsing Unveiled: Techniques, Challenges, and Prospects for Structured Information Extraction

投稿日: 2025年4月17日作成者: jarxiv

要約ドキュメント解析は、契約、学術論文、請求書などの非構造化および半構造化され … 続きを読む →

カテゴリー: cs.AI, cs.CL, cs.CV, cs.MM | コメントを受け付けていません

FocusedAD: Character-centric Movie Audio Description

投稿日: 2025年4月17日作成者: jarxiv

要約映画オーディオの説明（AD）は、対話のないセグメント中に視覚的なコンテンツ … 続きを読む →

カテゴリー: cs.CV, I.2.10 | コメントを受け付けていません

StructRe: Rewriting for Structured Shape Modeling

投稿日: 2025年4月17日作成者: jarxiv

要約人工の3D形状は、部品と階層で自然に編成されています。このような構造は、 … 続きを読む →

カテゴリー: cs.CV | コメントを受け付けていません

Cross-Modal Mapping: Mitigating the Modality Gap for Few-Shot Image Classification

投稿日: 2025年4月17日作成者: jarxiv

要約少ないショット画像分類は、コンピュータービジョンの分野、特にデータスカース … 続きを読む →

カテゴリー: cs.CV | コメントを受け付けていません

Exploring Self-supervised Skeleton-based Action Recognition in Occluded Environments

投稿日: 2025年4月17日作成者: jarxiv

要約アクション認識を自律的なロボットシステムに統合するには、人の閉塞などの課題 … 続きを読む →

カテゴリー: cs.CV, cs.MM, cs.RO, eess.IV | コメントを受け付けていません

OmniDrive: A Holistic Vision-Language Dataset for Autonomous Driving with Counterfactual Reasoning

投稿日: 2025年4月17日作成者: jarxiv

要約ビジョン言語モデル（VLM）の進歩により、強力な推論能力を活用するための自 … 続きを読む →

カテゴリー: cs.CV | コメントを受け付けていません

Strategic Client Selection to Address Non-IIDness in HAPS-enabled FL Networks

投稿日: 2025年4月17日作成者: jarxiv

要約高高度プラットフォームステーション（HAPS）によってサポートされている非 … 続きを読む →

カテゴリー: cs.CV, cs.LG, cs.NI | コメントを受け付けていません

Deep Anatomical Federated Network (Dafne): An open client-server framework for the continuous, collaborative improvement of deep learning-based medical image segmentation

投稿日: 2025年4月17日作成者: jarxiv

要約目的：Dafne（深い解剖学的フェデレーションネットワーク）を提示して評価 … 続きを読む →

カテゴリー: cs.CV, cs.LG, eess.IV | コメントを受け付けていません

月別アーカイブ: 2025年4月

Efficient Contrastive Decoding with Probabilistic Hallucination Detection – Mitigating Hallucinations in Large Vision Language Models –

OmniDrive: A Holistic Vision-Language Dataset for Autonomous Driving with Counterfactual Reasoning

Document Parsing Unveiled: Techniques, Challenges, and Prospects for Structured Information Extraction

FocusedAD: Character-centric Movie Audio Description

StructRe: Rewriting for Structured Shape Modeling

Cross-Modal Mapping: Mitigating the Modality Gap for Few-Shot Image Classification

Exploring Self-supervised Skeleton-based Action Recognition in Occluded Environments

OmniDrive: A Holistic Vision-Language Dataset for Autonomous Driving with Counterfactual Reasoning

Strategic Client Selection to Address Non-IIDness in HAPS-enabled FL Networks

Deep Anatomical Federated Network (Dafne): An open client-server framework for the continuous, collaborative improvement of deep learning-based medical image segmentation

最近の投稿

最近のコメント

アーカイブ

カテゴリー