「cs.CV」カテゴリーアーカイブ

Towards long-term player tracking with graph hierarchies and domain-specific features

投稿日: 2025年3月3日作成者: jarxiv

要約チームスポーツ分析では、プレーヤーの外観の類似性、閉塞、および動的モーショ … 続きを読む →

カテゴリー: cs.CV | コメントを受け付けていません

Anatomically-guided masked autoencoder pre-training for aneurysm detection

投稿日: 2025年3月3日作成者: jarxiv

要約頭蓋内動脈瘤は、世界中の罹患率と死亡率の主な原因であり、それらを手動で検出 … 続きを読む →

カテゴリー: cs.CV | コメントを受け付けていません

AeroReformer: Aerial Referring Transformer for UAV-based Referring Image Segmentation

投稿日: 2025年3月3日作成者: jarxiv

要約斬新で挑戦的なタスクとして、セグメンテーションを参照することで、コンピュー … 続きを読む →

カテゴリー: cs.CV | コメントを受け付けていません

Dual Thinking and Logical Processing — Are Multi-modal Large Language Models Closing the Gap with Human Vision ?

投稿日: 2025年3月3日作成者: jarxiv

要約二重の思考フレームワークは、高速で直感的で、論理処理が遅くなることを考慮し … 続きを読む →

カテゴリー: cs.AI, cs.CV, eess.IV | コメントを受け付けていません

RoboBrain: A Unified Brain Model for Robotic Manipulation from Abstract to Concrete

投稿日: 2025年3月3日作成者: jarxiv

要約マルチモーダル大手言語モデル（MLLM）の最近の進歩は、さまざまなマルチモ … 続きを読む →

カテゴリー: cs.CV, cs.RO | コメントを受け付けていません

Exploring the Effectiveness of Object-Centric Representations in Visual Question Answering: Comparative Insights with Foundation Models

投稿日: 2025年3月3日作成者: jarxiv

要約離散オブジェクトの構成として視覚シーンをモデル化するオブジェクト中心（OC … 続きを読む →

カテゴリー: cs.CV, cs.LG | コメントを受け付けていません

Language-Informed Hyperspectral Image Synthesis for Imbalanced-Small Sample Classification via Semi-Supervised Conditional Diffusion Model

投稿日: 2025年3月3日作成者: jarxiv

要約データ増強は、ハイパースペクトル画像分類（HSIC）の不均衡なスマルサンプ … 続きを読む →

カテゴリー: cs.CV | コメントを受け付けていません

Foundation Models — A Panacea for Artificial Intelligence in Pathology?

投稿日: 2025年3月3日作成者: jarxiv

要約病理学における人工知能（AI）の役割は、診断を支援することから、全体のスラ … 続きを読む →

カテゴリー: cs.AI, cs.CV | コメントを受け付けていません

Adaptive Keyframe Sampling for Long Video Understanding

投稿日: 2025年3月3日作成者: jarxiv

要約マルチモーダル大手言語モデル（MLLM）は、視覚入力をコンテキストとして大 … 続きを読む →

カテゴリー: cs.AI, cs.CV, cs.LG | コメントを受け付けていません

Back to the Future Cyclopean Stereo: a human perception approach unifying deep and geometric constraints

投稿日: 2025年3月3日作成者: jarxiv

要約深さの不連続性と閉塞を組み込んだシクロピアンアイモデルで見られるように、分 … 続きを読む →

カテゴリー: cs.CV | コメントを受け付けていません

「cs.CV」カテゴリーアーカイブ

Towards long-term player tracking with graph hierarchies and domain-specific features

Anatomically-guided masked autoencoder pre-training for aneurysm detection

AeroReformer: Aerial Referring Transformer for UAV-based Referring Image Segmentation

Dual Thinking and Logical Processing — Are Multi-modal Large Language Models Closing the Gap with Human Vision ?

RoboBrain: A Unified Brain Model for Robotic Manipulation from Abstract to Concrete

Exploring the Effectiveness of Object-Centric Representations in Visual Question Answering: Comparative Insights with Foundation Models

Language-Informed Hyperspectral Image Synthesis for Imbalanced-Small Sample Classification via Semi-Supervised Conditional Diffusion Model

Foundation Models — A Panacea for Artificial Intelligence in Pathology?

Adaptive Keyframe Sampling for Long Video Understanding

Back to the Future Cyclopean Stereo: a human perception approach unifying deep and geometric constraints

最近の投稿

最近のコメント

アーカイブ

カテゴリー