月別アーカイブ: 2025年3月

The PanAf-FGBG Dataset: Understanding the Impact of Backgrounds in Wildlife Behaviour Recognition

投稿日: 2025年3月3日作成者: jarxiv

要約カメラトラップビデオ映像のコンピュータービジョン分析は、キャプチャされた行 … 続きを読む →

カテゴリー: cs.AI, cs.CV | コメントを受け付けていません

ReMatching Dynamic Reconstruction Flow

投稿日: 2025年3月3日作成者: jarxiv

要約画像入力から動的シーンを再構築することは、多くのダウンストリームアプリケー … 続きを読む →

カテゴリー: cs.CV, cs.LG, eess.IV | コメントを受け付けていません

Towards long-term player tracking with graph hierarchies and domain-specific features

投稿日: 2025年3月3日作成者: jarxiv

要約チームスポーツ分析では、プレーヤーの外観の類似性、閉塞、および動的モーショ … 続きを読む →

カテゴリー: cs.CV | コメントを受け付けていません

Anatomically-guided masked autoencoder pre-training for aneurysm detection

投稿日: 2025年3月3日作成者: jarxiv

要約頭蓋内動脈瘤は、世界中の罹患率と死亡率の主な原因であり、それらを手動で検出 … 続きを読む →

カテゴリー: cs.CV | コメントを受け付けていません

AeroReformer: Aerial Referring Transformer for UAV-based Referring Image Segmentation

投稿日: 2025年3月3日作成者: jarxiv

要約斬新で挑戦的なタスクとして、セグメンテーションを参照することで、コンピュー … 続きを読む →

カテゴリー: cs.CV | コメントを受け付けていません

Dual Thinking and Logical Processing — Are Multi-modal Large Language Models Closing the Gap with Human Vision ?

投稿日: 2025年3月3日作成者: jarxiv

要約二重の思考フレームワークは、高速で直感的で、論理処理が遅くなることを考慮し … 続きを読む →

カテゴリー: cs.AI, cs.CV, eess.IV | コメントを受け付けていません

RoboBrain: A Unified Brain Model for Robotic Manipulation from Abstract to Concrete

投稿日: 2025年3月3日作成者: jarxiv

要約マルチモーダル大手言語モデル（MLLM）の最近の進歩は、さまざまなマルチモ … 続きを読む →

カテゴリー: cs.CV, cs.RO | コメントを受け付けていません

Exploring the Effectiveness of Object-Centric Representations in Visual Question Answering: Comparative Insights with Foundation Models

投稿日: 2025年3月3日作成者: jarxiv

要約離散オブジェクトの構成として視覚シーンをモデル化するオブジェクト中心（OC … 続きを読む →

カテゴリー: cs.CV, cs.LG | コメントを受け付けていません

Language-Informed Hyperspectral Image Synthesis for Imbalanced-Small Sample Classification via Semi-Supervised Conditional Diffusion Model

投稿日: 2025年3月3日作成者: jarxiv

要約データ増強は、ハイパースペクトル画像分類（HSIC）の不均衡なスマルサンプ … 続きを読む →

カテゴリー: cs.CV | コメントを受け付けていません

Foundation Models — A Panacea for Artificial Intelligence in Pathology?

投稿日: 2025年3月3日作成者: jarxiv

要約病理学における人工知能（AI）の役割は、診断を支援することから、全体のスラ … 続きを読む →

カテゴリー: cs.AI, cs.CV | コメントを受け付けていません

月別アーカイブ: 2025年3月

The PanAf-FGBG Dataset: Understanding the Impact of Backgrounds in Wildlife Behaviour Recognition

ReMatching Dynamic Reconstruction Flow

Towards long-term player tracking with graph hierarchies and domain-specific features

Anatomically-guided masked autoencoder pre-training for aneurysm detection

AeroReformer: Aerial Referring Transformer for UAV-based Referring Image Segmentation

Dual Thinking and Logical Processing — Are Multi-modal Large Language Models Closing the Gap with Human Vision ?

RoboBrain: A Unified Brain Model for Robotic Manipulation from Abstract to Concrete

Exploring the Effectiveness of Object-Centric Representations in Visual Question Answering: Comparative Insights with Foundation Models

Language-Informed Hyperspectral Image Synthesis for Imbalanced-Small Sample Classification via Semi-Supervised Conditional Diffusion Model

Foundation Models — A Panacea for Artificial Intelligence in Pathology?

最近の投稿

最近のコメント

アーカイブ

カテゴリー