月別アーカイブ: 2024年2月

Controllable Dense Captioner with Multimodal Embedding Bridging

投稿日: 2024年2月1日作成者: jarxiv

要約本稿では、言語ガイダンスを導入することでユーザーの高密度キャプションへの意 … 続きを読む →

カテゴリー: cs.CV | コメントを受け付けていません

Source-free Domain Adaptive Object Detection in Remote Sensing Images

投稿日: 2024年2月1日作成者: jarxiv

要約最近の研究では、リモートセンシング (RS) 画像のドメインギャップを … 続きを読む →

カテゴリー: cs.CV | コメントを受け付けていません

On the Generalizability of ECG-based Stress Detection Models

投稿日: 2024年2月1日作成者: jarxiv

要約ストレスは、仕事、医療、社会的交流など、日常生活のさまざまな側面で蔓延して … 続きを読む →

カテゴリー: cs.AI, cs.CV, cs.LG | コメントを受け付けていません

Endo-4DGS: Endoscopic Monocular Scene Reconstruction with 4D Gaussian Splatting

投稿日: 2024年2月1日作成者: jarxiv

要約ロボット支援による低侵襲手術の領域では、動的なシーンの再構成により下流のタ … 続きを読む →

カテゴリー: cs.CV | コメントを受け付けていません

PanAf20K: A Large Video Dataset for Wild Ape Detection and Behaviour Recognition

投稿日: 2024年2月1日作成者: jarxiv

要約我々は、自然環境における大型類人猿の最大かつ最も多様な注釈付きオープンアク … 続きを読む →

カテゴリー: cs.CV | コメントを受け付けていません

HyperZ$\cdot$Z$\cdot$W Operator Connects Slow-Fast Networks for Full Context Interaction

投稿日: 2024年2月1日作成者: jarxiv

要約セルフアテンションメカニズムは、トレーニング可能なパラメーターが非常に … 続きを読む →

カテゴリー: cs.CV | コメントを受け付けていません

Collaborative Multi-Object Tracking with Conformal Uncertainty Propagation

投稿日: 2024年2月1日作成者: jarxiv

要約物体検出と複数物体追跡 (MOT) は、自動運転システムの重要なコンポーネ … 続きを読む →

カテゴリー: cs.CV | コメントを受け付けていません

Combining Deep Learning and Street View Imagery to Map Smallholder Crop Types

投稿日: 2024年2月1日作成者: jarxiv

要約正確な作物タイプマップは、収量の推移を大規模に監視し、世界の作物生産を予測 … 続きを読む →

カテゴリー: cs.AI, cs.CV | コメントを受け付けていません

MelNet: A Real-Time Deep Learning Algorithm for Object Detection

投稿日: 2024年2月1日作成者: jarxiv

要約この研究では、MelNet という名前の物体検出のための新しい深層学習アル … 続きを読む →

カテゴリー: cs.AI, cs.CV, cs.LG | コメントを受け付けていません

Enhancing Multimodal Large Language Models with Vision Detection Models: An Empirical Study

投稿日: 2024年2月1日作成者: jarxiv

要約テキストと画像のモダリティを統合するマルチモーダル大規模言語モデル (ML … 続きを読む →

カテゴリー: cs.AI, cs.CV | コメントを受け付けていません

月別アーカイブ: 2024年2月

Controllable Dense Captioner with Multimodal Embedding Bridging

Source-free Domain Adaptive Object Detection in Remote Sensing Images

On the Generalizability of ECG-based Stress Detection Models

Endo-4DGS: Endoscopic Monocular Scene Reconstruction with 4D Gaussian Splatting

PanAf20K: A Large Video Dataset for Wild Ape Detection and Behaviour Recognition

HyperZ$\cdot$Z$\cdot$W Operator Connects Slow-Fast Networks for Full Context Interaction

Collaborative Multi-Object Tracking with Conformal Uncertainty Propagation

Combining Deep Learning and Street View Imagery to Map Smallholder Crop Types

MelNet: A Real-Time Deep Learning Algorithm for Object Detection

Enhancing Multimodal Large Language Models with Vision Detection Models: An Empirical Study

最近の投稿

最近のコメント

アーカイブ

カテゴリー