「cs.CV」カテゴリーアーカイブ

Efficient Camera Exposure Control for Visual Odometry via Deep Reinforcement Learning

投稿日: 2024年12月24日作成者: jarxiv

要約ビジュアルオドメトリ (VO) システムの安定性は、特に照明の変化が大き … 続きを読む →

カテゴリー: cs.CV, cs.RO | コメントを受け付けていません

Diving into Self-Evolving Training for Multimodal Reasoning

投稿日: 2024年12月24日作成者: jarxiv

要約大規模マルチモーダルモデル (LMM) には推論能力が不可欠です。マル … 続きを読む →

カテゴリー: cs.AI, cs.CL, cs.CV, cs.LG | コメントを受け付けていません

CLEAR: Character Unlearning in Textual and Visual Modalities

投稿日: 2024年12月24日作成者: jarxiv

要約機械学習 (MU) は、特定の個人情報や危険な情報を削除することにより、深 … 続きを読む →

カテゴリー: cs.CL, cs.CV | コメントを受け付けていません

The Dynamic Duo of Collaborative Masking and Target for Advanced Masked Autoencoder Learning

投稿日: 2024年12月24日作成者: jarxiv

要約マスクされたオートエンコーダ (MAE) は最近、自己教師あり視覚表現学習 … 続きを読む →

カテゴリー: cs.AI, cs.CV, cs.LG, eess.IV, eess.SP | コメントを受け付けていません

Empathetic Response in Audio-Visual Conversations Using Emotion Preference Optimization and MambaCompressor

投稿日: 2024年12月24日作成者: jarxiv

要約カスタマーサポートやメンタルヘルスケアなど、人との対話が必要な分野でチャッ … 続きを読む →

カテゴリー: cs.AI, cs.CV | コメントを受け付けていません

HumanVBench: Exploring Human-Centric Video Understanding Capabilities of MLLMs with Synthetic Benchmark Data

投稿日: 2024年12月24日作成者: jarxiv

要約マルチモーダル大規模言語モデル (MLLM) の分野では、人間中心のビデオ … 続きを読む →

カテゴリー: cs.AI, cs.CV | コメントを受け付けていません

Enhancing Reconstruction-Based Out-of-Distribution Detection in Brain MRI with Model and Metric Ensembles

投稿日: 2024年12月24日作成者: jarxiv

要約画像内の異常なパターンがパフォーマンスを妨げる可能性があるため、配信外 ( … 続きを読む →

カテゴリー: 68T07, cs.CV, cs.LG, eess.IV | コメントを受け付けていません

Improved Cotton Leaf Disease Classification Using Parameter-Efficient Deep Learning Framework

投稿日: 2024年12月24日作成者: jarxiv

要約「白い黄金」とも呼ばれる綿作物は、主に葉に影響を及ぼすさまざまな病気が原因 … 続きを読む →

カテゴリー: cs.AI, cs.CV, cs.LG | コメントを受け付けていません

Ditto: Motion-Space Diffusion for Controllable Realtime Talking Head Synthesis

投稿日: 2024年12月24日作成者: jarxiv

要約拡散モデルの最近の進歩により、オーディオ駆動のトーキングヘッド合成に革命 … 続きを読む →

カテゴリー: cs.CV, cs.LG, cs.SD, eess.AS | コメントを受け付けていません

V$^2$-SfMLearner: Learning Monocular Depth and Ego-motion for Multimodal Wireless Capsule Endoscopy

投稿日: 2024年12月24日作成者: jarxiv

要約深層学習は、カプセル内視鏡ビデオから深度マップとカプセルのエゴモーションを … 続きを読む →

カテゴリー: cs.AI, cs.CV, cs.RO | コメントを受け付けていません

「cs.CV」カテゴリーアーカイブ

Efficient Camera Exposure Control for Visual Odometry via Deep Reinforcement Learning

Diving into Self-Evolving Training for Multimodal Reasoning

CLEAR: Character Unlearning in Textual and Visual Modalities

The Dynamic Duo of Collaborative Masking and Target for Advanced Masked Autoencoder Learning

Empathetic Response in Audio-Visual Conversations Using Emotion Preference Optimization and MambaCompressor

HumanVBench: Exploring Human-Centric Video Understanding Capabilities of MLLMs with Synthetic Benchmark Data

Enhancing Reconstruction-Based Out-of-Distribution Detection in Brain MRI with Model and Metric Ensembles

Improved Cotton Leaf Disease Classification Using Parameter-Efficient Deep Learning Framework

Ditto: Motion-Space Diffusion for Controllable Realtime Talking Head Synthesis

V$^2$-SfMLearner: Learning Monocular Depth and Ego-motion for Multimodal Wireless Capsule Endoscopy

最近の投稿

最近のコメント

アーカイブ

カテゴリー