「cs.AI」カテゴリーアーカイブ

The Linear Attention Resurrection in Vision Transformer

投稿日: 2025年1月28日作成者: jarxiv

要約 Vision Transformers（VITS）は最近、コンピュータービ … 続きを読む →

カテゴリー: cs.AI, cs.CV | コメントを受け付けていません

MoColl: Agent-Based Specific and General Model Collaboration for Image Captioning

投稿日: 2025年1月28日作成者: jarxiv

要約画像キャプションは、コンピュータービジョンと自然言語処理の交差点における重 … 続きを読む →

カテゴリー: cs.AI, cs.CV | コメントを受け付けていません

UDBE: Unsupervised Diffusion-based Brightness Enhancement in Underwater Images

投稿日: 2025年1月28日作成者: jarxiv

要約水中環境でのアクティビティは、いくつかのシナリオで最も重要であり、水中画像 … 続きを読む →

カテゴリー: cs.AI, cs.CV, eess.IV | コメントを受け付けていません

From Dashcam Videos to Driving Simulations: Stress Testing Automated Vehicles against Rare Events

投稿日: 2025年1月28日作成者: jarxiv

要約現実的な運転シナリオを使用したシミュレーションで自動化された運転システム（ … 続きを読む →

カテゴリー: cs.AI, cs.CV | コメントを受け付けていません

Lightweight Weighted Average Ensemble Model for Pneumonia Detection in Chest X-Ray Images

投稿日: 2025年1月28日作成者: jarxiv

要約肺炎は、子供の病気と死の主な原因であり、早期かつ正確な検出の必要性を強調し … 続きを読む →

カテゴリー: cs.AI, cs.CV, eess.IV | コメントを受け付けていません

Return of the Encoder: Maximizing Parameter Efficiency for SLMs

投稿日: 2025年1月28日作成者: jarxiv

要約大規模なデコーダーのみの言語モデルの優位性は、シーケンス処理における基本的 … 続きを読む →

カテゴリー: cs.AI, cs.CL, cs.CV | コメントを受け付けていません

Brain-Adapter: Enhancing Neurological Disorder Analysis with Adapter-Tuning Multimodal Large Language Models

投稿日: 2025年1月28日作成者: jarxiv

要約脳障害を理解することは、正確な臨床診断と治療のために重要です。マルチモー … 続きを読む →

カテゴリー: cs.AI, cs.CV, eess.IV | コメントを受け付けていません

Mixture-of-Mamba: Enhancing Multi-Modal State-Space Models with Modality-Aware Sparsity

投稿日: 2025年1月28日作成者: jarxiv

要約状態空間モデル（SSM）は、シーケンシャルモデリングのための変圧器の効率的 … 続きを読む →

カテゴリー: cs.AI, cs.CL, cs.CV, cs.LG | コメントを受け付けていません

Large Models in Dialogue for Active Perception and Anomaly Detection

投稿日: 2025年1月28日作成者: jarxiv

要約自律航空監視は、人間が簡単にアクセスできない地域から情報を収集することを目 … 続きを読む →

カテゴリー: cs.AI, cs.CV | コメントを受け付けていません

MedPromptX: Grounded Multimodal Prompting for Chest X-ray Diagnosis

投稿日: 2025年1月28日作成者: jarxiv

要約胸部X線画像は、一般的に急性および慢性の心肺状態を予測するために使用されま … 続きを読む →

カテゴリー: cs.AI, cs.CV | コメントを受け付けていません

「cs.AI」カテゴリーアーカイブ

The Linear Attention Resurrection in Vision Transformer

MoColl: Agent-Based Specific and General Model Collaboration for Image Captioning

UDBE: Unsupervised Diffusion-based Brightness Enhancement in Underwater Images

From Dashcam Videos to Driving Simulations: Stress Testing Automated Vehicles against Rare Events

Lightweight Weighted Average Ensemble Model for Pneumonia Detection in Chest X-Ray Images

Return of the Encoder: Maximizing Parameter Efficiency for SLMs

Brain-Adapter: Enhancing Neurological Disorder Analysis with Adapter-Tuning Multimodal Large Language Models

Mixture-of-Mamba: Enhancing Multi-Modal State-Space Models with Modality-Aware Sparsity

Large Models in Dialogue for Active Perception and Anomaly Detection

MedPromptX: Grounded Multimodal Prompting for Chest X-ray Diagnosis

最近の投稿

最近のコメント

アーカイブ

カテゴリー