「cs.CV」カテゴリーアーカイブ

Brain-Adapter: Enhancing Neurological Disorder Analysis with Adapter-Tuning Multimodal Large Language Models

投稿日: 2025年1月28日作成者: jarxiv

要約脳障害を理解することは、正確な臨床診断と治療のために重要です。マルチモー … 続きを読む →

カテゴリー: cs.AI, cs.CV, eess.IV | コメントを受け付けていません

PEP-GS: Perceptually-Enhanced Precise Structured 3D Gaussians for View-Adaptive Rendering

投稿日: 2025年1月28日作成者: jarxiv

要約最近、3D Gaussian Splatting（3D-GS）は、リアルタ … 続きを読む →

カテゴリー: cs.CV | コメントを受け付けていません

Multi-view Structural Convolution Network for Domain-Invariant Point Cloud Recognition of Autonomous Vehicles

投稿日: 2025年1月28日作成者: jarxiv

要約ポイントクラウドの表現は最近、コンピュータービジョンの分野での研究ホットス … 続きを読む →

カテゴリー: cs.CV | コメントを受け付けていません

Mixture-of-Mamba: Enhancing Multi-Modal State-Space Models with Modality-Aware Sparsity

投稿日: 2025年1月28日作成者: jarxiv

要約状態空間モデル（SSM）は、シーケンシャルモデリングのための変圧器の効率的 … 続きを読む →

カテゴリー: cs.AI, cs.CL, cs.CV, cs.LG | コメントを受け付けていません

FALCON: Resolving Visual Redundancy and Fragmentation in High-resolution Multimodal Large Language Models via Visual Registers

投稿日: 2025年1月28日作成者: jarxiv

要約高解像度の視覚入力の組み込みにより、実際のタスクの視覚的知覚機能が強化され … 続きを読む →

カテゴリー: cs.CV | コメントを受け付けていません

Large Models in Dialogue for Active Perception and Anomaly Detection

投稿日: 2025年1月28日作成者: jarxiv

要約自律航空監視は、人間が簡単にアクセスできない地域から情報を収集することを目 … 続きを読む →

カテゴリー: cs.AI, cs.CV | コメントを受け付けていません

MedPromptX: Grounded Multimodal Prompting for Chest X-ray Diagnosis

投稿日: 2025年1月28日作成者: jarxiv

要約胸部X線画像は、一般的に急性および慢性の心肺状態を予測するために使用されま … 続きを読む →

カテゴリー: cs.AI, cs.CV | コメントを受け付けていません

LinPrim: Linear Primitives for Differentiable Volumetric Rendering

投稿日: 2025年1月28日作成者: jarxiv

要約ボリュームレンダリングは、観察されたビューから直接3Dシーン表現を最適化す … 続きを読む →

カテゴリー: cs.CV | コメントを受け付けていません

Adaptive Iterative Compression for High-Resolution Files: an Approach Focused on Preserving Visual Quality in Cinematic Workflows

投稿日: 2025年1月28日作成者: jarxiv

要約この研究では、映画撮影ワークフローとデジタル保存で使用される高解像度DPX … 続きを読む →

カテゴリー: cs.CV, cs.ET, cs.LG, cs.PF | コメントを受け付けていません

GUI-Bee: Align GUI Action Grounding to Novel Environments via Autonomous Exploration

投稿日: 2025年1月28日作成者: jarxiv

要約グラフィカルユーザーインターフェイス（GUI）アクション接地は、GUI画面 … 続きを読む →

カテゴリー: cs.AI, cs.CL, cs.CV, cs.LG | コメントを受け付けていません

「cs.CV」カテゴリーアーカイブ

Brain-Adapter: Enhancing Neurological Disorder Analysis with Adapter-Tuning Multimodal Large Language Models

PEP-GS: Perceptually-Enhanced Precise Structured 3D Gaussians for View-Adaptive Rendering

Multi-view Structural Convolution Network for Domain-Invariant Point Cloud Recognition of Autonomous Vehicles

Mixture-of-Mamba: Enhancing Multi-Modal State-Space Models with Modality-Aware Sparsity

FALCON: Resolving Visual Redundancy and Fragmentation in High-resolution Multimodal Large Language Models via Visual Registers

Large Models in Dialogue for Active Perception and Anomaly Detection

MedPromptX: Grounded Multimodal Prompting for Chest X-ray Diagnosis

LinPrim: Linear Primitives for Differentiable Volumetric Rendering

Adaptive Iterative Compression for High-Resolution Files: an Approach Focused on Preserving Visual Quality in Cinematic Workflows

GUI-Bee: Align GUI Action Grounding to Novel Environments via Autonomous Exploration

最近の投稿

最近のコメント

アーカイブ

カテゴリー