「cs.CV」カテゴリーアーカイブ

P-TAME: Explain Any Image Classifier with Trained Perturbations

投稿日: 2025年1月30日作成者: jarxiv

要約予測を正当化する必要がある重要な分野での深いニューラルネットワーク（DNN … 続きを読む →

カテゴリー: cs.AI, cs.CV | コメントを受け付けていません

SSF: Sparse Long-Range Scene Flow for Autonomous Driving

投稿日: 2025年1月30日作成者: jarxiv

要約シーンフローにより、3D世界の環境の動き特性を理解することができます。そ … 続きを読む →

カテゴリー: cs.CV | コメントを受け付けていません

Aggregation Schemes for Single-Vector WSI Representation Learning in Digital Pathology

投稿日: 2025年1月30日作成者: jarxiv

要約計算病理学で全体のスライド画像（WSI）を効率的に統合するための重要なステ … 続きを読む →

カテゴリー: cs.AI, cs.CV, cs.IR, eess.IV, q-bio.QM | コメントを受け付けていません

U2A: Unified Unimodal Adaptation for Robust and Efficient Multimodal Learning

投稿日: 2025年1月30日作成者: jarxiv

要約マルチモーダル学習は、多くの場合、最適なパフォーマンスを実現するために、新 … 続きを読む →

カテゴリー: cs.AI, cs.CV, cs.LG | コメントを受け付けていません

Synthesizing 3D Abstractions by Inverting Procedural Buildings with Transformers

投稿日: 2025年1月30日作成者: jarxiv

要約手続きモデルを反転させることを学ぶことにより、建物の抽象化を生成し、それら … 続きを読む →

カテゴリー: cs.AI, cs.CV, cs.LG | コメントを受け付けていません

PhysBench: Benchmarking and Enhancing Vision-Language Models for Physical World Understanding

投稿日: 2025年1月30日作成者: jarxiv

要約物理的な世界を理解することは、具体化されたAIの基本的な課題であり、エージ … 続きを読む →

カテゴリー: cs.AI, cs.CL, cs.CV, cs.LG, cs.RO | コメントを受け付けていません

Beyond-Labels: Advancing Open-Vocabulary Segmentation With Vision-Language Models

投稿日: 2025年1月30日作成者: jarxiv

要約自己学習学習は、効果的に訓練された場合、多数の画像または言語処理の問題を解 … 続きを読む →

カテゴリー: cs.CV | コメントを受け付けていません

GFE-Mamba: Mamba-based AD Multi-modal Progression Assessment via Generative Feature Extraction from MCI

投稿日: 2025年1月30日作成者: jarxiv

要約アルツハイマー病（AD）は、しばしば軽度の認知障害（MCI）に由来する進行 … 続きを読む →

カテゴリー: cs.AI, cs.CV | コメントを受け付けていません

PhysBench: Benchmarking and Enhancing Vision-Language Models for Physical World Understanding

投稿日: 2025年1月29日作成者: jarxiv

要約物理的な世界を理解することは、具体化されたAIの基本的な課題であり、エージ … 続きを読む →

カテゴリー: cs.AI, cs.CL, cs.CV, cs.LG, cs.RO | コメントを受け付けていません

BiFold: Bimanual Cloth Folding with Language Guidance

投稿日: 2025年1月29日作成者: jarxiv

要約布の折りたたみは、衣服の避けられない自己閉鎖、複雑なダイナミクス、衣服が持 … 続きを読む →

カテゴリー: cs.CV, cs.RO | コメントを受け付けていません

「cs.CV」カテゴリーアーカイブ

P-TAME: Explain Any Image Classifier with Trained Perturbations

SSF: Sparse Long-Range Scene Flow for Autonomous Driving

Aggregation Schemes for Single-Vector WSI Representation Learning in Digital Pathology

U2A: Unified Unimodal Adaptation for Robust and Efficient Multimodal Learning

Synthesizing 3D Abstractions by Inverting Procedural Buildings with Transformers

PhysBench: Benchmarking and Enhancing Vision-Language Models for Physical World Understanding

Beyond-Labels: Advancing Open-Vocabulary Segmentation With Vision-Language Models

GFE-Mamba: Mamba-based AD Multi-modal Progression Assessment via Generative Feature Extraction from MCI

PhysBench: Benchmarking and Enhancing Vision-Language Models for Physical World Understanding

BiFold: Bimanual Cloth Folding with Language Guidance

最近の投稿

最近のコメント

アーカイブ

カテゴリー