「cs.CV」カテゴリーアーカイブ

Scalable Whole Slide Image Representation Using K-Mean Clustering and Fisher Vector Aggregation

投稿日: 2025年1月22日作成者: jarxiv

要約スライド全体画像 (WSI) は、高解像度のギガピクセルサイズの画像であ … 続きを読む →

カテゴリー: cs.AI, cs.CV, cs.LG | コメントを受け付けていません

Proxies for Distortion and Consistency with Applications for Real-World Image Restoration

投稿日: 2025年1月22日作成者: jarxiv

要約現実世界の画像復元では、未知の劣化が発生した画像の回復を扱います。このタ … 続きを読む →

カテゴリー: cs.AI, cs.CV, cs.LG, eess.IV | コメントを受け付けていません

Teacher Encoder-Student Decoder Denoising Guided Segmentation Network for Anomaly Detection

投稿日: 2025年1月22日作成者: jarxiv

要約視覚的異常の検出は非常に困難なタスクであり、多くの場合、1 クラスの分類お … 続きを読む →

カテゴリー: cs.AI, cs.CV | コメントを受け付けていません

Beyond Specialization: Assessing the Capabilities of MLLMs in Age and Gender Estimation

投稿日: 2025年1月22日作成者: jarxiv

要約マルチモーダル大規模言語モデル (MLLM) は、最近非常に人気が高まって … 続きを読む →

カテゴリー: cs.AI, cs.CV, cs.LG, I.2.0 | コメントを受け付けていません

Explainability for Vision Foundation Models: A Survey

投稿日: 2025年1月22日作成者: jarxiv

要約人工知能システムが日常生活にますます統合されるにつれて、説明可能性の分野が … 続きを読む →

カテゴリー: cs.CV | コメントを受け付けていません

Fixing Imbalanced Attention to Mitigate In-Context Hallucination of Large Vision-Language Model

投稿日: 2025年1月22日作成者: jarxiv

要約 Large Vision Language Model (LVLM) は、 … 続きを読む →

カテゴリー: cs.CL, cs.CV | コメントを受け付けていません

Multi-Scale Texture Loss for CT denoising with GANs

投稿日: 2025年1月22日作成者: jarxiv

要約 Generative Adversarial Networks (GAN) … 続きを読む →

カテゴリー: cs.CV, cs.LG, eess.IV | コメントを受け付けていません

RL-RC-DoT: A Block-level RL agent for Task-Aware Video Compression

投稿日: 2025年1月22日作成者: jarxiv

要約ビデオエンコーダは、ビットレートの制約の下で再構成エラーを最小限に抑え … 続きを読む →

カテゴリー: cs.CV, cs.LG, eess.IV | コメントを受け付けていません

VITA-1.5: Towards GPT-4o Level Real-Time Vision and Speech Interaction

投稿日: 2025年1月22日作成者: jarxiv

要約最近のマルチモーダル大規模言語モデル (MLLM) は通常、視覚的モダリテ … 続きを読む →

カテゴリー: cs.CV, cs.SD, eess.AS | コメントを受け付けていません

Early Detection and Classification of Breast Cancer Using Deep Learning Techniques

投稿日: 2025年1月22日作成者: jarxiv

要約 WHOによると、乳がんは最も致死率の高いがんで、世界中で毎年膨大な数の患者 … 続きを読む →

カテゴリー: cs.CV, cs.LG, J.3 | コメントを受け付けていません

「cs.CV」カテゴリーアーカイブ

Scalable Whole Slide Image Representation Using K-Mean Clustering and Fisher Vector Aggregation

Proxies for Distortion and Consistency with Applications for Real-World Image Restoration

Teacher Encoder-Student Decoder Denoising Guided Segmentation Network for Anomaly Detection

Beyond Specialization: Assessing the Capabilities of MLLMs in Age and Gender Estimation

Explainability for Vision Foundation Models: A Survey

Fixing Imbalanced Attention to Mitigate In-Context Hallucination of Large Vision-Language Model

Multi-Scale Texture Loss for CT denoising with GANs

RL-RC-DoT: A Block-level RL agent for Task-Aware Video Compression

VITA-1.5: Towards GPT-4o Level Real-Time Vision and Speech Interaction

Early Detection and Classification of Breast Cancer Using Deep Learning Techniques

最近の投稿

最近のコメント

アーカイブ

カテゴリー