「cs.CV」カテゴリーアーカイブ

Improving Zero-Shot Object-Level Change Detection by Incorporating Visual Correspondence

投稿日: 2025年1月17日作成者: jarxiv

要約異なるビューにわたる 2 つの画像間のオブジェクトレベルの変化を検出するこ … 続きを読む →

カテゴリー: cs.AI, cs.CV | コメントを受け付けていません

A Comprehensive Survey of Foundation Models in Medicine

投稿日: 2025年1月17日作成者: jarxiv

要約基礎モデル (FM) は、多くの場合、自己教師あり学習手法を使用して、大規 … 続きを読む →

カテゴリー: cs.AI, cs.CV, cs.LG | コメントを受け付けていません

Unified Face Matching and Physical-Digital Spoofing Attack Detection

投稿日: 2025年1月17日作成者: jarxiv

要約顔認識テクノロジーは、セキュリティ、監視、認証システムの状況を劇的に変革し … 続きを読む →

カテゴリー: cs.CV | コメントを受け付けていません

A Comparative Study on Multi-task Uncertainty Quantification in Semantic Segmentation and Monocular Depth Estimation

投稿日: 2025年1月17日作成者: jarxiv

要約ディープニューラルネットワークは、セマンティックセグメンテーションや … 続きを読む →

カテゴリー: cs.AI, cs.CV, cs.LG | コメントを受け付けていません

VIS-MAE: An Efficient Self-supervised Learning Approach on Medical Image Segmentation and Classification

投稿日: 2025年1月17日作成者: jarxiv

要約人工知能 (AI) は、医療画像における診断とセグメンテーションに革命をも … 続きを読む →

カテゴリー: cs.CV, eess.IV | コメントを受け付けていません

Robin: a Suite of Multi-Scale Vision-Language Models and the CHIRP Evaluation Benchmark

投稿日: 2025年1月17日作成者: jarxiv

要約過去数年間における視覚言語モデル (VLM) の急増により、厳密かつ包括的 … 続きを読む →

カテゴリー: cs.AI, cs.CV | コメントを受け付けていません

Super-class guided Transformer for Zero-Shot Attribute Classification

投稿日: 2025年1月17日作成者: jarxiv

要約属性分類は、画像領域内の特定の特徴を識別するために重要です。ビジョン言語 … 続きを読む →

カテゴリー: cs.CV | コメントを受け付けていません

Vulnerability-Aware Spatio-Temporal Learning for Generalizable and Interpretable Deepfake Video Detection

投稿日: 2025年1月17日作成者: jarxiv

要約偽造シーケンスには空間的および時間的なアーチファクトが複雑に絡み合っている … 続きを読む →

カテゴリー: cs.CV | コメントを受け付けていません

Fine-Grained Image-Text Correspondence with Cost Aggregation for Open-Vocabulary Part Segmentation

投稿日: 2025年1月17日作成者: jarxiv

要約 Open-Vocabulary Part Segmentation (OV … 続きを読む →

カテゴリー: cs.CV | コメントを受け付けていません

Mitigating Hallucinations in Large Vision-Language Models via DPO: On-Policy Data Hold the Key

投稿日: 2025年1月17日作成者: jarxiv

要約幻覚は依然として大規模視覚言語モデル (LVLM) にとって大きな課題です … 続きを読む →

カテゴリー: cs.CV | コメントを受け付けていません

「cs.CV」カテゴリーアーカイブ

Improving Zero-Shot Object-Level Change Detection by Incorporating Visual Correspondence

A Comprehensive Survey of Foundation Models in Medicine

Unified Face Matching and Physical-Digital Spoofing Attack Detection

A Comparative Study on Multi-task Uncertainty Quantification in Semantic Segmentation and Monocular Depth Estimation

VIS-MAE: An Efficient Self-supervised Learning Approach on Medical Image Segmentation and Classification

Robin: a Suite of Multi-Scale Vision-Language Models and the CHIRP Evaluation Benchmark

Super-class guided Transformer for Zero-Shot Attribute Classification

Vulnerability-Aware Spatio-Temporal Learning for Generalizable and Interpretable Deepfake Video Detection

Fine-Grained Image-Text Correspondence with Cost Aggregation for Open-Vocabulary Part Segmentation

Mitigating Hallucinations in Large Vision-Language Models via DPO: On-Policy Data Hold the Key

最近の投稿

最近のコメント

アーカイブ

カテゴリー