月別アーカイブ: 2025年5月

VP Lab: a PEFT-Enabled Visual Prompting Laboratory for Semantic Segmentation

投稿日: 2025年5月22日作成者: jarxiv

要約大規模な前処理されたビジョンバックボーンは、セマンティックセグメンテーショ … 続きを読む →

カテゴリー: cs.CV | コメントを受け付けていません

Beyond Classification: Evaluating Diffusion Denoised Smoothing for Security-Utility Trade off

投稿日: 2025年5月22日作成者: jarxiv

要約基礎モデルは、さまざまなタスクで印象的なパフォーマンスを示していますが、敵 … 続きを読む →

カテゴリー: cs.AI, cs.CV, cs.LG | コメントを受け付けていません

A Methodology to Evaluate Strategies Predicting Rankings on Unseen Domains

投稿日: 2025年5月22日作成者: jarxiv

要約多くの場合、複数のエンティティ（メソッド、アルゴリズム、手順、ソリューショ … 続きを読む →

カテゴリー: cs.CV, cs.PF | コメントを受け付けていません

Aggregation Schemes for Single-Vector WSI Representation Learning in Digital Pathology

投稿日: 2025年5月22日作成者: jarxiv

要約計算病理学で全体のスライド画像（WSI）を効率的に統合するための重要なステ … 続きを読む →

カテゴリー: cs.AI, cs.CV, cs.IR, eess.IV, q-bio.QM | コメントを受け付けていません

LENS: Multi-level Evaluation of Multimodal Reasoning with Large Language Models

投稿日: 2025年5月22日作成者: jarxiv

要約マルチモーダル大手言語モデル（MLLM）は、視覚的および言語情報の統合に大 … 続きを読む →

カテゴリー: cs.CV | コメントを受け付けていません

SNAP: A Benchmark for Testing the Effects of Capture Conditions on Fundamental Vision Tasks

投稿日: 2025年5月22日作成者: jarxiv

要約 Deep-Rearningベースの（DL）コンピュータービジョンアルゴリズ … 続きを読む →

カテゴリー: cs.CV | コメントを受け付けていません

Oral Imaging for Malocclusion Issues Assessments: OMNI Dataset, Deep Learning Baselines and Benchmarking

投稿日: 2025年5月22日作成者: jarxiv

要約不正咬合は歯科矯正の主要な課題であり、その複雑な症状と多様な臨床症状により … 続きを読む →

カテゴリー: cs.CV | コメントを受け付けていません

FragFake: A Dataset for Fine-Grained Detection of Edited Images with Vision Language Models

投稿日: 2025年5月22日作成者: jarxiv

要約特に、最新の拡散モデルと画像編集方法が非常に現実的な操作を生成する可能性が … 続きを読む →

カテゴリー: cs.AI, cs.CR, cs.CV | コメントを受け付けていません

How far can we go with ImageNet for Text-to-Image generation?

投稿日: 2025年5月22日作成者: jarxiv

要約最近のテキストからイメージの生成モデルは、「より大きなISが優れている」パ … 続きを読む →

カテゴリー: cs.CV | コメントを受け付けていません

The Devil is in Fine-tuning and Long-tailed Problems:A New Benchmark for Scene Text Detection

投稿日: 2025年5月22日作成者: jarxiv

要約シーンのテキスト検出では、アカデミックベンチマークで優れた高性能な方法の出 … 続きを読む →

カテゴリー: cs.CV | コメントを受け付けていません

月別アーカイブ: 2025年5月

VP Lab: a PEFT-Enabled Visual Prompting Laboratory for Semantic Segmentation

Beyond Classification: Evaluating Diffusion Denoised Smoothing for Security-Utility Trade off

A Methodology to Evaluate Strategies Predicting Rankings on Unseen Domains

Aggregation Schemes for Single-Vector WSI Representation Learning in Digital Pathology

LENS: Multi-level Evaluation of Multimodal Reasoning with Large Language Models

SNAP: A Benchmark for Testing the Effects of Capture Conditions on Fundamental Vision Tasks

Oral Imaging for Malocclusion Issues Assessments: OMNI Dataset, Deep Learning Baselines and Benchmarking

FragFake: A Dataset for Fine-Grained Detection of Edited Images with Vision Language Models

How far can we go with ImageNet for Text-to-Image generation?

The Devil is in Fine-tuning and Long-tailed Problems:A New Benchmark for Scene Text Detection

最近の投稿

最近のコメント

アーカイブ

カテゴリー