「cs.CV」カテゴリーアーカイブ

CostFilter-AD: Enhancing Anomaly Detection through Matching Cost Filtering

投稿日: 2025年5月26日作成者: jarxiv

要約監視されていない異常検出（UAD）は、通常のサンプルに関して入力画像の異常 … 続きを読む →

カテゴリー: cs.AI, cs.CV, eess.IV | コメントを受け付けていません

RestoreVAR: Visual Autoregressive Generation for All-in-One Image Restoration

投稿日: 2025年5月26日作成者: jarxiv

要約安定した拡散などの潜在的な拡散モデル（LDMS）の使用は、オールインワン画 … 続きを読む →

カテゴリー: cs.AI, cs.CV | コメントを受け付けていません

SHARDeg: A Benchmark for Skeletal Human Action Recognition in Degraded Scenarios

投稿日: 2025年5月26日作成者: jarxiv

要約検出、予測、または分類タスクのコンピュータービジョン（CV）モデルは、リア … 続きを読む →

カテゴリー: cs.CV | コメントを受け付けていません

SpikeGen: Generative Framework for Visual Spike Stream Processing

投稿日: 2025年5月26日作成者: jarxiv

要約スパイクカメラなどの神経型の視覚システムは、動的な条件下で透明なテクスチャ … 続きを読む →

カテゴリー: cs.CV | コメントを受け付けていません

LookWhere? Efficient Visual Recognition by Learning Where to Look and What to See from Self-Supervision

投稿日: 2025年5月26日作成者: jarxiv

要約視覚変圧器は、より大きく、より正確で、計算がより高価です。トークンの数は … 続きを読む →

カテゴリー: cs.CV | コメントを受け付けていません

BOTM: Echocardiography Segmentation via Bi-directional Optimal Token Matching

投稿日: 2025年5月26日作成者: jarxiv

要約存在した心エコー検査セグメンテーション法は、形状の変動、部分観察、および2 … 続きを読む →

カテゴリー: cs.CV | コメントを受け付けていません

FDBPL: Faster Distillation-Based Prompt Learning for Region-Aware Vision-Language Models Adaptation

投稿日: 2025年5月26日作成者: jarxiv

要約視覚言語モデル（VLM）をダウンストリームタスクに適応させるために広く採用 … 続きを読む →

カテゴリー: cs.AI, cs.CV | コメントを受け付けていません

Multi-Faceted Multimodal Monosemanticity

投稿日: 2025年5月26日作成者: jarxiv

要約人間は、ビジョン、言語、スピーチなどの複数のモダリティを通じて世界を経験し … 続きを読む →

カテゴリー: cs.AI, cs.CV | コメントを受け付けていません

A Foundation Model Framework for Multi-View MRI Classification of Extramural Vascular Invasion and Mesorectal Fascia Invasion in Rectal Cancer

投稿日: 2025年5月26日作成者: jarxiv

要約背景：壁外血管浸潤（EVI）およびメソレクトル筋膜浸潤（MFI）の正確なM … 続きを読む →

カテゴリー: cs.CV, eess.IV | コメントを受け付けていません

Semantic Correspondence: Unified Benchmarking and a Strong Baseline

投稿日: 2025年5月26日作成者: jarxiv

要約セマンティック対応を確立することは、キーポイントを異なる画像間で同じセマン … 続きを読む →

カテゴリー: cs.CV | コメントを受け付けていません

「cs.CV」カテゴリーアーカイブ

CostFilter-AD: Enhancing Anomaly Detection through Matching Cost Filtering

RestoreVAR: Visual Autoregressive Generation for All-in-One Image Restoration

SHARDeg: A Benchmark for Skeletal Human Action Recognition in Degraded Scenarios

SpikeGen: Generative Framework for Visual Spike Stream Processing

LookWhere? Efficient Visual Recognition by Learning Where to Look and What to See from Self-Supervision

BOTM: Echocardiography Segmentation via Bi-directional Optimal Token Matching

FDBPL: Faster Distillation-Based Prompt Learning for Region-Aware Vision-Language Models Adaptation

Multi-Faceted Multimodal Monosemanticity

A Foundation Model Framework for Multi-View MRI Classification of Extramural Vascular Invasion and Mesorectal Fascia Invasion in Rectal Cancer

Semantic Correspondence: Unified Benchmarking and a Strong Baseline

最近の投稿

最近のコメント

アーカイブ

カテゴリー