「cs.CV」カテゴリーアーカイブ

Optimal-state Dynamics Estimation for Physics-based Human Motion Capture from Videos

投稿日: 2025年5月15日作成者: jarxiv

要約単眼ビデオからの人間のモーションキャプチャは、近年大きな進歩を遂げています … 続きを読む →

カテゴリー: cs.CV | コメントを受け付けていません

Variational Visual Question Answering

投稿日: 2025年5月15日作成者: jarxiv

要約視覚的な質問応答（VQA）のマルチモーダルモデルでは顕著な進歩にもかかわら … 続きを読む →

カテゴリー: cs.AI, cs.CV | コメントを受け付けていません

Embodied-Reasoner: Synergizing Visual Search, Reasoning, and Action for Embodied Interactive Tasks

投稿日: 2025年5月15日作成者: jarxiv

要約深い思考モデルの最近の進歩により、数学的およびコーディングタスクに関する顕 … 続きを読む →

カテゴリー: cs.CL, cs.CV | コメントを受け付けていません

LightLab: Controlling Light Sources in Images with Diffusion Models

投稿日: 2025年5月15日作成者: jarxiv

要約画像内の光源に対するきめの細かいパラメトリック制御のためのシンプルでありな … 続きを読む →

カテゴリー: cs.CV, cs.GR | コメントを受け付けていません

UWAV: Uncertainty-weighted Weakly-supervised Audio-Visual Video Parsing

投稿日: 2025年5月15日作成者: jarxiv

要約オーディオビジュアルビデオの解析（AVVP）は、両方のユニモーダルイベント … 続きを読む →

カテゴリー: cs.CV, cs.SD, eess.AS | コメントを受け付けていません

Thermal Detection of People with Mobility Restrictions for Barrier Reduction at Traffic Lights Controlled Intersections

投稿日: 2025年5月15日作成者: jarxiv

要約コンピュータービジョンの深い学習における急速な進歩により、RGBカメラベー … 続きを読む →

カテゴリー: cs.CV | コメントを受け付けていません

MGPATH: Vision-Language Model with Multi-Granular Prompt Learning for Few-Shot WSI Classification

投稿日: 2025年5月15日作成者: jarxiv

要約全体のスライド病理学の画像分類は、ギガピクセルの画像サイズと限られた注釈ラ … 続きを読む →

カテゴリー: cs.CV, cs.LG | コメントを受け付けていません

The RaspGrade Dataset: Towards Automatic Raspberry Ripeness Grading with Deep Learning

投稿日: 2025年5月15日作成者: jarxiv

要約この研究では、迅速で正確で非侵襲的な食品品質評価のためのコンピュータービジ … 続きを読む →

カテゴリー: cs.CV | コメントを受け付けていません

WaveGuard: Robust Deepfake Detection and Source Tracing via Dual-Tree Complex Wavelet and Graph Neural Networks

投稿日: 2025年5月15日作成者: jarxiv

要約 Deepfakeテクノロジーは、プライバシーの侵略や個人情報の盗難などのリ … 続きを読む →

カテゴリー: cs.CV | コメントを受け付けていません

Leveraging Segment Anything Model for Source-Free Domain Adaptation via Dual Feature Guided Auto-Prompting

投稿日: 2025年5月15日作成者: jarxiv

要約セグメンテーション用のソースフリードメイン適応（SFDA）は、ソースモデル … 続きを読む →

カテゴリー: cs.CV | コメントを受け付けていません

「cs.CV」カテゴリーアーカイブ

Optimal-state Dynamics Estimation for Physics-based Human Motion Capture from Videos

Variational Visual Question Answering

Embodied-Reasoner: Synergizing Visual Search, Reasoning, and Action for Embodied Interactive Tasks

LightLab: Controlling Light Sources in Images with Diffusion Models

UWAV: Uncertainty-weighted Weakly-supervised Audio-Visual Video Parsing

Thermal Detection of People with Mobility Restrictions for Barrier Reduction at Traffic Lights Controlled Intersections

MGPATH: Vision-Language Model with Multi-Granular Prompt Learning for Few-Shot WSI Classification

The RaspGrade Dataset: Towards Automatic Raspberry Ripeness Grading with Deep Learning

WaveGuard: Robust Deepfake Detection and Source Tracing via Dual-Tree Complex Wavelet and Graph Neural Networks

Leveraging Segment Anything Model for Source-Free Domain Adaptation via Dual Feature Guided Auto-Prompting

最近の投稿

最近のコメント

アーカイブ

カテゴリー