「cs.CV」カテゴリーアーカイブ

GarmentTracking: Category-Level Garment Pose Tracking

投稿日: 2025年4月16日作成者: jarxiv

要約衣服は人間にとって重要です。完全な衣服のポーズを推定および追跡できる視覚 … 続きを読む →

カテゴリー: cs.CV, cs.RO | コメントを受け付けていません

Think or Not Think: A Study of Explicit Thinking in Rule-Based Visual Reinforcement Fine-Tuning

投稿日: 2025年4月16日作成者: jarxiv

要約このペーパーでは、マルチモーダル大手言語モデル（MLLM）のルールベースの … 続きを読む →

カテゴリー: cs.CV | コメントを受け付けていません

Autoregressive Distillation of Diffusion Transformers

投稿日: 2025年4月16日作成者: jarxiv

要約トランスアーキテクチャを備えた拡散モデルは、高忠実度の画像と高解像度のスケ … 続きを読む →

カテゴリー: cs.CV | コメントを受け付けていません

Cognitive Disentanglement for Referring Multi-Object Tracking

投稿日: 2025年4月16日作成者: jarxiv

要約インテリジェント輸送知覚システムにおけるマルチソース情報融合の重要なアプリ … 続きを読む →

カテゴリー: cs.CV | コメントを受け付けていません

EchoMask: Speech-Queried Attention-based Mask Modeling for Holistic Co-Speech Motion Generation

投稿日: 2025年4月16日作成者: jarxiv

要約マスクされたモデリングフレームワークは、共和声モーション生成に有望を示して … 続きを読む →

カテゴリー: cs.CV, cs.GR, cs.SD | コメントを受け付けていません

CFIS-YOLO: A Lightweight Multi-Scale Fusion Network for Edge-Deployable Wood Defect Detection

投稿日: 2025年4月16日作成者: jarxiv

要約木材処理産業の品質管理を確保するには、木材の欠陥検出が重要です。ただし、 … 続きを読む →

カテゴリー: cs.AI, cs.CV | コメントを受け付けていません

Context-Aware Palmprint Recognition via a Relative Similarity Metric

投稿日: 2025年4月16日作成者: jarxiv

要約既存のマッチングフレームワークの堅牢性と識別性を高める相対類似性メトリック … 続きを読む →

カテゴリー: cs.CV | コメントを受け付けていません

Uncertainty Estimation for Trust Attribution to Speed-of-Sound Reconstruction with Variational Networks

投稿日: 2025年4月16日作成者: jarxiv

要約速度（SOS）は組織の生体力学的特性であり、そのイメージングは診断のた … 続きを読む →

カテゴリー: cs.CV | コメントを受け付けていません

Big Brother is Watching: Proactive Deepfake Detection via Learnable Hidden Face

投稿日: 2025年4月16日作成者: jarxiv

要約 Deepfake Technologiesが進歩し続けるにつれて、受動的検 … 続きを読む →

カテゴリー: cs.CV | コメントを受け付けていません

PVUW 2025 Challenge Report: Advances in Pixel-level Understanding of Complex Videos in the Wild

投稿日: 2025年4月16日作成者: jarxiv

要約このレポートは、CVPR 2025と協力して開催されたWild（PVU）チ … 続きを読む →

カテゴリー: cs.CV | コメントを受け付けていません

「cs.CV」カテゴリーアーカイブ

GarmentTracking: Category-Level Garment Pose Tracking

Think or Not Think: A Study of Explicit Thinking in Rule-Based Visual Reinforcement Fine-Tuning

Autoregressive Distillation of Diffusion Transformers

Cognitive Disentanglement for Referring Multi-Object Tracking

EchoMask: Speech-Queried Attention-based Mask Modeling for Holistic Co-Speech Motion Generation

CFIS-YOLO: A Lightweight Multi-Scale Fusion Network for Edge-Deployable Wood Defect Detection

Context-Aware Palmprint Recognition via a Relative Similarity Metric

Uncertainty Estimation for Trust Attribution to Speed-of-Sound Reconstruction with Variational Networks

Big Brother is Watching: Proactive Deepfake Detection via Learnable Hidden Face

PVUW 2025 Challenge Report: Advances in Pixel-level Understanding of Complex Videos in the Wild

最近の投稿

最近のコメント

アーカイブ

カテゴリー