「cs.CV」カテゴリーアーカイブ

NoisyRollout: Reinforcing Visual Reasoning with Data Augmentation

投稿日: 2025年5月28日作成者: jarxiv

要約強化学習（RL）の最近の進歩により、視覚言語モデル（VLM）の推論能力が強 … 続きを読む →

カテゴリー: cs.CV | コメントを受け付けていません

RapidPoseTriangulation: Multi-view Multi-person Whole-body Human Pose Triangulation in a Millisecond

投稿日: 2025年5月27日作成者: jarxiv

要約マルチビューイメージングとポーズ推定の統合は、コンピュータービジョンアプリ … 続きを読む →

カテゴリー: cs.CV | コメントを受け付けていません

SaSi: A Self-augmented and Self-interpreted Deep Learning Approach for Few-shot Cryo-ET Particle Detection

投稿日: 2025年5月27日作成者: jarxiv

要約 Cryo-Electron断層撮影（Cryo-ET）は、ネイティブに近い州 … 続きを読む →

カテゴリー: cs.AI, cs.CV, cs.LG | コメントを受け付けていません

Multimodal Reasoning Agent for Zero-Shot Composed Image Retrieval

投稿日: 2025年5月27日作成者: jarxiv

要約 Zero-Shot Composed Image Retrieval（ZS … 続きを読む →

カテゴリー: cs.CV, cs.IR | コメントを受け付けていません

DeepEyes: Incentivizing ‘Thinking with Images’ via Reinforcement Learning

投稿日: 2025年5月27日作成者: jarxiv

要約大規模なビジョン言語モデル（VLM）は、マルチモーダルの理解と推論に強力な … 続きを読む →

カテゴリー: cs.CV | コメントを受け付けていません

UltraVSR: Achieving Ultra-Realistic Video Super-Resolution with Efficient One-Step Diffusion Space

投稿日: 2025年5月27日作成者: jarxiv

要約拡散モデルは、現実的な画像の詳細を生成する大きな可能性を示しています。た … 続きを読む →

カテゴリー: cs.CV | コメントを受け付けていません

PHI: Bridging Domain Shift in Long-Term Action Quality Assessment via Progressive Hierarchical Instruction

投稿日: 2025年5月27日作成者: jarxiv

要約長期アクション品質評価（AQA）は、長いビデオでのアクションの定量的パフォ … 続きを読む →

カテゴリー: cs.CV | コメントを受け付けていません

Domain-Agnostic Stroke Lesion Segmentation Using Physics-Constrained Synthetic Data

投稿日: 2025年5月27日作成者: jarxiv

要約 MRIの脳卒中病変のセグメント化は、モデルの一般化可能性を制限する多様な獲 … 続きを読む →

カテゴリー: cs.CV, eess.IV, physics.med-ph | コメントを受け付けていません

ICDM: Interference Cancellation Diffusion Models for Wireless Semantic Communications

投稿日: 2025年5月27日作成者: jarxiv

要約拡散モデル（DMS）は、最近、除去能力のためにワイヤレス通信システムで大き … 続きを読む →

カテゴリー: cs.AI, cs.CV, cs.IT, math.IT | コメントを受け付けていません

From Single Images to Motion Policies via Video-Generation Environment Representations

投稿日: 2025年5月27日作成者: jarxiv

要約自律的なロボットは通常、周囲の表現を構築し、環境の幾何学に動きを適応させる … 続きを読む →

カテゴリー: cs.CV, cs.GR, cs.LG, cs.RO | コメントを受け付けていません

「cs.CV」カテゴリーアーカイブ

NoisyRollout: Reinforcing Visual Reasoning with Data Augmentation

RapidPoseTriangulation: Multi-view Multi-person Whole-body Human Pose Triangulation in a Millisecond

SaSi: A Self-augmented and Self-interpreted Deep Learning Approach for Few-shot Cryo-ET Particle Detection

Multimodal Reasoning Agent for Zero-Shot Composed Image Retrieval

DeepEyes: Incentivizing ‘Thinking with Images’ via Reinforcement Learning

UltraVSR: Achieving Ultra-Realistic Video Super-Resolution with Efficient One-Step Diffusion Space

PHI: Bridging Domain Shift in Long-Term Action Quality Assessment via Progressive Hierarchical Instruction

Domain-Agnostic Stroke Lesion Segmentation Using Physics-Constrained Synthetic Data

ICDM: Interference Cancellation Diffusion Models for Wireless Semantic Communications

From Single Images to Motion Policies via Video-Generation Environment Representations

最近の投稿

最近のコメント

アーカイブ

カテゴリー