投稿者「jarxiv」のアーカイブ

Mcity Data Engine: Iterative Model Improvement Through Open-Vocabulary Data Selection

投稿日: 2025年5月1日作成者: jarxiv

要約データが増え続ける可能性があるため、機械学習モデルのトレーニングに適したサ … 続きを読む →

カテゴリー: cs.CV | コメントを受け付けていません

HOT3D: Hand and Object Tracking in 3D from Egocentric Multi-View Videos

投稿日: 2025年5月1日作成者: jarxiv

要約 3Dでエゴセントリックハンドとオブジェクト追跡のために公開されているデータ … 続きを読む →

カテゴリー: cs.AI, cs.CV, cs.RO | コメントを受け付けていません

VecFontSDF: Learning to Reconstruct and Synthesize High-quality Vector Fonts via Signed Distance Functions

投稿日: 2025年5月1日作成者: jarxiv

要約フォント設計は、デジタルコンテンツデザインと最新の印刷業界で非常に重要です … 続きを読む →

カテゴリー: cs.CV | コメントを受け付けていません

BEVWorld: A Multimodal World Simulator for Autonomous Driving via Scene-Level BEV Latents

投稿日: 2025年5月1日作成者: jarxiv

要約世界モデルは、潜在的な将来のシナリオを予測する能力のために、自律運転に注目 … 続きを読む →

カテゴリー: cs.AI, cs.CV | コメントを受け付けていません

Diffusion-based Adversarial Identity Manipulation for Facial Privacy Protection

投稿日: 2025年5月1日作成者: jarxiv

要約フェイス認識（FR）システムの成功により、潜在的な不正な監視とソーシャルネ … 続きを読む →

カテゴリー: cs.CV | コメントを受け付けていません

HoloTime: Taming Video Diffusion Models for Panoramic 4D Scene Generation

投稿日: 2025年5月1日作成者: jarxiv

要約拡散モデルの急速な進歩は、通常、ユーザーエクスペリエンスにシーンレベルの4 … 続きを読む →

カテゴリー: cs.CV | コメントを受け付けていません

Vision Transformers on the Edge: A Comprehensive Survey of Model Compression and Acceleration Strategies

投稿日: 2025年5月1日作成者: jarxiv

要約近年、視覚変圧器（VITS）は、画像分類、オブジェクト検出、セグメンテーシ … 続きを読む →

カテゴリー: cs.AR, cs.CV | コメントを受け付けていません

Visual Text Processing: A Comprehensive Review and Unified Evaluation

投稿日: 2025年5月1日作成者: jarxiv

要約視覚テキストは、ドキュメント画像とシーン画像の両方で重要なコンポーネントで … 続きを読む →

カテゴリー: cs.CV | コメントを受け付けていません

Segmentation-Aware Generative Reinforcement Network (GRN) for Tissue Layer Segmentation in 3-D Ultrasound Images for Chronic Low-back Pain (cLBP) Assessment

投稿日: 2025年5月1日作成者: jarxiv

要約セグメンテーション損失フィードバックを統合して、単一の段階で画像生成とセグ … 続きを読む →

カテゴリー: cs.AI, cs.CV, cs.LG | コメントを受け付けていません

Garment3DGen: 3D Garment Stylization and Texture Generation

投稿日: 2025年5月1日作成者: jarxiv

要約 Garment3Dgenに、ガイダンスとして単一の入力画像を与えられたベー … 続きを読む →

カテゴリー: cs.CV | コメントを受け付けていません

投稿者「jarxiv」のアーカイブ

Mcity Data Engine: Iterative Model Improvement Through Open-Vocabulary Data Selection

HOT3D: Hand and Object Tracking in 3D from Egocentric Multi-View Videos

VecFontSDF: Learning to Reconstruct and Synthesize High-quality Vector Fonts via Signed Distance Functions

BEVWorld: A Multimodal World Simulator for Autonomous Driving via Scene-Level BEV Latents

Diffusion-based Adversarial Identity Manipulation for Facial Privacy Protection

HoloTime: Taming Video Diffusion Models for Panoramic 4D Scene Generation

Vision Transformers on the Edge: A Comprehensive Survey of Model Compression and Acceleration Strategies

Visual Text Processing: A Comprehensive Review and Unified Evaluation

Segmentation-Aware Generative Reinforcement Network (GRN) for Tissue Layer Segmentation in 3-D Ultrasound Images for Chronic Low-back Pain (cLBP) Assessment

Garment3DGen: 3D Garment Stylization and Texture Generation

最近の投稿

最近のコメント

アーカイブ

カテゴリー