月別アーカイブ: 2025年5月

HoloTime: Taming Video Diffusion Models for Panoramic 4D Scene Generation

投稿日: 2025年5月1日作成者: jarxiv

要約拡散モデルの急速な進歩は、通常、ユーザーエクスペリエンスにシーンレベルの4 … 続きを読む →

カテゴリー: cs.CV | コメントを受け付けていません

Vision Transformers on the Edge: A Comprehensive Survey of Model Compression and Acceleration Strategies

投稿日: 2025年5月1日作成者: jarxiv

要約近年、視覚変圧器（VITS）は、画像分類、オブジェクト検出、セグメンテーシ … 続きを読む →

カテゴリー: cs.AR, cs.CV | コメントを受け付けていません

Visual Text Processing: A Comprehensive Review and Unified Evaluation

投稿日: 2025年5月1日作成者: jarxiv

要約視覚テキストは、ドキュメント画像とシーン画像の両方で重要なコンポーネントで … 続きを読む →

カテゴリー: cs.CV | コメントを受け付けていません

Segmentation-Aware Generative Reinforcement Network (GRN) for Tissue Layer Segmentation in 3-D Ultrasound Images for Chronic Low-back Pain (cLBP) Assessment

投稿日: 2025年5月1日作成者: jarxiv

要約セグメンテーション損失フィードバックを統合して、単一の段階で画像生成とセグ … 続きを読む →

カテゴリー: cs.AI, cs.CV, cs.LG | コメントを受け付けていません

Garment3DGen: 3D Garment Stylization and Texture Generation

投稿日: 2025年5月1日作成者: jarxiv

要約 Garment3Dgenに、ガイダンスとして単一の入力画像を与えられたベー … 続きを読む →

カテゴリー: cs.CV | コメントを受け付けていません

Enhancing Self-Supervised Fine-Grained Video Object Tracking with Dynamic Memory Prediction

投稿日: 2025年5月1日作成者: jarxiv

要約成功したビデオ分析は、フレーム全体のピクセルの正確な認識に依存しており、ビ … 続きを読む →

カテゴリー: cs.AI, cs.CV | コメントを受け付けていません

PixelHacker: Image Inpainting with Structural and Semantic Consistency

投稿日: 2025年5月1日作成者: jarxiv

要約画像の開始は、画像編集と画像生成の間の基本的な研究領域です。最近の最先端 … 続きを読む →

カテゴリー: cs.CV | コメントを受け付けていません

BiPrompt-SAM: Enhancing Image Segmentation via Explicit Selection between Point and Text Prompts

投稿日: 2025年5月1日作成者: jarxiv

要約セグメンテーションはコンピュータービジョンの基本的なタスクであり、柔軟性の … 続きを読む →

カテゴリー: cs.CV, cs.LG | コメントを受け付けていません

REHEARSE-3D: A Multi-modal Emulated Rain Dataset for 3D Point Cloud De-raining

投稿日: 2025年5月1日作成者: jarxiv

要約センサーの劣化は、自律運転において大きな課題をもたらします。大雨の間、雨 … 続きを読む →

カテゴリー: cs.CV, cs.RO | コメントを受け付けていません

Vision Transformers in Precision Agriculture: A Comprehensive Survey

投稿日: 2025年5月1日作成者: jarxiv

要約植物の病気を検出することは、現代の農業の重要な側面です。作物の健康を維持し … 続きを読む →

カテゴリー: cs.AI, cs.CV | コメントを受け付けていません

月別アーカイブ: 2025年5月

HoloTime: Taming Video Diffusion Models for Panoramic 4D Scene Generation

Vision Transformers on the Edge: A Comprehensive Survey of Model Compression and Acceleration Strategies

Visual Text Processing: A Comprehensive Review and Unified Evaluation

Segmentation-Aware Generative Reinforcement Network (GRN) for Tissue Layer Segmentation in 3-D Ultrasound Images for Chronic Low-back Pain (cLBP) Assessment

Garment3DGen: 3D Garment Stylization and Texture Generation

Enhancing Self-Supervised Fine-Grained Video Object Tracking with Dynamic Memory Prediction

PixelHacker: Image Inpainting with Structural and Semantic Consistency

BiPrompt-SAM: Enhancing Image Segmentation via Explicit Selection between Point and Text Prompts

REHEARSE-3D: A Multi-modal Emulated Rain Dataset for 3D Point Cloud De-raining

Vision Transformers in Precision Agriculture: A Comprehensive Survey

最近の投稿

最近のコメント

アーカイブ

カテゴリー