月別アーカイブ: 2025年3月

Tuning-Free Multi-Event Long Video Generation via Synchronized Coupled Sampling

投稿日: 2025年3月12日作成者: jarxiv

要約テキストからビデオへの拡散モデルの最近の進歩により、単一のプロンプトから高 … 続きを読む →

カテゴリー: cs.AI, cs.CV, cs.LG | コメントを受け付けていません

Curriculum Direct Preference Optimization for Diffusion and Consistency Models

投稿日: 2025年3月12日作成者: jarxiv

要約直接選好最適化（DPO）は、人間のフィードバック（RLHF）からの強化学習 … 続きを読む →

カテゴリー: cs.AI, cs.CV, cs.LG | コメントを受け付けていません

Vision Transformer for Intracranial Hemorrhage Classification in CT Scans Using an Entropy-Aware Fuzzy Integral Strategy for Adaptive Scan-Level Decision Fusion

投稿日: 2025年3月12日作成者: jarxiv

要約頭蓋内出血（ICH）は、脳血管の破裂によって引き起こされる重大な医学的緊急 … 続きを読む →

カテゴリー: cs.AI, cs.CV, eess.IV | コメントを受け付けていません

HO-Cap: A Capture System and Dataset for 3D Reconstruction and Pose Tracking of Hand-Object Interaction

投稿日: 2025年3月12日作成者: jarxiv

要約 3D再構成のために、データキャプチャシステムと新しいデータセット、HO-C … 続きを読む →

カテゴリー: cs.CV | コメントを受け付けていません

INPC: Implicit Neural Point Clouds for Radiance Field Rendering

投稿日: 2025年3月12日作成者: jarxiv

要約未結合の実世界のシーンの再構築と新しいビューの統合のための新しいアプローチ … 続きを読む →

カテゴリー: cs.CV, cs.GR, cs.LG | コメントを受け付けていません

HiP-AD: Hierarchical and Multi-Granularity Planning with Deformable Attention for Autonomous Driving in a Single Decoder

投稿日: 2025年3月12日作成者: jarxiv

要約エンドツーエンドの自律運転（E2E-AD）テクノロジーは近年大きな進歩を遂 … 続きを読む →

カテゴリー: cs.CV, cs.RO | コメントを受け付けていません

LightGen: Efficient Image Generation through Knowledge Distillation and Direct Preference Optimization

投稿日: 2025年3月12日作成者: jarxiv

要約テキストからイメージの生成の最近の進歩は、主に広範なデータセットとパラメー … 続きを読む →

カテゴリー: cs.CV | コメントを受け付けていません

RealmDreamer: Text-Driven 3D Scene Generation with Inpainting and Depth Diffusion

投稿日: 2025年3月12日作成者: jarxiv

要約 Textの説明から前向きな3Dシーンを生成するためのテクニックであるRea … 続きを読む →

カテゴリー: cs.AI, cs.CV, cs.GR, cs.LG | コメントを受け付けていません

SegAgent: Exploring Pixel Understanding Capabilities in MLLMs by Imitating Human Annotator Trajectories

投稿日: 2025年3月12日作成者: jarxiv

要約 MLLMは適切な画像理解機能を実証していますが、Pixelレベルの理解に苦 … 続きを読む →

カテゴリー: cs.CV | コメントを受け付けていません

PanoDreamer: Optimization-Based Single Image to 360 3D Scene With Diffusion

投稿日: 2025年3月12日作成者: jarxiv

要約この論文では、単一の入力画像からコヒーレント360 {\ deg} 3Dシ … 続きを読む →

カテゴリー: cs.CV, cs.GR | コメントを受け付けていません

月別アーカイブ: 2025年3月

Tuning-Free Multi-Event Long Video Generation via Synchronized Coupled Sampling

Curriculum Direct Preference Optimization for Diffusion and Consistency Models

Vision Transformer for Intracranial Hemorrhage Classification in CT Scans Using an Entropy-Aware Fuzzy Integral Strategy for Adaptive Scan-Level Decision Fusion

HO-Cap: A Capture System and Dataset for 3D Reconstruction and Pose Tracking of Hand-Object Interaction

INPC: Implicit Neural Point Clouds for Radiance Field Rendering

HiP-AD: Hierarchical and Multi-Granularity Planning with Deformable Attention for Autonomous Driving in a Single Decoder

LightGen: Efficient Image Generation through Knowledge Distillation and Direct Preference Optimization

RealmDreamer: Text-Driven 3D Scene Generation with Inpainting and Depth Diffusion

SegAgent: Exploring Pixel Understanding Capabilities in MLLMs by Imitating Human Annotator Trajectories

PanoDreamer: Optimization-Based Single Image to 360 3D Scene With Diffusion

最近の投稿

最近のコメント

アーカイブ

カテゴリー