「cs.CV」カテゴリーアーカイブ

From Reflection to Perfection: Scaling Inference-Time Optimization for Text-to-Image Diffusion Models via Reflection Tuning

投稿日: 2025年4月23日作成者: jarxiv

要約最近のテキスト間拡散モデルは、トレーニングデータとモデルパラメーターの広範 … 続きを読む →

カテゴリー: cs.CV | コメントを受け付けていません

Survey of Video Diffusion Models: Foundations, Implementations, and Applications

投稿日: 2025年4月23日作成者: jarxiv

要約拡散モデルの最近の進歩により、ビデオ生成に革命をもたらし、従来の生成的敵対 … 続きを読む →

カテゴリー: cs.CL, cs.CV | コメントを受け付けていません

MR. Video: ‘MapReduce’ is the Principle for Long Video Understanding

投稿日: 2025年4月23日作成者: jarxiv

要約 MRを提案します。ビデオ、長いビデオを処理するためのシンプルで効果的なM … 続きを読む →

カテゴリー: cs.CV | コメントを受け付けていません

MMInference: Accelerating Pre-filling for Long-Context VLMs via Modality-Aware Permutation Sparse Attention

投稿日: 2025年4月23日作成者: jarxiv

要約長いコンテキスト機能と視覚的理解の統合は、ビジョン言語モデル（VLM）の前 … 続きを読む →

カテゴリー: cs.CV, cs.LG | コメントを受け付けていません

Distribution-aware Forgetting Compensation for Exemplar-Free Lifelong Person Re-identification

投稿日: 2025年4月23日作成者: jarxiv

要約生涯にわたる人の再識別（LREID）は、新しい情報に適応しながら古い知識を … 続きを読む →

カテゴリー: cs.AI, cs.CV | コメントを受け付けていません

VistaDepth: Frequency Modulation With Bias Reweighting For Enhanced Long-Range Depth Estimation

投稿日: 2025年4月23日作成者: jarxiv

要約単眼深度推定（MDE）は、単一のRGB画像からピクセルあたりの深度値を予測 … 続きを読む →

カテゴリー: cs.CV | コメントを受け付けていません

DRAWER: Digital Reconstruction and Articulation With Environment Realism

投稿日: 2025年4月23日作成者: jarxiv

要約現実世界のデータから仮想デジタルレプリカを作成すると、ゲームやロボット工学 … 続きを読む →

カテゴリー: cs.CV, cs.RO | コメントを受け付けていません

Phoenix: A Motion-based Self-Reflection Framework for Fine-grained Robotic Action Correction

投稿日: 2025年4月22日作成者: jarxiv

要約一般化可能な自己修正システムの構築は、ロボットが障害から回復するために重要 … 続きを読む →

カテゴリー: cs.AI, cs.CV, cs.RO | コメントを受け付けていません

GFreeDet: Exploiting Gaussian Splatting and Foundation Models for Model-free Unseen Object Detection in the BOP Challenge 2024

投稿日: 2025年4月22日作成者: jarxiv

要約 GFREEDETは、モデルのない設定でガウスのスプラッティングとビジョンフ … 続きを読む →

カテゴリー: cs.CV, cs.RO | コメントを受け付けていません

Latent Representations for Visual Proprioception in Inexpensive Robots

投稿日: 2025年4月22日作成者: jarxiv

要約ロボット操作には、ロボットの関節位置に関する明示的または暗黙的な知識が必要 … 続きを読む →

カテゴリー: cs.CV, cs.RO | コメントを受け付けていません

「cs.CV」カテゴリーアーカイブ

From Reflection to Perfection: Scaling Inference-Time Optimization for Text-to-Image Diffusion Models via Reflection Tuning

Survey of Video Diffusion Models: Foundations, Implementations, and Applications

MR. Video: ‘MapReduce’ is the Principle for Long Video Understanding

MMInference: Accelerating Pre-filling for Long-Context VLMs via Modality-Aware Permutation Sparse Attention

Distribution-aware Forgetting Compensation for Exemplar-Free Lifelong Person Re-identification

VistaDepth: Frequency Modulation With Bias Reweighting For Enhanced Long-Range Depth Estimation

DRAWER: Digital Reconstruction and Articulation With Environment Realism

Phoenix: A Motion-based Self-Reflection Framework for Fine-grained Robotic Action Correction

GFreeDet: Exploiting Gaussian Splatting and Foundation Models for Model-free Unseen Object Detection in the BOP Challenge 2024

Latent Representations for Visual Proprioception in Inexpensive Robots

最近の投稿

最近のコメント

アーカイブ

カテゴリー