「cs.CV」カテゴリーアーカイブ

RelDenClu: A Relative Density based Biclustering Method for identifying non-linear feature relations

投稿日: 2025年3月31日作成者: jarxiv

要約多くの場合、特徴関係に基づいたバイカルスターを見つけるための既存のバイクラ … 続きを読む →

カテゴリー: cs.CV, cs.LG | コメントを受け付けていません

Zero4D: Training-Free 4D Video Generation From Single Video Using Off-the-Shelf Video Diffusion Model

投稿日: 2025年3月31日作成者: jarxiv

要約最近、マルチビューまたは4Dビデオ生成が重要な研究トピックとして浮上してい … 続きを読む →

カテゴリー: cs.CV | コメントを受け付けていません

RAP: Retrieval-Augmented Personalization for Multimodal Large Language Models

投稿日: 2025年3月31日作成者: jarxiv

要約大規模な言語モデル（LLMS）の開発は、一般的なアシスタントとしてマルチモ … 続きを読む →

カテゴリー: cs.AI, cs.CL, cs.CV, cs.LG, cs.MM | コメントを受け付けていません

VidTwin: Video VAE with Decoupled Structure and Dynamics

投稿日: 2025年3月31日作成者: jarxiv

要約ビデオ自動エンコーダー（ビデオAE）の最近の進歩により、ビデオ生成の品質と … 続きを読む →

カテゴリー: cs.AI, cs.CV, cs.LG | コメントを受け付けていません

Unicorn: Text-Only Data Synthesis for Vision Language Model Training

投稿日: 2025年3月31日作成者: jarxiv

要約トレーニングビジョン言語モデル（VLM）には通常、大規模で高品質の画像テキ … 続きを読む →

カテゴリー: cs.AI, cs.CV, cs.MM | コメントを受け付けていません

Evaluation of Machine-generated Biomedical Images via A Tally-based Similarity Measure

投稿日: 2025年3月31日作成者: jarxiv

要約超解像度、インペインティング、全画像の生成、対応のないスタイル移動、ネット … 続きを読む →

カテゴリー: cs.AI, cs.CV, cs.LG, eess.IV | コメントを受け付けていません

Understanding Co-speech Gestures in-the-wild

投稿日: 2025年3月31日作成者: jarxiv

要約共同スピーチのジェスチャーは、非言語的コミュニケーションにおいて重要な役割 … 続きを読む →

カテゴリー: cs.CV | コメントを受け付けていません

TranSplat: Lighting-Consistent Cross-Scene Object Transfer with 3D Gaussian Splatting

投稿日: 2025年3月31日作成者: jarxiv

要約ガウススプラッティングフレームワークに基づいて（ソースからターゲットシーン … 続きを読む →

カテゴリー: cs.CV | コメントを受け付けていません

DSO: Aligning 3D Generators with Simulation Feedback for Physical Soundness

投稿日: 2025年3月31日作成者: jarxiv

要約ほとんどの3Dオブジェクトジェネレーターは、美的品質に焦点を当てており、ア … 続きを読む →

カテゴリー: cs.AI, cs.CV, cs.LG | コメントを受け付けていません

Q-Insight: Understanding Image Quality via Visual Reinforcement Learning

投稿日: 2025年3月31日作成者: jarxiv

要約画像品質評価（IQA）は、画像の知覚的な視覚品質に焦点を当て、画像の再構築 … 続きを読む →

カテゴリー: cs.CV | コメントを受け付けていません

「cs.CV」カテゴリーアーカイブ

RelDenClu: A Relative Density based Biclustering Method for identifying non-linear feature relations

Zero4D: Training-Free 4D Video Generation From Single Video Using Off-the-Shelf Video Diffusion Model

RAP: Retrieval-Augmented Personalization for Multimodal Large Language Models

VidTwin: Video VAE with Decoupled Structure and Dynamics

Unicorn: Text-Only Data Synthesis for Vision Language Model Training

Evaluation of Machine-generated Biomedical Images via A Tally-based Similarity Measure

Understanding Co-speech Gestures in-the-wild

TranSplat: Lighting-Consistent Cross-Scene Object Transfer with 3D Gaussian Splatting

DSO: Aligning 3D Generators with Simulation Feedback for Physical Soundness

Q-Insight: Understanding Image Quality via Visual Reinforcement Learning

最近の投稿

最近のコメント

アーカイブ

カテゴリー