月別アーカイブ: 2025年5月

VidMuse: A Simple Video-to-Music Generation Framework with Long-Short-Term Modeling

投稿日: 2025年5月8日作成者: jarxiv

要約この作業では、ビデオのみで条件付けられた音楽生成を体系的に研究しています。 … 続きを読む →

カテゴリー: cs.CV, cs.LG, cs.MM, cs.SD | コメントを受け付けていません

RAFT: Robust Augmentation of FeaTures for Image Segmentation

投稿日: 2025年5月8日作成者: jarxiv

要約画像セグメンテーションは、シーンの理解のための強力なコンピュータービジョン … 続きを読む →

カテゴリー: cs.CV | コメントを受け付けていません

Registration of 3D Point Sets Using Exponential-based Similarity Matrix

投稿日: 2025年5月8日作成者: jarxiv

要約ポイントクラウド登録は、コンピュータービジョンとロボット工学の根本的な問題 … 続きを読む →

カテゴリー: cs.CV | コメントを受け付けていません

LLM2CLIP: Powerful Language Model Unlocks Richer Visual Representation

投稿日: 2025年5月8日作成者: jarxiv

要約 Clipは、大規模な画像テキストペアの対照学習を介して、画像とテキスト機能 … 続きを読む →

カテゴリー: cs.CL, cs.CV | コメントを受け付けていません

Enhancing Virtual Try-On with Synthetic Pairs and Error-Aware Noise Scheduling

投稿日: 2025年5月8日作成者: jarxiv

要約標準的な製品ビューの孤立した衣服の画像と人の別の画像を考えると、仮想トライ … 続きを読む →

カテゴリー: cs.CV | コメントを受け付けていません

Efficiency Meets Fidelity: A Novel Quantization Framework for Stable Diffusion

投稿日: 2025年5月8日作成者: jarxiv

要約安定した拡散モデル（SDM）を介したテキストから画像の生成は、顕著な能力を … 続きを読む →

カテゴリー: cs.CV | コメントを受け付けていません

Componential Prompt-Knowledge Alignment for Domain Incremental Learning

投稿日: 2025年5月8日作成者: jarxiv

要約ドメイン増分学習（DIL）は、過去の知識を保持および利用しながら、ドメイン … 続きを読む →

カテゴリー: cs.CV, cs.LG | コメントを受け付けていません

Active Sampling for MRI-based Sequential Decision Making

投稿日: 2025年5月8日作成者: jarxiv

要約磁気共鳴画像法（MRI）の優れた診断能力にもかかわらず、ポイントオブケア（ … 続きを読む →

カテゴリー: cs.CV, cs.LG | コメントを受け付けていません

TetWeave: Isosurface Extraction using On-The-Fly Delaunay Tetrahedral Grids for Gradient-Based Mesh Optimization

投稿日: 2025年5月8日作成者: jarxiv

要約 Tetweaveを紹介します。Tetweaveは、四面体の行進に使用される … 続きを読む →

カテゴリー: cs.CV, cs.GR, I.3.5 | コメントを受け付けていません

MonoCoP: Chain-of-Prediction for Monocular 3D Object Detection

投稿日: 2025年5月8日作成者: jarxiv

要約 3D属性を正確に予測することは、単眼3Dオブジェクト検出（Mono3D）に … 続きを読む →

カテゴリー: cs.CV | コメントを受け付けていません

月別アーカイブ: 2025年5月

VidMuse: A Simple Video-to-Music Generation Framework with Long-Short-Term Modeling

RAFT: Robust Augmentation of FeaTures for Image Segmentation

Registration of 3D Point Sets Using Exponential-based Similarity Matrix

LLM2CLIP: Powerful Language Model Unlocks Richer Visual Representation

Enhancing Virtual Try-On with Synthetic Pairs and Error-Aware Noise Scheduling

Efficiency Meets Fidelity: A Novel Quantization Framework for Stable Diffusion

Componential Prompt-Knowledge Alignment for Domain Incremental Learning

Active Sampling for MRI-based Sequential Decision Making

TetWeave: Isosurface Extraction using On-The-Fly Delaunay Tetrahedral Grids for Gradient-Based Mesh Optimization

MonoCoP: Chain-of-Prediction for Monocular 3D Object Detection

最近の投稿

最近のコメント

アーカイブ

カテゴリー