月別アーカイブ: 2025年2月

A Survey on Image Quality Assessment: Insights, Analysis, and Future Outlook

投稿日: 2025年2月13日作成者: jarxiv

要約画質評価（IQA）は、画像中心のテクノロジーにおける極めて重要な課題を表し … 続きを読む →

カテゴリー: cs.CV | コメントを受け付けていません

Moment of Untruth: Dealing with Negative Queries in Video Moment Retrieval

投稿日: 2025年2月13日作成者: jarxiv

要約ビデオモーメント検索は、視覚言語モデルのパフォーマンスを評価するための一般 … 続きを読む →

カテゴリー: cs.CV | コメントを受け付けていません

Copula-based mixture model identification for subgroup clustering with imaging applications

投稿日: 2025年2月13日作成者: jarxiv

要約モデルベースのクラスタリング技術はさまざまなアプリケーション領域に広く適用 … 続きを読む →

カテゴリー: cs.CV, cs.LG | コメントを受け付けていません

Human-Centric Foundation Models: Perception, Generation and Agentic Modeling

投稿日: 2025年2月13日作成者: jarxiv

要約人間の理解と生成は、デジタル人間とヒューマノイドの実施形態をモデル化するた … 続きを読む →

カテゴリー: cs.AI, cs.CV, cs.LG, cs.MM | コメントを受け付けていません

TimeSuite: Improving MLLMs for Long Video Understanding via Grounded Tuning

投稿日: 2025年2月13日作成者: jarxiv

要約マルチモーダル大手言語モデル（MLLMS）は、短いビデオ理解で印象的なパフ … 続きを読む →

カテゴリー: cs.AI, cs.CV, cs.MM | コメントを受け付けていません

Brain Latent Progression: Individual-based Spatiotemporal Disease Progression on 3D Brain MRIs via Latent Diffusion

投稿日: 2025年2月13日作成者: jarxiv

要約縦方向の磁気共鳴イメージング（MRI）データセットの利用可能性の増加により … 続きを読む →

カテゴリー: cs.AI, cs.CV | コメントを受け付けていません

AR Glulam: Accurate Augmented Reality Using Multiple Fiducial Markers for Glulam Fabrication

投稿日: 2025年2月13日作成者: jarxiv

要約拡張現実（AR）における最近の進歩は、建築、設計、および製造におけるアプリ … 続きを読む →

カテゴリー: cs.CV, cs.ET, cs.HC | コメントを受け付けていません

A Novel Approach to for Multimodal Emotion Recognition : Multimodal semantic information fusion

投稿日: 2025年2月13日作成者: jarxiv

要約人工知能とコンピュータービジョンテクノロジーの進歩により、マルチモーダル感 … 続きを読む →

カテゴリー: cs.AI, cs.CV | コメントを受け付けていません

Ultrasound Image Generation using Latent Diffusion Models

投稿日: 2025年2月13日作成者: jarxiv

要約画像生成の拡散モデルは、多様で高品質の画像を生成する能力により、関心が高ま … 続きを読む →

カテゴリー: 68-06, cs.CV | コメントを受け付けていません

Light-A-Video: Training-free Video Relighting via Progressive Light Fusion

投稿日: 2025年2月13日作成者: jarxiv

要約大規模なデータセットと事前に訓練された拡散モデルによって駆動される画像の学 … 続きを読む →

カテゴリー: cs.CV | コメントを受け付けていません

月別アーカイブ: 2025年2月

A Survey on Image Quality Assessment: Insights, Analysis, and Future Outlook

Moment of Untruth: Dealing with Negative Queries in Video Moment Retrieval

Copula-based mixture model identification for subgroup clustering with imaging applications

Human-Centric Foundation Models: Perception, Generation and Agentic Modeling

TimeSuite: Improving MLLMs for Long Video Understanding via Grounded Tuning

Brain Latent Progression: Individual-based Spatiotemporal Disease Progression on 3D Brain MRIs via Latent Diffusion

AR Glulam: Accurate Augmented Reality Using Multiple Fiducial Markers for Glulam Fabrication

A Novel Approach to for Multimodal Emotion Recognition : Multimodal semantic information fusion

Ultrasound Image Generation using Latent Diffusion Models

Light-A-Video: Training-free Video Relighting via Progressive Light Fusion

最近の投稿

最近のコメント

アーカイブ

カテゴリー