「cs.CV」カテゴリーアーカイブ

Question-Answering Dense Video Events

投稿日: 2024年9月9日作成者: jarxiv

要約マルチモーダル大規模言語モデル (MLLM) は、単一イベントビデオの質 … 続きを読む →

カテゴリー: cs.AI, cs.CV, cs.MM | コメントを受け付けていません

Future Does Matter: Boosting 3D Object Detection with Temporal Motion Estimation in Point Cloud Sequences

投稿日: 2024年9月9日作成者: jarxiv

要約正確かつ堅牢な LiDAR 3D 物体検出は、自動運転における包括的なシー … 続きを読む →

カテゴリー: cs.CV | コメントを受け付けていません

HiSC4D: Human-centered interaction and 4D Scene Capture in Large-scale Space Using Wearable IMUs and LiDAR

投稿日: 2024年9月9日作成者: jarxiv

要約大規模な屋内と屋外のシーン、多様な人間の動き、豊かな人間と人間の相互作用、 … 続きを読む →

カテゴリー: cs.AI, cs.CV, cs.GR, cs.MM | コメントを受け付けていません

HiPrompt: Tuning-free Higher-Resolution Generation with Hierarchical MLLM Prompts

投稿日: 2024年9月9日作成者: jarxiv

要約事前トレーニングされた拡散モデルを使用した高解像度画像生成の可能性は計り知 … 続きを読む →

カテゴリー: cs.CV | コメントを受け付けていません

Train Till You Drop: Towards Stable and Robust Source-free Unsupervised 3D Domain Adaptation

投稿日: 2024年9月9日作成者: jarxiv

要約私たちは、3D セマンティックセグメンテーションのためのソースフリーの教 … 続きを読む →

カテゴリー: cs.CV | コメントを受け付けていません

Open-MAGVIT2: An Open-Source Project Toward Democratizing Auto-regressive Visual Generation

投稿日: 2024年9月9日作成者: jarxiv

要約 300M から 1.5B までの自動回帰画像生成モデルファミリである O … 続きを読む →

カテゴリー: cs.AI, cs.CV | コメントを受け付けていません

LAR-IQA: A Lightweight, Accurate, and Robust No-Reference Image Quality Assessment Model

投稿日: 2024年9月9日作成者: jarxiv

要約深層学習技術を使用した非参照画像品質評価 (NR-IQA) 分野の最近の進 … 続きを読む →

カテゴリー: cs.CV, cs.MM, eess.IV | コメントを受け付けていません

MSLIQA: Enhancing Learning Representations for Image Quality Assessment through Multi-Scale Learning

投稿日: 2024年9月9日作成者: jarxiv

要約非参照画像品質評価 (NR-IQA) は、歪みの多様性と注釈付きの大規模な … 続きを読む →

カテゴリー: cs.CV, cs.MM | コメントを受け付けていません

Exploring Foundation Models for Synthetic Medical Imaging: A Study on Chest X-Rays and Fine-Tuning Techniques

投稿日: 2024年9月9日作成者: jarxiv

要約機械学習は、病気の予防と治療法の特定を支援することにより、医療を大幅に進歩 … 続きを読む →

カテゴリー: cs.CV, cs.GR, eess.IV | コメントを受け付けていません

VILA-U: a Unified Foundation Model Integrating Visual Understanding and Generation

投稿日: 2024年9月9日作成者: jarxiv

要約 VILA-U は、ビデオ、画像、言語の理解と生成を統合する統合基盤モデルで … 続きを読む →

カテゴリー: cs.CV, cs.LG | コメントを受け付けていません

「cs.CV」カテゴリーアーカイブ

Question-Answering Dense Video Events

Future Does Matter: Boosting 3D Object Detection with Temporal Motion Estimation in Point Cloud Sequences

HiSC4D: Human-centered interaction and 4D Scene Capture in Large-scale Space Using Wearable IMUs and LiDAR

HiPrompt: Tuning-free Higher-Resolution Generation with Hierarchical MLLM Prompts

Train Till You Drop: Towards Stable and Robust Source-free Unsupervised 3D Domain Adaptation

Open-MAGVIT2: An Open-Source Project Toward Democratizing Auto-regressive Visual Generation

LAR-IQA: A Lightweight, Accurate, and Robust No-Reference Image Quality Assessment Model

MSLIQA: Enhancing Learning Representations for Image Quality Assessment through Multi-Scale Learning

Exploring Foundation Models for Synthetic Medical Imaging: A Study on Chest X-Rays and Fine-Tuning Techniques

VILA-U: a Unified Foundation Model Integrating Visual Understanding and Generation

最近の投稿

最近のコメント

アーカイブ

カテゴリー