「cs.AI」カテゴリーアーカイブ

Chitrarth: Bridging Vision and Language for a Billion People

投稿日: 2025年2月24日作成者: jarxiv

要約最近のマルチモーダルファンデーションモデルは、主に英語または高リソースのヨ … 続きを読む →

カテゴリー: cs.AI, cs.CL, cs.CV | コメントを受け付けていません

Enhancing Vehicle Make and Model Recognition with 3D Attention Modules

投稿日: 2025年2月24日作成者: jarxiv

要約車両の製造およびモデル認識（VMMR）は、インテリジェント輸送システムの重 … 続きを読む →

カテゴリー: cs.AI, cs.CV | コメントを受け付けていません

Evaluating Multimodal Generative AI with Korean Educational Standards

投稿日: 2025年2月24日作成者: jarxiv

要約このペーパーでは、韓国の国家教育テストを使用してマルチモーダル生成AIシス … 続きを読む →

カテゴリー: cs.AI, cs.CL, cs.CV | コメントを受け付けていません

Anatomy-Informed Deep Learning and Radiomics for Automated Neurofibroma Segmentation in Whole-Body MRI

投稿日: 2025年2月24日作成者: jarxiv

要約神経線維腫症1型は、神経線維腫（NFS）の発症を特徴とする遺伝的障害であり … 続きを読む →

カテゴリー: cs.AI, cs.CV, eess.IV | コメントを受け付けていません

LaRE$^2$: Latent Reconstruction Error Based Method for Diffusion-Generated Image Detection

投稿日: 2025年2月24日作成者: jarxiv

要約拡散モデルの進化により、画像生成の品質が劇的に向上し、実際の画像と生成され … 続きを読む →

カテゴリー: cs.AI, cs.CV | コメントを受け付けていません

MVIP — A Dataset and Methods for Application Oriented Multi-View and Multi-Modal Industrial Part Recognition

投稿日: 2025年2月24日作成者: jarxiv

要約マルチモーダルおよびマルチビューアプリケーション指向の産業部品認識の新しい … 続きを読む →

カテゴリー: cs.AI, cs.CV | コメントを受け付けていません

Q-PETR: Quant-aware Position Embedding Transformation for Multi-View 3D Object Detection

投稿日: 2025年2月24日作成者: jarxiv

要約 PETRベースの方法は、3D認識でベンチマークを支配しており、近代的な自律 … 続きを読む →

カテゴリー: cs.AI, cs.CV | コメントを受け付けていません

UniDB: A Unified Diffusion Bridge Framework via Stochastic Optimal Control

投稿日: 2025年2月24日作成者: jarxiv

要約拡散ブリッジモデルの最近の進歩は、Doobの$ H $ transform … 続きを読む →

カテゴリー: cs.AI, cs.CV, cs.SY, eess.SY | コメントを受け付けていません

Depth-aware Fusion Method based on Image and 4D Radar Spectrum for 3D Object Detection

投稿日: 2025年2月24日作成者: jarxiv

要約安全性と信頼性は、自律運転を一般に受け入れるために重要です。正確で信頼で … 続きを読む →

カテゴリー: cs.AI, cs.CV, cs.RO | コメントを受け付けていません

Bridging vision language model (VLM) evaluation gaps with a framework for scalable and cost-effective benchmark generation

投稿日: 2025年2月24日作成者: jarxiv

要約 AIモデルの信頼できる評価は、科学的進歩と実用的な応用にとって重要です。 … 続きを読む →

カテゴリー: cs.AI, cs.CL, cs.CV, cs.LG | コメントを受け付けていません

「cs.AI」カテゴリーアーカイブ

Chitrarth: Bridging Vision and Language for a Billion People

Enhancing Vehicle Make and Model Recognition with 3D Attention Modules

Evaluating Multimodal Generative AI with Korean Educational Standards

Anatomy-Informed Deep Learning and Radiomics for Automated Neurofibroma Segmentation in Whole-Body MRI

LaRE$^2$: Latent Reconstruction Error Based Method for Diffusion-Generated Image Detection

MVIP — A Dataset and Methods for Application Oriented Multi-View and Multi-Modal Industrial Part Recognition

Q-PETR: Quant-aware Position Embedding Transformation for Multi-View 3D Object Detection

UniDB: A Unified Diffusion Bridge Framework via Stochastic Optimal Control

Depth-aware Fusion Method based on Image and 4D Radar Spectrum for 3D Object Detection

Bridging vision language model (VLM) evaluation gaps with a framework for scalable and cost-effective benchmark generation

最近の投稿

最近のコメント

アーカイブ

カテゴリー