「cs.LG」カテゴリーアーカイブ

RFMI: Estimating Mutual Information on Rectified Flow for Text-to-Image Alignment

投稿日: 2025年3月19日作成者: jarxiv

要約フローマッチングフレームワークでトレーニングされた修正フロー（RF）モデル … 続きを読む →

カテゴリー: cs.CV, cs.LG | コメントを受け付けていません

The Shape of Attraction in UMAP: Exploring the Embedding Forces in Dimensionality Reduction

投稿日: 2025年3月19日作成者: jarxiv

要約均一なマニホールド近似と投影（UMAP）は、最も人気のあるネイバーの埋め込 … 続きを読む →

カテゴリー: cs.AI, cs.CV, cs.LG | コメントを受け付けていません

Targeted Neural Architectures in Multi-Objective Frameworks for Complete Glioma Characterization from Multimodal MRI

投稿日: 2025年3月19日作成者: jarxiv

要約脳腫瘍は、脳組織の異常な細胞増殖に起因します。診断されていない場合、それ … 続きを読む →

カテゴリー: cs.CV, cs.LG, eess.IV, I.4.6 | コメントを受け付けていません

CaReBench: A Fine-Grained Benchmark for Video Captioning and Retrieval

投稿日: 2025年3月19日作成者: jarxiv

要約ビデオキャプションや検索を含むビデオの理解は、ビデオ言語モデル（VLM）に … 続きを読む →

カテゴリー: cs.CV, cs.IR, cs.LG | コメントを受け付けていません

Advancing Medical Representation Learning Through High-Quality Data

投稿日: 2025年3月19日作成者: jarxiv

要約医学的視覚言語データセットの規模が増えているにもかかわらず、モデルのパフォ … 続きを読む →

カテゴリー: cs.CV, cs.LG, eess.IV | コメントを受け付けていません

PQPP: A Joint Benchmark for Text-to-Image Prompt and Query Performance Prediction

投稿日: 2025年3月19日作成者: jarxiv

要約テキストからイメージの生成は最近、生成的拡散モデルの視覚的に印象的な結果に … 続きを読む →

カテゴリー: cs.AI, cs.CL, cs.CV, cs.LG | コメントを受け付けていません

DUNE: Distilling a Universal Encoder from Heterogeneous 2D and 3D Teachers

投稿日: 2025年3月19日作成者: jarxiv

要約最近のマルチティーチャー蒸留方法により、複数の基礎モデルのエンコーダーが単 … 続きを読む →

カテゴリー: cs.CV, cs.LG | コメントを受け付けていません

PULASki: Learning inter-rater variability using statistical distances to improve probabilistic segmentation

投稿日: 2025年3月19日作成者: jarxiv

要約医療イメージングの領域では、セグメンテーションのための多くの監視された学習 … 続きを読む →

カテゴリー: cs.AI, cs.CV, cs.HC, cs.LG | コメントを受け付けていません

ExDDV: A New Dataset for Explainable Deepfake Detection in Video

投稿日: 2025年3月19日作成者: jarxiv

要約生成されたビデオのリアリズムと品質が増え続けると、自動ディープフェイク検出 … 続きを読む →

カテゴリー: cs.AI, cs.CL, cs.CV, cs.LG, cs.MM | コメントを受け付けていません

Cosmos World Foundation Model Platform for Physical AI

投稿日: 2025年3月19日作成者: jarxiv

要約物理的なAIは、最初にデジタルで訓練する必要があります。それ自体のデジタ … 続きを読む →

カテゴリー: cs.AI, cs.CV, cs.LG, cs.RO | コメントを受け付けていません

「cs.LG」カテゴリーアーカイブ

RFMI: Estimating Mutual Information on Rectified Flow for Text-to-Image Alignment

The Shape of Attraction in UMAP: Exploring the Embedding Forces in Dimensionality Reduction

Targeted Neural Architectures in Multi-Objective Frameworks for Complete Glioma Characterization from Multimodal MRI

CaReBench: A Fine-Grained Benchmark for Video Captioning and Retrieval

Advancing Medical Representation Learning Through High-Quality Data

PQPP: A Joint Benchmark for Text-to-Image Prompt and Query Performance Prediction

DUNE: Distilling a Universal Encoder from Heterogeneous 2D and 3D Teachers

PULASki: Learning inter-rater variability using statistical distances to improve probabilistic segmentation

ExDDV: A New Dataset for Explainable Deepfake Detection in Video

Cosmos World Foundation Model Platform for Physical AI

最近の投稿

最近のコメント

アーカイブ

カテゴリー