「cs.LG」カテゴリーアーカイブ

Unified Triplet-Level Hallucination Evaluation for Large Vision-Language Models

投稿日: 2024年10月31日作成者: jarxiv

要約視覚言語推論における優れたパフォーマンスにもかかわらず、大規模視覚言語モデ … 続きを読む →

カテゴリー: cs.AI, cs.CL, cs.CV, cs.LG | コメントを受け付けていません

Why Fine-grained Labels in Pretraining Benefit Generalization?

投稿日: 2024年10月31日作成者: jarxiv

要約最近の研究では、きめの細かいラベル付けされたデータを使用してディープニュ … 続きを読む →

カテゴリー: cs.CV, cs.LG, stat.ML | コメントを受け付けていません

Revisiting MAE pre-training for 3D medical image segmentation

投稿日: 2024年10月31日作成者: jarxiv

要約自己教師あり学習 (SSL) は、ラベル付きデータの不足に悩まされているさ … 続きを読む →

カテゴリー: cs.AI, cs.CV, cs.LG | コメントを受け付けていません

FAIR-TAT: Improving Model Fairness Using Targeted Adversarial Training

投稿日: 2024年10月31日作成者: jarxiv

要約ディープニューラルネットワークは、敵対的な攻撃や一般的な破損の影響を受 … 続きを読む →

カテゴリー: cs.CV, cs.LG | コメントを受け付けていません

Unbounded: A Generative Infinite Game of Character Life Simulation

投稿日: 2024年10月31日作成者: jarxiv

要約生成無限ゲームの概念を紹介します。これは、生成モデルを使用することで、ハー … 続きを読む →

カテゴリー: cs.AI, cs.CL, cs.CV, cs.GR, cs.LG | コメントを受け付けていません

VisualPredicator: Learning Abstract World Models with Neuro-Symbolic Predicates for Robot Planning

投稿日: 2024年10月31日作成者: jarxiv

要約広範なインテリジェントエージェントは、生の感覚運動空間の複雑さを抽象化しな … 続きを読む →

カテゴリー: cs.AI, cs.CV, cs.LG, cs.RO | コメントを受け付けていません

Fourier Amplitude and Correlation Loss: Beyond Using L2 Loss for Skillful Precipitation Nowcasting

投稿日: 2024年10月31日作成者: jarxiv

要約近年、深層学習アプローチが降水ナウキャスティングに広く採用されています。 … 続きを読む →

カテゴリー: cs.AI, cs.CV, cs.LG | コメントを受け付けていません

Weight Copy and Low-Rank Adaptation for Few-Shot Distillation of Vision Transformers

投稿日: 2024年10月31日作成者: jarxiv

要約フューショット知識蒸留は、限られたデータと計算リソースを使用して、大規模な … 続きを読む →

カテゴリー: cs.AI, cs.CV, cs.LG | コメントを受け付けていません

CARES: A Comprehensive Benchmark of Trustworthiness in Medical Vision Language Models

投稿日: 2024年10月31日作成者: jarxiv

要約人工知能は、特に Medical Large Vision Languag … 続きを読む →

カテゴリー: cs.CL, cs.CV, cs.CY, cs.LG | コメントを受け付けていません

Aligning Audio-Visual Joint Representations with an Agentic Workflow

投稿日: 2024年10月31日作成者: jarxiv

要約ビジュアルコンテンツと付随するオーディオ信号は、オーディオビジュアル ( … 続きを読む →

カテゴリー: cs.AI, cs.CV, cs.LG, cs.MM, cs.SD, eess.AS | コメントを受け付けていません

「cs.LG」カテゴリーアーカイブ

Unified Triplet-Level Hallucination Evaluation for Large Vision-Language Models

Why Fine-grained Labels in Pretraining Benefit Generalization?

Revisiting MAE pre-training for 3D medical image segmentation

FAIR-TAT: Improving Model Fairness Using Targeted Adversarial Training

Unbounded: A Generative Infinite Game of Character Life Simulation

VisualPredicator: Learning Abstract World Models with Neuro-Symbolic Predicates for Robot Planning

Fourier Amplitude and Correlation Loss: Beyond Using L2 Loss for Skillful Precipitation Nowcasting

Weight Copy and Low-Rank Adaptation for Few-Shot Distillation of Vision Transformers

CARES: A Comprehensive Benchmark of Trustworthiness in Medical Vision Language Models

Aligning Audio-Visual Joint Representations with an Agentic Workflow

最近の投稿

最近のコメント

アーカイブ

カテゴリー