「cs.LG」カテゴリーアーカイブ

DYNAMAX: Dynamic computing for Transformers and Mamba based architectures

投稿日: 2025年4月30日作成者: jarxiv

要約早期出口（EES）は、データサンプルの満足のいく予測信頼度が達成されたら、 … 続きを読む →

カテゴリー: (Primary), 68T07, cs.AI, cs.CL, cs.LG | コメントを受け付けていません

Training Plug-n-Play Knowledge Modules with Deep Context Distillation

投稿日: 2025年4月30日作成者: jarxiv

要約特に低データのシナリオで、またはプライベートドキュメントや専門文書を扱う場 … 続きを読む →

カテゴリー: cs.AI, cs.LG | コメントを受け付けていません

Wanda++: Pruning Large Language Models via Regional Gradients

投稿日: 2025年4月30日作成者: jarxiv

要約大規模な言語モデル（LLMS）剪定は、パフォーマンスへの影響を最小限に抑え … 続きを読む →

カテゴリー: cs.AI, cs.CL, cs.LG | コメントを受け付けていません

Test-time regression: a unifying framework for designing sequence models with associative memory

投稿日: 2025年4月30日作成者: jarxiv

要約シーケンスモデルは、現代の深い学習の中心にあります。しかし、急速な進歩に … 続きを読む →

カテゴリー: cs.AI, cs.LG, cs.NE, stat.ML | コメントを受け付けていません

Hubs and Spokes Learning: Efficient and Scalable Collaborative Machine Learning

投稿日: 2025年4月30日作成者: jarxiv

要約 Federated Learning（FL）と分散学習（P2PL）の強みを … 続きを読む →

カテゴリー: cs.AI, cs.DC, cs.LG | コメントを受け付けていません

Toward Efficient Exploration by Large Language Model Agents

投稿日: 2025年4月30日作成者: jarxiv

要約強化学習（RL）内の急成長エリアは、大規模な言語モデル（LLMS）を中心と … 続きを読む →

カテゴリー: cs.AI, cs.LG | コメントを受け付けていません

The Best of Both Worlds: Integrating Language Models and Diffusion Models for Video Generation

投稿日: 2025年4月30日作成者: jarxiv

要約テキストからビデオへの最近の進歩（T2V）の生成は、自己回帰言語モデルと拡 … 続きを読む →

カテゴリー: cs.AI, cs.CL, cs.CV, cs.LG | コメントを受け付けていません

UniversalRAG: Retrieval-Augmented Generation over Multiple Corpora with Diverse Modalities and Granularities

投稿日: 2025年4月30日作成者: jarxiv

要約検索された生成（RAG）は、クエリに関連する外部知識をモデルの応答に接地す … 続きを読む →

カテゴリー: cs.AI, cs.CL, cs.CV, cs.IR, cs.LG | コメントを受け付けていません

Negate or Embrace: On How Misalignment Shapes Multimodal Representation Learning

投稿日: 2025年4月30日作成者: jarxiv

要約画像テキストペアを使用したマルチモーダルコントラスト学習（MMCL）によっ … 続きを読む →

カテゴリー: cs.CV, cs.LG | コメントを受け付けていません

3D ReX: Causal Explanations in 3D Neuroimaging Classification

投稿日: 2025年4月30日作成者: jarxiv

要約説明可能性は、医療イメージングにおけるAIモデルにとって重要な問題のままで … 続きを読む →

カテゴリー: cs.AI, cs.CV, cs.LG, eess.IV | コメントを受け付けていません

「cs.LG」カテゴリーアーカイブ

DYNAMAX: Dynamic computing for Transformers and Mamba based architectures

Training Plug-n-Play Knowledge Modules with Deep Context Distillation

Wanda++: Pruning Large Language Models via Regional Gradients

Test-time regression: a unifying framework for designing sequence models with associative memory

Hubs and Spokes Learning: Efficient and Scalable Collaborative Machine Learning

Toward Efficient Exploration by Large Language Model Agents

The Best of Both Worlds: Integrating Language Models and Diffusion Models for Video Generation

UniversalRAG: Retrieval-Augmented Generation over Multiple Corpora with Diverse Modalities and Granularities

Negate or Embrace: On How Misalignment Shapes Multimodal Representation Learning

3D ReX: Causal Explanations in 3D Neuroimaging Classification

最近の投稿

最近のコメント

アーカイブ

カテゴリー