「cs.LG」カテゴリーアーカイブ

Diversity as a Reward: Fine-Tuning LLMs on a Mixture of Domain-Undetermined Data

投稿日: 2025年5月23日作成者: jarxiv

要約多様なデータセットを使用した大規模な言語モデル（LLMS）の微調整は、さま … 続きを読む →

カテゴリー: cs.AI, cs.CL, cs.LG | コメントを受け付けていません

Don’t ‘Overthink’ Passage Reranking: Is Reasoning Truly Necessary?

投稿日: 2025年5月23日作成者: jarxiv

要約複雑な自然言語のタスクにわたる推論モデルの成功により、情報検索（IR）コミ … 続きを読む →

カテゴリー: cs.AI, cs.CL, cs.IR, cs.LG | コメントを受け付けていません

Slamming: Training a Speech Language Model on One GPU in a Day

投稿日: 2025年5月23日作成者: jarxiv

要約 24時間で単一のアカデミックGPUで高品質の音声言語モデル（SLM）をトレ … 続きを読む →

カテゴリー: cs.AI, cs.CL, cs.LG, cs.SD, eess.AS | コメントを受け付けていません

Structure-Aligned Protein Language Model

投稿日: 2025年5月23日作成者: jarxiv

要約タンパク質言語モデル（PLMS）は、さまざまな下流タスクで優れている広大な … 続きを読む →

カテゴリー: cs.AI, cs.LG | コメントを受け付けていません

WGFormer: An SE(3)-Transformer Driven by Wasserstein Gradient Flows for Molecular Ground-State Conformation Prediction

投稿日: 2025年5月23日作成者: jarxiv

要約分子基質立体構造（すなわち、エネルギー最大の立体構造）を予測することは、分 … 続きを読む →

カテゴリー: cs.AI, cs.LG, physics.chem-ph, q-bio.BM | コメントを受け付けていません

Latent Principle Discovery for Language Model Self-Improvement

投稿日: 2025年5月23日作成者: jarxiv

要約言語モデル（LM）ユーザーが世代の品質を向上させることを目指している場合、 … 続きを読む →

カテゴリー: cs.AI, cs.CL, cs.LG | コメントを受け付けていません

Beyond Needle(s) in the Embodied Haystack: Environment, Architecture, and Training Considerations for Long Context Reasoning

投稿日: 2025年5月23日作成者: jarxiv

要約 $ \ infty $ -thorを紹介します。これは、具体化されたAIで … 続きを読む →

カテゴリー: cs.AI, cs.LG, cs.RO | コメントを受け付けていません

The Polar Express: Optimal Matrix Sign Methods and Their Application to the Muon Algorithm

投稿日: 2025年5月23日作成者: jarxiv

要約極性分解と関連するマトリックス記号関数を計算することは、数十年にわたって数 … 続きを読む →

カテゴリー: cs.AI, cs.CL, cs.LG, cs.NA, math.NA, math.OC | コメントを受け付けていません

FoMoH: A clinically meaningful foundation model evaluation for structured electronic health records

投稿日: 2025年5月23日作成者: jarxiv

要約財団モデルは、ダウンストリームタスクとは無関係に意味のある表現を抽出する能 … 続きを読む →

カテゴリー: cs.AI, cs.LG | コメントを受け付けていません

MixAT: Combining Continuous and Discrete Adversarial Training for LLMs

投稿日: 2025年5月23日作成者: jarxiv

要約大規模な言語モデル（LLMS）の安全性とアラインメントでの最近の努力にもか … 続きを読む →

カテゴリー: cs.AI, cs.LG, I.2.7 | コメントを受け付けていません

「cs.LG」カテゴリーアーカイブ

Diversity as a Reward: Fine-Tuning LLMs on a Mixture of Domain-Undetermined Data

Don’t ‘Overthink’ Passage Reranking: Is Reasoning Truly Necessary?

Slamming: Training a Speech Language Model on One GPU in a Day

Structure-Aligned Protein Language Model

WGFormer: An SE(3)-Transformer Driven by Wasserstein Gradient Flows for Molecular Ground-State Conformation Prediction

Latent Principle Discovery for Language Model Self-Improvement

Beyond Needle(s) in the Embodied Haystack: Environment, Architecture, and Training Considerations for Long Context Reasoning

The Polar Express: Optimal Matrix Sign Methods and Their Application to the Muon Algorithm

FoMoH: A clinically meaningful foundation model evaluation for structured electronic health records

MixAT: Combining Continuous and Discrete Adversarial Training for LLMs

最近の投稿

最近のコメント

アーカイブ

カテゴリー