「cs.LG」カテゴリーアーカイブ

Non-Halting Queries: Exploiting Fixed Points in LLMs

投稿日: 2025年2月25日作成者: jarxiv

要約自己回帰モデルの固定点を悪用する新しい脆弱性を導入し、それを使用して停止し … 続きを読む →

カテゴリー: cs.AI, cs.CL, cs.LG | コメントを受け付けていません

Aligned at the Start: Conceptual Groupings in LLM Embeddings

投稿日: 2025年2月25日作成者: jarxiv

要約このペーパーでは、焦点を見越えられている入力埋め込み、つまりトランスブロッ … 続きを読む →

カテゴリー: cs.AI, cs.CL, cs.LG | コメントを受け付けていません

Score Change of Variables

投稿日: 2025年2月25日作成者: jarxiv

要約スコア関数の変数式の一般的な変更を導き出します。スムーズで反転可能な変換$ … 続きを読む →

カテゴリー: 68T01, cs.AI, cs.LG, I.2.6, math.PR | コメントを受け付けていません

Big-Math: A Large-Scale, High-Quality Math Dataset for Reinforcement Learning in Language Models

投稿日: 2025年2月25日作成者: jarxiv

要約推論モデルへの関心の高まりにより、数学はアルゴリズムと方法論の改善の顕著な … 続きを読む →

カテゴリー: cs.AI, cs.CL, cs.LG | コメントを受け付けていません

Learning to Reason at the Frontier of Learnability

投稿日: 2025年2月25日作成者: jarxiv

要約強化学習は現在、特に数学の問題などの推論スタイルのタスクについて、大規模な … 続きを読む →

カテゴリー: cs.AI, cs.CL, cs.LG | コメントを受け付けていません

The Empirical Impact of Reducing Symmetries on the Performance of Deep Ensembles and MoE

投稿日: 2025年2月25日作成者: jarxiv

要約最近の研究では、ニューラルネットワークの対称性を減らすことで、パラメーター … 続きを読む →

カテゴリー: cs.AI, cs.LG | コメントを受け付けていません

Large Language Models are Powerful EHR Encoders

投稿日: 2025年2月25日作成者: jarxiv

要約電子健康記録（EHR）は臨床的予測の豊富な可能性を提供しますが、それらの固 … 続きを読む →

カテゴリー: cs.AI, cs.CL, cs.LG | コメントを受け付けていません

Reasoning with Latent Thoughts: On the Power of Looped Transformers

投稿日: 2025年2月25日作成者: jarxiv

要約大規模な言語モデルは、顕著な推論能力を示しており、スケーリング法則は、特に … 続きを読む →

カテゴリー: cs.AI, cs.CL, cs.LG | コメントを受け付けていません

The Geometry of Refusal in Large Language Models: Concept Cones and Representational Independence

投稿日: 2025年2月25日作成者: jarxiv

要約大規模な言語モデル（LLM）の安全性の配置は、敵対的に作られた入力を介して … 続きを読む →

カテゴリー: cs.AI, cs.CL, cs.LG | コメントを受け付けていません

LongSpec: Long-Context Speculative Decoding with Efficient Drafting and Verification

投稿日: 2025年2月25日作成者: jarxiv

要約投機的デコードは、大規模な言語モデル（LLMS）における自己回帰デコードの … 続きを読む →

カテゴリー: cs.AI, cs.CL, cs.LG | コメントを受け付けていません

「cs.LG」カテゴリーアーカイブ

Non-Halting Queries: Exploiting Fixed Points in LLMs

Aligned at the Start: Conceptual Groupings in LLM Embeddings

Score Change of Variables

Big-Math: A Large-Scale, High-Quality Math Dataset for Reinforcement Learning in Language Models

Learning to Reason at the Frontier of Learnability

The Empirical Impact of Reducing Symmetries on the Performance of Deep Ensembles and MoE

Large Language Models are Powerful EHR Encoders

Reasoning with Latent Thoughts: On the Power of Looped Transformers

The Geometry of Refusal in Large Language Models: Concept Cones and Representational Independence

LongSpec: Long-Context Speculative Decoding with Efficient Drafting and Verification

最近の投稿

最近のコメント

アーカイブ

カテゴリー