「cs.LG」カテゴリーアーカイブ

Efficient Reasoning with Hidden Thinking

投稿日: 2025年2月3日作成者: jarxiv

要約チェーンオブテーブ（COT）の推論は、マルチモーダル大手言語モデル（MLL … 続きを読む →

カテゴリー: cs.AI, cs.CL, cs.LG | コメントを受け付けていません

Can LLMs Separate Instructions From Data? And What Do We Even Mean By That?

投稿日: 2025年2月3日作成者: jarxiv

要約命令チューニングされた大手言語モデル（LLMS）は、多数の実用的なアプリケ … 続きを読む →

カテゴリー: cs.CL, cs.LG | コメントを受け付けていません

mFollowIR: a Multilingual Benchmark for Instruction Following in Retrieval

投稿日: 2025年2月3日作成者: jarxiv

要約検索システムは一般に、短くて不足しているWebスタイルのクエリに焦点を当て … 続きを読む →

カテゴリー: cs.CL, cs.IR, cs.LG | コメントを受け付けていません

Judge Decoding: Faster Speculative Sampling Requires Going Beyond Model Alignment

投稿日: 2025年2月3日作成者: jarxiv

要約大規模な言語モデル（LLMS）のパフォーマンスは、その基礎となるサイズに密 … 続きを読む →

カテゴリー: cs.CL, cs.LG | コメントを受け付けていません

FlexiGPT: Pruning and Extending Large Language Models with Low-Rank Weight Sharing

投稿日: 2025年2月3日作成者: jarxiv

要約自然言語処理（NLP）における大規模な言語モデル（LLMS）の急速な増殖は … 続きを読む →

カテゴリー: cs.CL, cs.LG | コメントを受け付けていません

GPT-4o as the Gold Standard: A Scalable and General Purpose Approach to Filter Language Model Pretraining Data

投稿日: 2025年2月3日作成者: jarxiv

要約大規模な言語モデルには膨大な量の高品質のトレーニングデータが必要ですが、W … 続きを読む →

カテゴリー: cs.CL, cs.LG | コメントを受け付けていません

SELMA: A Speech-Enabled Language Model for Virtual Assistant Interactions

投稿日: 2025年2月3日作成者: jarxiv

要約この作業では、オーディオとテキストを大規模な言語モデル（LLM）に統合する … 続きを読む →

カテゴリー: cs.CL, cs.LG, cs.SD, eess.AS | コメントを受け付けていません

Strassen Attention: Unlocking Compositional Abilities in Transformers Based on a New Lower Bound Method

投稿日: 2025年2月3日作成者: jarxiv

要約変圧器の理論的な制限を評価するための新しい方法を提案し、無限の精度で1層ソ … 続きを読む →

カテゴリー: cs.AI, cs.LG | コメントを受け付けていません

Sparse Autoencoders Reveal Universal Feature Spaces Across Large Language Models

投稿日: 2025年2月3日作成者: jarxiv

要約私たちは、大規模な言語モデル（LLMS）の特徴普遍性を調査します。これは、 … 続きを読む →

カテゴリー: cs.AI, cs.CL, cs.LG | コメントを受け付けていません

Policy Gradient Methods for Risk-Sensitive Distributional Reinforcement Learning with Provable Convergence

投稿日: 2025年2月3日作成者: jarxiv

要約リスクに敏感な強化学習（RL）は、ハイステークスアプリケーションで信頼でき … 続きを読む →

カテゴリー: cs.AI, cs.LG, math.OC | コメントを受け付けていません

「cs.LG」カテゴリーアーカイブ

Efficient Reasoning with Hidden Thinking

Can LLMs Separate Instructions From Data? And What Do We Even Mean By That?

mFollowIR: a Multilingual Benchmark for Instruction Following in Retrieval

Judge Decoding: Faster Speculative Sampling Requires Going Beyond Model Alignment

FlexiGPT: Pruning and Extending Large Language Models with Low-Rank Weight Sharing

GPT-4o as the Gold Standard: A Scalable and General Purpose Approach to Filter Language Model Pretraining Data

SELMA: A Speech-Enabled Language Model for Virtual Assistant Interactions

Strassen Attention: Unlocking Compositional Abilities in Transformers Based on a New Lower Bound Method

Sparse Autoencoders Reveal Universal Feature Spaces Across Large Language Models

Policy Gradient Methods for Risk-Sensitive Distributional Reinforcement Learning with Provable Convergence

最近の投稿

最近のコメント

アーカイブ

カテゴリー