月別アーカイブ: 2025年3月

SpeCache: Speculative Key-Value Caching for Efficient Generation of LLMs

投稿日: 2025年3月21日作成者: jarxiv

要約トランスベースの大手言語モデル（LLM）はすでに長いテキストタスクで顕著な … 続きを読む →

カテゴリー: cs.CL | コメントを受け付けていません

CodeReviewQA: The Code Review Comprehension Assessment for Large Language Models

投稿日: 2025年3月21日作成者: jarxiv

要約最先端の大規模な言語モデル（LLMS）は、印象的なコード生成機能を実証して … 続きを読む →

カテゴリー: cs.CL, cs.SE | コメントを受け付けていません

Binary-Integer-Programming Based Algorithm for Expert Load Balancing in Mixture-of-Experts Models

投稿日: 2025年3月21日作成者: jarxiv

要約 MOE（Expertsの混合）モデルの事前トレーニングの場合、主な問題の1 … 続きを読む →

カテゴリー: cs.CL, cs.LG | コメントを受け付けていません

Accurate Scene Text Recognition with Efficient Model Scaling and Cloze Self-Distillation

投稿日: 2025年3月21日作成者: jarxiv

要約スケーリングアーキテクチャは、シーンテキスト認識（STR）の改善に効果的で … 続きを読む →

カテゴリー: cs.AI, cs.CL, cs.CV | コメントを受け付けていません

Using Contextually Aligned Online Reviews to Measure LLMs’ Performance Disparities Across Language Varieties

投稿日: 2025年3月21日作成者: jarxiv

要約言語は異なる品種を持つことができます。これらの品種は、大規模な言語モデル … 続きを読む →

カテゴリー: cs.CL, cs.HC | コメントを受け付けていません

Reinforcement Learning for Reasoning in Small LLMs: What Works and What Doesn’t

投稿日: 2025年3月21日作成者: jarxiv

要約大規模な言語モデル（LLM）の推論機能を強化することは、通常、大規模な計算 … 続きを読む →

カテゴリー: cs.CL, cs.LG | コメントを受け付けていません

Graph-Guided Textual Explanation Generation Framework

投稿日: 2025年3月21日作成者: jarxiv

要約自然言語の説明（NLE）は、モデルの予測に関する推論のもっともらしい自由テ … 続きを読む →

カテゴリー: cs.CL | コメントを受け付けていません

Robust LLM safeguarding via refusal feature adversarial training

投稿日: 2025年3月21日作成者: jarxiv

要約大規模な言語モデル（LLM）は、有害な反応を引き出す可能性のある敵対的な攻 … 続きを読む →

カテゴリー: cs.CL, cs.CR, cs.LG | コメントを受け付けていません

Fin-R1: A Large Language Model for Financial Reasoning through Reinforcement Learning

投稿日: 2025年3月21日作成者: jarxiv

要約大きな言語モデルの推論は、さまざまなドメインで急速に進化しています。ただ … 続きを読む →

カテゴリー: cs.CL | コメントを受け付けていません

LLM Braces: Straightening Out LLM Predictions with Relevant Sub-Updates

投稿日: 2025年3月21日作成者: jarxiv

要約最近の発見は、変圧器ベースの大手言語モデル（LLM）の知識の多くがそのフィ … 続きを読む →

カテゴリー: cs.CL | コメントを受け付けていません

月別アーカイブ: 2025年3月

SpeCache: Speculative Key-Value Caching for Efficient Generation of LLMs

CodeReviewQA: The Code Review Comprehension Assessment for Large Language Models

Binary-Integer-Programming Based Algorithm for Expert Load Balancing in Mixture-of-Experts Models

Accurate Scene Text Recognition with Efficient Model Scaling and Cloze Self-Distillation

Using Contextually Aligned Online Reviews to Measure LLMs’ Performance Disparities Across Language Varieties

Reinforcement Learning for Reasoning in Small LLMs: What Works and What Doesn’t

Graph-Guided Textual Explanation Generation Framework

Robust LLM safeguarding via refusal feature adversarial training

Fin-R1: A Large Language Model for Financial Reasoning through Reinforcement Learning

LLM Braces: Straightening Out LLM Predictions with Relevant Sub-Updates

最近の投稿

最近のコメント

アーカイブ

カテゴリー