投稿者「jarxiv」のアーカイブ

LARGE: Legal Retrieval Augmented Generation Evaluation Tool

投稿日: 2025年4月3日作成者: jarxiv

要約最近、大規模な言語モデル（LLMS）の能力を高めるための検索上位生成（RA … 続きを読む →

カテゴリー: cs.CL | コメントを受け付けていません

Finding Transformer Circuits with Edge Pruning

投稿日: 2025年4月3日作成者: jarxiv

要約言語モデルの解釈への道は、多くの場合、回路の分析を介して進行します。これは … 続きを読む →

カテゴリー: cs.CL | コメントを受け付けていません

Multilingual European Language Models: Benchmarking Approaches and Challenges

投稿日: 2025年4月3日作成者: jarxiv

要約チャットの相互作用を通じてさまざまなタスクを解決できる生成大型言語モデル（ … 続きを読む →

カテゴリー: cs.CL | コメントを受け付けていません

DEPT: Decoupled Embeddings for Pre-training Language Models

投稿日: 2025年4月3日作成者: jarxiv

要約言語モデルのプリトレーニングでは、幅広いデータ混合物を使用して、ドメインと … 続きを読む →

カテゴリー: cs.CL, cs.LG | コメントを受け付けていません

Code Generation and Algorithmic Problem Solving Using Llama 3.1 405B

投稿日: 2025年4月3日作成者: jarxiv

要約 MetaのLlama 3.1 405bなどのLlama 3.1モデルによる … 続きを読む →

カテゴリー: cs.CL, cs.SE | コメントを受け付けていません

Is the Reversal Curse a Binding Problem? Uncovering Limitations of Transformers from a Basic Generalization Failure

投稿日: 2025年4月3日作成者: jarxiv

要約それらの印象的な能力にもかかわらず、LLMは逆転呪いとして知られる基本的な … 続きを読む →

カテゴリー: cs.CL, cs.LG | コメントを受け付けていません

Review, Refine, Repeat: Understanding Iterative Decoding of AI Agents with Dynamic Evaluation and Selection

投稿日: 2025年4月3日作成者: jarxiv

要約 AIエージェントはさまざまなタスクで顕著なパフォーマンスを示していますが、 … 続きを読む →

カテゴリー: cs.CL | コメントを受け付けていません

OpenCodeReasoning: Advancing Data Distillation for Competitive Coding

投稿日: 2025年4月3日作成者: jarxiv

要約推論に基づいた大規模な言語モデルの出現以来、多くの人は、推論能力を学生モデ … 続きを読む →

カテゴリー: cs.CL | コメントを受け付けていません

Interpretable Steering of Large Language Models with Feature Guided Activation Additions

投稿日: 2025年4月3日作成者: jarxiv

要約大規模な言語モデル（LLM）の動作に対する効果的かつ信頼できる制御は、重要 … 続きを読む →

カテゴリー: cs.AI, cs.CL, cs.LG | コメントを受け付けていません

Epistemic Skills: Reasoning about Knowledge and Oblivion

投稿日: 2025年4月3日作成者: jarxiv

要約このペーパーでは、グループ知識の概念を取り入れながら、知識を獲得し、忘却に … 続きを読む →

カテゴリー: cs.AI, cs.CC, cs.LO | コメントを受け付けていません

投稿者「jarxiv」のアーカイブ

LARGE: Legal Retrieval Augmented Generation Evaluation Tool

Finding Transformer Circuits with Edge Pruning

Multilingual European Language Models: Benchmarking Approaches and Challenges

DEPT: Decoupled Embeddings for Pre-training Language Models

Code Generation and Algorithmic Problem Solving Using Llama 3.1 405B

Is the Reversal Curse a Binding Problem? Uncovering Limitations of Transformers from a Basic Generalization Failure

Review, Refine, Repeat: Understanding Iterative Decoding of AI Agents with Dynamic Evaluation and Selection

OpenCodeReasoning: Advancing Data Distillation for Competitive Coding

Interpretable Steering of Large Language Models with Feature Guided Activation Additions

Epistemic Skills: Reasoning about Knowledge and Oblivion

最近の投稿

最近のコメント

アーカイブ

カテゴリー