「cs.CL」カテゴリーアーカイブ

Towards Universality: Studying Mechanistic Similarity Across Language Model Architectures

投稿日: 2024年10月11日作成者: jarxiv

要約解釈可能性における普遍性の仮説は、異なるニューラルネットワークが収束して … 続きを読む →

カテゴリー: cs.CL | コメントを受け付けていません

What Makes Large Language Models Reason in (Multi-Turn) Code Generation?

投稿日: 2024年10月11日作成者: jarxiv

要約思考連鎖などの即効性のある手法は、大規模言語モデル (LLM) の出力を向 … 続きを読む →

カテゴリー: cs.CL | コメントを受け付けていません

Think Beyond Size: Dynamic Prompting for More Effective Reasoning

投稿日: 2024年10月11日作成者: jarxiv

要約この文書では、大規模言語モデル (LLM) の推論機能の向上を目的とした新 … 続きを読む →

カテゴリー: cs.CL, cs.LG | コメントを受け付けていません

Rewarding Progress: Scaling Automated Process Verifiers for LLM Reasoning

投稿日: 2024年10月11日作成者: jarxiv

要約大規模な言語モデルで推論を改善するための有望なアプローチは、プロセス報酬モ … 続きを読む →

カテゴリー: cs.CL, cs.LG | コメントを受け付けていません

The Effect of Surprisal on Reading Times in Information Seeking and Repeated Reading

投稿日: 2024年10月11日作成者: jarxiv

要約驚きが処理の困難に及ぼす影響は、心理言語学における研究の中心的なテーマとな … 続きを読む →

カテゴリー: cs.CL | コメントを受け付けていません

GenARM: Reward Guided Generation with Autoregressive Reward Model for Test-time Alignment

投稿日: 2024年10月11日作成者: jarxiv

要約大規模言語モデル (LLM) は優れた機能を発揮しますが、人間の好みに注意 … 続きを読む →

カテゴリー: cs.CL | コメントを受け付けていません

Closing the Loop: Learning to Generate Writing Feedback via Language Model Simulated Student Revisions

投稿日: 2024年10月11日作成者: jarxiv

要約フィードバックを提供することは、生徒のライティングスキルを向上させるため … 続きを読む →

カテゴリー: cs.AI, cs.CL, cs.LG | コメントを受け付けていません

Teaching-Inspired Integrated Prompting Framework: A Novel Approach for Enhancing Reasoning in Large Language Models

投稿日: 2024年10月11日作成者: jarxiv

要約大規模言語モデル (LLM) は、さまざまなドメインにわたって優れたパフォ … 続きを読む →

カテゴリー: cs.AI, cs.CL | コメントを受け付けていません

Sparse Attention Decomposition Applied to Circuit Tracing

投稿日: 2024年10月11日作成者: jarxiv

要約多くの論文は、アテンションヘッドが互いに連携して複雑なタスクを実行すること … 続きを読む →

カテゴリー: cs.AI, cs.CL, cs.LG | コメントを受け付けていません

Paramanu: A Family of Novel Efficient Generative Foundation Language Models for Indian Languages

投稿日: 2024年10月11日作成者: jarxiv

要約インド言語の新しい言語モデル (LM) ファミリーである「Paramanu … 続きを読む →

カテゴリー: cs.AI, cs.CL | コメントを受け付けていません

「cs.CL」カテゴリーアーカイブ

Towards Universality: Studying Mechanistic Similarity Across Language Model Architectures

What Makes Large Language Models Reason in (Multi-Turn) Code Generation?

Think Beyond Size: Dynamic Prompting for More Effective Reasoning

Rewarding Progress: Scaling Automated Process Verifiers for LLM Reasoning

The Effect of Surprisal on Reading Times in Information Seeking and Repeated Reading

GenARM: Reward Guided Generation with Autoregressive Reward Model for Test-time Alignment

Closing the Loop: Learning to Generate Writing Feedback via Language Model Simulated Student Revisions

Teaching-Inspired Integrated Prompting Framework: A Novel Approach for Enhancing Reasoning in Large Language Models

Sparse Attention Decomposition Applied to Circuit Tracing

Paramanu: A Family of Novel Efficient Generative Foundation Language Models for Indian Languages

最近の投稿

最近のコメント

アーカイブ

カテゴリー