「cs.CL」カテゴリーアーカイブ

REASONING GYM: Reasoning Environments for Reinforcement Learning with Verifiable Rewards

投稿日: 2025年6月2日作成者: jarxiv

要約検証可能な報酬を伴う強化学習のための推論環境のライブラリであるReashi … 続きを読む →

カテゴリー: cs.AI, cs.CL, cs.LG | コメントを受け付けていません

HelpSteer3: Human-Annotated Feedback and Edit Data to Empower Inference-Time Scaling in Open-Ended General-Domain Tasks

投稿日: 2025年6月2日作成者: jarxiv

要約推論時間スケーリングは、OpenAI O1やDeepSeek R1などの最 … 続きを読む →

カテゴリー: cs.AI, cs.CL, cs.LG | コメントを受け付けていません

Drop Dropout on Single-Epoch Language Model Pretraining

投稿日: 2025年6月2日作成者: jarxiv

要約もともと、ドロップアウトは、過剰適合を減らすことにより、深い学習のほぼすべ … 続きを読む →

カテゴリー: cs.AI, cs.CL | コメントを受け付けていません

Middle-Layer Representation Alignment for Cross-Lingual Transfer in Fine-Tuned LLMs

投稿日: 2025年6月2日作成者: jarxiv

要約大規模な言語モデルは、微調整を通じてタスク固有のアプリケーションで顕著な能 … 続きを読む →

カテゴリー: cs.AI, cs.CL | コメントを受け付けていません

You need to MIMIC to get FAME: Solving Meeting Transcript Scarcity with a Multi-Agent Conversations

投稿日: 2025年6月2日作成者: jarxiv

要約会議の要約は、主にプライバシーの制限と高価な収集プロセスのために、限られた … 続きを読む →

カテゴリー: cs.AI, cs.CL | コメントを受け付けていません

PhySense: Principle-Based Physics Reasoning Benchmarking for Large Language Models

投稿日: 2025年6月2日作成者: jarxiv

要約大規模な言語モデル（LLM）は急速に進歩しており、物理学の問題を含む複雑な … 続きを読む →

カテゴリー: cs.AI, cs.CL, cs.LG | コメントを受け付けていません

SparQLe: Speech Queries to Text Translation Through LLMs

投稿日: 2025年6月2日作成者: jarxiv

要約大規模な言語モデル（LLMS）の影響力が高まっているため、音声表現を統合し … 続きを読む →

カテゴリー: cs.AI, cs.CL | コメントを受け付けていません

Improving Reliability and Explainability of Medical Question Answering through Atomic Fact Checking in Retrieval-Augmented LLMs

投稿日: 2025年6月2日作成者: jarxiv

要約大規模な言語モデル（LLMS）は広範な医学的知識を示しますが、幻覚と不正確 … 続きを読む →

カテゴリー: cs.AI, cs.CL | コメントを受け付けていません

RuleArena: A Benchmark for Rule-Guided Reasoning with LLMs in Real-World Scenarios

投稿日: 2025年6月2日作成者: jarxiv

要約このペーパーでは、Rulearenaを紹介します。これは、推論において複雑 … 続きを読む →

カテゴリー: cs.AI, cs.CL | コメントを受け付けていません

MiCRo: Mixture Modeling and Context-aware Routing for Personalized Preference Learning

投稿日: 2025年6月2日作成者: jarxiv

要約報酬モデリングは、大規模な言語モデル（LLM）を調整するために人間のフィー … 続きを読む →

カテゴリー: cs.AI, cs.CL | コメントを受け付けていません

「cs.CL」カテゴリーアーカイブ

REASONING GYM: Reasoning Environments for Reinforcement Learning with Verifiable Rewards

HelpSteer3: Human-Annotated Feedback and Edit Data to Empower Inference-Time Scaling in Open-Ended General-Domain Tasks

Drop Dropout on Single-Epoch Language Model Pretraining

Middle-Layer Representation Alignment for Cross-Lingual Transfer in Fine-Tuned LLMs

You need to MIMIC to get FAME: Solving Meeting Transcript Scarcity with a Multi-Agent Conversations

PhySense: Principle-Based Physics Reasoning Benchmarking for Large Language Models

SparQLe: Speech Queries to Text Translation Through LLMs

Improving Reliability and Explainability of Medical Question Answering through Atomic Fact Checking in Retrieval-Augmented LLMs

RuleArena: A Benchmark for Rule-Guided Reasoning with LLMs in Real-World Scenarios

MiCRo: Mixture Modeling and Context-aware Routing for Personalized Preference Learning

最近の投稿

最近のコメント

アーカイブ

カテゴリー