「cs.CL」カテゴリーアーカイブ

Stochastic Chameleons: Irrelevant Context Hallucinations Reveal Class-Based (Mis)Generalization in LLMs

投稿日: 2025年5月29日作成者: jarxiv

要約 NLPベンチマーク上の大規模な言語モデル（LLMS）の広範な成功には、LL … 続きを読む →

カテゴリー: cs.CL | コメントを受け付けていません

Characterizing Bias: Benchmarking Large Language Models in Simplified versus Traditional Chinese

投稿日: 2025年5月29日作成者: jarxiv

要約大規模な言語モデル（LLM）の能力は、単純化された中国語と伝統的な中国語の … 続きを読む →

カテゴリー: cs.CL, cs.CY | コメントを受け付けていません

WebDancer: Towards Autonomous Information Seeking Agency

投稿日: 2025年5月29日作成者: jarxiv

要約複雑な現実世界の問題に対処するには、詳細な情報探索とマルチステップの推論が … 続きを読む →

カテゴリー: cs.CL | コメントを受け付けていません

The Climb Carves Wisdom Deeper Than the Summit: On the Noisy Rewards in Learning to Reason

投稿日: 2025年5月29日作成者: jarxiv

要約強化学習（RL）を通じて推論するためのトレーニング後の大手言語モデル（LL … 続きを読む →

カテゴリー: cs.CL | コメントを受け付けていません

GuessArena: Guess Who I Am? A Self-Adaptive Framework for Evaluating LLMs in Domain-Specific Knowledge and Reasoning

投稿日: 2025年5月29日作成者: jarxiv

要約大規模な言語モデル（LLMS）の評価は、伝統的に静的ベンチマークに依存して … 続きを読む →

カテゴリー: cs.CL | コメントを受け付けていません

AutoL2S: Auto Long-Short Reasoning for Efficient Large Language Models

投稿日: 2025年5月29日作成者: jarxiv

要約推論対応の大規模な言語モデル（LLMS）は、複雑な推論タスクで強力なパフォ … 続きを読む →

カテゴリー: cs.CL, cs.LG | コメントを受け付けていません

How Do LLMs Perform Two-Hop Reasoning in Context?

投稿日: 2025年5月29日作成者: jarxiv

要約「ソクラテスは人間です。すべての人間は致命的です。したがって、ソクラテ … 続きを読む →

カテゴリー: cs.AI, cs.CL | コメントを受け付けていません

SynLogic: Synthesizing Verifiable Reasoning Data at Scale for Learning Logical Reasoning and Beyond

投稿日: 2025年5月29日作成者: jarxiv

要約 Openai-O1やDeepseek R1などの最近の進歩により、大規模な … 続きを読む →

カテゴリー: cs.AI, cs.CL | コメントを受け付けていません

Moderating Harm: Benchmarking Large Language Models for Cyberbullying Detection in YouTube Comments

投稿日: 2025年5月29日作成者: jarxiv

要約オンラインプラットフォームが成長するにつれて、コメントセクションは、ユーザ … 続きを読む →

カテゴリー: cs.AI, cs.CL | コメントを受け付けていません

ClaimPKG: Enhancing Claim Verification via Pseudo-Subgraph Generation with Lightweight Specialized LLM

投稿日: 2025年5月29日作成者: jarxiv

要約知識グラフ（KGS）を統合して、大規模な言語モデル（LLM）の推論能力を強 … 続きを読む →

カテゴリー: cs.AI, cs.CL, cs.DB | コメントを受け付けていません

「cs.CL」カテゴリーアーカイブ

Stochastic Chameleons: Irrelevant Context Hallucinations Reveal Class-Based (Mis)Generalization in LLMs

Characterizing Bias: Benchmarking Large Language Models in Simplified versus Traditional Chinese

WebDancer: Towards Autonomous Information Seeking Agency

The Climb Carves Wisdom Deeper Than the Summit: On the Noisy Rewards in Learning to Reason

GuessArena: Guess Who I Am? A Self-Adaptive Framework for Evaluating LLMs in Domain-Specific Knowledge and Reasoning

AutoL2S: Auto Long-Short Reasoning for Efficient Large Language Models

How Do LLMs Perform Two-Hop Reasoning in Context?

SynLogic: Synthesizing Verifiable Reasoning Data at Scale for Learning Logical Reasoning and Beyond

Moderating Harm: Benchmarking Large Language Models for Cyberbullying Detection in YouTube Comments

ClaimPKG: Enhancing Claim Verification via Pseudo-Subgraph Generation with Lightweight Specialized LLM

最近の投稿

最近のコメント

アーカイブ

カテゴリー