「cs.CL」カテゴリーアーカイブ

Quantifying the Reasoning Abilities of LLMs on Real-world Clinical Cases

投稿日: 2025年3月11日作成者: jarxiv

要約 Deepseek-R1やOpenai-O3などの推論強化大型言語モデル（L … 続きを読む →

カテゴリー: cs.CL | コメントを受け付けていません

Folded Context Condensation in Path Integral Formalism for Infinite Context Transformers

投稿日: 2025年3月11日作成者: jarxiv

要約この作業では、パス積分形式のフレームワーク内でコアメカニズムを再解釈するこ … 続きを読む →

カテゴリー: cs.AI, cs.CL, cs.LG, cs.NE, hep-ph | コメントを受け付けていません

Token-Supervised Value Models for Enhancing Mathematical Problem-Solving Capabilities of Large Language Models

投稿日: 2025年3月11日作成者: jarxiv

要約大規模な言語モデル（LLM）の数学的問題解決能力を改善するためのテスト時間 … 続きを読む →

カテゴリー: cs.AI, cs.CL | コメントを受け付けていません

Language Models Fail to Introspect About Their Knowledge of Language

投稿日: 2025年3月11日作成者: jarxiv

要約大規模な言語モデル（LLM）が自分の内部状態について内省できるかどうかに最 … 続きを読む →

カテゴリー: cs.AI, cs.CL | コメントを受け付けていません

TokenButler: Token Importance is Predictable

投稿日: 2025年3月11日作成者: jarxiv

要約大規模な言語モデル（LLMS）は、キー価値（kV）キャッシュに依存してトー … 続きを読む →

カテゴリー: cs.AI, cs.CL, cs.LG | コメントを受け付けていません

LMM-R1: Empowering 3B LMMs with Strong Reasoning Abilities Through Two-Stage Rule-Based RL

投稿日: 2025年3月11日作成者: jarxiv

要約大規模なマルチモーダルモデル（LMMS）の推論の強化は、特に建築的制約が推 … 続きを読む →

カテゴリー: cs.AI, cs.CL | コメントを受け付けていません

KSOD: Knowledge Supplement for LLMs On Demand

投稿日: 2025年3月11日作成者: jarxiv

要約大規模な言語モデル（LLM）は、さまざまなタスクで顕著な機能を実証していま … 続きを読む →

カテゴリー: cs.AI, cs.CL, cs.LG | コメントを受け付けていません

MA-LoT: Multi-Agent Lean-based Long Chain-of-Thought Reasoning enhances Formal Theorem Proving

投稿日: 2025年3月11日作成者: jarxiv

要約 LEANのようなコンピューターで検証可能な言語を使用して数学的問題を解決す … 続きを読む →

カテゴリー: cs.AI, cs.CL | コメントを受け付けていません

Optimizing Test-Time Compute via Meta Reinforcement Fine-Tuning

投稿日: 2025年3月11日作成者: jarxiv

要約テスト時間計算を効果的に使用するトレーニングモデルは、LLMSの推論パフォ … 続きを読む →

カテゴリー: cs.AI, cs.CL, cs.LG | コメントを受け付けていません

Sparrow: Data-Efficient Video-LLM with Text-to-Image Augmentation

投稿日: 2025年3月11日作成者: jarxiv

要約近年、ビジョン理解ドメインにおけるマルチモーダル大手言語モデル（MLLM） … 続きを読む →

カテゴリー: cs.CL, cs.CV, cs.LG | コメントを受け付けていません

「cs.CL」カテゴリーアーカイブ

Quantifying the Reasoning Abilities of LLMs on Real-world Clinical Cases

Folded Context Condensation in Path Integral Formalism for Infinite Context Transformers

Token-Supervised Value Models for Enhancing Mathematical Problem-Solving Capabilities of Large Language Models

Language Models Fail to Introspect About Their Knowledge of Language

TokenButler: Token Importance is Predictable

LMM-R1: Empowering 3B LMMs with Strong Reasoning Abilities Through Two-Stage Rule-Based RL

KSOD: Knowledge Supplement for LLMs On Demand

MA-LoT: Multi-Agent Lean-based Long Chain-of-Thought Reasoning enhances Formal Theorem Proving

Optimizing Test-Time Compute via Meta Reinforcement Fine-Tuning

Sparrow: Data-Efficient Video-LLM with Text-to-Image Augmentation

最近の投稿

最近のコメント

アーカイブ

カテゴリー