「cs.CL」カテゴリーアーカイブ

SATBench: Benchmarking LLMs’ Logical Reasoning via Automated Puzzle Generation from SAT Formulas

投稿日: 2025年5月21日作成者: jarxiv

要約 Satbenchを紹介します。これは、ブールの満足度（SAT）の問題から派 … 続きを読む →

カテゴリー: cs.AI, cs.CL, cs.LG, cs.LO | コメントを受け付けていません

TiEBe: Tracking Language Model Recall of Notable Worldwide Events Through Time

投稿日: 2025年5月21日作成者: jarxiv

要約知識の状況が進化し、大規模な言語モデル（LLM）がますます広くなるにつれて … 続きを読む →

カテゴリー: cs.AI, cs.CL | コメントを受け付けていません

TinyV: Reducing False Negatives in Verification Improves RL for LLM Reasoning

投稿日: 2025年5月21日作成者: jarxiv

要約 Rehnection Learning（RL）は、報酬信号でポリシーを最適 … 続きを読む →

カテゴリー: cs.AI, cs.CL, cs.LG | コメントを受け付けていません

Debating for Better Reasoning: An Unsupervised Multimodal Approach

投稿日: 2025年5月21日作成者: jarxiv

要約大規模な言語モデル（LLM）が多様なドメインとモダリティにわたって専門知識 … 続きを読む →

カテゴリー: cs.AI, cs.CL | コメントを受け付けていません

Will AI Tell Lies to Save Sick Children? Litmus-Testing AI Values Prioritization with AIRiskDilemmas

投稿日: 2025年5月21日作成者: jarxiv

要約 AIのリスクの検出は、より強力なモデルが出現し、これらの検出試行を回避する … 続きを読む →

カテゴリー: cs.AI, cs.CL, cs.CY, cs.HC, cs.LG | コメントを受け付けていません

SAFEPATH: Preventing Harmful Reasoning in Chain-of-Thought via Early Alignment

投稿日: 2025年5月21日作成者: jarxiv

要約大規模な推論モデル（LRM）は、複雑な問題解決のための強力なツールになりま … 続きを読む →

カテゴリー: cs.AI, cs.CL | コメントを受け付けていません

ContextAgent: Context-Aware Proactive LLM Agents with Open-World Sensory Perceptions

投稿日: 2025年5月21日作成者: jarxiv

要約大規模な言語モデル（LLMS）の最近の進歩により、インテリジェントなエージ … 続きを読む →

カテゴリー: cs.AI, cs.CL, cs.HC | コメントを受け付けていません

NExT-Search: Rebuilding User Feedback Ecosystem for Generative AI Search

投稿日: 2025年5月21日作成者: jarxiv

要約生成AI検索は、複雑なクエリに対するエンドツーエンドの回答を提供し、複数の … 続きを読む →

カテゴリー: cs.AI, cs.CL, cs.HC, cs.IR | コメントを受け付けていません

Mind the Gap: Bridging Thought Leap for Improved Chain-of-Thought Tuning

投稿日: 2025年5月21日作成者: jarxiv

要約大規模な言語モデル（LLMS）は、チェーンオブシャート（COT）の推論を通 … 続きを読む →

カテゴリー: cs.AI, cs.CL | コメントを受け付けていません

Does Acceleration Cause Hidden Instability in Vision Language Models? Uncovering Instance-Level Divergence Through a Large-Scale Empirical Study

投稿日: 2025年5月21日作成者: jarxiv

要約 Vision-Language Models（VLMS）は、広範囲にわたる … 続きを読む →

カテゴリー: cs.CL, cs.CV | コメントを受け付けていません

「cs.CL」カテゴリーアーカイブ

SATBench: Benchmarking LLMs’ Logical Reasoning via Automated Puzzle Generation from SAT Formulas

TiEBe: Tracking Language Model Recall of Notable Worldwide Events Through Time

TinyV: Reducing False Negatives in Verification Improves RL for LLM Reasoning

Debating for Better Reasoning: An Unsupervised Multimodal Approach

Will AI Tell Lies to Save Sick Children? Litmus-Testing AI Values Prioritization with AIRiskDilemmas

SAFEPATH: Preventing Harmful Reasoning in Chain-of-Thought via Early Alignment

ContextAgent: Context-Aware Proactive LLM Agents with Open-World Sensory Perceptions

NExT-Search: Rebuilding User Feedback Ecosystem for Generative AI Search

Mind the Gap: Bridging Thought Leap for Improved Chain-of-Thought Tuning

Does Acceleration Cause Hidden Instability in Vision Language Models? Uncovering Instance-Level Divergence Through a Large-Scale Empirical Study

最近の投稿

最近のコメント

アーカイブ

カテゴリー