「cs.CL」カテゴリーアーカイブ

Quality Estimation based Feedback Training for Improving Pronoun Translation

投稿日: 2025年1月7日作成者: jarxiv

要約代名詞の翻訳は、ニューラル機械翻訳 (NMT) における長年の課題であり、 … 続きを読む →

カテゴリー: cs.AI, cs.CL | コメントを受け付けていません

Automating the Generation of Prompts for LLM-based Action Choice in PDDL Planning

投稿日: 2025年1月7日作成者: jarxiv

要約大規模言語モデル (LLM) は、さまざまな NLP タスクに革命をもたら … 続きを読む →

カテゴリー: cs.AI, cs.CL | コメントを受け付けていません

Quantization Meets Reasoning: Exploring LLM Low-Bit Quantization Degradation for Mathematical Reasoning

投稿日: 2025年1月7日作成者: jarxiv

要約大規模な言語モデルは、MATH などの複雑な数学的推論ベンチマークで大幅な … 続きを読む →

カテゴリー: cs.AI, cs.CL | コメントを受け付けていません

Decoupling Knowledge and Reasoning in Transformers: A Modular Architecture with Generalized Cross-Attention

投稿日: 2025年1月7日作成者: jarxiv

要約 Transformer はさまざまな分野で目覚ましい成功を収めてきましたが … 続きを読む →

カテゴリー: cs.AI, cs.CL, cs.LG | コメントを受け付けていません

LangFair: A Python Package for Assessing Bias and Fairness in Large Language Model Use Cases

投稿日: 2025年1月7日作成者: jarxiv

要約大規模言語モデル (LLM) はさまざまな形でバイアスを示し、性別、人種、 … 続きを読む →

カテゴリー: cs.AI, cs.CL, cs.CY, cs.LG | コメントを受け付けていません

Lived Experience Not Found: LLMs Struggle to Align with Experts on Addressing Adverse Drug Reactions from Psychiatric Medication Use

投稿日: 2025年1月7日作成者: jarxiv

要約精神科治療薬による薬物副作用（ADR）は、メンタルヘルス患者の入院の主な原 … 続きを読む →

カテゴリー: cs.AI, cs.CL, cs.CY | コメントを受け付けていません

PRMBench: A Fine-grained and Challenging Benchmark for Process-Level Reward Models

投稿日: 2025年1月7日作成者: jarxiv

要約プロセスレベルの報酬モデル (PRM) は、複雑な推論および意思決定タス … 続きを読む →

カテゴリー: cs.AI, cs.CL, cs.LG | コメントを受け付けていません

ForecastBench: A Dynamic Benchmark of AI Forecasting Capabilities

投稿日: 2025年1月7日作成者: jarxiv

要約将来の出来事の予測は、情報に基づいた意思決定に不可欠な情報です。機械学習 … 続きを読む →

カテゴリー: cs.AI, cs.CL, cs.LG | コメントを受け付けていません

Are Your LLMs Capable of Stable Reasoning?

投稿日: 2025年1月7日作成者: jarxiv

要約大規模言語モデル (LLM) の急速な進歩により、複雑な推論タスクにおける … 続きを読む →

カテゴリー: cs.AI, cs.CL | コメントを受け付けていません

The Two-Hop Curse: LLMs trained on A$\rightarrow$B, B$\rightarrow$C fail to learn A$\rightarrow$C

投稿日: 2025年1月7日作成者: jarxiv

要約 [注意: このバージョンは古いです。最近の研究はいくつかの重要な主張に矛 … 続きを読む →

カテゴリー: cs.AI, cs.CL | コメントを受け付けていません

「cs.CL」カテゴリーアーカイブ

Quality Estimation based Feedback Training for Improving Pronoun Translation

Automating the Generation of Prompts for LLM-based Action Choice in PDDL Planning

Quantization Meets Reasoning: Exploring LLM Low-Bit Quantization Degradation for Mathematical Reasoning

Decoupling Knowledge and Reasoning in Transformers: A Modular Architecture with Generalized Cross-Attention

LangFair: A Python Package for Assessing Bias and Fairness in Large Language Model Use Cases

Lived Experience Not Found: LLMs Struggle to Align with Experts on Addressing Adverse Drug Reactions from Psychiatric Medication Use

PRMBench: A Fine-grained and Challenging Benchmark for Process-Level Reward Models

ForecastBench: A Dynamic Benchmark of AI Forecasting Capabilities

Are Your LLMs Capable of Stable Reasoning?

The Two-Hop Curse: LLMs trained on A$\rightarrow$B, B$\rightarrow$C fail to learn A$\rightarrow$C

最近の投稿

最近のコメント

アーカイブ

カテゴリー