月別アーカイブ: 2025年4月

ScholarCopilot: Training Large Language Models for Academic Writing with Accurate Citations

投稿日: 2025年4月4日作成者: jarxiv

要約アカデミックライティングでは、首尾一貫したテキスト生成と関連文献の正確な引 … 続きを読む →

カテゴリー: cs.CL | コメントを受け付けていません

LLM for Complex Reasoning Task: An Exploratory Study in Fermi Problems

投稿日: 2025年4月4日作成者: jarxiv

要約フェルミ問題(FP)は、人間のような論理と数値推論を必要とする数学的推論課 … 続きを読む →

カテゴリー: cs.CL | コメントを受け付けていません

Limitations of Religious Data and the Importance of the Target Domain: Towards Machine Translation for Guinea-Bissau Creole

投稿日: 2025年4月4日作成者: jarxiv

要約ギニアビサウ・クレオール語（Kiriol）の機械翻訳のための新しいデータセ … 続きを読む →

カテゴリー: cs.CL | コメントを受け付けていません

The Hidden Space of Safety: Understanding Preference-Tuned LLMs in Multilingual context

投稿日: 2025年4月4日作成者: jarxiv

要約アライメントチューニングにより、大規模な言語モデルは推論、命令追従、有害な … 続きを読む →

カテゴリー: cs.CL | コメントを受け付けていません

ERPO: Advancing Safety Alignment via Ex-Ante Reasoning Preference Optimization

投稿日: 2025年4月4日作成者: jarxiv

要約近年の大規模言語モデル（LLM）の進歩により、人工知能の進歩が加速している … 続きを読む →

カテゴリー: cs.CL | コメントを受け付けていません

Why do LLMs attend to the first token?

投稿日: 2025年4月4日作成者: jarxiv

要約大規模言語モデル(LLM)は、シーケンスの最初のトークンに集中する傾向があ … 続きを読む →

カテゴリー: cs.CL | コメントを受け付けていません

Enhancing LLM Robustness to Perturbed Instructions: An Empirical Study

投稿日: 2025年4月4日作成者: jarxiv

要約大規模言語モデル（LLM）は入力の摂動に対して非常に脆弱である。LLMのロ … 続きを読む →

カテゴリー: cs.CL | コメントを受け付けていません

Measuring Large Language Models Capacity to Annotate Journalistic Sourcing

投稿日: 2025年4月4日作成者: jarxiv

要約 2022年後半にChatGPTが発表されて以来、大規模言語モデルの能力とそ … 続きを読む →

カテゴリー: cs.CL, cs.CY | コメントを受け付けていません

MultiBLiMP 1.0: A Massively Multilingual Benchmark of Linguistic Minimal Pairs

投稿日: 2025年4月4日作成者: jarxiv

要約 101の言語、6つの言語現象をカバーし、125,000以上のミニマルペアを … 続きを読む →

カテゴリー: cs.CL | コメントを受け付けていません

A Framework for Robust Cognitive Evaluation of LLMs

投稿日: 2025年4月4日作成者: jarxiv

要約大規模言語モデル（LLM）における創発的な認知能力は広く観察されているが、 … 続きを読む →

カテゴリー: cs.CL | コメントを受け付けていません

月別アーカイブ: 2025年4月

ScholarCopilot: Training Large Language Models for Academic Writing with Accurate Citations

LLM for Complex Reasoning Task: An Exploratory Study in Fermi Problems

Limitations of Religious Data and the Importance of the Target Domain: Towards Machine Translation for Guinea-Bissau Creole

The Hidden Space of Safety: Understanding Preference-Tuned LLMs in Multilingual context

ERPO: Advancing Safety Alignment via Ex-Ante Reasoning Preference Optimization

Why do LLMs attend to the first token?

Enhancing LLM Robustness to Perturbed Instructions: An Empirical Study

Measuring Large Language Models Capacity to Annotate Journalistic Sourcing

MultiBLiMP 1.0: A Massively Multilingual Benchmark of Linguistic Minimal Pairs

A Framework for Robust Cognitive Evaluation of LLMs

最近の投稿

最近のコメント

アーカイブ

カテゴリー