「cs.CL」カテゴリーアーカイブ

The Hidden Space of Safety: Understanding Preference-Tuned LLMs in Multilingual context

投稿日: 2025年4月4日作成者: jarxiv

要約アライメントチューニングにより、大規模な言語モデルは推論、命令追従、有害な … 続きを読む →

カテゴリー: cs.CL | コメントを受け付けていません

ERPO: Advancing Safety Alignment via Ex-Ante Reasoning Preference Optimization

投稿日: 2025年4月4日作成者: jarxiv

要約近年の大規模言語モデル（LLM）の進歩により、人工知能の進歩が加速している … 続きを読む →

カテゴリー: cs.CL | コメントを受け付けていません

Why do LLMs attend to the first token?

投稿日: 2025年4月4日作成者: jarxiv

要約大規模言語モデル(LLM)は、シーケンスの最初のトークンに集中する傾向があ … 続きを読む →

カテゴリー: cs.CL | コメントを受け付けていません

Enhancing LLM Robustness to Perturbed Instructions: An Empirical Study

投稿日: 2025年4月4日作成者: jarxiv

要約大規模言語モデル（LLM）は入力の摂動に対して非常に脆弱である。LLMのロ … 続きを読む →

カテゴリー: cs.CL | コメントを受け付けていません

Measuring Large Language Models Capacity to Annotate Journalistic Sourcing

投稿日: 2025年4月4日作成者: jarxiv

要約 2022年後半にChatGPTが発表されて以来、大規模言語モデルの能力とそ … 続きを読む →

カテゴリー: cs.CL, cs.CY | コメントを受け付けていません

MultiBLiMP 1.0: A Massively Multilingual Benchmark of Linguistic Minimal Pairs

投稿日: 2025年4月4日作成者: jarxiv

要約 101の言語、6つの言語現象をカバーし、125,000以上のミニマルペアを … 続きを読む →

カテゴリー: cs.CL | コメントを受け付けていません

A Framework for Robust Cognitive Evaluation of LLMs

投稿日: 2025年4月4日作成者: jarxiv

要約大規模言語モデル（LLM）における創発的な認知能力は広く観察されているが、 … 続きを読む →

カテゴリー: cs.CL | コメントを受け付けていません

BIRD: A Trustworthy Bayesian Inference Framework for Large Language Models

投稿日: 2025年4月4日作成者: jarxiv

要約予測モデルは、実世界のタスクにおいてしばしば不完全な情報を扱う必要がある。 … 続きを読む →

カテゴリー: cs.CL | コメントを受け付けていません

A Survey of Large Language Models in Mental Health Disorder Detection on Social Media

投稿日: 2025年4月4日作成者: jarxiv

要約メンタルヘルス問題の検出と介入は、世界的に重要な研究テーマであり、ソーシャ … 続きを読む →

カテゴリー: cs.CL, I.2.7 | コメントを受け付けていません

Measuring temporal effects of agent knowledge by date-controlled tool use

投稿日: 2025年4月4日作成者: jarxiv

要約知識の蓄積と更新には、時間的な進行が不可欠である。ウェブ検索はエージェント … 続きを読む →

カテゴリー: cs.CL, cs.IR | コメントを受け付けていません

「cs.CL」カテゴリーアーカイブ

The Hidden Space of Safety: Understanding Preference-Tuned LLMs in Multilingual context

ERPO: Advancing Safety Alignment via Ex-Ante Reasoning Preference Optimization

Why do LLMs attend to the first token?

Enhancing LLM Robustness to Perturbed Instructions: An Empirical Study

Measuring Large Language Models Capacity to Annotate Journalistic Sourcing

MultiBLiMP 1.0: A Massively Multilingual Benchmark of Linguistic Minimal Pairs

A Framework for Robust Cognitive Evaluation of LLMs

BIRD: A Trustworthy Bayesian Inference Framework for Large Language Models

A Survey of Large Language Models in Mental Health Disorder Detection on Social Media

Measuring temporal effects of agent knowledge by date-controlled tool use

最近の投稿

最近のコメント

アーカイブ

カテゴリー