「cs.CL」カテゴリーアーカイブ

Does Writing with Language Models Reduce Content Diversity?

投稿日: 2024年7月2日作成者: jarxiv

要約大規模言語モデル (LLM) により、モデル支援を利用した共同執筆が急増し … 続きを読む →

カテゴリー: cs.CL, cs.CY, cs.HC, cs.LG | コメントを受け付けていません

Predicting Text Preference Via Structured Comparative Reasoning

投稿日: 2024年7月2日作成者: jarxiv

要約比較推論はテキストの好みの予測において重要な役割を果たします。ただし、大 … 続きを読む →

カテゴリー: cs.CL | コメントを受け付けていません

Safe and Responsible Large Language Model : Can We Balance Bias Reduction and Language Understanding in Large Language Models?

投稿日: 2024年7月2日作成者: jarxiv

要約大規模言語モデル (LLM) により、さまざまな NLP タスクが大幅に進 … 続きを読む →

カテゴリー: cs.CL | コメントを受け付けていません

WaterBench: Towards Holistic Evaluation of Watermarks for Large Language Models

投稿日: 2024年7月2日作成者: jarxiv

要約大規模言語モデル (LLM) の潜在的な誤用を軽減するために、最近の研究で … 続きを読む →

カテゴリー: cs.AI, cs.CL | コメントを受け付けていません

RouteLLM: Learning to Route LLMs with Preference Data

投稿日: 2024年7月2日作成者: jarxiv

要約大規模言語モデル (LLM) は、幅広いタスクにわたって優れた機能を発揮し … 続きを読む →

カテゴリー: cs.AI, cs.CL, cs.LG | コメントを受け付けていません

Benchmarking Mental State Representations in Language Models

投稿日: 2024年7月2日作成者: jarxiv

要約心の理論による推論を必要とするタスクに対する言語モデル (LM) の生成パ … 続きを読む →

カテゴリー: cs.AI, cs.CL | コメントを受け付けていません

Assessing the Brittleness of Safety Alignment via Pruning and Low-Rank Modifications

投稿日: 2024年7月2日作成者: jarxiv

要約大規模言語モデル (LLM) は、ジェイルブレイクや、さらには悪意のない微 … 続きを読む →

カテゴリー: cs.AI, cs.CL, cs.LG | コメントを受け付けていません

Is one brick enough to break the wall of spoken dialogue state tracking?

投稿日: 2024年7月2日作成者: jarxiv

要約タスク指向対話 (TOD) システムでは、ユーザーの要求に対するシステムの … 続きを読む →

カテゴリー: cs.AI, cs.CL, eess.AS, eess.SP | コメントを受け付けていません

How Reliable Are Automatic Evaluation Methods for Instruction-Tuned LLMs?

投稿日: 2024年7月2日作成者: jarxiv

要約命令調整された大規模言語モデル (LLM) の取り組みでは、人間による評価 … 続きを読む →

カテゴリー: cs.AI, cs.CL | コメントを受け付けていません

Textual Similarity as a Key Metric in Machine Translation Quality Estimation

投稿日: 2024年7月2日作成者: jarxiv

要約機械翻訳 (MT) 品質評価 (QE) は、参考テキストなしで翻訳の信頼性 … 続きを読む →

カテゴリー: cs.AI, cs.CL | コメントを受け付けていません

「cs.CL」カテゴリーアーカイブ

Does Writing with Language Models Reduce Content Diversity?

Predicting Text Preference Via Structured Comparative Reasoning

Safe and Responsible Large Language Model : Can We Balance Bias Reduction and Language Understanding in Large Language Models?

WaterBench: Towards Holistic Evaluation of Watermarks for Large Language Models

RouteLLM: Learning to Route LLMs with Preference Data

Benchmarking Mental State Representations in Language Models

Assessing the Brittleness of Safety Alignment via Pruning and Low-Rank Modifications

Is one brick enough to break the wall of spoken dialogue state tracking?

How Reliable Are Automatic Evaluation Methods for Instruction-Tuned LLMs?

Textual Similarity as a Key Metric in Machine Translation Quality Estimation

最近の投稿

最近のコメント

アーカイブ

カテゴリー