「cs.CL」カテゴリーアーカイブ

Towards Building Multilingual Language Model for Medicine

投稿日: 2024年2月22日作成者: jarxiv

要約この論文では、さまざまな地域のより広範で言語的に多様な聴衆に利益をもたらす … 続きを読む →

カテゴリー: cs.CL | コメントを受け付けていません

Are You Sure? Challenging LLMs Leads to Performance Drops in The FlipFlop Experiment

投稿日: 2024年2月22日作成者: jarxiv

要約大規模言語モデル (LLM) のインタラクティブな性質により、理論的にはモ … 続きを読む →

カテゴリー: cs.CL | コメントを受け付けていません

Analysing The Impact of Sequence Composition on Language Model Pre-Training

投稿日: 2024年2月22日作成者: jarxiv

要約ほとんどの言語モデルの事前トレーニングフレームワークは、複数のドキュメン … 続きを読む →

カテゴリー: cs.CL | コメントを受け付けていません

Hallucinations or Attention Misdirection? The Path to Strategic Value Extraction in Business Using Large Language Models

投稿日: 2024年2月22日作成者: jarxiv

要約トランスフォーマーアーキテクチャを備えた大規模言語モデルは、テキスト生成 … 続きを読む →

カテゴリー: cs.CL | コメントを受け付けていません

OlympiadBench: A Challenging Benchmark for Promoting AGI with Olympiad-Level Bilingual Multimodal Scientific Problems

投稿日: 2024年2月22日作成者: jarxiv

要約最近の進歩により、大規模言語モデル (LLM) と大規模マルチモーダルモ … 続きを読む →

カテゴリー: cs.CL | コメントを受け付けていません

Is LLM-as-a-Judge Robust? Investigating Universal Adversarial Attacks on Zero-shot LLM Assessment

投稿日: 2024年2月22日作成者: jarxiv

要約大規模言語モデル (LLM) は強力なゼロショット評価ツールであり、筆記試 … 続きを読む →

カテゴリー: cs.CL | コメントを受け付けていません

Coercing LLMs to do and reveal (almost) anything

投稿日: 2024年2月22日作成者: jarxiv

要約最近、大規模言語モデル (LLM) に対する敵対的攻撃により、モデルが「脱 … 続きを読む →

カテゴリー: cs.CL, cs.CR, cs.LG | コメントを受け付けていません

CriticBench: Evaluating Large Language Models as Critic

投稿日: 2024年2月22日作成者: jarxiv

要約大規模言語モデル (LLM) のスケーラブルな監視と自己改善には、批判能力 … 続きを読む →

カテゴリー: cs.AI, cs.CL | コメントを受け付けていません

CRITIC: Large Language Models Can Self-Correct with Tool-Interactive Critiquing

投稿日: 2024年2月22日作成者: jarxiv

要約大規模言語モデル (LLM) の最近の発展は目覚ましいものがあります。た … 続きを読む →

カテゴリー: cs.AI, cs.CL | コメントを受け付けていません

ToRA: A Tool-Integrated Reasoning Agent for Mathematical Problem Solving

投稿日: 2024年2月22日作成者: jarxiv

要約大規模な言語モデルは、さまざまな言語タスクにおいて大幅な進歩を遂げています … 続きを読む →

カテゴリー: cs.AI, cs.CL | コメントを受け付けていません

「cs.CL」カテゴリーアーカイブ

Towards Building Multilingual Language Model for Medicine

Are You Sure? Challenging LLMs Leads to Performance Drops in The FlipFlop Experiment

Analysing The Impact of Sequence Composition on Language Model Pre-Training

Hallucinations or Attention Misdirection? The Path to Strategic Value Extraction in Business Using Large Language Models

OlympiadBench: A Challenging Benchmark for Promoting AGI with Olympiad-Level Bilingual Multimodal Scientific Problems

Is LLM-as-a-Judge Robust? Investigating Universal Adversarial Attacks on Zero-shot LLM Assessment

Coercing LLMs to do and reveal (almost) anything

CriticBench: Evaluating Large Language Models as Critic

CRITIC: Large Language Models Can Self-Correct with Tool-Interactive Critiquing

ToRA: A Tool-Integrated Reasoning Agent for Mathematical Problem Solving

最近の投稿

最近のコメント

アーカイブ

カテゴリー