「cs.CL」カテゴリーアーカイブ

Guiding Large Language Models to Post-Edit Machine Translation with Error Annotations

投稿日: 2024年4月12日作成者: jarxiv

要約機械翻訳 (MT) は、大規模言語モデル (LLM) が専用の教師ありシス … 続きを読む →

カテゴリー: cs.AI, cs.CL | コメントを受け付けていません

Source-Aware Training Enables Knowledge Attribution in Language Models

投稿日: 2024年4月12日作成者: jarxiv

要約大規模言語モデル (LLM) は、事前トレーニング中に膨大な量の知識を学習 … 続きを読む →

カテゴリー: cs.AI, cs.CL | コメントを受け付けていません

High-Dimension Human Value Representation in Large Language Models

投稿日: 2024年4月12日作成者: jarxiv

要約大規模言語モデル (LLM) がさまざまなタスクや分野に広く適用されるよう … 続きを読む →

カテゴリー: cs.AI, cs.CL | コメントを受け付けていません

Me LLaMA: Foundation Large Language Models for Medical Applications

投稿日: 2024年4月12日作成者: jarxiv

要約 ChatGPT や LLaMA などの大規模言語モデル (LLM) の最近 … 続きを読む →

カテゴリー: cs.AI, cs.CL | コメントを受け付けていません

A Systematic Comparison of Syllogistic Reasoning in Humans and Language Models

投稿日: 2024年4月12日作成者: jarxiv

要約合理的な行動の中心的な要素は論理的推論、つまり一連の前提からどの結論が導か … 続きを読む →

カテゴリー: cs.AI, cs.CL, cs.LG | コメントを受け付けていません

DesignQA: A Multimodal Benchmark for Evaluating Large Language Models’ Understanding of Engineering Documentation

投稿日: 2024年4月12日作成者: jarxiv

要約この研究では、技術文書のエンジニアリング要件を理解し、適用する際のマルチモ … 続きを読む →

カテゴリー: cs.AI, cs.CL | コメントを受け付けていません

Rho-1: Not All Tokens Are What You Need

投稿日: 2024年4月12日作成者: jarxiv

要約以前の言語モデルの事前トレーニング方法では、次のトークンの予測損失がすべて … 続きを読む →

カテゴリー: cs.AI, cs.CL | コメントを受け付けていません

OSWorld: Benchmarking Multimodal Agents for Open-Ended Tasks in Real Computer Environments

投稿日: 2024年4月12日作成者: jarxiv

要約人間の介入を最小限に抑えて複雑なコンピュータタスクを実行する自律型エージェ … 続きを読む →

カテゴリー: cs.AI, cs.CL | コメントを受け付けていません

LLoCO: Learning Long Contexts Offline

投稿日: 2024年4月12日作成者: jarxiv

要約大規模言語モデル (LLM) にとって、長いコンテキストの処理は、セルフ … 続きを読む →

カテゴリー: cs.AI, cs.CL, cs.LG | コメントを受け付けていません

Manipulating Large Language Models to Increase Product Visibility

投稿日: 2024年4月12日作成者: jarxiv

要約ユーザーのクエリに合わせた自然言語応答を提供するために、大規模言語モデル … 続きを読む →

カテゴリー: cs.AI, cs.CL, cs.IR | コメントを受け付けていません

「cs.CL」カテゴリーアーカイブ

Guiding Large Language Models to Post-Edit Machine Translation with Error Annotations

Source-Aware Training Enables Knowledge Attribution in Language Models

High-Dimension Human Value Representation in Large Language Models

Me LLaMA: Foundation Large Language Models for Medical Applications

A Systematic Comparison of Syllogistic Reasoning in Humans and Language Models

DesignQA: A Multimodal Benchmark for Evaluating Large Language Models’ Understanding of Engineering Documentation

Rho-1: Not All Tokens Are What You Need

OSWorld: Benchmarking Multimodal Agents for Open-Ended Tasks in Real Computer Environments

LLoCO: Learning Long Contexts Offline

Manipulating Large Language Models to Increase Product Visibility

最近の投稿

最近のコメント

アーカイブ

カテゴリー