「cs.CL」カテゴリーアーカイブ

Filtered Direct Preference Optimization

投稿日: 2024年4月24日作成者: jarxiv

要約人間のフィードバックからの強化学習 (RLHF) は、言語モデルを人間の好 … 続きを読む →

カテゴリー: cs.AI, cs.CL, cs.LG | コメントを受け付けていません

Achieving >97% on GSM8K: Deeply Understanding the Problems Makes LLMs Perfect Reasoners

投稿日: 2024年4月24日作成者: jarxiv

要約思考連鎖促進戦略により、さまざまな NLP タスクにわたる大規模言語モデル … 続きを読む →

カテゴリー: cs.AI, cs.CL | コメントを受け付けていません

Convergences and Divergences between Automatic Assessment and Human Evaluation: Insights from Comparing ChatGPT-Generated Translation and Neural Machine Translation

投稿日: 2024年4月24日作成者: jarxiv

要約大規模な言語モデルは、並列機械翻訳 (NMT) システムと比較して、さらに … 続きを読む →

カテゴリー: cs.AI, cs.CL | コメントを受け付けていません

WangchanLion and WangchanX MRC Eval

投稿日: 2024年4月24日作成者: jarxiv

要約この技術レポートでは、タイ語の機械読解 (MRC) に焦点を当てた命令の微 … 続きを読む →

カテゴリー: cs.AI, cs.CL | コメントを受け付けていません

Transformers Can Represent $n$-gram Language Models

投稿日: 2024年4月24日作成者: jarxiv

要約既存の研究の多くは、計算の形式的なモデルを使用してその表現能力を記述するこ … 続きを読む →

カテゴリー: cs.AI, cs.CC, cs.CL, cs.FL, cs.LG | コメントを受け付けていません

Deferred NAM: Low-latency Top-K Context Injection via Deferred Context Encoding for Non-Streaming ASR

投稿日: 2024年4月24日作成者: jarxiv

要約コンテキストバイアスにより、音声認識プログラムは、連絡先名などの重要なフ … 続きを読む →

カテゴリー: cs.AI, cs.CL, cs.LG, cs.NE, eess.AS | コメントを受け付けていません

Aurora-M: The First Open Source Multilingual Language Model Red-teamed according to the U.S. Executive Order

投稿日: 2024年4月24日作成者: jarxiv

要約事前トレーニングされた言語モデルはいくつかの AI アプリケーションを支え … 続きを読む →

カテゴリー: cs.AI, cs.CL, cs.LG | コメントを受け付けていません

Multi-Head Mixture-of-Experts

投稿日: 2024年4月24日作成者: jarxiv

要約 Sparse Mixtures of Experts (SMoE) は、ト … 続きを読む →

カテゴリー: cs.AI, cs.CL, cs.LG | コメントを受け付けていません

MPIrigen: MPI Code Generation through Domain-Specific Language Models

投稿日: 2024年4月24日作成者: jarxiv

要約多数のノードにわたって計算を拡張することが不可欠であることから、特にメッセ … 続きを読む →

カテゴリー: cs.AI, cs.CL, cs.DC, cs.LG, cs.SE | コメントを受け付けていません

The WMDP Benchmark: Measuring and Reducing Malicious Use With Unlearning

投稿日: 2024年4月24日作成者: jarxiv

要約人工知能に関するホワイトハウス大統領令は、生物兵器、サイバー兵器、化学兵器 … 続きを読む →

カテゴリー: cs.AI, cs.CL, cs.CY, cs.LG | コメントを受け付けていません

「cs.CL」カテゴリーアーカイブ

Filtered Direct Preference Optimization

Achieving >97% on GSM8K: Deeply Understanding the Problems Makes LLMs Perfect Reasoners

Convergences and Divergences between Automatic Assessment and Human Evaluation: Insights from Comparing ChatGPT-Generated Translation and Neural Machine Translation

WangchanLion and WangchanX MRC Eval

Transformers Can Represent $n$-gram Language Models

Deferred NAM: Low-latency Top-K Context Injection via Deferred Context Encoding for Non-Streaming ASR

Aurora-M: The First Open Source Multilingual Language Model Red-teamed according to the U.S. Executive Order

Multi-Head Mixture-of-Experts

MPIrigen: MPI Code Generation through Domain-Specific Language Models

The WMDP Benchmark: Measuring and Reducing Malicious Use With Unlearning

最近の投稿

最近のコメント

アーカイブ

カテゴリー