「cs.CL」カテゴリーアーカイブ

Mitigating the Language Mismatch and Repetition Issues in LLM-based Machine Translation via Model Editing

投稿日: 2024年10月10日作成者: jarxiv

要約大規模言語モデル (LLM) は最近 NLP 分野に革命をもたらしましたが … 続きを読む →

カテゴリー: cs.CL, cs.LG | コメントを受け付けていません

Predictability maximization and the origins of word order harmony

投稿日: 2024年10月10日作成者: jarxiv

要約私たちは、情報理論の観点から、頭部とその従属部分の順序配置に関する言語問題 … 続きを読む →

カテゴリー: cs.CL, physics.soc-ph, q-bio.NC | コメントを受け付けていません

Data Selection via Optimal Control for Language Models

投稿日: 2024年10月10日作成者: jarxiv

要約この研究では、下流で使用するための LM の機能を強化するために、大量のコ … 続きを読む →

カテゴリー: cs.CL | コメントを受け付けていません

Counterfactuals As a Means for Evaluating Faithfulness of Attribution Methods in Autoregressive Language Models

投稿日: 2024年10月10日作成者: jarxiv

要約自己回帰言語モデルが広く採用されているにもかかわらず、説明可能性評価の研究 … 続きを読む →

カテゴリー: cs.CL | コメントを受け付けていません

Not All Contexts Are Equal: Teaching LLMs Credibility-aware Generation

投稿日: 2024年10月10日作成者: jarxiv

要約大規模な言語モデルの急速な開発により、外部知識を統合して知識のボトルネック … 続きを読む →

カテゴリー: cs.CL | コメントを受け付けていません

Stanceformer: Target-Aware Transformer for Stance Detection

投稿日: 2024年10月10日作成者: jarxiv

要約スタンス検出のタスクには、特定の主題またはターゲットに対するテキスト内で表 … 続きを読む →

カテゴリー: cs.CL | コメントを受け付けていません

MLE-bench: Evaluating Machine Learning Agents on Machine Learning Engineering

投稿日: 2024年10月10日作成者: jarxiv

要約 AI エージェントが機械学習エンジニアリングでどの程度優れたパフォーマンス … 続きを読む →

カテゴリー: cs.CL | コメントを受け付けていません

Axis Tour: Word Tour Determines the Order of Axes in ICA-transformed Embeddings

投稿日: 2024年10月10日作成者: jarxiv

要約単語の埋め込みは自然言語処理において最も重要なコンポーネントの 1 つです … 続きを読む →

カテゴリー: cs.CL | コメントを受け付けていません

Unleashing Multi-Hop Reasoning Potential in Large Language Models through Repetition of Misordered Context

投稿日: 2024年10月10日作成者: jarxiv

要約マルチホップ推論は、特定のコンテキスト内のサポート文書に基づいた複数ステッ … 続きを読む →

カテゴリー: cs.CL | コメントを受け付けていません

Private prediction for large-scale synthetic text generation

投稿日: 2024年10月10日作成者: jarxiv

要約私たちは、大規模言語モデル (LLM) を使用し、プライベート予測を通じて … 続きを読む →

カテゴリー: cs.CL, cs.CR, cs.LG | コメントを受け付けていません

「cs.CL」カテゴリーアーカイブ

Mitigating the Language Mismatch and Repetition Issues in LLM-based Machine Translation via Model Editing

Predictability maximization and the origins of word order harmony

Data Selection via Optimal Control for Language Models

Counterfactuals As a Means for Evaluating Faithfulness of Attribution Methods in Autoregressive Language Models

Not All Contexts Are Equal: Teaching LLMs Credibility-aware Generation

Stanceformer: Target-Aware Transformer for Stance Detection

MLE-bench: Evaluating Machine Learning Agents on Machine Learning Engineering

Axis Tour: Word Tour Determines the Order of Axes in ICA-transformed Embeddings

Unleashing Multi-Hop Reasoning Potential in Large Language Models through Repetition of Misordered Context

Private prediction for large-scale synthetic text generation

最近の投稿

最近のコメント

アーカイブ

カテゴリー