「cs.CL」カテゴリーアーカイブ

Alleviating Hallucinations in Large Language Models with Scepticism Modeling

投稿日: 2024年9月11日作成者: jarxiv

要約幻覚は大規模言語モデル (LLM) にとって大きな課題であり、さまざまな分 … 続きを読む →

カテゴリー: cs.CL, cs.LG | コメントを受け付けていません

Exploring Italian sentence embeddings properties through multi-tasking

投稿日: 2024年9月11日作成者: jarxiv

要約マルチタスク設定において、既存の LLM がイタリア語の抽象言語情報をどの … 続きを読む →

カテゴリー: 68T50, cs.CL, I.2.7 | コメントを受け付けていません

TeXBLEU: Automatic Metric for Evaluate LaTeX Format

投稿日: 2024年9月11日作成者: jarxiv

要約 LaTeX は、特に科学、技術、数学、コンピューターサイエンスの分野で、 … 続きを読む →

カテゴリー: cs.CL | コメントを受け付けていません

Sortformer: Seamless Integration of Speaker Diarization and ASR by Bridging Timestamps and Tokens

投稿日: 2024年9月11日作成者: jarxiv

要約私たちは、既存のエンドツーエンドのダイアライゼーションモデルと比較して型 … 続きを読む →

カテゴリー: cs.CL, cs.LG, cs.SD, eess.AS | コメントを受け付けていません

SORSA: Singular Values and Orthonormal Regularized Singular Vectors Adaptation of Large Language Models

投稿日: 2024年9月11日作成者: jarxiv

要約大規模言語モデル (LLM) の急速な進歩には、そのパラメーターサイズの … 続きを読む →

カテゴリー: cs.CL, cs.LG | コメントを受け付けていません

Mitigating the Influence of Distractor Tasks in LMs with Prior-Aware Decoding

投稿日: 2024年9月11日作成者: jarxiv

要約言語モデル (LM) の広範な機能は、気を散らすタスクに対する感度によって … 続きを読む →

カテゴリー: cs.CL | コメントを受け付けていません

E2LLM: Encoder Elongated Large Language Models for Long-Context Understanding and Reasoning

投稿日: 2024年9月11日作成者: jarxiv

要約大規模言語モデル (LLM) の領域では、複数ラウンドの対話、コード生成、 … 続きを読む →

カテゴリー: cs.CL | コメントを受け付けていません

QueryBuilder: Human-in-the-Loop Query Development for Information Retrieval

投稿日: 2024年9月11日作成者: jarxiv

要約多くの場合、情報検索 (IR) システムのユーザーは、包括的な情報ニーズ … 続きを読む →

カテゴリー: cs.CL, cs.IR, cs.LG | コメントを受け付けていません

HexaCoder: Secure Code Generation via Oracle-Guided Synthetic Training Data

投稿日: 2024年9月11日作成者: jarxiv

要約大規模言語モデル (LLM) は、自動コード生成の大きな可能性を示しており … 続きを読む →

カテゴリー: cs.AI, cs.CL, cs.CR, cs.LG, cs.SE | コメントを受け付けていません

An Effective Context-Balanced Adaptation Approach for Long-Tailed Speech Recognition

投稿日: 2024年9月11日作成者: jarxiv

要約エンドツーエンド (E2E) 自動音声認識 (ASR) モデルは、さまざま … 続きを読む →

カテゴリー: cs.AI, cs.CL, cs.SD, eess.AS | コメントを受け付けていません

「cs.CL」カテゴリーアーカイブ

Alleviating Hallucinations in Large Language Models with Scepticism Modeling

Exploring Italian sentence embeddings properties through multi-tasking

TeXBLEU: Automatic Metric for Evaluate LaTeX Format

Sortformer: Seamless Integration of Speaker Diarization and ASR by Bridging Timestamps and Tokens

SORSA: Singular Values and Orthonormal Regularized Singular Vectors Adaptation of Large Language Models

Mitigating the Influence of Distractor Tasks in LMs with Prior-Aware Decoding

E2LLM: Encoder Elongated Large Language Models for Long-Context Understanding and Reasoning

QueryBuilder: Human-in-the-Loop Query Development for Information Retrieval

HexaCoder: Secure Code Generation via Oracle-Guided Synthetic Training Data

An Effective Context-Balanced Adaptation Approach for Long-Tailed Speech Recognition

最近の投稿

最近のコメント

アーカイブ

カテゴリー