「cs.CL」カテゴリーアーカイブ

HELMET: How to Evaluate Long-Context Language Models Effectively and Thoroughly

投稿日: 2024年10月4日作成者: jarxiv

要約ロングコンテクスト言語モデル（LCLM）を評価するためのベンチマークは数多 … 続きを読む →

カテゴリー: cs.AI, cs.CL | コメントを受け付けていません

Lookback Lens: Detecting and Mitigating Contextual Hallucinations in Large Language Models Using Only Attention Maps

投稿日: 2024年10月4日作成者: jarxiv

要約ある文章を要約したり、質問に答えたりするよう求められると、大規模言語モデル … 続きを読む →

カテゴリー: cs.AI, cs.CL, cs.LG | コメントを受け付けていません

Selective Attention Improves Transformer

投稿日: 2024年10月4日作成者: jarxiv

要約注意のコンテキストにある不要な要素は、パフォーマンスを低下させる。我々は、 … 続きを読む →

カテゴリー: cs.AI, cs.CL, cs.LG | コメントを受け付けていません

LLMs Know More Than They Show: On the Intrinsic Representation of LLM Hallucinations

投稿日: 2024年10月4日作成者: jarxiv

要約大規模言語モデル(LLM)は、事実誤認、偏り、推論の失敗など、「幻覚」と総 … 続きを読む →

カテゴリー: 68T50, cs.AI, cs.CL, I.2.7 | コメントを受け付けていません

Domain-Specific Retrieval-Augmented Generation Using Vector Stores, Knowledge Graphs, and Tensor Factorization

投稿日: 2024年10月4日作成者: jarxiv

要約大規模言語モデル(LLM)は、大規模なコーパスで事前に学習され、質問応答( … 続きを読む →

カテゴリー: cs.AI, cs.CL, cs.IR, cs.SE | コメントを受け付けていません

Large Language Models as Markov Chains

投稿日: 2024年10月4日作成者: jarxiv

要約大規模言語モデル(LLM)は、自然言語処理タスクの広い範囲において、またそ … 続きを読む →

カテゴリー: cs.AI, cs.CL, cs.LG, stat.ML | コメントを受け付けていません

Adaptive Inference-Time Compute: LLMs Can Predict if They Can Do Better, Even Mid-Generation

投稿日: 2024年10月4日作成者: jarxiv

要約推論時間計算は、大規模言語モデル（LLM）の性能を向上させるための強力なパ … 続きを読む →

カテゴリー: cs.AI, cs.CL, cs.LG | コメントを受け付けていません

Unified Multi-Modal Interleaved Document Representation for Information Retrieval

投稿日: 2024年10月4日作成者: jarxiv

要約情報検索(IR)手法は、与えられたクエリに応答する関連文書を特定することを … 続きを読む →

カテゴリー: cs.AI, cs.CL, cs.IR | コメントを受け付けていません

Justice or Prejudice? Quantifying Biases in LLM-as-a-Judge

投稿日: 2024年10月4日作成者: jarxiv

要約 LLM-as-a-Judgeは、様々なベンチマークにおける評価手法として広 … 続きを読む →

カテゴリー: cs.AI, cs.CL | コメントを受け付けていません

Salient Information Prompting to Steer Content in Prompt-based Abstractive Summarization

投稿日: 2024年10月4日作成者: jarxiv

要約大規模言語モデル(LLM)は、プロンプト技術を用いることで、領域横断的に流 … 続きを読む →

カテゴリー: cs.AI, cs.CL, cs.LG | コメントを受け付けていません

「cs.CL」カテゴリーアーカイブ

HELMET: How to Evaluate Long-Context Language Models Effectively and Thoroughly

Lookback Lens: Detecting and Mitigating Contextual Hallucinations in Large Language Models Using Only Attention Maps

Selective Attention Improves Transformer

LLMs Know More Than They Show: On the Intrinsic Representation of LLM Hallucinations

Domain-Specific Retrieval-Augmented Generation Using Vector Stores, Knowledge Graphs, and Tensor Factorization

Large Language Models as Markov Chains

Adaptive Inference-Time Compute: LLMs Can Predict if They Can Do Better, Even Mid-Generation

Unified Multi-Modal Interleaved Document Representation for Information Retrieval

Justice or Prejudice? Quantifying Biases in LLM-as-a-Judge

Salient Information Prompting to Steer Content in Prompt-based Abstractive Summarization

最近の投稿

最近のコメント

アーカイブ

カテゴリー