「cs.CL」カテゴリーアーカイブ

Optimizing Contextual Speech Recognition Using Vector Quantization for Efficient Retrieval

投稿日: 2024年11月5日作成者: jarxiv

要約ニューラルコンテキストバイアスにより、音声認識モデルがコンテキストに関 … 続きを読む →

カテゴリー: cs.CL, eess.AS | コメントを受け付けていません

The LLM Language Network: A Neuroscientific Approach for Identifying Causally Task-Relevant Units

投稿日: 2024年11月5日作成者: jarxiv

要約大規模言語モデル (LLM) は、言語タスクだけでなく、論理的推論や社会的 … 続きを読む →

カテゴリー: cs.CL, cs.LG | コメントを受け付けていません

MdEval: Massively Multilingual Code Debugging

投稿日: 2024年11月5日作成者: jarxiv

要約コード大規模言語モデル (LLM) は、バグのあるコードスニペットに基づ … 続きを読む →

カテゴリー: cs.CL | コメントを受け付けていません

LLM-Ref: Enhancing Reference Handling in Technical Writing with Large Language Models

投稿日: 2024年11月5日作成者: jarxiv

要約大規模言語モデル (LLM) はデータ合成には優れていますが、ドメイン固有 … 続きを読む →

カテゴリー: cs.CL, I.2.7 | コメントを受け付けていません

Sparsing Law: Towards Large Language Models with Greater Activation Sparsity

投稿日: 2024年11月5日作成者: jarxiv

要約アクティベーションの希薄性は、アクティベーション出力内に、除去できる寄与度 … 続きを読む →

カテゴリー: cs.CL, cs.LG, I.2.7, stat.ML | コメントを受け付けていません

WebRL: Training LLM Web Agents via Self-Evolving Online Curriculum Reinforcement Learning

投稿日: 2024年11月5日作成者: jarxiv

要約大規模言語モデル (LLM) は、特に Web ベースのタスクにおいて自律 … 続きを読む →

カテゴリー: cs.CL | コメントを受け付けていません

Seq-VCR: Preventing Collapse in Intermediate Transformer Representations for Enhanced Reasoning

投稿日: 2024年11月5日作成者: jarxiv

要約デコーダーのみのトランスフォーマーは、複雑な推論タスク、特に複数の連続操作 … 続きを読む →

カテゴリー: cs.CL, cs.LG | コメントを受け付けていません

AmbigNLG: Addressing Task Ambiguity in Instruction for NLG

投稿日: 2024年11月5日作成者: jarxiv

要約自然言語生成 (NLG) の命令におけるタスクの曖昧さの課題に取り組むため … 続きを読む →

カテゴリー: cs.CL | コメントを受け付けていません

Attacking Vision-Language Computer Agents via Pop-ups

投稿日: 2024年11月5日作成者: jarxiv

要約大規模ビジョンおよび言語モデル (VLM) を活用した自律型エージェントは … 続きを読む →

カテゴリー: cs.CL | コメントを受け付けていません

Tool Learning with Large Language Models: A Survey

投稿日: 2024年11月5日作成者: jarxiv

要約最近、大規模言語モデル (LLM) を使用したツール学習が、LLM の機能 … 続きを読む →

カテゴリー: cs.AI, cs.CL | コメントを受け付けていません

「cs.CL」カテゴリーアーカイブ

Optimizing Contextual Speech Recognition Using Vector Quantization for Efficient Retrieval

The LLM Language Network: A Neuroscientific Approach for Identifying Causally Task-Relevant Units

MdEval: Massively Multilingual Code Debugging

LLM-Ref: Enhancing Reference Handling in Technical Writing with Large Language Models

Sparsing Law: Towards Large Language Models with Greater Activation Sparsity

WebRL: Training LLM Web Agents via Self-Evolving Online Curriculum Reinforcement Learning

Seq-VCR: Preventing Collapse in Intermediate Transformer Representations for Enhanced Reasoning

AmbigNLG: Addressing Task Ambiguity in Instruction for NLG

Attacking Vision-Language Computer Agents via Pop-ups

Tool Learning with Large Language Models: A Survey

最近の投稿

最近のコメント

アーカイブ

カテゴリー