「cs.CL」カテゴリーアーカイブ

Bridging the Gap between Different Vocabularies for LLM Ensemble

投稿日: 2024年4月16日作成者: jarxiv

要約さまざまな大規模言語モデル (LLM) をアンサンブルして、相互補完的な可 … 続きを読む →

カテゴリー: cs.CL | コメントを受け付けていません

A Novel Paradigm Boosting Translation Capabilities of Large Language Models

投稿日: 2024年4月16日作成者: jarxiv

要約この論文では、機械翻訳 (MT) タスクのコンテキストで大規模言語モデル … 続きを読む →

カテゴリー: cs.CL | コメントを受け付けていません

Learning Planning-based Reasoning by Trajectories Collection and Process Reward Synthesizing

投稿日: 2024年4月16日作成者: jarxiv

要約大規模言語モデル (LLM) は、段階的な理論的根拠の生成を通じて複雑な推 … 続きを読む →

カテゴリー: cs.AI, cs.CL | コメントを受け付けていません

State Space Model for New-Generation Network Alternative to Transformers: A Survey

投稿日: 2024年4月16日作成者: jarxiv

要約ディープラーニング後の時代において、Transformer アーキテクチャ … 続きを読む →

カテゴリー: cs.AI, cs.CL, cs.CV, cs.LG, cs.MM | コメントを受け付けていません

DiagGPT: An LLM-based and Multi-agent Dialogue System with Automatic Topic Management for Flexible Task-Oriented Dialogue

投稿日: 2024年4月16日作成者: jarxiv

要約 ChatGPT のような大規模言語モデル (LLM) の重要な用途は、さま … 続きを読む →

カテゴリー: cs.AI, cs.CL | コメントを受け付けていません

Triad: A Framework Leveraging a Multi-Role LLM-based Agent to Solve Knowledge Base Question Answering

投稿日: 2024年4月16日作成者: jarxiv

要約 LLM ベースのエージェントの最近の進歩により、さまざまなタスクにわたって … 続きを読む →

カテゴリー: 68T50, cs.AI, cs.CL, I.2.7 | コメントを受け付けていません

Neuron-level LLM Patching for Code Generation

投稿日: 2024年4月16日作成者: jarxiv

要約大規模言語モデル (LLM) は、ソフトウェアエンジニアリング、特にコー … 続きを読む →

カテゴリー: cs.CL, cs.LG, cs.SE | コメントを受け付けていません

Prepacking: A Simple Method for Fast Prefilling and Increased Throughput in Large Language Models

投稿日: 2024年4月16日作成者: jarxiv

要約トランスフォーマーベースの大規模言語モデル (LLM) の推論中、事前入力 … 続きを読む →

カテゴリー: cs.AI, cs.CL, cs.LG | コメントを受け付けていません

Large Language Models as Optimizers

投稿日: 2024年4月16日作成者: jarxiv

要約最適化は至る所で行われています。微分ベースのアルゴリズムはさまざまな問題 … 続きを読む →

カテゴリー: cs.AI, cs.CL, cs.LG | コメントを受け付けていません

Wisdom of Instruction-Tuned Language Model Crowds. Exploring Model Label Variation

投稿日: 2024年4月16日作成者: jarxiv

要約大規模言語モデル (LLM) は優れたテキスト分類機能を示し、ゼロショット … 続きを読む →

カテゴリー: cs.CL | コメントを受け付けていません

「cs.CL」カテゴリーアーカイブ

Bridging the Gap between Different Vocabularies for LLM Ensemble

A Novel Paradigm Boosting Translation Capabilities of Large Language Models

Learning Planning-based Reasoning by Trajectories Collection and Process Reward Synthesizing

State Space Model for New-Generation Network Alternative to Transformers: A Survey

DiagGPT: An LLM-based and Multi-agent Dialogue System with Automatic Topic Management for Flexible Task-Oriented Dialogue

Triad: A Framework Leveraging a Multi-Role LLM-based Agent to Solve Knowledge Base Question Answering

Neuron-level LLM Patching for Code Generation

Prepacking: A Simple Method for Fast Prefilling and Increased Throughput in Large Language Models

Large Language Models as Optimizers

Wisdom of Instruction-Tuned Language Model Crowds. Exploring Model Label Variation

最近の投稿

最近のコメント

アーカイブ

カテゴリー