「cs.CL」カテゴリーアーカイブ

1-bit AI Infra: Part 1.1, Fast and Lossless BitNet b1.58 Inference on CPUs

投稿日: 2024年10月24日作成者: jarxiv

要約 BitNet や BitNet b1.58 などの 1 ビットラージ言語モ … 続きを読む →

カテゴリー: cs.CL | コメントを受け付けていません

GRAMMAR: Grounded and Modular Methodology for Assessment of Closed-Domain Retrieval-Augmented Language Model

投稿日: 2024年10月24日作成者: jarxiv

要約検索拡張生成 (RAG) システムは、クローズドドメインおよび社内のナレ … 続きを読む →

カテゴリー: cs.AI, cs.CL | コメントを受け付けていません

Leveraging the Domain Adaptation of Retrieval Augmented Generation Models for Question Answering and Reducing Hallucination

投稿日: 2024年10月24日作成者: jarxiv

要約大規模言語モデルの継続的な進歩により、さまざまな NLP タスクにわたって … 続きを読む →

カテゴリー: cs.CL, cs.HC | コメントを受け付けていません

A Review of Prominent Paradigms for LLM-Based Agents: Tool Use (Including RAG), Planning, and Feedback Learning

投稿日: 2024年10月24日作成者: jarxiv

要約現在、ツールの使用、計画、フィードバック学習は、さまざまなタスクにわたって … 続きを読む →

カテゴリー: cs.AI, cs.CL, cs.SE | コメントを受け付けていません

OmniFlatten: An End-to-end GPT Model for Seamless Voice Conversation

投稿日: 2024年10月24日作成者: jarxiv

要約全二重音声対話システムは、人間と人間のやりとりを厳密に反映した同時双方向通 … 続きを読む →

カテゴリー: cs.AI, cs.CL, cs.SD, eess.AS | コメントを受け付けていません

Understanding When Tree of Thoughts Succeeds: Larger Models Excel in Generation, Not Discrimination

投稿日: 2024年10月24日作成者: jarxiv

要約 Tree of Thoughts (ToT) は、推論ステップを提案するジ … 続きを読む →

カテゴリー: cs.CL | コメントを受け付けていません

Can Language Models Induce Grammatical Knowledge from Indirect Evidence?

投稿日: 2024年10月24日作成者: jarxiv

要約文の受容性を判断するための文法知識を誘導する言語モデルには、どのような種類 … 続きを読む →

カテゴリー: cs.CL | コメントを受け付けていません

RaTEScore: A Metric for Radiology Report Generation

投稿日: 2024年10月24日作成者: jarxiv

要約この論文では、AI モデルによって生成された医療レポートの品質を評価するた … 続きを読む →

カテゴリー: cs.CL | コメントを受け付けていません

Do Large Language Models Have an English Accent? Evaluating and Improving the Naturalness of Multilingual LLMs

投稿日: 2024年10月24日作成者: jarxiv

要約現在の大規模言語モデル (LLM) は主に英語を主言語として設計されており … 続きを読む →

カテゴリー: cs.AI, cs.CL | コメントを受け付けていません

Interpreting Context Look-ups in Transformers: Investigating Attention-MLP Interactions

投稿日: 2024年10月24日作成者: jarxiv

要約大規模言語モデル (LLM) の内部動作を理解することは、LLM の理論的 … 続きを読む →

カテゴリー: cs.AI, cs.CL, cs.LG | コメントを受け付けていません

「cs.CL」カテゴリーアーカイブ

1-bit AI Infra: Part 1.1, Fast and Lossless BitNet b1.58 Inference on CPUs

GRAMMAR: Grounded and Modular Methodology for Assessment of Closed-Domain Retrieval-Augmented Language Model

Leveraging the Domain Adaptation of Retrieval Augmented Generation Models for Question Answering and Reducing Hallucination

A Review of Prominent Paradigms for LLM-Based Agents: Tool Use (Including RAG), Planning, and Feedback Learning

OmniFlatten: An End-to-end GPT Model for Seamless Voice Conversation

Understanding When Tree of Thoughts Succeeds: Larger Models Excel in Generation, Not Discrimination

Can Language Models Induce Grammatical Knowledge from Indirect Evidence?

RaTEScore: A Metric for Radiology Report Generation

Do Large Language Models Have an English Accent? Evaluating and Improving the Naturalness of Multilingual LLMs

Interpreting Context Look-ups in Transformers: Investigating Attention-MLP Interactions

最近の投稿

最近のコメント

アーカイブ

カテゴリー