「cs.CL」カテゴリーアーカイブ

SWITCH: Studying with Teacher for Knowledge Distillation of Large Language Models

投稿日: 2024年10月28日作成者: jarxiv

要約大規模言語モデル (LLM) は成功を収めていますが、依然として高い推論コ … 続きを読む →

カテゴリー: cs.CL | コメントを受け付けていません

StructRAG: Boosting Knowledge Intensive Reasoning of LLMs via Inference-time Hybrid Information Structurization

投稿日: 2024年10月28日作成者: jarxiv

要約検索拡張生成 (RAG) は、多くの知識ベースのタスクにおいて大規模言語モ … 続きを読む →

カテゴリー: cs.AI, cs.CL | コメントを受け付けていません

Detection of Human and Machine-Authored Fake News in Urdu

投稿日: 2024年10月28日作成者: jarxiv

要約ソーシャルメディアの台頭によりフェイクニュースの拡散が増幅され、現在ではC … 続きを読む →

カテゴリー: cs.CL, cs.LG | コメントを受け付けていません

Large Language Models Still Exhibit Bias in Long Text

投稿日: 2024年10月28日作成者: jarxiv

要約大規模言語モデル (LLM) の既存の公平性ベンチマークは、主に多肢選択式 … 続きを読む →

カテゴリー: cs.CL | コメントを受け付けていません

Mirror Matrix on the Wall: coding and vector notation as tools for introspection

投稿日: 2024年10月28日作成者: jarxiv

要約 GNU Octave によって採用されたベクトル表記は、Kenneth E … 続きを読む →

カテゴリー: cs.CL, cs.SE | コメントを受け付けていません

On the Robustness of Editing Large Language Models

投稿日: 2024年10月28日作成者: jarxiv

要約大規模言語モデル (LLM) は、コミュニケーション型 AI の構築におい … 続きを読む →

カテゴリー: cs.CL | コメントを受け付けていません

ChunkRAG: Novel LLM-Chunk Filtering Method for RAG Systems

投稿日: 2024年10月28日作成者: jarxiv

要約大規模言語モデル (LLM) を使用する検索拡張生成 (RAG) システム … 続きを読む →

カテゴリー: cs.CL | コメントを受け付けていません

Steering Knowledge Selection Behaviours in LLMs via SAE-Based Representation Engineering

投稿日: 2024年10月28日作成者: jarxiv

要約大規模言語モデル (LLM) は、パラメーターに大量の事実の知識を保存でき … 続きを読む →

カテゴリー: cs.CL | コメントを受け付けていません

A distributional simplicity bias in the learning dynamics of transformers

投稿日: 2024年10月28日作成者: jarxiv

要約過剰パラメータ化されたニューラルネットワークが効果的に一般化する驚くべき … 続きを読む →

カテゴリー: cs.CL | コメントを受け付けていません

Demonstration-based learning for few-shot biomedical named entity recognition under machine reading comprehension

投稿日: 2024年10月28日作成者: jarxiv

要約深層学習技術は大きな成果を上げていますが、手動でラベル付けされた大量のデー … 続きを読む →

カテゴリー: cs.CL | コメントを受け付けていません

「cs.CL」カテゴリーアーカイブ

SWITCH: Studying with Teacher for Knowledge Distillation of Large Language Models

StructRAG: Boosting Knowledge Intensive Reasoning of LLMs via Inference-time Hybrid Information Structurization

Detection of Human and Machine-Authored Fake News in Urdu

Large Language Models Still Exhibit Bias in Long Text

Mirror Matrix on the Wall: coding and vector notation as tools for introspection

On the Robustness of Editing Large Language Models

ChunkRAG: Novel LLM-Chunk Filtering Method for RAG Systems

Steering Knowledge Selection Behaviours in LLMs via SAE-Based Representation Engineering

A distributional simplicity bias in the learning dynamics of transformers

Demonstration-based learning for few-shot biomedical named entity recognition under machine reading comprehension

最近の投稿

最近のコメント

アーカイブ

カテゴリー