「cs.CL」カテゴリーアーカイブ

Data Mixture Inference: What do BPE Tokenizers Reveal about their Training Data?

投稿日: 2024年9月6日作成者: jarxiv

要約現在の最強の言語モデルの事前トレーニングデータは不透明です。特に、さま … 続きを読む →

カテゴリー: cs.CL, cs.LG | コメントを受け付けていません

LAST: Language Model Aware Speech Tokenization

投稿日: 2024年9月6日作成者: jarxiv

要約音声トークン化は音声言語モデル (LM) の基礎として機能し、音声言語モデ … 続きを読む →

カテゴリー: cs.CL, cs.SD, eess.AS | コメントを受け付けていません

RAG based Question-Answering for Contextual Response Prediction System

投稿日: 2024年9月6日作成者: jarxiv

要約大規模言語モデル (LLM) は、効果的な質問応答システムとしての可能性を … 続きを読む →

カテゴリー: cs.CL, cs.IR | コメントを受け付けていません

Attention Heads of Large Language Models: A Survey

投稿日: 2024年9月6日作成者: jarxiv

要約 ChatGPT の出現以来、大規模言語モデル (LLM) はさまざまなタス … 続きを読む →

カテゴリー: cs.CL | コメントを受け付けていません

CogniDual Framework: Self-Training Large Language Models within a Dual-System Theoretical Framework for Improving Cognitive Tasks

投稿日: 2024年9月6日作成者: jarxiv

要約認知心理学では、知覚、注意、記憶、言語、問題解決、意思決定、推論を研究しま … 続きを読む →

カテゴリー: cs.AI, cs.CL | コメントを受け付けていません

A review on the use of large language models as virtual tutors

投稿日: 2024年9月6日作成者: jarxiv

要約 Transformer アーキテクチャは、自然言語処理の長期的な依存関係の … 続きを読む →

カテゴリー: cs.AI, cs.CL | コメントを受け付けていません

Exposing and Explaining Fake News On-the-Fly

投稿日: 2024年9月6日作成者: jarxiv

要約ソーシャルメディアプラットフォームにより、情報の迅速な普及と消費が可能 … 続きを読む →

カテゴリー: cs.AI, cs.CL, cs.SI | コメントを受け付けていません

Temporal Order Preserved Optimal Transport-based Cross-modal Knowledge Transfer Learning for ASR

投稿日: 2024年9月6日作成者: jarxiv

要約言語知識を事前学習済み言語モデル (PLM) から音響モデルに転送すると、 … 続きを読む →

カテゴリー: cs.AI, cs.CL, cs.SD, eess.AS | コメントを受け付けていません

Fine-tuning large language models for domain adaptation: Exploration of training strategies, scaling, model merging and synergistic capabilities

投稿日: 2024年9月6日作成者: jarxiv

要約材料科学や工学などの分野におけるドメインアプリケーション向けの大規模言語 … 続きを読む →

カテゴリー: cond-mat.mtrl-sci, cs.AI, cs.CL | コメントを受け付けていません

Unleashing the potential of prompt engineering in Large Language Models: a comprehensive review

投稿日: 2024年9月6日作成者: jarxiv

要約この包括的なレビューでは、大規模言語モデル (LLM) の機能を解放する際 … 続きを読む →

カテゴリー: cs.AI, cs.CL, I.2.7 | コメントを受け付けていません

「cs.CL」カテゴリーアーカイブ

Data Mixture Inference: What do BPE Tokenizers Reveal about their Training Data?

LAST: Language Model Aware Speech Tokenization

RAG based Question-Answering for Contextual Response Prediction System

Attention Heads of Large Language Models: A Survey

CogniDual Framework: Self-Training Large Language Models within a Dual-System Theoretical Framework for Improving Cognitive Tasks

A review on the use of large language models as virtual tutors

Exposing and Explaining Fake News On-the-Fly

Temporal Order Preserved Optimal Transport-based Cross-modal Knowledge Transfer Learning for ASR

Fine-tuning large language models for domain adaptation: Exploration of training strategies, scaling, model merging and synergistic capabilities

Unleashing the potential of prompt engineering in Large Language Models: a comprehensive review

最近の投稿

最近のコメント

アーカイブ

カテゴリー