「cs.CL」カテゴリーアーカイブ

VerAs: Verify then Assess STEM Lab Reports

投稿日: 2024年4月26日作成者: jarxiv

要約 STEM 教育では批判的思考スキルにますます重点が置かれているため、探究ス … 続きを読む →

カテゴリー: cs.AI, cs.CL, cs.LG | コメントを受け付けていません

Layer Skip: Enabling Early Exit Inference and Self-Speculative Decoding

投稿日: 2024年4月26日作成者: jarxiv

要約大規模言語モデル (LLM) の推論を高速化するエンドツーエンドのソリュー … 続きを読む →

カテゴリー: cs.AI, cs.CL, cs.LG | コメントを受け付けていません

Gemini 1.5: Unlocking multimodal understanding across millions of tokens of context

投稿日: 2024年4月26日作成者: jarxiv

要約このレポートでは、Gemini ファミリの最新モデルである Gemini … 続きを読む →

カテゴリー: cs.AI, cs.CL | コメントを受け付けていません

Automatic Speech Recognition System-Independent Word Error Rate Estimatio

投稿日: 2024年4月26日作成者: jarxiv

要約単語誤り率 (WER) は、自動音声認識 (ASR) システムによって生成 … 続きを読む →

カテゴリー: cs.CL, cs.SD, eess.AS | コメントを受け付けていません

Dataset of Quotation Attribution in German News Articles

投稿日: 2024年4月26日作成者: jarxiv

要約誰が誰に何を言ったかを抽出することは、オンラインニュース記事などの今日の … 続きを読む →

カテゴリー: cs.CL | コメントを受け付けていません

Prefix Text as a Yarn: Eliciting Non-English Alignment in Foundation Language Model

投稿日: 2024年4月26日作成者: jarxiv

要約教師あり微調整 (SFT) は、基盤となる大規模言語モデル (LLM) の … 続きを読む →

カテゴリー: cs.AI, cs.CL | コメントを受け付けていません

REBEL: Reinforcement Learning via Regressing Relative Rewards

投稿日: 2024年4月26日作成者: jarxiv

要約近接ポリシー最適化 (PPO) は、もともと連続制御問題のために開発されま … 続きを読む →

カテゴリー: cs.CL, cs.CV, cs.LG | コメントを受け付けていません

Modeling Selective Feature Attention for Representation-based Siamese Text Matching

投稿日: 2024年4月26日作成者: jarxiv

要約表現ベースのシャムネットワークは、導入コストと推論コストが低いため、軽量 … 続きを読む →

カテゴリー: cs.CL | コメントを受け付けていません

Continual Learning of Large Language Models: A Comprehensive Survey

投稿日: 2024年4月26日作成者: jarxiv

要約事前に収集された静的な一般的なデータセットでトレーニングされた大規模言語モ … 続きを読む →

カテゴリー: cs.AI, cs.CL, cs.LG | コメントを受け付けていません

Weak-to-Strong Extrapolation Expedites Alignment

投稿日: 2024年4月26日作成者: jarxiv

要約大規模言語モデル (LLM) の機能は、理想的にはデータとコンピューティン … 続きを読む →

カテゴリー: cs.AI, cs.CL, cs.LG | コメントを受け付けていません

「cs.CL」カテゴリーアーカイブ

VerAs: Verify then Assess STEM Lab Reports

Layer Skip: Enabling Early Exit Inference and Self-Speculative Decoding

Gemini 1.5: Unlocking multimodal understanding across millions of tokens of context

Automatic Speech Recognition System-Independent Word Error Rate Estimatio

Dataset of Quotation Attribution in German News Articles

Prefix Text as a Yarn: Eliciting Non-English Alignment in Foundation Language Model

REBEL: Reinforcement Learning via Regressing Relative Rewards

Modeling Selective Feature Attention for Representation-based Siamese Text Matching

Continual Learning of Large Language Models: A Comprehensive Survey

Weak-to-Strong Extrapolation Expedites Alignment

最近の投稿

最近のコメント

アーカイブ

カテゴリー