月別アーカイブ: 2024年7月

On the Effect of Purely Synthetic Training Data for Different Automatic Speech Recognition Architectures

投稿日: 2024年7月26日作成者: jarxiv

要約この研究では、自動音声認識 (ASR) をトレーニングするための合成データ … 続きを読む →

カテゴリー: cs.CL, cs.LG, cs.SD, eess.AS | コメントを受け付けていません

Keep the Cost Down: A Review on Methods to Optimize LLM’ s KV-Cache Consumption

投稿日: 2024年7月26日作成者: jarxiv

要約 2022 年後半の ChatGPT リリースに代表される大規模言語モデル … 続きを読む →

カテゴリー: cs.CL | コメントを受け付けていません

GermanPartiesQA: Benchmarking Commercial Large Language Models for Political Bias and Sycophancy

投稿日: 2024年7月26日作成者: jarxiv

要約 LLM は人間がコンテンツを作成し、操作する方法を変えており、国民の政治的 … 続きを読む →

カテゴリー: cs.CL, cs.CY, K.4 | コメントを受け付けていません

Resolving Discrepancies in Compute-Optimal Scaling of Language Models

投稿日: 2024年7月26日作成者: jarxiv

要約カプランら。およびホフマンら。は、計算予算に応じて最適なモデルサイズ … 続きを読む →

カテゴリー: cs.CL, cs.LG | コメントを受け付けていません

PATCH! Psychometrics-AssisTed benCHmarking of Large Language Models: A Case Study of Proficiency in 8th Grade Mathematics

投稿日: 2024年7月26日作成者: jarxiv

要約大規模 (マルチモーダル) 言語モデル (LLM) の既存のベンチマークの … 続きを読む →

カテゴリー: cs.CL, cs.CY | コメントを受け付けていません

Improving Stance Detection by Leveraging Measurement Knowledge from Social Sciences: A Case Study of Dutch Political Tweets and Traditional Gender Role Division

投稿日: 2024年7月26日作成者: jarxiv

要約スタンス検出 (SD) は、ターゲットに対するテキストの作成者の視点 (つ … 続きを読む →

カテゴリー: cs.CL, cs.CY, cs.IR | コメントを受け付けていません

I can listen but cannot read: An evaluation of two-tower multimodal systems for instrument recognition

投稿日: 2024年7月26日作成者: jarxiv

要約音楽 2 タワーマルチモーダルシステムは、オーディオとテキストのモダリ … 続きを読む →

カテゴリー: cs.CL, cs.IR, cs.LG, cs.SD, eess.AS | コメントを受け付けていません

Tracking linguistic information in transformer-based sentence embeddings through targeted sparsification

投稿日: 2024年7月26日作成者: jarxiv

要約トランスフォーマーベースのモデルを分析したところ、テキスト入力からさまざま … 続きを読む →

カテゴリー: 68T50, cs.CL, I.2.7 | コメントを受け付けていません

The FIGNEWS Shared Task on News Media Narratives

投稿日: 2024年7月26日作成者: jarxiv

要約 ACL 2024と同時開催されるArabicNLP 2024カンファレンス … 続きを読む →

カテゴリー: cs.CL | コメントを受け付けていません

Block Verification Accelerates Speculative Decoding

投稿日: 2024年7月26日作成者: jarxiv

要約投機的デコードは、推論中に大規模な言語モデルをロスレスで高速化するための効 … 続きを読む →

カテゴリー: cs.CL, cs.DS, cs.IT, cs.LG, math.IT | コメントを受け付けていません

月別アーカイブ: 2024年7月

On the Effect of Purely Synthetic Training Data for Different Automatic Speech Recognition Architectures

Keep the Cost Down: A Review on Methods to Optimize LLM’ s KV-Cache Consumption

GermanPartiesQA: Benchmarking Commercial Large Language Models for Political Bias and Sycophancy

Resolving Discrepancies in Compute-Optimal Scaling of Language Models

PATCH! Psychometrics-AssisTed benCHmarking of Large Language Models: A Case Study of Proficiency in 8th Grade Mathematics

Improving Stance Detection by Leveraging Measurement Knowledge from Social Sciences: A Case Study of Dutch Political Tweets and Traditional Gender Role Division

I can listen but cannot read: An evaluation of two-tower multimodal systems for instrument recognition

Tracking linguistic information in transformer-based sentence embeddings through targeted sparsification

The FIGNEWS Shared Task on News Media Narratives

Block Verification Accelerates Speculative Decoding

最近の投稿

最近のコメント

アーカイブ

カテゴリー