「cs.CL」カテゴリーアーカイブ

Griffin: Mixing Gated Linear Recurrences with Local Attention for Efficient Language Models

投稿日: 2024年3月1日作成者: jarxiv

要約リカレントニューラルネットワーク (RNN) は推論が速く、長いシーケ … 続きを読む →

カテゴリー: cs.CL, cs.LG | コメントを受け付けていません

Redefining Digital Health Interfaces with Large Language Models

投稿日: 2024年3月1日作成者: jarxiv

要約デジタル医療ツールは、医療サービスの提供を大幅に改善する可能性を秘めていま … 続きを読む →

カテゴリー: cs.CL | コメントを受け付けていません

Heavy-Tailed Class Imbalance and Why Adam Outperforms Gradient Descent on Language Models

投稿日: 2024年3月1日作成者: jarxiv

要約 Adam は、大規模な言語トランスフォーマーの最適化において勾配降下法を上 … 続きを読む →

カテゴリー: cs.CL, cs.LG, math.OC, stat.ML | コメントを受け付けていません

Sequoia: Scalable, Robust, and Hardware-aware Speculative Decoding

投稿日: 2024年3月1日作成者: jarxiv

要約大規模言語モデル (LLM) の使用が増えるにつれて、これらのモデルを使用 … 続きを読む →

カテゴリー: cs.CL | コメントを受け付けていません

Accelerating materials discovery for polymer solar cells: Data-driven insights enabled by natural language processing

投稿日: 2024年3月1日作成者: jarxiv

要約文献からポリマー太陽電池の特性データを抽出し、さまざまなアクティブラーニ … 続きを読む →

カテゴリー: cond-mat.mtrl-sci, cs.CL, physics.app-ph | コメントを受け付けていません

Robust Guidance for Unsupervised Data Selection: Capturing Perplexing Named Entities for Domain-Specific Machine Translation

投稿日: 2024年3月1日作成者: jarxiv

要約広範なデータセットを使用することで、多言語機械翻訳モデルのトレーニングが可 … 続きを読む →

カテゴリー: cs.AI, cs.CL | コメントを受け付けていません

‘It Felt Like Having a Second Mind’: Investigating Human-AI Co-creativity in Prewriting with Large Language Models

投稿日: 2024年3月1日作成者: jarxiv

要約プリライティングは、最初の草稿の前にアイデアを発見して発展させるプロセスで … 続きを読む →

カテゴリー: cs.AI, cs.CL, cs.HC | コメントを受け付けていません

LLM Inference Unveiled: Survey and Roofline Model Insights

投稿日: 2024年3月1日作成者: jarxiv

要約効率的な大規模言語モデル (LLM) 推論の分野は急速に進化しており、機会 … 続きを読む →

カテゴリー: cs.AI, cs.CL | コメントを受け付けていません

OpenMedLM: Prompt engineering can out-perform fine-tuning in medical question-answering with open-source large language models

投稿日: 2024年3月1日作成者: jarxiv

要約 LLM は、さまざまな特殊なタスクを実行できる能力がますます高まっており、 … 続きを読む →

カテゴリー: cs.AI, cs.CL, cs.IR | コメントを受け付けていません

Wisdom of the Silicon Crowd: LLM Ensemble Prediction Capabilities Match Human Crowd Accuracy

投稿日: 2024年3月1日作成者: jarxiv

要約実際の人間の予測精度は「群衆の知恵」効果に依存しており、個々の予測者の群衆 … 続きを読む →

カテゴリー: cs.AI, cs.CL, cs.CY, cs.LG | コメントを受け付けていません

「cs.CL」カテゴリーアーカイブ

Griffin: Mixing Gated Linear Recurrences with Local Attention for Efficient Language Models

Redefining Digital Health Interfaces with Large Language Models

Heavy-Tailed Class Imbalance and Why Adam Outperforms Gradient Descent on Language Models

Sequoia: Scalable, Robust, and Hardware-aware Speculative Decoding

Accelerating materials discovery for polymer solar cells: Data-driven insights enabled by natural language processing

Robust Guidance for Unsupervised Data Selection: Capturing Perplexing Named Entities for Domain-Specific Machine Translation

‘It Felt Like Having a Second Mind’: Investigating Human-AI Co-creativity in Prewriting with Large Language Models

LLM Inference Unveiled: Survey and Roofline Model Insights

OpenMedLM: Prompt engineering can out-perform fine-tuning in medical question-answering with open-source large language models

Wisdom of the Silicon Crowd: LLM Ensemble Prediction Capabilities Match Human Crowd Accuracy

最近の投稿

最近のコメント

アーカイブ

カテゴリー