「cs.CL」カテゴリーアーカイブ

Bridge-Coder: Unlocking LLMs’ Potential to Overcome Language Gaps in Low-Resource Code

投稿日: 2024年10月25日作成者: jarxiv

要約大規模言語モデル (LLM) は、Python などの高リソースプログラ … 続きを読む →

カテゴリー: cs.CL | コメントを受け付けていません

Scaling Law with Learning Rate Annealing

投稿日: 2024年10月25日作成者: jarxiv

要約ニューラル言語モデルのクロスエントロピー損失曲線は、訓練ステップ全体にわた … 続きを読む →

カテゴリー: cs.CL | コメントを受け付けていません

Does Data Contamination Detection Work (Well) for LLMs? A Survey and Evaluation on Detection Assumptions

投稿日: 2024年10月25日作成者: jarxiv

要約大規模言語モデル (LLM) は、さまざまなベンチマークにわたって優れたパ … 続きを読む →

カテゴリー: cs.CL | コメントを受け付けていません

RET-LLM: Towards a General Read-Write Memory for Large Language Models

投稿日: 2024年10月25日作成者: jarxiv

要約大規模言語モデル (LLM) は、その広範なパラメーターと包括的なデータ利 … 続きを読む →

カテゴリー: cs.CL | コメントを受け付けていません

LongGenBench: Long-context Generation Benchmark

投稿日: 2024年10月25日作成者: jarxiv

要約現在のロングコンテキストベンチマークは主に検索ベースのテストに焦点を当てて … 続きを読む →

カテゴリー: cs.AI, cs.CL | コメントを受け付けていません

GPT vs RETRO: Exploring the Intersection of Retrieval and Parameter-Efficient Fine-Tuning

投稿日: 2024年10月25日作成者: jarxiv

要約パラメータ効率の良い微調整 (PEFT) と検索拡張生成 (RAG) は、 … 続きを読む →

カテゴリー: cs.AI, cs.CL, cs.IR, cs.LG | コメントを受け付けていません

RE-RAG: Improving Open-Domain QA Performance and Interpretability with Relevance Estimator in Retrieval-Augmented Generation

投稿日: 2024年10月25日作成者: jarxiv

要約検索拡張生成 (RAG) フレームワークは、パラメトリック知識と外部知識の … 続きを読む →

カテゴリー: cs.AI, cs.CL | コメントを受け付けていません

From English-Centric to Effective Bilingual: LLMs with Custom Tokenizers for Underrepresented Languages

投稿日: 2024年10月25日作成者: jarxiv

要約この論文では、英語およびあらゆるターゲット言語をサポートするバイリンガルベ … 続きを読む →

カテゴリー: cs.AI, cs.CL | コメントを受け付けていません

Composing Global Optimizers to Reasoning Tasks via Algebraic Objects in Neural Nets

投稿日: 2024年10月25日作成者: jarxiv

要約我々は、アーベル群の推論タスク（例：モジュラー加算）で訓練された、二次活性 … 続きを読む →

カテゴリー: cs.AI, cs.CL, cs.LG, math.AC, math.RA | コメントを受け付けていません

Demystifying Large Language Models for Medicine: A Primer

投稿日: 2024年10月25日作成者: jarxiv

要約大規模言語モデル (LLM) は、さまざまなコンテキストにわたって人間のよ … 続きを読む →

カテゴリー: cs.AI, cs.CL | コメントを受け付けていません

「cs.CL」カテゴリーアーカイブ

Bridge-Coder: Unlocking LLMs’ Potential to Overcome Language Gaps in Low-Resource Code

Scaling Law with Learning Rate Annealing

Does Data Contamination Detection Work (Well) for LLMs? A Survey and Evaluation on Detection Assumptions

RET-LLM: Towards a General Read-Write Memory for Large Language Models

LongGenBench: Long-context Generation Benchmark

GPT vs RETRO: Exploring the Intersection of Retrieval and Parameter-Efficient Fine-Tuning

RE-RAG: Improving Open-Domain QA Performance and Interpretability with Relevance Estimator in Retrieval-Augmented Generation

From English-Centric to Effective Bilingual: LLMs with Custom Tokenizers for Underrepresented Languages

Composing Global Optimizers to Reasoning Tasks via Algebraic Objects in Neural Nets

Demystifying Large Language Models for Medicine: A Primer

最近の投稿

最近のコメント

アーカイブ

カテゴリー