「cs.LG」カテゴリーアーカイブ

Fine-Tuning and Prompt Optimization: Two Great Steps that Work Better Together

投稿日: 2024年10月8日作成者: jarxiv

要約自然言語処理 (NLP) システムは、検索拡張生成 (RAG) などの洗練 … 続きを読む →

カテゴリー: cs.AI, cs.CL, cs.LG | コメントを受け付けていません

Representation noising effectively prevents harmful fine-tuning on LLMs

投稿日: 2024年10月8日作成者: jarxiv

要約オープンソースの大規模言語モデル (LLM) をリリースすると、悪意のある … 続きを読む →

カテゴリー: cs.CL, cs.LG | コメントを受け付けていません

When ‘A Helpful Assistant’ Is Not Really Helpful: Personas in System Prompts Do Not Improve Performances of Large Language Models

投稿日: 2024年10月8日作成者: jarxiv

要約プロンプトは、人間が大規模言語モデル (LLM) と対話する主要な方法とし … 続きを読む →

カテゴリー: cs.AI, cs.CL, cs.CY, cs.HC, cs.LG | コメントを受け付けていません

Efficient Model-Agnostic Multi-Group Equivariant Networks

投稿日: 2024年10月8日作成者: jarxiv

要約 equitune (Basu et al., 2023b) やその一般化 … 続きを読む →

カテゴリー: cs.AI, cs.CL, cs.LG | コメントを受け付けていません

MetaMetrics: Calibrating Metrics For Generation Tasks Using Human Preferences

投稿日: 2024年10月8日作成者: jarxiv

要約パフォーマンス評価指標の品質を理解することは、モデルの出力が人間の好みに確 … 続きを読む →

カテゴリー: cs.AI, cs.CL, cs.CV, cs.LG | コメントを受け付けていません

Creative Beam Search: LLM-as-a-Judge For Improving Response Generation

投稿日: 2024年10月8日作成者: jarxiv

要約大規模な言語モデルは、人工的な創造性を含むいくつかの分野に革命をもたらして … 続きを読む →

カテゴリー: cs.AI, cs.CL, cs.HC, cs.LG | コメントを受け付けていません

Understanding Warmup-Stable-Decay Learning Rates: A River Valley Loss Landscape Perspective

投稿日: 2024年10月8日作成者: jarxiv

要約一般的なコサイン学習率のスケジュールはステップの総数に依存するため、現在、 … 続きを読む →

カテゴリー: cs.CL, cs.LG, stat.ML | コメントを受け付けていません

Density estimation with LLMs: a geometric investigation of in-context learning trajectories

投稿日: 2024年10月8日作成者: jarxiv

要約大規模言語モデル (LLM) は、時系列予測を含むさまざまなタスクにわたっ … 続きを読む →

カテゴリー: cs.CL, cs.LG, stat.ML | コメントを受け付けていません

Precise Model Benchmarking with Only a Few Observations

投稿日: 2024年10月8日作成者: jarxiv

要約大規模な質問応答データセット内の特定のトピックに属する質問に対する大規模言 … 続きを読む →

カテゴリー: cs.CL, cs.CV, cs.LG, stat.AP | コメントを受け付けていません

Cookbook: A framework for improving LLM generative abilities via programmatic data generating templates

投稿日: 2024年10月8日作成者: jarxiv

要約命令データセットで大規模言語モデル (LLM) を微調整することは、生成機 … 続きを読む →

カテゴリー: cs.CL, cs.LG | コメントを受け付けていません

「cs.LG」カテゴリーアーカイブ

Fine-Tuning and Prompt Optimization: Two Great Steps that Work Better Together

Representation noising effectively prevents harmful fine-tuning on LLMs

When ‘A Helpful Assistant’ Is Not Really Helpful: Personas in System Prompts Do Not Improve Performances of Large Language Models

Efficient Model-Agnostic Multi-Group Equivariant Networks

MetaMetrics: Calibrating Metrics For Generation Tasks Using Human Preferences

Creative Beam Search: LLM-as-a-Judge For Improving Response Generation

Understanding Warmup-Stable-Decay Learning Rates: A River Valley Loss Landscape Perspective

Density estimation with LLMs: a geometric investigation of in-context learning trajectories

Precise Model Benchmarking with Only a Few Observations

Cookbook: A framework for improving LLM generative abilities via programmatic data generating templates

最近の投稿

最近のコメント

アーカイブ

カテゴリー