「cs.CL」カテゴリーアーカイブ

Towards a Theoretical Understanding of Synthetic Data in LLM Post-Training: A Reverse-Bottleneck Perspective

投稿日: 2024年10月3日作成者: jarxiv

要約高品質で特定のデータが不足しているため、合成データは大規模言語モデル (L … 続きを読む →

カテゴリー: cs.AI, cs.CL, cs.LG | コメントを受け付けていません

Auto-Demo Prompting: Leveraging Generated Outputs as Demonstrations for Enhanced Batch Prompting

投稿日: 2024年10月3日作成者: jarxiv

要約バッチプロンプトは、計算効率の向上を目的として、複数の入力を同時に処理す … 続きを読む →

カテゴリー: cs.AI, cs.CL | コメントを受け付けていません

Evaluating Robustness of Reward Models for Mathematical Reasoning

投稿日: 2024年10月3日作成者: jarxiv

要約報酬モデルは、ヒューマンフィードバック (RLHF) システムからの強化 … 続きを読む →

カテゴリー: cs.AI, cs.CL, cs.LG | コメントを受け付けていません

Learning Dynamics of LLM Finetuning

投稿日: 2024年10月3日作成者: jarxiv

要約特定のトレーニングサンプルの学習が他のサンプルのモデルの予測にどのような … 続きを読む →

カテゴリー: cs.AI, cs.CL, cs.LG | コメントを受け付けていません

README: Bridging Medical Jargon and Lay Understanding for Patient Education through Data-Centric NLP

投稿日: 2024年10月3日作成者: jarxiv

要約医療の進歩により、患者中心のアプローチ、特にセルフケアと患者教育に焦点が移 … 続きを読む →

カテゴリー: cs.AI, cs.CL | コメントを受け付けていません

Controllable Preference Optimization: Toward Controllable Multi-Objective Alignment

投稿日: 2024年10月3日作成者: jarxiv

要約人工知能における整合性は、モデルの応答と人間の好みおよび価値観の間の一貫性 … 続きを読む →

カテゴリー: cs.AI, cs.CL, cs.SY, eess.SY | コメントを受け付けていません

Scaling Optimal LR Across Token Horizons

投稿日: 2024年10月3日作成者: jarxiv

要約最先端の LLM は、モデルサイズ、データセットサイズ、クラスターサ … 続きを読む →

カテゴリー: cs.AI, cs.CL, cs.LG | コメントを受け付けていません

DeFine: Enhancing LLM Decision-Making with Factor Profiles and Analogical Reasoning

投稿日: 2024年10月3日作成者: jarxiv

要約 LLM は、長い文脈を推論し、重要な要素を特定する能力があるため、意思決定 … 続きを読む →

カテゴリー: cs.AI, cs.CL | コメントを受け付けていません

Composing Global Optimizers to Reasoning Tasks via Algebraic Objects in Neural Nets

投稿日: 2024年10月3日作成者: jarxiv

要約我々は、アーベル群の推論タスク（例：モジュラー加算）で訓練された、二次活性 … 続きを読む →

カテゴリー: cs.AI, cs.CL, cs.LG, math.AC, math.RA | コメントを受け付けていません

Social Conjuring: Multi-User Runtime Collaboration with AI in Building Virtual 3D Worlds

投稿日: 2024年10月3日作成者: jarxiv

要約生成人工知能は、仮想世界の存在を促す上で有望であることが示されていますが、 … 続きを読む →

カテゴリー: cs.AI, cs.CL, cs.ET, cs.HC | コメントを受け付けていません

「cs.CL」カテゴリーアーカイブ

Towards a Theoretical Understanding of Synthetic Data in LLM Post-Training: A Reverse-Bottleneck Perspective

Auto-Demo Prompting: Leveraging Generated Outputs as Demonstrations for Enhanced Batch Prompting

Evaluating Robustness of Reward Models for Mathematical Reasoning

Learning Dynamics of LLM Finetuning

README: Bridging Medical Jargon and Lay Understanding for Patient Education through Data-Centric NLP

Controllable Preference Optimization: Toward Controllable Multi-Objective Alignment

Scaling Optimal LR Across Token Horizons

DeFine: Enhancing LLM Decision-Making with Factor Profiles and Analogical Reasoning

Composing Global Optimizers to Reasoning Tasks via Algebraic Objects in Neural Nets

Social Conjuring: Multi-User Runtime Collaboration with AI in Building Virtual 3D Worlds

最近の投稿

最近のコメント

アーカイブ

カテゴリー