「cs.AI」カテゴリーアーカイブ

A Survey Analyzing Generalization in Deep Reinforcement Learning

投稿日: 2024年10月31日作成者: jarxiv

要約強化学習の研究は、ディープニューラルネットワークを利用して高次元の状態 … 続きを読む →

カテゴリー: cs.AI, cs.LG, stat.ML | コメントを受け付けていません

Exploring Design Choices for Building Language-Specific LLMs

投稿日: 2024年10月31日作成者: jarxiv

要約大規模言語モデル (LLM) は急速に進歩しているにもかかわらず、大部分の … 続きを読む →

カテゴリー: cs.AI, cs.CL, cs.LG | コメントを受け付けていません

Aequitas Flow: Streamlining Fair ML Experimentation

投稿日: 2024年10月31日作成者: jarxiv

要約 Aequitas Flow は、エンドツーエンドの公平な機械学習 (ML) … 続きを読む →

カテゴリー: cs.AI, cs.CY, cs.LG | コメントを受け付けていません

ReasoningRec: Bridging Personalized Recommendations and Human-Interpretable Explanations through LLM Reasoning

投稿日: 2024年10月31日作成者: jarxiv

要約このペーパーでは、大規模言語モデル (LLM) を活用して推奨事項と人間が … 続きを読む →

カテゴリー: cs.AI, cs.IR | コメントを受け付けていません

Certification for Differentially Private Prediction in Gradient-Based Training

投稿日: 2024年10月31日作成者: jarxiv

要約差分プライバシーは機械学習モデルの情報漏洩の上限ですが、意味のあるプライバ … 続きを読む →

カテゴリー: cs.AI, cs.LG | コメントを受け付けていません

Instigating Cooperation among LLM Agents Using Adaptive Information Modulation

投稿日: 2024年10月31日作成者: jarxiv

要約この論文では、人間の戦略的行動の代理として LLM エージェントを強化学習 … 続きを読む →

カテゴリー: cs.AI, cs.CL, cs.CY, cs.GT | コメントを受け付けていません

Position Coupling: Improving Length Generalization of Arithmetic Transformers Using Task Structure

投稿日: 2024年10月31日作成者: jarxiv

要約整数の加算などの単純な算術タスクの場合でも、Transformer がトレ … 続きを読む →

カテゴリー: cs.AI, cs.CL, cs.LG | コメントを受け付けていません

Kinetix: Investigating the Training of General Agents through Open-Ended Physics-Based Control Tasks

投稿日: 2024年10月31日作成者: jarxiv

要約オフラインデータセットで自己教師あり学習でトレーニングされた大規模モデル … 続きを読む →

カテゴリー: cs.AI, cs.LG | コメントを受け付けていません

Grounding by Trying: LLMs with Reinforcement Learning-Enhanced Retrieval

投稿日: 2024年10月31日作成者: jarxiv

要約大規模言語モデル (LLM) の幻覚は、LLM が情報を検索し、実際の情報 … 続きを読む →

カテゴリー: cs.AI, cs.LG | コメントを受け付けていません

Bandits with Preference Feedback: A Stackelberg Game Perspective

投稿日: 2024年10月31日作成者: jarxiv

要約好みのフィードバックを備えたバンディットは、直接値のクエリではなくペアごと … 続きを読む →

カテゴリー: cs.AI, cs.GT, cs.LG, stat.ML | コメントを受け付けていません

「cs.AI」カテゴリーアーカイブ

A Survey Analyzing Generalization in Deep Reinforcement Learning

Exploring Design Choices for Building Language-Specific LLMs

Aequitas Flow: Streamlining Fair ML Experimentation

ReasoningRec: Bridging Personalized Recommendations and Human-Interpretable Explanations through LLM Reasoning

Certification for Differentially Private Prediction in Gradient-Based Training

Instigating Cooperation among LLM Agents Using Adaptive Information Modulation

Position Coupling: Improving Length Generalization of Arithmetic Transformers Using Task Structure

Kinetix: Investigating the Training of General Agents through Open-Ended Physics-Based Control Tasks

Grounding by Trying: LLMs with Reinforcement Learning-Enhanced Retrieval

Bandits with Preference Feedback: A Stackelberg Game Perspective

最近の投稿

最近のコメント

アーカイブ

カテゴリー