「I.2.6」カテゴリーアーカイブ

Towards Understanding Sycophancy in Language Models

投稿日: 2023年10月30日作成者: jarxiv

要約人間のフィードバックは、AI アシスタントの微調整によく利用されます。し … 続きを読む →

カテゴリー: cs.AI, cs.CL, cs.LG, I.2.6, stat.ML | コメントを受け付けていません

No-Regret Online Reinforcement Learning with Adversarial Losses and Transitions

投稿日: 2023年10月27日作成者: jarxiv

要約敵対的なマルコフ決定プロセス用の既存のオンライン学習アルゴリズムは、たとえ … 続きを読む →

カテゴリー: cs.LG, I.2.6, stat.ML | コメントを受け付けていません

Improved Best-of-Both-Worlds Guarantees for Multi-Armed Bandits: FTRL with General Regularizers and Multiple Optimal Arms

投稿日: 2023年10月27日作成者: jarxiv

要約私たちは、確率的設定と敵対的設定の両方で同時に最適に実行する適応型マルチア … 続きを読む →

カテゴリー: cs.LG, I.2.6, stat.ML | コメントを受け付けていません

Optimization dependent generalization bound for ReLU networks based on sensitivity in the tangent bundle

投稿日: 2023年10月27日作成者: jarxiv

要約ディープラーニングの最近の進歩により、ディープニューラルネットワークの … 続きを読む →

カテゴリー: 68, cs.AI, cs.LG, I.2.6 | コメントを受け付けていません

SEEDS: Exponential SDE Solvers for Fast High-Quality Sampling from Diffusion Models

投稿日: 2023年10月27日作成者: jarxiv

要約拡散確率モデル (DPM) として知られる強力なクラスの生成モデルが注目を … 続きを読む →

カテゴリー: cs.CV, cs.LG, cs.NA, I.2.6, math.NA | コメントを受け付けていません

Necessary and Sufficient Conditions for Optimal Decision Trees using Dynamic Programming

投稿日: 2023年10月26日作成者: jarxiv

要約デシジョンツリーのグローバル最適化は、精度、サイズ、ひいては人間の理解可 … 続きを読む →

カテゴリー: 68T09, 68T20, 90C39, cs.AI, cs.DS, cs.LG, I.2.6 | コメントを受け付けていません

Towards Understanding Sycophancy in Language Models

投稿日: 2023年10月25日作成者: jarxiv

要約ヒューマンフィードバックからの強化学習 (RLHF) は、高品質の AI … 続きを読む →

カテゴリー: cs.AI, cs.CL, cs.LG, I.2.6, stat.ML | コメントを受け付けていません

Towards Understanding Sycophancy in Language Models

投稿日: 2023年10月23日作成者: jarxiv

要約ヒューマンフィードバックからの強化学習 (RLHF) は、高品質の AI … 続きを読む →

カテゴリー: cs.AI, cs.CL, cs.LG, I.2.6, stat.ML | コメントを受け付けていません

Exact and efficient solutions of the LMC Multitask Gaussian Process model

投稿日: 2023年10月19日作成者: jarxiv

要約共領域化の線形モデル (LMC) は、回帰または分類のためのマルチタスク … 続きを読む →

カテゴリー: cs.LG, I.2.6, stat.ML | コメントを受け付けていません

Machine Learning-based Nutrient Application’s Timeline Recommendation for Smart Agriculture: A Large-Scale Data Mining Approach

投稿日: 2023年10月19日作成者: jarxiv

要約この研究は、作物栽培における肥料散布を監視する際のデータ分析の重要な役割に … 続きを読む →

カテゴリー: cs.AI, cs.LG, I.2.6 | コメントを受け付けていません

「I.2.6」カテゴリーアーカイブ

Towards Understanding Sycophancy in Language Models

No-Regret Online Reinforcement Learning with Adversarial Losses and Transitions

Improved Best-of-Both-Worlds Guarantees for Multi-Armed Bandits: FTRL with General Regularizers and Multiple Optimal Arms

Optimization dependent generalization bound for ReLU networks based on sensitivity in the tangent bundle

SEEDS: Exponential SDE Solvers for Fast High-Quality Sampling from Diffusion Models

Necessary and Sufficient Conditions for Optimal Decision Trees using Dynamic Programming

Towards Understanding Sycophancy in Language Models

Towards Understanding Sycophancy in Language Models

Exact and efficient solutions of the LMC Multitask Gaussian Process model

Machine Learning-based Nutrient Application’s Timeline Recommendation for Smart Agriculture: A Large-Scale Data Mining Approach

最近の投稿

最近のコメント

アーカイブ

カテゴリー