「math.OC」カテゴリーアーカイブ

A Method to Improve the Performance of Reinforcement Learning Based on the Y Operator for a Class of Stochastic Differential Equation-Based Child-Mother Systems

投稿日: 2024年1月2日作成者: jarxiv

要約この論文では、確率微分方程式(SDE)によって支配されるシステムに対するA … 続きを読む →

カテゴリー: cs.AI, math.OC | コメントを受け付けていません

Decision-focused predictions via pessimistic bilevel optimization: a computational study

投稿日: 2024年1月1日作成者: jarxiv

要約最適化パラメータの不確実性への対処は、長年にわたる重要な課題です。通常、 … 続きを読む →

カテゴリー: 90C30, cs.LG, math.OC | コメントを受け付けていません

On the Robustness of Decision-Focused Learning

投稿日: 2023年12月29日作成者: jarxiv

要約意思決定焦点学習 (DFL) は、不完全な最適化問題の欠落パラメーターを予 … 続きを読む →

カテゴリー: 68Txx, cs.LG, math.OC | コメントを受け付けていません

Cumulative Regret Analysis of the Piyavskii–Shubert Algorithm and Its Variants for Global Optimization

投稿日: 2023年12月29日作成者: jarxiv

要約私たちは大域的最適化の問題を研究し、Piyavskii-Shubert ア … 続きを読む →

カテゴリー: cs.LG, math.OC | コメントを受け付けていません

Symmetry Breaking in Symmetric Tensor Decomposition

投稿日: 2023年12月29日作成者: jarxiv

要約このノートでは、対称テンソルのランク分解の計算に関連する高度に非凸の最適化 … 続きを読む →

カテゴリー: cs.LG, math.OC | コメントを受け付けていません

Resilient Constrained Reinforcement Learning

投稿日: 2023年12月29日作成者: jarxiv

要約私たちは、トレーニング前に複数の制約仕様が識別されない、制約付き強化学習 … 続きを読む →

カテゴリー: cs.LG, cs.SY, eess.SY, math.OC | コメントを受け付けていません

Bayesian Design Principles for Frequentist Sequential Learning

投稿日: 2023年12月29日作成者: jarxiv

要約我々は、効率的なバンディット学習アルゴリズムと強化学習アルゴリズムを統一ベ … 続きを読む →

カテゴリー: cs.LG, math.OC, math.ST, stat.TH | コメントを受け付けていません

Implicitly normalized forecaster with clipping for linear and non-linear heavy-tailed multi-armed bandits

投稿日: 2023年12月27日作成者: jarxiv

要約 Implicitly Normalized Forecaster (INF … 続きを読む →

カテゴリー: cs.LG, math.OC, stat.ML | コメントを受け付けていません

Bayesian Design Principles for Frequentist Sequential Learning

投稿日: 2023年12月27日作成者: jarxiv

要約我々は、効率的なバンディット学習アルゴリズムと強化学習アルゴリズムを統一ベ … 続きを読む →

カテゴリー: cs.LG, math.OC, math.ST, stat.TH | コメントを受け付けていません

Bridging the Gaps: Learning Verifiable Model-Free Quadratic Programming Controllers Inspired by Model Predictive Control

投稿日: 2023年12月27日作成者: jarxiv

要約このペーパーでは、モデル予測制御 (MPC) からインスピレーションを得た … 続きを読む →

カテゴリー: cs.LG, cs.RO, cs.SY, eess.SY, math.OC | コメントを受け付けていません

「math.OC」カテゴリーアーカイブ

A Method to Improve the Performance of Reinforcement Learning Based on the Y Operator for a Class of Stochastic Differential Equation-Based Child-Mother Systems

Decision-focused predictions via pessimistic bilevel optimization: a computational study

On the Robustness of Decision-Focused Learning

Cumulative Regret Analysis of the Piyavskii–Shubert Algorithm and Its Variants for Global Optimization

Symmetry Breaking in Symmetric Tensor Decomposition

Resilient Constrained Reinforcement Learning

Bayesian Design Principles for Frequentist Sequential Learning

Implicitly normalized forecaster with clipping for linear and non-linear heavy-tailed multi-armed bandits

Bayesian Design Principles for Frequentist Sequential Learning

Bridging the Gaps: Learning Verifiable Model-Free Quadratic Programming Controllers Inspired by Model Predictive Control

最近の投稿

最近のコメント

アーカイブ

カテゴリー