「math.OC」カテゴリーアーカイブ

Policy Gradient Methods for Discrete Time Linear Quadratic Regulator With Random Parameters

投稿日: 2024年3月4日作成者: jarxiv

要約本論文では，時間に対して独立かつ同次に分布するランダムなパラメータを持つ離 … 続きを読む →

カテゴリー: cs.LG, math.OC | コメントを受け付けていません

New Characterizations and Efficient Local Search for General Integer Linear Programming

投稿日: 2024年3月4日作成者: jarxiv

要約整数線形計画法（ILP）は、実用的な組合せ最適化問題を幅広くモデル化し、産 … 続きを読む →

カテゴリー: 90C06, 90C10, cs.AI, I.2.8, math.OC | コメントを受け付けていません

Dimensionless Policies based on the Buckingham $π$ Theorem: Is This a Good Way to Generalize Numerical Results?

投稿日: 2024年3月1日作成者: jarxiv

要約コンテキスト (モーションコントロールの問題を定義する変数のリスト) が … 続きを読む →

カテゴリー: 00A73, 68T40, 70Q05 (Primary), 93C85, cs.AI, cs.RO, cs.SY, eess.SY, math.OC | コメントを受け付けていません

Efficient Model-Free Exploration in Low-Rank MDPs

投稿日: 2024年3月1日作成者: jarxiv

要約強化学習における主な課題は、一般化と関数近似が必要な高次元領域を探索するた … 続きを読む →

カテゴリー: cs.LG, math.OC | コメントを受け付けていません

Convex Hulls of Reachable Sets

投稿日: 2024年3月1日作成者: jarxiv

要約我々は、境界のある外乱と不確実な初期条件を伴う、到達可能な非線形システムの … 続きを読む →

カテゴリー: cs.LG, cs.RO, cs.SY, eess.SY, math.OC | コメントを受け付けていません

Heavy-Tailed Class Imbalance and Why Adam Outperforms Gradient Descent on Language Models

投稿日: 2024年3月1日作成者: jarxiv

要約 Adam は、大規模な言語トランスフォーマーの最適化において勾配降下法を上 … 続きを読む →

カテゴリー: cs.CL, cs.LG, math.OC, stat.ML | コメントを受け付けていません

Training Dynamics of Multi-Head Softmax Attention for In-Context Learning: Emergence, Convergence, and Optimality

投稿日: 2024年3月1日作成者: jarxiv

要約マルチタスク線形回帰のインコンテキスト学習のためのマルチヘッドソフトマッ … 続きを読む →

カテゴリー: cs.AI, cs.LG, math.OC, math.ST, stat.ML, stat.TH | コメントを受け付けていません

Taming Nonconvex Stochastic Mirror Descent with General Bregman Divergence

投稿日: 2024年2月28日作成者: jarxiv

要約このペーパーでは、現代の非凸最適化設定における確率的ミラー降下法 (SMD … 続きを読む →

カテゴリー: 90C15, 90C26, cs.LG, G.1.6, math.OC | コメントを受け付けていません

Robustly Learning Single-Index Models via Alignment Sharpness

投稿日: 2024年2月28日作成者: jarxiv

要約不可知論的モデルにおける $L_2^2$ 損失の下での単一インデックスモ … 続きを読む →

カテゴリー: cs.DS, cs.LG, math.OC, math.ST, stat.ML, stat.TH | コメントを受け付けていません

Variational Learning is Effective for Large Deep Networks

投稿日: 2024年2月28日作成者: jarxiv

要約変分学習は大規模なニューラルネットワークには効果がないという一般的な考え … 続きを読む →

カテゴリー: cs.AI, cs.CL, cs.LG, math.OC, stat.ML | コメントを受け付けていません

「math.OC」カテゴリーアーカイブ

Policy Gradient Methods for Discrete Time Linear Quadratic Regulator With Random Parameters

New Characterizations and Efficient Local Search for General Integer Linear Programming

Dimensionless Policies based on the Buckingham $π$ Theorem: Is This a Good Way to Generalize Numerical Results?

Efficient Model-Free Exploration in Low-Rank MDPs

Convex Hulls of Reachable Sets

Heavy-Tailed Class Imbalance and Why Adam Outperforms Gradient Descent on Language Models

Training Dynamics of Multi-Head Softmax Attention for In-Context Learning: Emergence, Convergence, and Optimality

Taming Nonconvex Stochastic Mirror Descent with General Bregman Divergence

Robustly Learning Single-Index Models via Alignment Sharpness

Variational Learning is Effective for Large Deep Networks

最近の投稿

最近のコメント

アーカイブ

カテゴリー