「60F17」カテゴリーアーカイブ

The ODE Method for Asymptotic Statistics in Stochastic Approximation and Reinforcement Learning

投稿日: 2024年11月8日作成者: jarxiv

要約この論文は $d$ 次元の確率的近似再帰 $$ \theta_{n+1}= … 続きを読む →

カテゴリー: 60F17, 62L20, 68T05, cs.LG, math.ST, stat.TH | コメントを受け付けていません

The ODE Method for Asymptotic Statistics in Stochastic Approximation and Reinforcement Learning

投稿日: 2024年2月22日作成者: jarxiv

要約この論文は、確率的近似再帰 \[ \theta_{n+1}= \theta … 続きを読む →

カテゴリー: 60F17, 62L20, 68T05, cs.LG, math.ST, stat.TH | コメントを受け付けていません