「37H99」カテゴリーアーカイブ

On the continuity and smoothness of the value function in reinforcement learning and optimal control

投稿日: 2024年3月22日作成者: jarxiv

要約価値関数は、強化学習と最適制御の両方においてエージェントが受け取る将来の累 … 続きを読む →

カテゴリー: 37H99, 37N35, 93E03, cs.AI, cs.SY, eess.SY, I.2.8 | コメントを受け付けていません

Beyond expectations: Residual Dynamic Mode Decomposition and Variance for Stochastic Dynamical Systems

投稿日: 2023年11月13日作成者: jarxiv

要約コープマン演算子は非線形力学システムを線形化し、そのスペクトル情報を非常に … 続きを読む →

カテゴリー: 37H99, 37M10, 37N25, 47A10, 47B33, 65P99, cs.LG, cs.NA, math.DS, math.NA, math.SP, nlin.CD | コメントを受け付けていません

Beyond expectations: Residual Dynamic Mode Decomposition and Variance for Stochastic Dynamical Systems

投稿日: 2023年8月22日作成者: jarxiv

要約コープマン演算子は非線形力学システムを線形化し、そのスペクトル情報を非常に … 続きを読む →

カテゴリー: 37H99, 37M10, 37N25, 47A10, 47B33, 65P99, cs.LG, cs.NA, math.DS, math.NA, math.SP, nlin.CD | コメントを受け付けていません