「93E03」カテゴリーアーカイブ

On the continuity and smoothness of the value function in reinforcement learning and optimal control

投稿日: 2024年3月22日作成者: jarxiv

要約価値関数は、強化学習と最適制御の両方においてエージェントが受け取る将来の累 … 続きを読む →

カテゴリー: 37H99, 37N35, 93E03, cs.AI, cs.SY, eess.SY, I.2.8 | コメントを受け付けていません