Solving the swing-up and balance task for the Acrobot and Pendubot with SAC

要約

私たちは、IJCAI 2023 での AI オリンピック競技会に参加するためのペンデュボットとアクロボットのスイングアップとバランスタスクのソリューションを紹介します。私たちのソリューションは、ソフトアクター Crtic (SAC) の強化学習 (RL) アルゴリズムに基づいています。
二重振り子をトップ位置で安定させるためのリニア二次レギュレータ(LQR)コントローラのスイングアップと引力領域への進入に関する方針。
当社のコントローラーは、ペンデュボットとアクロボットの両方の問題シナリオにおいて、パフォーマンスと堅牢性において競争力のあるスコアを達成しています。

要約(オリジナル)

We present a solution of the swing-up and balance task for the pendubot and acrobot for the participation in the AI Olympics competition at IJCAI 2023. Our solution is based on the Soft Actor Crtic (SAC) reinforcement learning (RL) algorithm for training a policy for the swing-up and entering the region of attraction of a linear quadratic regulator(LQR) controller for stabilizing the double pendulum at the top position. Our controller achieves competitive scores in performance and robustness for both, pendubot and acrobot, problem scenarios.

arxiv情報

著者	Chi Zhang,Akhil Sathuluri,Markus Zimmermann
発行日	2023-12-18 16:04:54+00:00
arxivサイト	arxiv_id(pdf)

提供元, 利用サービス

arxiv.jp, Google

Solving the swing-up and balance task for the Acrobot and Pendubot with SAC

要約

要約(オリジナル)

arxiv情報

提供元, 利用サービス

最近の投稿

最近のコメント

アーカイブ

カテゴリー