Learning Hybrid Policies for MPC with Application to Drone Flight in Unknown Dynamic Environments

要約

近年、ドローンは現実世界のさまざまなタスクに応用されることが増えています。
モデル予測制御 (MPC) は、モデル化エラー/不確実性および外乱に対する堅牢性により、ドローン飛行制御の実用的な方法として浮上しています。
ただし、MPC は手動で調整されたパラメーターに敏感であるため、未知の環境ダイナミクスに直面すると、パフォーマンスが急激に低下する可能性があります。
この論文では、未知のダイナミクスを特徴とするスイングゲートを横切るドローンを制御するという課題に取り組みます。
この論文では、高レベルの決定変数を活用して不確実な環境条件に適応する、hyMPC と呼ばれるパラメーター化された MPC アプローチを紹介します。
これらの決定変数を導出するために、高レベルのガウスポリシーのトレーニングを目的とした新しいポリシー検索フレームワークが提示されます。
その後、ガウスポリシーの繰り返し実行を通じて収集されたデータに基づいてトレーニングされたニューラルネットワークポリシーの力を利用して、リアルタイムの意思決定変数を提供します。
hyMPC の有効性は数値シミュレーションによって検証され、スイングゲートを通過する 20 回のドローン飛行テストで 100% の成功率を達成し、環境力学に関する限られた事前知識でも安全で正確な飛行を達成できる能力を実証しました。

要約(オリジナル)

In recent years, drones have found increased applications in a wide array of real-world tasks. Model predictive control (MPC) has emerged as a practical method for drone flight control, owing to its robustness against modeling errors/uncertainties and external disturbances. However, MPC’s sensitivity to manually tuned parameters can lead to rapid performance degradation when faced with unknown environmental dynamics. This paper addresses the challenge of controlling a drone as it traverses a swinging gate characterized by unknown dynamics. This paper introduces a parameterized MPC approach named hyMPC that leverages high-level decision variables to adapt to uncertain environmental conditions. To derive these decision variables, a novel policy search framework aimed at training a high-level Gaussian policy is presented. Subsequently, we harness the power of neural network policies, trained on data gathered through the repeated execution of the Gaussian policy, to provide real-time decision variables. The effectiveness of hyMPC is validated through numerical simulations, achieving a 100\% success rate in 20 drone flight tests traversing a swinging gate, demonstrating its capability to achieve safe and precise flight with limited prior knowledge of environmental dynamics.

arxiv情報

著者	Zhaohan Feng,Jie Chen,Wei Xiao,Jian Sun,Bin Xin,Gang Wang
発行日	2024-01-25 07:41:36+00:00
arxivサイト	arxiv_id(pdf)

提供元, 利用サービス

arxiv.jp, Google

Learning Hybrid Policies for MPC with Application to Drone Flight in Unknown Dynamic Environments

要約

要約(オリジナル)

arxiv情報

提供元, 利用サービス

最近の投稿

最近のコメント

アーカイブ

カテゴリー