Receding-Constraint Model Predictive Control using a Learned Approximate Control-Invariant Set

要約

近年、高度なモデルベースおよびデータ駆動型の制御手法により、複雑なロボットシステムの可能性が解き放たれており、この傾向は近い将来、指数関数的な速度で続くことが予想されます。
しかし、これらの高度な制御手法による安全性の確保には依然として課題が残っています。
コントローラー (モデル予測コントローラーまたは強化学習ポリシー) を安全にするためのよく知られたツールは、いわゆる制御不変セット (別名セーフセット) です。
残念ながら、非線形システムの場合、一般にそのようなセットを正確に計算することはできません。
近似制御不変集合を計算するための数値アルゴリズムは存在しますが、集合が正確でない場合、古典的な理論的制御手法は機能しません。
この文書では、この問題に対処するための最近の取り組みについて説明します。
我々は、古典的な手法よりも弱い仮定のもとで再帰的な実現可能性や安全性を保証できる新しいモデル予測制御スキームを提案します。
特に、再帰的実現可能性は、安全集合制約を地平線を越えて後方に移動させ、その集合が制御の不変性よりも弱い条件を満たすと仮定することによって保証されます。
代わりに、安全セット上のさらに弱い仮定の下で安全性が保証され、制約違反のリスクが検出されるたびに安全なタスク中止戦略がトリガーされます。
私たちはシミュレートされたロボットマニピュレーターでアプローチを評価し、追跡コスト、完了したタスクの数、および計算時間の点で妥当なパフォーマンスを維持しながら、最先端のアプローチよりも制約違反が少ないことを経験的に実証しました。

要約(オリジナル)

In recent years, advanced model-based and data-driven control methods are unlocking the potential of complex robotics systems, and we can expect this trend to continue at an exponential rate in the near future. However, ensuring safety with these advanced control methods remains a challenge. A well-known tool to make controllers (either Model Predictive Controllers or Reinforcement Learning policies) safe, is the so-called control-invariant set (a.k.a. safe set). Unfortunately, for nonlinear systems, such a set cannot be exactly computed in general. Numerical algorithms exist for computing approximate control-invariant sets, but classic theoretic control methods break down if the set is not exact. This paper presents our recent efforts to address this issue. We present a novel Model Predictive Control scheme that can guarantee recursive feasibility and/or safety under weaker assumptions than classic methods. In particular, recursive feasibility is guaranteed by making the safe-set constraint move backward over the horizon, and assuming that such set satisfies a condition that is weaker than control invariance. Safety is instead guaranteed under an even weaker assumption on the safe set, triggering a safe task-abortion strategy whenever a risk of constraint violation is detected. We evaluated our approach on a simulated robot manipulator, empirically demonstrating that it leads to less constraint violations than state-of-the-art approaches, while retaining reasonable performance in terms of tracking cost, number of completed tasks, and computation time.

arxiv情報

著者	Gianni Lunardi,Asia La Rocca,Matteo Saveriano,Andrea Del Prete
発行日	2024-08-28 11:41:21+00:00
arxivサイト	arxiv_id(pdf)

提供元, 利用サービス

arxiv.jp, Google

Receding-Constraint Model Predictive Control using a Learned Approximate Control-Invariant Set

要約

要約(オリジナル)

arxiv情報

提供元, 利用サービス

最近の投稿

最近のコメント

アーカイブ

カテゴリー