Counter-example guided Imitation Learning of Feedback Controllers from Temporal Logic Specifications

要約

信号時間論理 (STL) を使用して表現された制御要件の模倣学習のための新しい方法を提案します。
より具体的には、複雑なコントローラーを模倣するようにニューラルネットワークをトレーニングする問題に焦点を当てます。
学習プロセスは、反例とカバレッジ測定に基づいた効率的なデータ集約によって導かれます。
さらに、STL 要件のパラメータ化とパラメータ推定を通じて、学習したコントローラのパフォーマンスを評価する方法を紹介します。
飛行ロボットのケーススタディで私たちのアプローチを実証します。

要約(オリジナル)

We present a novel method for imitation learning for control requirements expressed using Signal Temporal Logic (STL). More concretely we focus on the problem of training a neural network to imitate a complex controller. The learning process is guided by efficient data aggregation based on counter-examples and a coverage measure. Moreover, we introduce a method to evaluate the performance of the learned controller via parameterization and parameter estimation of the STL requirements. We demonstrate our approach with a flying robot case study.

arxiv情報

著者	Thao Dang,Alexandre Donzé,Inzemamul Haque,Nikolaos Kekatos,Indranil Saha
発行日	2024-03-25 10:09:42+00:00
arxivサイト	arxiv_id(pdf)

提供元, 利用サービス

arxiv.jp, Google

Counter-example guided Imitation Learning of Feedback Controllers from Temporal Logic Specifications

要約

要約(オリジナル)

arxiv情報

提供元, 利用サービス

最近の投稿

最近のコメント

アーカイブ

カテゴリー