Safe Learning of PDDL Domains with Conditional Effects — Extended Version

要約

さまざまなタイプの計画上の問題を解決するために、強力なドメインに依存しないプランナーが開発されています。
これらのプランナーは、多くの場合、何らかの計画領域記述言語で指定された、代理エージェントのアクションのモデルを必要とします。
このようなアクションモデルを手動で設計することは、非常に困難な作業であることで知られています。
別の方法は、観察からアクションモデルを自動的に学習することです。
このようなアクションモデルで作成されたすべての計画が実際の未知のアクションモデルと一致する場合、そのアクションモデルは安全であると呼ばれます。
このような安全なアクションモデルを学習するためのアルゴリズムは存在しますが、多くの計画問題で一般的な構成要素である条件付きまたは普遍的な効果を持つ領域を処理することはできません。
私たちは、条件付き効果を伴う自明ではない安全なアクションモデルを学習するには、指数関数的な数のサンプルが必要になる可能性があることを証明します。
次に、そのような学習が扱いやすい合理的な仮定を特定し、それを実行できる最初のアルゴリズムである条件付き効果の SAM 学習 (Conditional-SAM) を提案します。
私たちはConditional-SAMを理論的に解析し、実験的に評価します。
私たちの結果は、Conditional-SAM によって学習されたアクションモデルを使用して、実験されたほとんどのドメインでテストセットの問題のほとんどを完全に解決できることを示しています。

要約(オリジナル)

Powerful domain-independent planners have been developed to solve various types of planning problems. These planners often require a model of the acting agent’s actions, given in some planning domain description language. Manually designing such an action model is a notoriously challenging task. An alternative is to automatically learn action models from observation. Such an action model is called safe if every plan created with it is consistent with the real, unknown action model. Algorithms for learning such safe action models exist, yet they cannot handle domains with conditional or universal effects, which are common constructs in many planning problems. We prove that learning non-trivial safe action models with conditional effects may require an exponential number of samples. Then, we identify reasonable assumptions under which such learning is tractable and propose SAM Learning of Conditional Effects (Conditional-SAM), the first algorithm capable of doing so. We analyze Conditional-SAM theoretically and evaluate it experimentally. Our results show that the action models learned by Conditional-SAM can be used to solve perfectly most of the test set problems in most of the experimented domains.

arxiv情報

著者	Argaman Mordoch,Enrico Scala,Roni Stern,Brendan Juba
発行日	2024-03-22 14:49:49+00:00
arxivサイト	arxiv_id(pdf)

提供元, 利用サービス

arxiv.jp, Google

Safe Learning of PDDL Domains with Conditional Effects — Extended Version

要約

要約(オリジナル)

arxiv情報

提供元, 利用サービス

最近の投稿

最近のコメント

アーカイブ

カテゴリー