Hierarchical Policy Blending As Optimal Transport

要約

最適なトランスポート (HiPBOT) として階層的なポリシーブレンディングを提示します。
この階層的なフレームワークは、低レベルのリアクティブエキスパートポリシーの重みを適応させ、エキスパートポリシーとエージェントのプロダクトのパラメーター空間に先読み計画レイヤーを追加します。
私たちの高レベルのプランナーは、不均衡な最適輸送を介してポリシーのブレンドを実現し、基礎となるリーマン運動ポリシーのスケーリングを統合し、それらのリーマン行列を効果的に調整し、エキスパートとエージェント間の優先順位を決定し、安全性とタスクの成功を保証します。
低次元のナビゲーションから高次元の全身制御まで、さまざまなアプリケーションシナリオでの実験結果は、HiPBOT の有効性と効率性を示しています。HiPBOT は、確率論的推論を実行するか、ツリー構造を定義する最先端のベースラインよりも優れています。
ロボット制御への最適な搬送の新しいアプリケーションへの道を開きます。
https://sites.google.com/view/hipobot のその他の資料

要約(オリジナル)

We present hierarchical policy blending as optimal transport (HiPBOT). This hierarchical framework adapts the weights of low-level reactive expert policies, adding a look-ahead planning layer on the parameter space of a product of expert policies and agents. Our high-level planner realizes a policy blending via unbalanced optimal transport, consolidating the scaling of underlying Riemannian motion policies, effectively adjusting their Riemannian matrix, and deciding over the priorities between experts and agents, guaranteeing safety and task success. Our experimental results in a range of application scenarios from low-dimensional navigation to high-dimensional whole-body control showcase the efficacy and efficiency of HiPBOT, which outperforms state-of-the-art baselines that either perform probabilistic inference or define a tree structure of experts, paving the way for new applications of optimal transport to robot control. More material at https://sites.google.com/view/hipobot

arxiv情報

著者	An T. Le,Kay Hansel,Jan Peters,Georgia Chalvatzaki
発行日	2023-03-16 10:45:07+00:00
arxivサイト	arxiv_id(pdf)

提供元, 利用サービス

arxiv.jp, Google

Hierarchical Policy Blending As Optimal Transport

要約

要約(オリジナル)

arxiv情報

提供元, 利用サービス

最近の投稿

最近のコメント

アーカイブ

カテゴリー