Uncertainty-Aware Deployment of Pre-trained Language-Conditioned Imitation Learning Policies

要約

さまざまなタスクやロボットプラットフォームからのデータに基づいてトレーニングされた大規模なロボットポリシーは、汎用ロボットの実現に大きな期待を持っています。
しかし、新しい環境条件に対する信頼性の高い一般化は依然として大きな課題です。
この課題に対処するために、事前に訓練された言語条件付き模倣学習エージェントの不確実性を認識した展開のための新しいアプローチを提案します。
具体的には、温度スケーリングを使用してこれらのモデルを校正し、候補アクションのローカル情報を集約することで、校正されたモデルを利用して不確実性を考慮した意思決定を行います。
このような 3 つの事前トレーニング済みモデルを使用してシミュレーションでアプローチを実装し、タスクの完了率を大幅に向上させる可能性を示します。
付属のコードはリンクからアクセスできます: https://github.com/BobWu1998/uncertainty_quant_all.git

要約(オリジナル)

Large-scale robotic policies trained on data from diverse tasks and robotic platforms hold great promise for enabling general-purpose robots; however, reliable generalization to new environment conditions remains a major challenge. Toward addressing this challenge, we propose a novel approach for uncertainty-aware deployment of pre-trained language-conditioned imitation learning agents. Specifically, we use temperature scaling to calibrate these models and exploit the calibrated model to make uncertainty-aware decisions by aggregating the local information of candidate actions. We implement our approach in simulation using three such pre-trained models, and showcase its potential to significantly enhance task completion rates. The accompanying code is accessible at the link: https://github.com/BobWu1998/uncertainty_quant_all.git

arxiv情報

著者	Bo Wu,Bruce D. Lee,Kostas Daniilidis,Bernadette Bucher,Nikolai Matni
発行日	2024-03-27 03:19:36+00:00
arxivサイト	arxiv_id(pdf)

提供元, 利用サービス

arxiv.jp, Google

Uncertainty-Aware Deployment of Pre-trained Language-Conditioned Imitation Learning Policies

要約

要約(オリジナル)

arxiv情報

提供元, 利用サービス

最近の投稿

最近のコメント

アーカイブ

カテゴリー