PTDRL: Parameter Tuning using Deep Reinforcement Learning

要約

ロボットが安全かつ高速に移動できるようにするさまざまな自律ナビゲーションアルゴリズムが存在します。
ただし、これらのアルゴリズムの多くは、新しい環境に直面したときにパラメータの再調整が必要になります。
この論文では、特定のナビゲーションシステムに対して期待される報酬を最大化するパラメータを固定パラメータセットから適応的に選択するパラメータ調整戦略である PTDRL を提案します。
私たちの学習戦略は、さまざまな環境、さまざまなプラットフォーム、さまざまなユーザーの好みに使用できます。
具体的には、古典的な動作計画アルゴリズムをナビゲーションシステムとして使用し、その動作を最適化するためにパラメータをトレーニングすることで、屋内空間におけるソーシャルナビゲーションの問題に取り組んでいます。
実験結果は、PTDRL が他のオンラインパラメーター調整戦略よりも優れたパフォーマンスを発揮できることを示しています。

要約(オリジナル)

A variety of autonomous navigation algorithms exist that allow robots to move around in a safe and fast manner. However, many of these algorithms require parameter re-tuning when facing new environments. In this paper, we propose PTDRL, a parameter-tuning strategy that adaptively selects from a fixed set of parameters those that maximize the expected reward for a given navigation system. Our learning strategy can be used for different environments, different platforms, and different user preferences. Specifically, we attend to the problem of social navigation in indoor spaces, using a classical motion planning algorithm as our navigation system and training its parameters to optimize its behavior. Experimental results show that PTDRL can outperform other online parameter-tuning strategies.

arxiv情報

著者	Elias Goldsztejn,Tal Feiner,Ronen Brafman
発行日	2023-06-19 10:36:53+00:00
arxivサイト	arxiv_id(pdf)

提供元, 利用サービス

arxiv.jp, Google

PTDRL: Parameter Tuning using Deep Reinforcement Learning

要約

要約(オリジナル)

arxiv情報

提供元, 利用サービス

最近の投稿

最近のコメント

アーカイブ

カテゴリー