Context-aware Mamba-based Reinforcement Learning for social robot navigation

要約

ソーシャルロボットナビゲーション (SRN) は、社会的に受け入れられる方法で歩行者が多い環境をナビゲートすることを伴う関連問題です。
これは、歩行者が多い環境でソーシャルロボットを効果的にするために不可欠な部分です。
このようなロボットの使用例は、コンパニオンロボットから倉庫ロボット、自律型車椅子までさまざまです。
近年、深層強化学習はソーシャルロボットナビゲーションの研究に利用されることが増えています。
私たちの取り組みでは、CAMRL (Context-Aware Mamba-based Reinforcement Learning) を導入しています。
Mamba は、シーケンスタスクにおいてトランスフォーマーに匹敵する結果を達成した、深層学習ベースの新しい状態空間モデル (SSM) です。
CAMRL は Mamba を使用してロボットの次のアクションを決定します。これにより、ニューラルネットワークによって予測される次の状態の値が最大化され、割り当てられた報酬に基づいてロボットが効果的にナビゲートできるようになります。
当社は、ORCA と SFM に基づくさまざまな密度と環境動作を含む厳密なテストデータセットを使用して、既存のソリューション (CADRL、LSTM-RL、SARL) と並行して CAMRL を評価します。これにより、CAMRL がより高い成功率を達成し、衝突を最小限に抑え、
歩行者からのより安全な距離。
この研究では、新しい SRN プランナーを導入し、ロボットナビゲーションのためのディープステート宇宙モデルの可能性を示しています。

要約(オリジナル)

Social robot navigation (SRN) is a relevant problem that involves navigating a pedestrian-rich environment in a socially acceptable manner. It is an essential part of making social robots effective in pedestrian-rich settings. The use cases of such robots could vary from companion robots to warehouse robots to autonomous wheelchairs. In recent years, deep reinforcement learning has been increasingly used in research on social robot navigation. Our work introduces CAMRL (Context-Aware Mamba-based Reinforcement Learning). Mamba is a new deep learning-based State Space Model (SSM) that has achieved results comparable to transformers in sequencing tasks. CAMRL uses Mamba to determine the robot’s next action, which maximizes the value of the next state predicted by the neural network, enabling the robot to navigate effectively based on the rewards assigned. We evaluate CAMRL alongside existing solutions (CADRL, LSTM-RL, SARL) using a rigorous testing dataset which involves a variety of densities and environment behaviors based on ORCA and SFM, thus, demonstrating that CAMRL achieves higher success rates, minimizes collisions, and maintains safer distances from pedestrians. This work introduces a new SRN planner, showcasing the potential for deep-state space models for robot navigation.

arxiv情報

著者	Syed Muhammad Mustafa,Omema Rizvi,Zain Ahmed Usmani,Abdul Basit Memon
発行日	2024-08-05 17:55:20+00:00
arxivサイト	arxiv_id(pdf)

提供元, 利用サービス

arxiv.jp, Google

Context-aware Mamba-based Reinforcement Learning for social robot navigation

要約

要約(オリジナル)

arxiv情報

提供元, 利用サービス

最近の投稿

最近のコメント

アーカイブ

カテゴリー