Robust Reinforcement Learning-Based Locomotion for Resource-Constrained Quadrupeds with Exteroceptive Sensing

要約

コンパクトな四角形ロボットは、実際のシナリオでの展開にますます適していることが証明されています。
サイズが小さくなると、人間の環境への簡単な統合が促進されます。
それにもかかわらず、特に地形の認識の高い計算的要求のため、不均一な地形のリアルタイムの移動は依然として困難です。
このペーパーでは、慎重な深度センサーの選択に支えられたリアルタイムの標高マッピングを活用する、挑戦的な地形におけるリソースが制約した小規模四足動物のための堅牢な強化学習ベースの外部機密運動コントローラーを提示します。
ポリシーと状態推定器の両方を同時にトレーニングします。これらの推定器は、オプションで視覚inertial匂い（VIO）と融合した、標高マッピングの臭気源源を一緒に提供します。
Vioがなくても堅牢性を維持するための追加のフライトセンサーを配置することの重要性を実証し、したがって、計算リソースを解放する可能性があります。
提案されたコントローラーは、高さが最大17.5 cmまでのステップを完璧に通過し、Vioの有無にかかわらず22.5 cmのステップで80％の成功率を達成できることを実験的に実証します。
提案されたコントローラーは、それぞれ最大1.0 m/sと1.5 RAD/sの正確な前方およびヨー速度追跡も実現します。
github.com/eth-pbl/elmap-rl-controllerでトレーニングコードをオープンソーシングします。

要約(オリジナル)

Compact quadrupedal robots are proving increasingly suitable for deployment in real-world scenarios. Their smaller size fosters easy integration into human environments. Nevertheless, real-time locomotion on uneven terrains remains challenging, particularly due to the high computational demands of terrain perception. This paper presents a robust reinforcement learning-based exteroceptive locomotion controller for resource-constrained small-scale quadrupeds in challenging terrains, which exploits real-time elevation mapping, supported by a careful depth sensor selection. We concurrently train both a policy and a state estimator, which together provide an odometry source for elevation mapping, optionally fused with visual-inertial odometry (VIO). We demonstrate the importance of positioning an additional time-of-flight sensor for maintaining robustness even without VIO, thus having the potential to free up computational resources. We experimentally demonstrate that the proposed controller can flawlessly traverse steps up to 17.5 cm in height and achieve an 80% success rate on 22.5 cm steps, both with and without VIO. The proposed controller also achieves accurate forward and yaw velocity tracking of up to 1.0 m/s and 1.5 rad/s respectively. We open-source our training code at github.com/ETH-PBL/elmap-rl-controller.

arxiv情報

著者	Davide Plozza,Patricia Apostol,Paul Joseph,Simon Schläpfer,Michele Magno
発行日	2025-05-18 20:29:23+00:00
arxivサイト	arxiv_id(pdf)

提供元, 利用サービス

arxiv.jp, Google

Robust Reinforcement Learning-Based Locomotion for Resource-Constrained Quadrupeds with Exteroceptive Sensing

要約

要約(オリジナル)

arxiv情報

提供元, 利用サービス

最近の投稿

最近のコメント

アーカイブ

カテゴリー