Efficient Learning of Urban Driving Policies Using Bird’s-Eye-View State Representations


RecurrDriveNet と呼ばれる当社の PPO ベースのアプローチは、CARLA の自動運転タスクのシミュレーションで実証されており、トレーニングに必要なエクスペリエンスは 100 万件のみでありながら、従来のフレームスタッキング手法を上回るパフォーマンスを発揮します。
RecurrDriveNet は、他の道路利用者と安全にやり取りすることで、走行キロメートルあたり 1 件未満の違反を引き起こします。


Autonomous driving involves complex decision-making in highly interactive environments, requiring thoughtful negotiation with other traffic participants. While reinforcement learning provides a way to learn such interaction behavior, efficient learning critically depends on scalable state representations. Contrary to imitation learning methods, high-dimensional state representations still constitute a major bottleneck for deep reinforcement learning methods in autonomous driving. In this paper, we study the challenges of constructing bird’s-eye-view representations for autonomous driving and propose a recurrent learning architecture for long-horizon driving. Our PPO-based approach, called RecurrDriveNet, is demonstrated on a simulated autonomous driving task in CARLA, where it outperforms traditional frame-stacking methods while only requiring one million experiences for training. RecurrDriveNet causes less than one infraction per driven kilometer by interacting safely with other road users.


著者 Raphael Trumpp,Martin Büchner,Abhinav Valada,Marco Caccamo
発行日 2023-05-31 14:38:00+00:00
arxivサイト arxiv_id(pdf)

カテゴリー: cs.RO パーマリンク