Robo-taxi Fleet Coordination at Scale via Reinforcement Learning

要約

一般的に自律的なモビリティオンデマンド（AMOD）システムとして知られているオンデマンド輸送サービスを提供するロボ税の艦隊は、汚染、エネルギー消費、都市の混雑の削減など、社会的利益に大きな約束を抱えています。
ただし、これらのシステムを大規模に調整することは重要な課題であり、既存の調整アルゴリズムがシステムの最大限の可能性を活用できないことが多いためです。
この作業では、数学的モデリングをデータ駆動型の手法と統合する新しい意思決定フレームワークを紹介します。
特に、強化学習のレンズを通じてAMOD調整の問題を提示し、グラフ表現学習、強化学習、および古典的な運用研究ツールの主な強みを活用するグラフネットワークベースのフレームワークを提案します。
多様なシミュレーションの忠実度とシナリオ全体の広範な評価は、アプローチの柔軟性を示しており、以前の方法と比較して優れたシステムパフォーマンス、計算効率、および一般化可能性を実現します。
最後に、この分野での研究努力を民主化する必要性に動機付けられ、アクセス可能なシミュレーションプラットフォームを提供し、方法論を比較するための標準化された検証プロセスを確立するように設計されたオープンソースコードベースとともに、ネットワークレベルの調整のための公開されたベンチマーク、データセット、およびシミュレーターをリリースします。
https://github.com/stanfordasl/rl4amodで利用可能なコード

要約(オリジナル)

Fleets of robo-taxis offering on-demand transportation services, commonly known as Autonomous Mobility-on-Demand (AMoD) systems, hold significant promise for societal benefits, such as reducing pollution, energy consumption, and urban congestion. However, orchestrating these systems at scale remains a critical challenge, with existing coordination algorithms often failing to exploit the systems’ full potential. This work introduces a novel decision-making framework that unites mathematical modeling with data-driven techniques. In particular, we present the AMoD coordination problem through the lens of reinforcement learning and propose a graph network-based framework that exploits the main strengths of graph representation learning, reinforcement learning, and classical operations research tools. Extensive evaluations across diverse simulation fidelities and scenarios demonstrate the flexibility of our approach, achieving superior system performance, computational efficiency, and generalizability compared to prior methods. Finally, motivated by the need to democratize research efforts in this area, we release publicly available benchmarks, datasets, and simulators for network-level coordination alongside an open-source codebase designed to provide accessible simulation platforms and establish a standardized validation process for comparing methodologies. Code available at: https://github.com/StanfordASL/RL4AMOD

arxiv情報

著者	Luigi Tresca,Carolin Schmidt,James Harrison,Filipe Rodrigues,Gioele Zardini,Daniele Gammelli,Marco Pavone
発行日	2025-04-08 15:19:41+00:00
arxivサイト	arxiv_id(pdf)

提供元, 利用サービス

arxiv.jp, Google

Robo-taxi Fleet Coordination at Scale via Reinforcement Learning

要約

要約(オリジナル)

arxiv情報

提供元, 利用サービス

最近の投稿

最近のコメント

アーカイブ

カテゴリー