Multi-agent Reinforcement Learning vs. Fixed-Time Control for Traffic Signal Optimization: A Simulation Study

要約

特に交差点での都市交通渋滞は、旅行時間、燃料消費、排出量に大きな影響を与えます。
従来の固定時間信号制御システムは、しばしば動的なトラフィックパターンを効果的に管理するための適応性を欠いています。
この研究では、シミュレートされた環境内の複数の交差点にわたる交通信号の調整を最適化するためのマルチエージェント補強学習（MARL）の適用を調査します。
Pygameを利用して、シミュレーションが開発され、相互接続された交差点のネットワークをランダムに生成された車両フローとモデル化して、現実的なトラフィックの変動を反映しました。
分散化されたMARLコントローラーが実装されました。各トラフィック信号は自律剤として動作し、近隣のエージェントからのローカル観測と情報に基づいて決定を下します。
パフォーマンスは、平均車両待機時間や全体的なスループットなどのメトリックを使用して、ベースライン固定時間コントローラーに対して評価されました。
MARLアプローチは、平均待機時間の短縮やスループットの改善など、統計的に有意な改善を実証しました。
これらの調査結果は、MARLベースの動的制御戦略が都市交通管理の効率を改善するための大きな約束を抱えていることを示唆しています。
スケーラビリティと現実世界の実装の課題に対処するために、さらに研究が推奨されます。

要約(オリジナル)

Urban traffic congestion, particularly at intersections, significantly impacts travel time, fuel consumption, and emissions. Traditional fixed-time signal control systems often lack the adaptability to manage dynamic traffic patterns effectively. This study explores the application of multi-agent reinforcement learning (MARL) to optimize traffic signal coordination across multiple intersections within a simulated environment. Utilizing Pygame, a simulation was developed to model a network of interconnected intersections with randomly generated vehicle flows to reflect realistic traffic variability. A decentralized MARL controller was implemented, in which each traffic signal operates as an autonomous agent, making decisions based on local observations and information from neighboring agents. Performance was evaluated against a baseline fixed-time controller using metrics such as average vehicle wait time and overall throughput. The MARL approach demonstrated statistically significant improvements, including reduced average waiting times and improved throughput. These findings suggest that MARL-based dynamic control strategies hold substantial promise for improving urban traffic management efficiency. More research is recommended to address scalability and real-world implementation challenges.

arxiv情報

著者	Saahil Mahato
発行日	2025-05-20 15:59:44+00:00
arxivサイト	arxiv_id(pdf)

提供元, 利用サービス

arxiv.jp, Google

Multi-agent Reinforcement Learning vs. Fixed-Time Control for Traffic Signal Optimization: A Simulation Study

要約

要約(オリジナル)

arxiv情報

提供元, 利用サービス

最近の投稿

最近のコメント

アーカイブ

カテゴリー