Anytime Multi-Agent Path Finding with an Adaptive Delay-Based Heuristic

要約

Anytime Multi-Agent Path Finding (MAPF) は、マルチエージェントシステムにおけるスケーラブルなパス最適化への有望なアプローチです。
MAPF-LNS は大規模近傍検索 (LNS) に基づいており、高速な初期解が、解の選択されたパスを破棄および修復することによって繰り返し最適化される現在の最先端のアプローチです。
現在の MAPF-LNS バリアントは通常、適応選択メカニズムを使用して、複数の破壊ヒューリスティックから選択します。
ただし、有望な破壊ヒューリスティックを決定するには、MAPF-LNS はかなりの探索時間を必要とします。
一般的な破壊ヒューリスティックは非適応型であるため、これらのヒューリスティックによって引き起こされるパフォーマンスのボトルネックは、適応型ヒューリスティック選択だけでは克服できず、ソリューションコストの観点から MAPF-LNS の全体的な有効性が制限されます。
この論文では、MAPF-LNS の単一破壊ヒューリスティックのバリアントとして、成功ベースの自己学習で強化された適応遅延ベースの破壊と修復 (ADDRESS) を提案します。
ADDRESS は、制限付きトンプソンサンプリングを最も遅延したエージェントの上位 K セットに適用して、適応型 LNS 近傍生成用のシードエージェントを選択します。
MAPF ベンチマークセットの複数のマップで ADDRESS を評価し、元の MAPF-LNS やその他の最先端の手法と比較して、最大 1,000 のエージェントを使用する大規模なシナリオで少なくとも 50% のコスト改善を実証しました。

要約(オリジナル)

Anytime multi-agent path finding (MAPF) is a promising approach to scalable path optimization in multi-agent systems. MAPF-LNS, based on Large Neighborhood Search (LNS), is the current state-of-the-art approach where a fast initial solution is iteratively optimized by destroying and repairing selected paths of the solution. Current MAPF-LNS variants commonly use an adaptive selection mechanism to choose among multiple destroy heuristics. However, to determine promising destroy heuristics, MAPF-LNS requires a considerable amount of exploration time. As common destroy heuristics are non-adaptive, any performance bottleneck caused by these heuristics cannot be overcome via adaptive heuristic selection alone, thus limiting the overall effectiveness of MAPF-LNS in terms of solution cost. In this paper, we propose Adaptive Delay-based Destroy-and-Repair Enhanced with Success-based Self-Learning (ADDRESS) as a single-destroy-heuristic variant of MAPF-LNS. ADDRESS applies restricted Thompson Sampling to the top-K set of the most delayed agents to select a seed agent for adaptive LNS neighborhood generation. We evaluate ADDRESS in multiple maps from the MAPF benchmark set and demonstrate cost improvements by at least 50% in large-scale scenarios with up to a thousand agents, compared with the original MAPF-LNS and other state-of-the-art methods.

arxiv情報

著者	Thomy Phan,Benran Zhang,Shao-Hung Chan,Sven Koenig
発行日	2024-12-17 17:29:09+00:00
arxivサイト	arxiv_id(pdf)

提供元, 利用サービス

arxiv.jp, Google

Anytime Multi-Agent Path Finding with an Adaptive Delay-Based Heuristic

要約

要約(オリジナル)

arxiv情報

提供元, 利用サービス

最近の投稿

最近のコメント

アーカイブ

カテゴリー