Decision-Oriented Intervention Cost Prediction for Multi-robot Persistent Monitoring

要約

この論文では、あるクラスの配車ルート問題に対する微分可能な意思決定指向の学習手法を紹介します。
具体的には、無人航空機 (UAV) と無人地上車両 (UGV) のチームが環境を継続的に監視するシナリオを検討します。
UGV は、消耗した UAV を充電するために迂回するために人間によって時々引き継がれます。
目標は、介入コストを削減しながら環境を効率的に監視できるように、UGV のルートを選択することです。
前者は単調なサブモジュール関数としてモデル化されるのに対し、後者は UGV のルートの線形関数です。
前者は既知であるが、後者は学習する必要があるコンテキスト (風や地形の状態など) に依存するシナリオを考えます。
通常、最初にコスト関数の予測を学習し、次に最適化問題を解決します。
ただし、予測に使用される損失関数は、適切なルートを見つけるという最終目標と一致していない可能性があります。
私たちは、タスクの最適化を予測フェーズの微分可能層として組み込んだ \emph{意思決定指向学習} フレームワークを提案します。
タスクの最適化 (非単調サブモジュラー関数) を微分可能にするために、微分可能コストスケール貪欲アルゴリズムを提案します。
数値シミュレーションを通じて、提案されたフレームワークの有効性を実証します。
結果は、提案されたフレームワークが従来のアプローチよりも優れたパフォーマンスをもたらす可能性があることを示しています。

要約(オリジナル)

In this paper, we present a differentiable, decision-oriented learning technique for a class of vehicle routing problems. Specifically, we consider a scenario where a team of Unmanned Aerial Vehicles (UAVs) and Unmanned Ground Vehicles (UGVs) are persistently monitoring an environment. The UGVs are occasionally taken over by humans to take detours to recharge the depleted UAVs. The goal is to select routes for the UGVs so that they can efficiently monitor the environment while reducing the cost of interventions. The former is modeled as a monotone, submodular function whereas the latter is a linear function of the routes of the UGVs. We consider a scenario where the former is known but the latter depends on the context (e.g., wind and terrain conditions) that must be learned. Typically, we first learn to predict the cost function and then solve the optimization problem. However, the loss function used in prediction may be misaligned with our final goal of finding good routes. We propose a \emph{decision-oriented learning} framework that incorporates task optimization as a differentiable layer in the prediction phase. To make the task optimization (which is a non-monotone submodular function) differentiable, we propose the Differentiable Cost Scaled Greedy algorithm. We demonstrate the efficacy of the proposed framework through numerical simulations. The results show that the proposed framework can result in better performance than the traditional approach.

arxiv情報

著者	Guangyao Shi,Chak Lam Shek,Nare Karapetyan,Pratap Tokekar
発行日	2023-10-02 18:11:01+00:00
arxivサイト	arxiv_id(pdf)

提供元, 利用サービス

arxiv.jp, Google

Decision-Oriented Intervention Cost Prediction for Multi-robot Persistent Monitoring

要約

要約(オリジナル)

arxiv情報

提供元, 利用サービス

最近の投稿

最近のコメント

アーカイブ

カテゴリー