Learning a Generic Value-Selection Heuristic Inside a Constraint Programming Solver

要約

制約プログラミングは、組み合わせ問題を解決するための効率的なアプローチであることが知られています。
ソルバーにおける重要な設計上の選択は分岐ヒューリスティックです。これは、探索を最小限の時間で最適なソリューションに導くように設計されています。
ただし、これらのヒューリスティックの開発は時間のかかるプロセスであり、問題固有の専門知識が必要です。
この観察により、機械学習を使用して専門家の介入なしに効率的なヒューリスティックを自動的に学習する多くの取り組みが動機付けられました。
私たちの知る限り、これはまだ未解決の研究課題です。
いくつかの一般的な変数選択ヒューリスティックが文献で入手可能ですが、一般的な値選択ヒューリスティックのオプションはさらに希少です。
この論文では、制約プログラミングソルバー内で値選択ヒューリスティックを取得するために使用できる一般的な学習手順を導入することで、この問題に取り組むことを提案します。
これは、ディープ Q ラーニングアルゴリズム、調整された報酬信号、異種グラフニューラルネットワークアーキテクチャの組み合わせによって実現されました。
グラフの色分け、最大独立集合、および最大カット問題に関する実験では、私たちのフレームワークが汎用的でありながら、大量のバックトラックを必要とせずに、最適性に近いより良い解決策を見つけることができることを示しています。

要約(オリジナル)

Constraint programming is known for being an efficient approach for solving combinatorial problems. Important design choices in a solver are the branching heuristics, which are designed to lead the search to the best solutions in a minimum amount of time. However, developing these heuristics is a time-consuming process that requires problem-specific expertise. This observation has motivated many efforts to use machine learning to automatically learn efficient heuristics without expert intervention. To the best of our knowledge, it is still an open research question. Although several generic variable-selection heuristics are available in the literature, the options for a generic value-selection heuristic are more scarce. In this paper, we propose to tackle this issue by introducing a generic learning procedure that can be used to obtain a value-selection heuristic inside a constraint programming solver. This has been achieved thanks to the combination of a deep Q-learning algorithm, a tailored reward signal, and a heterogeneous graph neural network architecture. Experiments on graph coloring, maximum independent set, and maximum cut problems show that our framework is able to find better solutions close to optimality without requiring a large amounts of backtracks while being generic.

arxiv情報

著者	Tom Marty,Tristan François,Pierre Tessier,Louis Gauthier,Louis-Martin Rousseau,Quentin Cappart
発行日	2023-10-02 16:59:40+00:00
arxivサイト	arxiv_id(pdf)

提供元, 利用サービス

arxiv.jp, Google

Learning a Generic Value-Selection Heuristic Inside a Constraint Programming Solver

要約

要約(オリジナル)

arxiv情報

提供元, 利用サービス

最近の投稿

最近のコメント

アーカイブ

カテゴリー