Learning to Optimize Permutation Flow Shop Scheduling via Graph-based Imitation Learning

要約

ジョブの最適な並べ替えを見つけることを目的とした並べ替えフローショップスケジューリング (PFSS) は、製造システムで広く使用されています。
大規模な PFSS 問題を解く場合、ヒューリスティックなどの従来の最適化アルゴリズムでは、解の精度と計算効率の両方の要求をほとんど満たすことができなかったため、最近では学習ベースの手法がより注目を集めています。
強化学習法によって問題を解決しようとする研究もありますが、トレーニング中に収束が遅いという問題があり、解決策に関してはまだ十分な精度が得られません。
そのために、より安定して正確に収束を加速する、専門家主導の模倣学習を通じてモデルをトレーニングすることを提案します。
さらに、入力ジョブのより良い特徴表現を抽出するために、グラフ構造をエンコーダーとして組み込みます。
広範な実験により、私たちが提案したモデルが大幅に向上し、最大 1000 のジョブを伴う大規模な問題において優れた一般化可能性を示すことが明らかになりました。
最先端の強化学習手法と比較して、私たちのモデルのネットワークパラメーターはわずか 37\% に減少し、エキスパートソリューションに対するモデルの解の差は平均 6.8\% から 1.3\% に減少しました。
。
コードは \url{https://github.com/longkangli/PFSS-IL} から入手できます。

要約(オリジナル)

The permutation flow shop scheduling (PFSS), aiming at finding the optimal permutation of jobs, is widely used in manufacturing systems. When solving large-scale PFSS problems, traditional optimization algorithms such as heuristics could hardly meet the demands of both solution accuracy and computational efficiency, thus learning-based methods have recently garnered more attention. Some work attempts to solve the problems by reinforcement learning methods, which suffer from slow convergence issues during training and are still not accurate enough regarding the solutions. To that end, we propose to train the model via expert-driven imitation learning, which accelerates convergence more stably and accurately. Moreover, in order to extract better feature representations of input jobs, we incorporate the graph structure as the encoder. The extensive experiments reveal that our proposed model obtains significant promotion and presents excellent generalizability in large-scale problems with up to 1000 jobs. Compared to the state-of-the-art reinforcement learning method, our model’s network parameters are reduced to only 37\% of theirs, and the solution gap of our model towards the expert solutions decreases from 6.8\% to 1.3\% on average. The code is available at: \url{https://github.com/longkangli/PFSS-IL}.

arxiv情報

著者	Longkang Li,Siyuan Liang,Zihao Zhu,Chris Ding,Hongyuan Zha,Baoyuan Wu
発行日	2023-12-14 17:59:55+00:00
arxivサイト	arxiv_id(pdf)

提供元, 利用サービス

arxiv.jp, Google

Learning to Optimize Permutation Flow Shop Scheduling via Graph-based Imitation Learning

要約

要約(オリジナル)

arxiv情報

提供元, 利用サービス

最近の投稿

最近のコメント

アーカイブ

カテゴリー