Self-Supervision is All You Need for Solving Rubik’s Cube

要約

既存の組み合わせ検索方法は複雑なことが多く、ある程度の専門知識が必要です。
この研究では、ルービックキューブに代表される、事前定義された目標を持つ組み合わせ問題を解決するためのシンプルで効率的な深層学習手法を紹介します。
このような問題に対しては、目標状態から分岐するランダムスクランブルでディープニューラルネットワークをトレーニングするだけで、最適に近い解決策を達成するのに十分であることを示します。
Rubik’s Cube、15 Puzzle、および 7$\times$7 Lights Out でテストしたところ、トレーニングデータが大幅に少ないにもかかわらず、私たちの手法は以前の最先端の手法 DeepCubeA を上回り、ソリューションの最適性と計算コストの間のトレードオフが改善されました。
。
さらに、モデルのサイズとトレーニングデータ量に関するルービックキューブソルバーのスケーリング則を調査します。

要約(オリジナル)

Existing combinatorial search methods are often complex and require some level of expertise. This work introduces a simple and efficient deep learning method for solving combinatorial problems with a predefined goal, represented by Rubik’s Cube. We demonstrate that, for such problems, training a deep neural network on random scrambles branching from the goal state is sufficient to achieve near-optimal solutions. When tested on Rubik’s Cube, 15 Puzzle, and 7$\times$7 Lights Out, our method outperformed the previous state-of-the-art method DeepCubeA, improving the trade-off between solution optimality and computational cost, despite significantly less training data. Furthermore, we investigate the scaling law of our Rubik’s Cube solver with respect to model size and training data volume.

arxiv情報

著者	Kyo Takano
発行日	2023-05-23 17:55:46+00:00
arxivサイト	arxiv_id(pdf)

提供元, 利用サービス

arxiv.jp, Google

Self-Supervision is All You Need for Solving Rubik’s Cube

要約

要約(オリジナル)

arxiv情報

提供元, 利用サービス

最近の投稿

最近のコメント

アーカイブ

カテゴリー