You have to test out these very long sequences of moves ... More information: Ali Shehper et al, What makes math problems hard for reinforcement learning: a case study, arXiv (2024).