Széchenyi Plan Plus | Government of Hungary. Funded by the European Union. NextGeneration EU.

EN HU
  • Discover
    • News
    • Events
    • Report
  • Research & development
    • Areas of application
    • Research topics
  • Resources
    • Publications
    • Lead researchers
  • Partners
    • Consortium members
    • International partners
    • Industry contacts
    • University contacts
  1. Home
  2. Publications
Machine Learning & Knowledge Extraction (Vol. 7, No. 3) / 10 September 2025

MCTS-Based Policy Improvement for Reinforcement Learning

Curriculum Learning (CL) is a potent field in Machine Learning that provides several excellent techniques for enhancing the performance of the training process given the same data points, regardless of the training method used. In this research, we propose a novel Monte Carlo Tree Search (MCTS)-based technique that enhances model performance, articulating the utilization of MCTS in Curriculum Learning. The proposed approach leverages MCTS to optimize the sequence of batches during the training process. First, we demonstrate the application of our method in Reinforcement Learning, where sparse rewards often diminish convergence and deteriorate performance. By leveraging the strategic planning and exploration capabilities of MCTS, our method systematically identifies and selects trajectories that are more informative and have a higher potential to enhance policy improvement. This MCTS-guided batch optimization focuses the learning process on valuable experiences, accelerating convergence and improving overall performance. We evaluate our approach on standard RL benchmarks, demonstrating that it outperforms conventional batch selection methods regarding learning speed and policy effectiveness. The results highlight the potential of combining MCTS with CL to optimize batch selection, offering a promising direction for future research in efficient Reinforcement Learning.

Url
https://doi.org/10.3390/make7030098
Authors
Csippán, Gy.
Péter, I.
Kővári, B.
Bécsi, T.
Institutes

Kapcsolat

Prof. Dr. Péter Gáspár

H-1111 Budapest, Kende u. 13-17.

+36 1 279 6000

autonom@nemzetilabor.hu

© 2020-2023 National Laboratory for Autonomous Systems, Budapest