Széchenyi Plan Plus | Government of Hungary. Funded by the European Union. NextGeneration EU.

EN HU
  • Discover
    • News
    • Events
    • Report
  • Research & development
    • Areas of application
    • Research topics
  • Resources
    • Publications
    • Lead researchers
  • Partners
    • Consortium members
    • International partners
    • Industry contacts
    • University contacts
  1. Home
  2. Publications
Electronics (Vol. 13 – Advancements in Cross-Disciplinary AI: Theory and Application – 2nd Edition) / 25 May 2024

Beyond Trial and Error: Lane Keeping with Monte Carlo Tree Search-Driven Optimization of Reinforcement Learning

In recent years, Reinforcement Learning (RL) has excelled in the realm of autonomous vehicle control, which is distinguished by the absence of limitations, such as specific training data or the necessity for explicit mathematical model identification. Particularly in the context of lane keeping, a diverse set of rewarding strategies yields a spectrum of realizable policies. Nevertheless, the challenge lies in discerning the optimal behavior that maximizes performance. Traditional approaches entail exhaustive training through a trial-and-error strategy across conceivable reward functions, which is a process notorious for its time-consuming nature and substantial financial implications. Contrary to conventional methodologies, the Monte Carlo Tree Search (MCTS) enables the prediction of reward function quality through Monte Carlo simulations, thereby eliminating the need for exhaustive training on all available reward functions. The findings obtained from MCTS simulations can be effectively leveraged to selectively train only the most suitable RL models. This approach helps alleviate the resource-heavy nature of traditional RL processes through altering the training pipeline. This paper validates the theoretical framework concerning the unique property of the Monte Carlo Tree Search algorithm by emphasizing its generality through highlighting crossalgorithmic and crossenvironmental capabilities while also showcasing its potential to reduce training costs.

Url
https://doi.org/10.3390/electronics13112058
Authors
Kővári, B.
Pelenczei, B.
Knáb, I.
Bécsi, T.
Areas of application

Autonomous Road Vehicles

Institutes

Kapcsolat

Prof. Dr. Péter Gáspár

H-1111 Budapest, Kende u. 13-17.

+36 1 279 6000

autonom@nemzetilabor.hu

© 2020-2023 National Laboratory for Autonomous Systems, Budapest