Gaussian Process Aggregation for Root-Parallel Monte Carlo Tree Search with Continuous Actions
2025年12月10日
4 authors
摘要
Monte Carlo Tree Search is a cornerstone algorithm for online planning, and its root-parallel variant is widely used when wall clock time is limited but best performance is desired. In environments with continuous action spaces, how to best aggregate statistics from different threads is an important yet underexplored question. In this work, we introduce a method that uses Gaussian Process Regression to obtain value estimates for promising actions that were not trialed in the environment. We perform a systematic evaluation across 6 different domains, demonstrating that our approach outperforms existing aggregation strategies while requiring a modest increase in inference time.
分类
作者
Junlin XiaoVictor-Alexandru DarvariuBruno LacerdaNick Hawes