Kimura, Tomoaki, The University of Electro-Communications
-
Vol 9, No 1 (2020) - Articles
Development of AlphaZero-based Reinforcment Learning Algorithm for Solving Partially Observable Markov Decision Process (POMDP) Problem
Abstract PDF
ISSN: 2186-5140