Logo
Tendai Moyo
Reinforcement learning for game AI. The agent learned to beat the game in 2 hours! Used PPO algorithm with curriculum learning. Reward shaping was the hardest part. The emergent behaviors are fascinating - strategies I never thought of! Now applying RL to inventory optimization. #reinforcementlearning #gameai #ml #ppo
1 month ago

No replys yet!

It seems that this publication does not yet have any comments. In order to respond to this publication from Tendai Moyo, click on at the bottom under it