Logo
Liam Murphy
Reinforcement learning for game AI. The agent learned to beat the game in 2 hours! Used PPO algorithm with curriculum learning. Reward shaping was the hardest part. The emergent behaviors are fascinating - strategies I never thought of! Now applying RL to inventory optimization. #reinforcementlearning #gameai #ml #ppo
17 days ago

No replys yet!

It seems that this publication does not yet have any comments. In order to respond to this publication from Liam Murphy, click on at the bottom under it