[OpenAI] Reinforcement Learning with Prediction-Based Rewards

Nov 1, 2018

quote

We’ve developed Random Network Distillation (RND), a prediction-based method for encouraging reinforcement learning agents to explore their environments through curiosity, which for the first time[1]

There is an anonymous ICLR submission concurrent with our own work which exceeds human performance, though not to the same extent.

https://openai.com/blog/reinforcement-learning-with-prediction-based-rewards/

← Back to all articles Quick Navigation: Next:[ j ] – Prev:[ k ] – List:[ l ]

Brain Networks Laboratory (Choe Lab)

[OpenAI] Reinforcement Learning with Prediction-Based Rewards

quote