author: Sergey Levine Bernhard Schölkopf Zoubin Ghahramani
https://papers.nips.cc/paper/6974-interpolated-policy-gradient-merging-on-policy-and-off-policy-gradient-estimation-for-deep-reinforcement-learning.pdf
Paulina is supported by:
About Paulina
Help