author: Jan Leike Dario Amodei Geoffrey Irving Shane Legg
https://papers.nips.cc/paper/8025-reward-learning-from-human-preferences-and-demonstrations-in-atari.pdf
Paulina is supported by:
About Paulina
Help