Search filters

Reinforcement Learning in Factored MDPs: Oracle-Efficient Algorithms and Tighter Regret Bounds for the Non-Episodic Setting

Image Image of a generic work. The text above it indicates that there is no free image of the work available, and that if you own one, you can click on the placeholder link to upload it.
Description scholarly article by Ziping Xu & Ambuj Tewari published November 2020 in Advances in Neural Information Processing Systems 33
Author/s

author: Ambuj Tewari 

Publication date November 2020
Language English
Country of origin
Wikipedia link
Access work

https://proceedings.neurips.cc/paper/2020/file/d3b1fb02964aa64e257f9f26a31f72cf-Paper.pdf

Copyright status
Missing/wrong data? Edit Wikidata item