Search filters

Authors whose works are in public domain in at least one jurisdiction

List of works by Csaba Szepesvári

CoinDICE: Off-Policy Confidence Interval Estimation

Combinatorial Cascading Bandits

Detecting Overfitting via Adversarial Examples

Differentiable Meta-Learning of Bandit Policies

scientific article published in November 2020

Efficient Planning in Large MDPs with Weak Linear Function Approximation

Escaping the Gravitational Pull of Softmax

Following the Leader and Fast Rates in Linear Prediction: Curved Constraint Sets and Other Regularities

scientific article published in January 2016

ImpatientCapsAndRuns: Approximately Optimal Algorithm Configuration from an Infinite Pool

Linear Multi-Resource Allocation with Semi-Bandit Feedback

Mixing Time Estimation in Reversible Markov Chains from a Single Sample Path

Model Selection in Contextual Stochastic Bandit Problems

scientific article published in November 2020

Multi-view Matrix Factorization for Linear Dynamical System Estimation

scientific article published in January 2017

Online Algorithm for Unsupervised Sequential Selection with Contextual Information

scientific article published in November 2020

Online Learning in Markov Decision Processes with Adversarially Chosen Transition Probability Distributions

Online Learning with Gaussian Payoffs and Side Observations

PAC-Bayes Analysis Beyond the Usual Bounds

scientific article published in November 2020

PAC-Bayes bounds for stable algorithms with instance-dependent priors

scholarly article by Omar Rivasplata et al published 2018 in Advances in Neural Information Processing Systems 31

SDP Relaxation with Randomized Rounding for Energy Disaggregation

scientific article published in January 2016

Think out of the \"Box\": Generically-Constrained Asynchronous Composite Optimization and Hedging

article by Pooria Joulani et al published 2019 in Advances in Neural Information Processing Systems 32

TopRank: A practical algorithm for online stochastic ranking

Universal Option Models

scientific article published in January 2014

Variational Policy Gradient Method for Reinforcement Learning with General Utilities

scientific article published in November 2020