Search filters

List of works by Satinder Singh

A Nonlinear Predictive State Representation

scientific article published in January 2004

A Self-Tuning Actor-Critic Algorithm

scientific article published in November 2020

A social reinforcement learning agent

scholarly article published 2001

Action-Conditional Video Prediction using Deep Networks in Atari Games

An Efficient, Exact Algorithm for Solving Tree-Structured Graphical Games

An MDP-Based Approach to Online Mechanism Design

scientific article published in January 2004

Analytical Mean Squared Error Curves in Temporal Difference Learning

Approximately Efficient Online Mechanism Design

scientific article published in January 2005

Artificial intelligence: Learning to play Go from scratch

scientific article published in October 2017

Completing State Representations using Spectral Learning

Computational rationality: linking mechanism and behavior through bounded utility maximization

scientific article published on 20 March 2014

Confirming the theoretical structure of expert-developed text messages to improve adherence to anti-hypertensive medications

scientific article published on 3 October 2015

Constraint satisfaction algorithms for graphical games

scholarly article published 2007

Convergence of Stochastic Iterative Dynamic Programming Algorithms

scientific article published in January 1994

Deep Learning for Real-Time Atari Game Play Using Offline Monte-Carlo Tree Search Planning

scientific article published in January 2014

Discovering Reinforcement Learning Algorithms

scientific article published in November 2020

Discovery of Useful Questions as Auxiliary Tasks

ECOLOGICALLY VALID LONG-TERM MOOD MONITORING OF INDIVIDUALS WITH BIPOLAR DISORDER USING SPEECH.

scientific article

Experimental Results on Learning Stochastic Memoryless Policies for Partially Observable Markov Decision Processes

Finite-Sample Convergence Rates for Q-Learning and Indirect Algorithms

scholarly article by Michael J. Kearns & Satinder Singh published 1999 in Advances in Neural Information Processing Systems 11

Hindsight Credit Assignment

How to Dynamically Merge Markov Decision Processes

scientific article published in January 1998

Improved Switching among Temporally Abstract Actions

Improving Policies without Measuring Merits

scientific article published in January 1996

Intrinsically Motivated Reinforcement Learning

scientific article published in January 2005

Learning and discovery of predictive state representations in dynamical systems with reset

scientific article (publication date: 2004)

Learning payoff functions in infinite games

Learning to Play No-Press Diplomacy with Best Response Policy Iteration

Linking Context to Evaluation in the Design of Safety Critical Interfaces

Mastering the game of Stratego with model-free multiagent reinforcement learning

scientific article published on 2 December 2022

Maximizing the value of mobile health monitoring by avoiding redundant patient reports: prediction of depression-related symptoms and adherence problems in automated health assessment services

scientific article published on 05 July 2013

Meta-Gradient Reinforcement Learning with an Objective Discovered Online

scientific article published in November 2020

Modeling Information Diffusion in Networks with Unobserved Links

No-Press Diplomacy: Modeling Multi-Agent Gameplay

Off-policy Learning with Options and Recognizers

scientific article published in January 2006

On Efficiency in Hierarchical Reinforcement Learning

scientific article published in November 2020

On Learning Intrinsic Rewards for Policy Gradient Methods

Optimizing Admission Control while Ensuring Quality of Service in Multimedia Networks via Reinforcement Learning

scholarly article by Timothy X. Brown et al published 1999 in Advances in Neural Information Processing Systems 11

Patient-Centered Pain Care Using Artificial Intelligence and Mobile Health Tools: Protocol for a Randomized Study Funded by the US Department of Veterans Affairs Health Services Research and Development Program.

scientific article published on 7 April 2016

Policy Gradient Methods for Reinforcement Learning with Function Approximation

scholarly article by Richard S. Sutton et al published 2000 in Advances in Neural Information Processing Systems 12

Predicting Lifetimes in Dynamically Allocated Memory

Reinforcement Learning Algorithm for Partially Observable Markov Decision Problems

scientific article published in January 1995

Reinforcement Learning for Dynamic Channel Allocation in Cellular Telephone Systems

scholarly article by Satinder Singh & Dimitri P. Bertsekas published 1997 in Advances in Neural Information Processing Systems 9

Reinforcement Learning for Spoken Dialogue Systems

scholarly article by Satinder Singh et al published 2000 in Advances in Neural Information Processing Systems 12

Reinforcement Learning with Soft State Aggregation

scientific article published in January 1995

Repeated Inverse Reinforcement Learning

scientific article published in January 2017

Reward Design via Online Gradient Ascent

article published in 2010

Reward Mapping for Transfer in Long-Lived Agents

Robust Reinforcement Learning in Motion Planning

scientific article published in January 1994

Simple Local Models for Complex Dynamical Systems

article by Erik Talvitie & Satinder Singh published 2009 in Advances in Neural Information Processing Systems 21

Strategic Interactions in the TAC 2003 Supply Chain Tournament

The Efficient Learning of Multiple Task Sequences

scientific article published in January 1992

The Value Equivalence Principle for Model-Based Reinforcement Learning

scientific article published in November 2020

The adaptive nature of eye movements in linguistic tasks: how payoff and architecture shape speed-accuracy trade-offs

scientific article

The potential impact of intelligent systems for mobile health self-management support: Monte Carlo simulations of text message support for medication adherence

scientific article

Utility maximization and bounds on human information processing

scientific article published on 20 March 2014

Value Prediction Network

scientific article published in January 2017