Bounds and dynamics for empirical game theoretic analysis University of Alberta | Publication | 2019-12-01 | Karl Tuyls, Julien Perolat, Marc Lanctot, Edward Hughes, Richard Everett, Joel Z. Leibo, Csaba Szepesvári, Thore Graepel |
A modular analysis of adaptive (non-)convex optimization: Optimism, composite objectives, variance reduction, and variational bounds University of Alberta | Publication | 2020-02-01 | |
Mixing time estimation in reversible Markov chains from a single sample path University of Alberta | Publication | 2019-08-01 | Daniel Hsu, Aryeh Kontorovich, David A. Levin, Yuval Peres, Csaba Szepesvári, Geoffrey Wolfer |
Stochastic Optimization in a Cumulative Prospect Theory Framework University of Alberta | Publication | 2018-09-01 | |
A Linearly Relaxed Approximate Linear Program for Markov Decision Processes University of Alberta | Publication | 2018-04-01 | |
Sequential Learning for Multi-Channel Wireless Network Monitoring With Channel Switching Costs University of Alberta | Publication | 2014-11-01 | |
Guest Editors' introduction University of Alberta | Publication | 2014-01-01 | |
Pseudo-MDPs and factored linear action models University of Alberta | Publication | 2014-12-01 | |
Online Markov Decision Processes Under Bandit Feedback University of Alberta | Publication | 2014-03-01 | |
Partial Monitoring\textemdash Classification, Regret Bounds, and Algorithms University of Alberta | Publication | 2014-11-01 | |
Alignment based kernel learning with a continuous set of base kernels University of Alberta | Publication | 2013-05-01 | |
Toward a classification of finite partial-monitoring games University of Alberta | Publication | 2013-02-01 | |
Partial Monitoring with Side Information University of Alberta | Publication | 2012-01-01 | |
The grand challenge of computer Go University of Alberta | Publication | 2012-03-01 | Sylvain Gelly, Levente Kocsis, Marc Schoenauer, Michè le Sebag, David Silver, Csaba Szepesvári, Olivier Teytaud |
Regularized least-squares regression: Learning from a sequence University of Alberta | Publication | 2012-02-01 | |
Model selection in reinforcement learning University of Alberta | Publication | 2011-06-01 | |
Editors' Introduction University of Alberta | Publication | 2011-01-01 | |
Sequential learning for optimal monitoring of multi-channel wireless networks University of Alberta | Publication | 2011-04-01 | |
Extending rapidly-exploring random trees for asymptotically optimal anytime motion planning University of Alberta | Publication | 2010-10-01 | |
Algorithms for Reinforcement Learning University of Alberta | Publication | 2010-01-01 | |
Active learning in heteroscedastic noise University of Alberta | Publication | 2010-06-01 | |
Models of active learning in group-structured state spaces University of Alberta | Publication | 2010-04-01 | |
Training parsers by inverse reinforcement learning University of Alberta | Publication | 2009-04-01 | |
LMS -2: Towards an algorithm that is as cheap as LMS and almost as efficient as RLS University of Alberta | Publication | 2009-12-01 | |
Regularized Fitted Q-Iteration for planning in continuous-space Markovian decision problems University of Alberta | Publication | 2009-01-01 | |
Model-based and model-free reinforcement learning for visual servoing University of Alberta | Publication | 2009-05-01 | |
Learning to segment from a few well-selected training images University of Alberta | Publication | 2009-01-01 | |
Learning when to stop thinking and do something! University of Alberta | Publication | 2009-01-01 | |
Fast gradient-descent methods for temporal-difference learning with linear function approximation University of Alberta | Publication | 2009-01-01 | |
Exploration\textendash exploitation tradeoff using variance estimates in multi-armed bandits University of Alberta | Publication | 2009-04-01 | |
Active Learning of Group-Structured Environments University of Alberta | Publication | 2008-01-01 | |
Active Learning in Multi-armed Bandits University of Alberta | Publication | 2008-01-01 | |
Regularized Fitted Q-Iteration: Application to Planning University of Alberta | Publication | 2008-01-01 | |
Learning near-optimal policies with Bellman-residual minimization based fitted policy iteration and a single sample path University of Alberta | Publication | 2007-11-01 | |
Value-Iteration Based Fitted Policy Iteration: Learning with a Single Trajectory University of Alberta | Publication | 2007-04-01 | |
Manifold-adaptive dimension estimation University of Alberta | Publication | 2007-01-01 | |
RSPSA : Enhanced Parameter Optimization in Games University of Alberta | Publication | 2006-01-01 | |
Universal parameter optimisation in games based on SPSA University of Alberta | Publication | 2006-03-01 | |
Local Importance Sampling: A Novel Technique to Enhance Particle Filtering University of Alberta | Publication | 2006-04-01 | |
X-mHMM : An Efficient Algorithm for Training Mixtures of HMMs When the Number of Mixtures Is Unknown University of Alberta | Publication | 2018-01-01 | |
Log-optimal currency portfolios and control Lyapunov exponents University of Alberta | Publication | 2018-01-01 | |
Efficient object tracking in video sequences by means of LS -N-IPS University of Alberta | Publication | 2018-01-01 | |
Uncertainty, performance, and model dependency in approximate adaptive nonlinear control University of Alberta | Publication | 2000-01-01 | |
Computer Aided Diagnosis of Clustered Microcalcifications Using Artificial Neural Nets University of Alberta | Publication | 2000-01-01 | Erich Sorantin, Ferdinand Schmidt, Heinz Mayer, Michael Becker, Csaba Szepesvári, Ewald Graif, Peter Winkler |
A Unified Analysis of Value-Function-Based Reinforcement-Learning Algorithms University of Alberta | Publication | 1999-11-01 | |
An automatic method for the identification and interpretation of clustered microcalcifications in mammograms University of Alberta | Publication | 1999-01-01 | Ferdinand Schmidt, Erich Sorantin, Csaba Szepesvári, Ewald Graif, Michael Becker, Heinz Mayer, Karin Hartwagner |
Parallel and robust skeletonization built on self-organizing elements University of Alberta | Publication | 1999-01-01 | |
The SBASE protein domain library, release 6.0: a collection of annotated protein sequence segments University of Alberta | Publication | 1999-01-01 | |
An integrated architecture for motion-control and path-planning University of Alberta | Publication | 1998-12-01 | |
Neurocontroller using dynamic state feedback for compensatory control University of Alberta | Publication | 1997-12-01 | |
Robust control using inverse dynamics neurocontrollers University of Alberta | Publication | 1997-12-01 | |
Approximate geometry representations and sensory fusion University of Alberta | Publication | 1996-07-01 | |
SELF -ORGANIZING MULTI-RESOLUTION GRID FOR MOTION PLANNING AND CONTROL University of Alberta | Publication | 1996-12-01 | |
Behavior of an Adaptive Self-organizing Autonomous Agent Working with Cues and Competing Concepts University of Alberta | Publication | 1993-09-01 | |
Decision-theoretic Clustering of Strategies University of Alberta | Publication | 2015-01-01 | |
A Randomized Mirror Descent Algorithm for Large Scale Multiple Kernel Learning University of Alberta | Publication | 2013-06-01 | |