| Bounds and dynamics for empirical game theoretic analysis University of Alberta | Publication | 2019-12-01 | Karl Tuyls, Julien Perolat, Marc Lanctot, Edward Hughes, Richard Everett, Joel Z. Leibo, Csaba Szepesvári, Thore Graepel | |
| A modular analysis of adaptive (non-)convex optimization: Optimism, composite objectives, variance reduction, and variational bounds University of Alberta | Publication | 2020-02-01 | | |
| Mixing time estimation in reversible Markov chains from a single sample path University of Alberta | Publication | 2019-08-01 | Daniel Hsu, Aryeh Kontorovich, David A. Levin, Yuval Peres, Csaba Szepesvári, Geoffrey Wolfer | |
| Stochastic Optimization in a Cumulative Prospect Theory Framework University of Alberta | Publication | 2018-09-01 | | |
| A Linearly Relaxed Approximate Linear Program for Markov Decision Processes University of Alberta | Publication | 2018-04-01 | | |
| Sequential Learning for Multi-Channel Wireless Network Monitoring With Channel Switching Costs University of Alberta | Publication | 2014-11-01 | | |
| Guest Editors' introduction University of Alberta | Publication | 2014-01-01 | | |
| Pseudo-MDPs and factored linear action models University of Alberta | Publication | 2014-12-01 | | |
| Online Markov Decision Processes Under Bandit Feedback University of Alberta | Publication | 2014-03-01 | | |
| Partial Monitoring\textemdash Classification, Regret Bounds, and Algorithms University of Alberta | Publication | 2014-11-01 | | |
| Alignment based kernel learning with a continuous set of base kernels University of Alberta | Publication | 2013-05-01 | | |
| Toward a classification of finite partial-monitoring games University of Alberta | Publication | 2013-02-01 | | |
| Partial Monitoring with Side Information University of Alberta | Publication | 2012-01-01 | | |
| The grand challenge of computer Go University of Alberta | Publication | 2012-03-01 | Sylvain Gelly, Levente Kocsis, Marc Schoenauer, Michè le Sebag, David Silver, Csaba Szepesvári, Olivier Teytaud | |
| Regularized least-squares regression: Learning from a sequence University of Alberta | Publication | 2012-02-01 | | |
| Model selection in reinforcement learning University of Alberta | Publication | 2011-06-01 | | |
| Editors' Introduction University of Alberta | Publication | 2011-01-01 | | |
| Sequential learning for optimal monitoring of multi-channel wireless networks University of Alberta | Publication | 2011-04-01 | | |
| Extending rapidly-exploring random trees for asymptotically optimal anytime motion planning University of Alberta | Publication | 2010-10-01 | | |
| Algorithms for Reinforcement Learning University of Alberta | Publication | 2010-01-01 | | |
| Active learning in heteroscedastic noise University of Alberta | Publication | 2010-06-01 | | |
| Models of active learning in group-structured state spaces University of Alberta | Publication | 2010-04-01 | | |
| Training parsers by inverse reinforcement learning University of Alberta | Publication | 2009-04-01 | | |
| LMS -2: Towards an algorithm that is as cheap as LMS and almost as efficient as RLS University of Alberta | Publication | 2009-12-01 | | |
| Regularized Fitted Q-Iteration for planning in continuous-space Markovian decision problems University of Alberta | Publication | 2009-01-01 | | |
| Model-based and model-free reinforcement learning for visual servoing University of Alberta | Publication | 2009-05-01 | | |
| Learning to segment from a few well-selected training images University of Alberta | Publication | 2009-01-01 | | |
| Learning when to stop thinking and do something! University of Alberta | Publication | 2009-01-01 | | |
| Fast gradient-descent methods for temporal-difference learning with linear function approximation University of Alberta | Publication | 2009-01-01 | | |
| Exploration\textendash exploitation tradeoff using variance estimates in multi-armed bandits University of Alberta | Publication | 2009-04-01 | | |
| Active Learning of Group-Structured Environments University of Alberta | Publication | 2008-01-01 | | |
| Active Learning in Multi-armed Bandits University of Alberta | Publication | 2008-01-01 | | |
| Regularized Fitted Q-Iteration: Application to Planning University of Alberta | Publication | 2008-01-01 | | |
| Learning near-optimal policies with Bellman-residual minimization based fitted policy iteration and a single sample path University of Alberta | Publication | 2007-11-01 | | |
| Value-Iteration Based Fitted Policy Iteration: Learning with a Single Trajectory University of Alberta | Publication | 2007-04-01 | | |
| Manifold-adaptive dimension estimation University of Alberta | Publication | 2007-01-01 | | |
| RSPSA : Enhanced Parameter Optimization in Games University of Alberta | Publication | 2006-01-01 | | |
| Universal parameter optimisation in games based on SPSA University of Alberta | Publication | 2006-03-01 | | |
| Local Importance Sampling: A Novel Technique to Enhance Particle Filtering University of Alberta | Publication | 2006-04-01 | | |
| X-mHMM : An Efficient Algorithm for Training Mixtures of HMMs When the Number of Mixtures Is Unknown University of Alberta | Publication | 2018-01-01 | | |
| Log-optimal currency portfolios and control Lyapunov exponents University of Alberta | Publication | 2018-01-01 | | |
| Efficient object tracking in video sequences by means of LS -N-IPS University of Alberta | Publication | 2018-01-01 | | |
| Uncertainty, performance, and model dependency in approximate adaptive nonlinear control University of Alberta | Publication | 2000-01-01 | | |
| Computer Aided Diagnosis of Clustered Microcalcifications Using Artificial Neural Nets University of Alberta | Publication | 2000-01-01 | Erich Sorantin, Ferdinand Schmidt, Heinz Mayer, Michael Becker, Csaba Szepesvári, Ewald Graif, Peter Winkler | |
| A Unified Analysis of Value-Function-Based Reinforcement-Learning Algorithms University of Alberta | Publication | 1999-11-01 | | |
| An automatic method for the identification and interpretation of clustered microcalcifications in mammograms University of Alberta | Publication | 1999-01-01 | Ferdinand Schmidt, Erich Sorantin, Csaba Szepesvári, Ewald Graif, Michael Becker, Heinz Mayer, Karin Hartwagner | |
| Parallel and robust skeletonization built on self-organizing elements University of Alberta | Publication | 1999-01-01 | | |
| The SBASE protein domain library, release 6.0: a collection of annotated protein sequence segments University of Alberta | Publication | 1999-01-01 | | |
| An integrated architecture for motion-control and path-planning University of Alberta | Publication | 1998-12-01 | | |
| Neurocontroller using dynamic state feedback for compensatory control University of Alberta | Publication | 1997-12-01 | | |
| Robust control using inverse dynamics neurocontrollers University of Alberta | Publication | 1997-12-01 | | |
| Approximate geometry representations and sensory fusion University of Alberta | Publication | 1996-07-01 | | |
| SELF -ORGANIZING MULTI-RESOLUTION GRID FOR MOTION PLANNING AND CONTROL University of Alberta | Publication | 1996-12-01 | | |
| Behavior of an Adaptive Self-organizing Autonomous Agent Working with Cues and Competing Concepts University of Alberta | Publication | 1993-09-01 | | |
| Decision-theoretic Clustering of Strategies University of Alberta | Publication | 2015-01-01 | | |
| A Randomized Mirror Descent Algorithm for Large Scale Multiple Kernel Learning University of Alberta | Publication | 2013-06-01 | | |