Alignment based kernel learning with a continuous set of base kernels University of Alberta | Publication | 2013-05-01 | |
Partition Tree Weighting University of Alberta | Publication | 2013-01-01 | |
The Hanabi challenge: A new frontier for AI research University of Alberta | Publication | 2020-03-01 | Nolan Bard, Jakob N. Foerster, Sarath Chandar, Neil Burch, Marc Lanctot, H. Francis Song, Emilio Parisotto, Vincent Dumoulin, Subhodeep Moitra, Edward Hughes, Iain Dunning, Shibl Mourad, Hugo Larochelle, Marc G. Bellemare, Michael Bowling |
Heads-up limit hold\textquotesingle em poker is solved University of Alberta | Publication | 2017-10-01 | |
DeepStack : Expert-level artificial intelligence in heads-up no-limit poker University of Alberta | Publication | 2017-03-01 | Matej Morav\vc ík, Martin Schmid, Neil Burch, Viliam Lis\'y, Dustin Morrill, Nolan Bard, Trevor Davis, Kevin Waugh, Michael Johanson, Michael Bowling |
Do pokers players know how good they are? Accuracy of poker skill estimation in online and offline players University of Alberta | Publication | 2014-02-01 | |
Multidisciplinary students and instructors University of Alberta | Publication | 2008-01-01 | |
Robust game play against unknown opponents University of Alberta | Publication | 2006-01-01 | |
Learning predictive state representations using non-blind policies University of Alberta | Publication | 2006-01-01 | Michael Bowling, Peter McCracken, Michael James, James Neufeld, Dana Wilkinson |
Game-Tree Search with Adaptation in Stochastic Imperfect-Information Games University of Alberta | Publication | 2006-01-01 | |
Multi-Agent Planning in the Presence of Multiple Goals University of Alberta | Publication | 2006-02-01 | |
Action respecting embedding University of Alberta | Publication | 2005-01-01 | |
STP : Skills, tactics, and plays for multi-robot control in adversarial environments University of Alberta | Publication | 2005-02-01 | |
Multiagent learning using a variable learning rate University of Alberta | Publication | 2002-04-01 | |
The CMUnited -98 champion small-robot team University of Alberta | Publication | 1998-01-01 | |
Optimal estimation of multivariate ARMA models University of Alberta | Publication | 2015-01-01 | |
Efficient Nash Equilibrium Approximation through Monte Carlo Counterfactual Regret Minimization University of Alberta | Publication | 2011-12-01 | Michael B. Johanson, Nolan D. Bard, Michael Bowling, Marc R. Lanctot, Richard G. Gibson |
Finding Optimal Abstract Strategies in Extensive Form Games University of Alberta | Publication | 2012-03-01 | |
Online Implicit Agent Modelling University of Alberta | Publication | 2013-05-01 | |
Evaluating State-Space Abstractions in Extensive-Form Games University of Alberta | Publication | 2013-05-01 | |
Solving Imperfect Information Games Using Decomposition University of Alberta | Publication | 2014-04-01 | |
Asymmetric Abstractions for Adversarial Settings University of Alberta | Publication | 2014-05-01 | |
Heads-up Limit Hold'em Poker is Solved University of Alberta | Publication | 2015-01-01 | |
Solving Heads-Up Limit Texas Hold'em University of Alberta | Publication | 2015-04-01 | |
No-Regret Learning in Extensive-Form Games with Imperfect Recall University of Alberta | Publication | 2012-04-01 | |
Using Response Functions to Measure Strategy Strength University of Alberta | Publication | 2014-04-01 | |
AIVAT: A New Variance Reduction Technique for Agent Evaluation in Imperfect Information Games University of Alberta | Publication | 2018-02-01 | Martin Schmid, Neil E. Burch, Matej Moravcik, Dustin R. Morrill, Michael Bowling |
Decision-theoretic Clustering of Strategies University of Alberta | Publication | 2015-01-01 | |
Variance reduction via antithetic Markov chains University of Alberta | Publication | 2015-05-01 | |
The Forget-me-not Process University of Alberta | Publication | 2016-12-01 | Kieran Miller, Joel Veness, James Kirkpatrick, Michael Bowling, Anna Koop, Demis Hassabis |
Baseline: Practical Control Variates for Agent Evaluation in Zero-Sum Domains University of Alberta | Publication | 2013-05-01 | |
Investigating Contingency Awareness using Atari 2600 Games University of Alberta | Publication | 2012-03-01 | |
A Laplacian Framework for Option Discovery in Reinforcement Learning University of Alberta | Publication | 2017-08-01 | |
Variance Reduction in Monte Carlo Tree Search University of Alberta | Publication | 2011-08-01 | |
Online Monte Carlo Counterfactual Regret Minimization for Search in Imperfect Information Games University of Alberta | Publication | 2015-01-01 | |
Solving Games with Functional Regret Estimation University of Alberta | Publication | 2014-11-01 | |
Tractable Objectives for Robust Policy Optimization University of Alberta | Publication | 2012-12-01 | |
Automating Collusion Detection in Sequential Games University of Alberta | Publication | 2013-07-01 | |
Improving Exploration in UCT Using Local Manifolds University of Alberta | Publication | 2014-11-01 | |
Counterfactual Regret Minimization in Sequential Security Games University of Alberta | Publication | 2015-11-01 | |
Solving Large Extensive-Form Games with Strategy Constraints University of Alberta | Publication | 2019-01-01 | |
Policy Tree: Adaptive Representation for Policy Gradient University of Alberta | Publication | 2014-11-01 | |
State of the Art Control of Atari Games Using Shallow Reinforcement Learning University of Alberta | Publication | 2016-01-01 | |
Learning Purposeful Behaviour in the Absence of Rewards University of Alberta | Publication | 2016-05-01 | |
Revisiting the Arcade Learning Environment: Evaluation Protocols and Open Problems for General Agents University of Alberta | Publication | 2018-03-01 | Marlos C. Machado, Marc Bellemare, Erik Talvitie, Joel Veness, Matthew Hausknecht, Michael Bowling |
Monte Carlo Tree Search in Continuous Action Spaces with Execution Uncertainty University of Alberta | Publication | 2016-04-01 | |
A Randomized Mirror Descent Algorithm for Large Scale Multiple Kernel Learning University of Alberta | Publication | 2013-06-01 | |
Context Tree Switching University of Alberta | Publication | 2011-12-01 | |
The Arcade Learning Environment: An Evaluation Platform for General Agents University of Alberta | Publication | 2013-06-01 | |
Sketch-Based Linear Value Function Approximation University of Alberta | Publication | 2012-12-01 | |
Bayesian Learning of Recursively Factored Environments University of Alberta | Publication | 2013-06-01 | |
Equilibrium Approximation Quality of Current No-Limit Poker Bots University of Alberta | Publication | 2017-02-01 | |
Variance Reduction in Monte Carlo Regret Minimization for Extensive Games using Baselines University of Alberta | Publication | 2018-10-01 | Martin Schmid, Neil Burch, Marc Lanctot, Matej Moravcik, Rudolf Kadlec, Michael Bowling |
On Local Regret University of Alberta | Publication | 2012-04-01 | |
Linear Fitted-Q Iteration with Multiple Reward Functions University of Alberta | Publication | 2012-11-01 | |
Subset Selection of Search Heuristics. University of Alberta | Publication | 2013-08-01 | |
Approximate Linear Programming for Constrained Partially Observable Markov Decision Processes University of Alberta | Publication | 2014-11-01 | Pascal Poupart, Aarti Malhotra, Pei Pei, Kee-Eung Kim, Bongseok Goh, Michael Bowling |
Computing Treatments for Type-1 Diabetes Mellitus University of Alberta | Publication | 2013-01-01 | |
Bayesian Action Decoder for Deep Multi-Agent Reinforcement Learning University of Alberta | Publication | 2019-06-01 | Jakob N Foerster, H. Francis Song, Edward Hughes, Neil Burch, Iain Dunning, Shimon Whiteson, Matthew Botvinick, Michael Bowling |
Actor-Critic Policy Optimization in Partially Observable Multiagent Environments University of Alberta | Publication | 2018-12-01 | Srinivasan Sriram, Marc Lanctot, Vinicius Flores Zambaldi, Julien Perolat, Karl Tuyls, Remi Munos, Michael Bowling |