Szepesvári, Csaba

Profile

Csaba Szepesvári, an Amii Fellow and a Professor at the Department of Computing Science at the University of Alberta, and the team-lead for the "Foundations" team at DeepMind focuses on the development of principled approaches to AI with the use machine learning. He has extensive industrial and academic experience and holds various editorial positions at leading journals and conferences. He is the co-inventor of UCT, a widely successful Monte-Carlo tree search algorithm, which ignited much work in AI, and won the 2016 test-of-time award at ECML/PKDD. He has over 200 publications at first-tier conferences and journals, and has been the PC chair of both major machine learning theory conferences, in addition to serving as the action editor of the major machine learning journals.

Outputs

Title	Category	Date	Authors
Bounds and dynamics for empirical game theoretic analysis University of Alberta	Publication	2019-12-01	Karl Tuyls, Julien Perolat, Marc Lanctot, Edward Hughes, Richard Everett, Joel Z. Leibo, Csaba Szepesvári, Thore Graepel
A modular analysis of adaptive (non-)convex optimization: Optimism, composite objectives, variance reduction, and variational bounds University of Alberta	Publication	2020-02-01	Pooria Joulani, Andrá s György, Csaba Szepesvári
Mixing time estimation in reversible Markov chains from a single sample path University of Alberta	Publication	2019-08-01	Daniel Hsu, Aryeh Kontorovich, David A. Levin, Yuval Peres, Csaba Szepesvári, Geoffrey Wolfer
Stochastic Optimization in a Cumulative Prospect Theory Framework University of Alberta	Publication	2018-09-01	Cheng Jie, Prashanth L.A., Michael Fu, Steve Marcus, Csaba Szepesvári
A Linearly Relaxed Approximate Linear Program for Markov Decision Processes University of Alberta	Publication	2018-04-01	Chandrashekar Lakshminarayanan, Shalabh Bhatnagar, Csaba Szepesvári
Sequential Learning for Multi-Channel Wireless Network Monitoring With Channel Switching Costs University of Alberta	Publication	2014-11-01	Thanh Le, Csaba Szepesvári, Rong Zheng
Guest Editors' introduction University of Alberta	Publication	2014-01-01	Jyrki Kivinen, Csaba Szepesvári, Thomas Zeugmann
Pseudo-MDPs and factored linear action models University of Alberta	Publication	2014-12-01	Hengshuai Yao, Csaba Szepesvári, Bernardo Avila Pires, Xinhua Zhang
Online Markov Decision Processes Under Bandit Feedback University of Alberta	Publication	2014-03-01	Gergely Neu, Andras Gyorgy, Csaba Szepesvári, Andras Antos
Partial Monitoring\textemdash Classification, Regret Bounds, and Algorithms University of Alberta	Publication	2014-11-01	Gá bor Bartók, Dean P. Foster, Dávid Pál, Alexander Rakhlin, Csaba Szepesvári
Alignment based kernel learning with a continuous set of base kernels University of Alberta	Publication	2013-05-01	Arash Afkanpour, Csaba Szepesvári, Michael Bowling
Toward a classification of finite partial-monitoring games University of Alberta	Publication	2013-02-01	Andrá s Antos, Gábor Bartók, Dávid Pál, Csaba Szepesvári
Partial Monitoring with Side Information University of Alberta	Publication	2012-01-01	Gá bor Bartók, Csaba Szepesvári
The grand challenge of computer Go University of Alberta	Publication	2012-03-01	Sylvain Gelly, Levente Kocsis, Marc Schoenauer, Michè le Sebag, David Silver, Csaba Szepesvári, Olivier Teytaud
Regularized least-squares regression: Learning from a sequence University of Alberta	Publication	2012-02-01	Amir-massoud Farahmand, Csaba Szepesvári
Model selection in reinforcement learning University of Alberta	Publication	2011-06-01	Amir-massoud Farahmand, Csaba Szepesvári
Editors' Introduction University of Alberta	Publication	2011-01-01	Jyrki Kivinen, Csaba Szepesvári, Esko Ukkonen, Thomas Zeugmann
Sequential learning for optimal monitoring of multi-channel wireless networks University of Alberta	Publication	2011-04-01	Pallavi Arora, Csaba Szepesvári, Rong Zheng
Extending rapidly-exploring random trees for asymptotically optimal anytime motion planning University of Alberta	Publication	2010-10-01	Yasin Abbasi-Yadkori, Joseph Modayil, Csaba Szepesvári
Algorithms for Reinforcement Learning University of Alberta	Publication	2010-01-01	Csaba Szepesvári
Active learning in heteroscedastic noise University of Alberta	Publication	2010-06-01	Andrá s Antos, Varun Grover, Csaba Szepesvári
Models of active learning in group-structured state spaces University of Alberta	Publication	2010-04-01	Gá bor Bartók, Csaba Szepesvári, Sandra Zilles
Training parsers by inverse reinforcement learning University of Alberta	Publication	2009-04-01	Gergely Neu, Csaba Szepesvári
LMS -2: Towards an algorithm that is as cheap as LMS and almost as efficient as RLS University of Alberta	Publication	2009-12-01	Hengshuai Yao, Shalabh Bhatnagar, Csaba Szepesvári
Regularized Fitted Q-Iteration for planning in continuous-space Markovian decision problems University of Alberta	Publication	2009-01-01	Amir massoud Farahmand, Mohammad Ghavamzadeh, Csaba Szepesvári, Shie Mannor
Model-based and model-free reinforcement learning for visual servoing University of Alberta	Publication	2009-05-01	A.M. Farahmand, A. Shademan, Martin Jagersand, Csaba Szepesvári
Learning to segment from a few well-selected training images University of Alberta	Publication	2009-01-01	Alireza Farhangfar, Russell Greiner, Csaba Szepesvári
Learning when to stop thinking and do something! University of Alberta	Publication	2009-01-01	Barnabá s Póczos, Yasin Abbasi-Yadkori, Csaba Szepesvári, Russell Greiner, Nathan Sturtevant
Fast gradient-descent methods for temporal-difference learning with linear function approximation University of Alberta	Publication	2009-01-01	Richard Sutton, Hamid Reza Maei, Doina Precup, Shalabh Bhatnagar, David Silver, Csaba Szepesvári, Eric Wiewiora
Exploration\textendash exploitation tradeoff using variance estimates in multi-armed bandits University of Alberta	Publication	2009-04-01	Jean-Yves Audibert, Ré mi Munos, Csaba Szepesvári
Active Learning of Group-Structured Environments University of Alberta	Publication	2008-01-01	Gá bor Bartók, Csaba Szepesvári, Sandra Zilles
Active Learning in Multi-armed Bandits University of Alberta	Publication	2008-01-01	Andrá s Antos, Varun Grover, Csaba Szepesvári
Regularized Fitted Q-Iteration: Application to Planning University of Alberta	Publication	2008-01-01	Amir massoud Farahmand, Mohammad Ghavamzadeh, Csaba Szepesvári, Shie Mannor
Learning near-optimal policies with Bellman-residual minimization based fitted policy iteration and a single sample path University of Alberta	Publication	2007-11-01	Andrá s Antos, Csaba Szepesvári, Rémi Munos
Value-Iteration Based Fitted Policy Iteration: Learning with a Single Trajectory University of Alberta	Publication	2007-04-01	Andras Antos, Csaba Szepesvári, Remi Munos
Manifold-adaptive dimension estimation University of Alberta	Publication	2007-01-01	Amir massoud Farahmand, Csaba Szepesvári, Jean-Yves Audibert
RSPSA : Enhanced Parameter Optimization in Games University of Alberta	Publication	2006-01-01	Levente Kocsis, Csaba Szepesvári, Mark H. M. Winands
Universal parameter optimisation in games based on SPSA University of Alberta	Publication	2006-03-01	Levente Kocsis, Csaba Szepesvári
Local Importance Sampling: A Novel Technique to Enhance Particle Filtering University of Alberta	Publication	2006-04-01	Pé ter Torma, Csaba Szepesvári
X-mHMM : An Efficient Algorithm for Training Mixtures of HMMs When the Number of Mixtures Is Unknown University of Alberta	Publication	2018-01-01	Z. Szamonek, Csaba Szepesvári
Log-optimal currency portfolios and control Lyapunov exponents University of Alberta	Publication	2018-01-01	L. Gerencseer, M. Rasonyi, Csaba Szepesvári, Z.S. Vago
Efficient object tracking in video sequences by means of LS -N-IPS University of Alberta	Publication	2018-01-01	P. Torma, Csaba Szepesvári
Uncertainty, performance, and model dependency in approximate adaptive nonlinear control University of Alberta	Publication	2000-01-01	M. French, Csaba Szepesvári, E. Rogers
Computer Aided Diagnosis of Clustered Microcalcifications Using Artificial Neural Nets University of Alberta	Publication	2000-01-01	Erich Sorantin, Ferdinand Schmidt, Heinz Mayer, Michael Becker, Csaba Szepesvári, Ewald Graif, Peter Winkler
A Unified Analysis of Value-Function-Based Reinforcement-Learning Algorithms University of Alberta	Publication	1999-11-01	Csaba Szepesvári, Michael L. Littman
An automatic method for the identification and interpretation of clustered microcalcifications in mammograms University of Alberta	Publication	1999-01-01	Ferdinand Schmidt, Erich Sorantin, Csaba Szepesvári, Ewald Graif, Michael Becker, Heinz Mayer, Karin Hartwagner
Parallel and robust skeletonization built on self-organizing elements University of Alberta	Publication	1999-01-01	Zsolt Kalmá r, Zsolt Marczell, Csaba Szepesvári, András Lörincz
The SBASE protein domain library, release 6.0: a collection of annotated protein sequence segments University of Alberta	Publication	1999-01-01	J. Murvai, K. Vlahovicek, E. Barta, Csaba Szepesvári, C. Acatrinei, S. Pongor
An integrated architecture for motion-control and path-planning University of Alberta	Publication	1998-12-01	Csaba Szepesvári, Andr�s L?rincz
Neurocontroller using dynamic state feedback for compensatory control University of Alberta	Publication	1997-12-01	Csaba Szepesvári, Szabolcs Cimmer, András L\Horincz
Robust control using inverse dynamics neurocontrollers University of Alberta	Publication	1997-12-01	Csaba Szepesvári, András L\Horincz
Approximate geometry representations and sensory fusion University of Alberta	Publication	1996-07-01	Csaba Szepesvári, András L\Horincz
SELF -ORGANIZING MULTI-RESOLUTION GRID FOR MOTION PLANNING AND CONTROL University of Alberta	Publication	1996-12-01	TIBOR FOMIN, TAMÁ S ROZGONYI, Csaba Szepesvári, ANDRÁS L\HORINCZ
Behavior of an Adaptive Self-organizing Autonomous Agent Working with Cues and Competing Concepts University of Alberta	Publication	1993-09-01	Csaba Szepesvári, Andràs Lórincz
Decision-theoretic Clustering of Strategies University of Alberta	Publication	2015-01-01	Nolan D. Bard, Deon Nicholas, Csaba Szepesvári, Michael Bowling
A Randomized Mirror Descent Algorithm for Large Scale Multiple Kernel Learning University of Alberta	Publication	2013-06-01	Arash Afkhanpour, Andras Gyorgy, Csaba Szepesvári, Michael Bowling

Csaba Szepesvári elected as a Fellow of the Association for the Advancement of Artificial Intelligence | Computing Science

Csaba Szepesvári elected as a Fellow of the Association for the Advancement of Artificial Intelligence Professor Csaba Szepesvári is recognized for his significant contributions to reinforcement ...

Published: January 23, 2023

Recognizing and retaining AI excellence: Five UAlberta researchers named CIFAR AI Chairs | Faculty of Science

Recognizing and retaining AI excellence: Five UAlberta researchers named CIFAR AI Chairs Canadian, Edmonton, and University of Alberta leadership in artificial intelligence further cemented with ...

Published: December 9, 2019

First NamePrénom:
Last NameNom:
EmailCourriel:
SubjectObjet:

First NamePrénom:
Last NameNom:
EmailCourriel:
Phone #:

Szepesvári, Csaba

From AI4Society Forum

Szepesvári, Csaba

Profile

Outputs

Csaba Szepesvári elected as a Fellow of the Association for the Advancement of Artificial Intelligence | Computing Science

Recognizing and retaining AI excellence: Five UAlberta researchers named CIFAR AI Chairs | Faculty of Science