DeepLearn 2022 Summer
6th International Gran Canaria School
on Deep Learning
Las Palmas de Gran Canaria, Spain · July 25-29, 2022
Registration
Downloads
  • Call DeepLearn 2022 Summer
  • Poster DeepLearn 2022 Summer
  • Lecture Materials
  • Home
  • Schedule
  • Lecturers
  • News
  • Accommodation
  • Info
    • Sponsoring
    • Code of conduct
    • Visa
  • Home
  • Schedule
  • Lecturers
  • News
  • Accommodation
  • Info
    • Sponsoring
    • Code of conduct
    • Visa
Sean Meyn

Sean Meyn

University of Florida

[introductory/intermediate] Reinforcement Learning: Fundamentals, and Roadmaps for Successful Design

Summary

The theory of reinforcement learning (RL) is currently grounded in the theory of optimal control—typically MDPs. The dream of RL is automatic control that is truly automatic: without any knowledge of physics or biology or medicine, an RL algorithm tunes itself to become a super controller: the smoothest ride into space, and the most expert micro-surgeon! We are currently far from practical autonomous driving or surgery, but the science is progressing quickly. With so much enthusiasm we expect many breakthroughs in the near future.

This lecture series will present the basics of algorithm design, and is hopefully of interest to both newcomers and experienced researchers in RL. It “begins at the beginning”, with the basics of algorithm design grounded in refinements of the “ODE method”. The focus is algorithm design with emphasis on fast convergence. Also fundamental in RL are information theoretic concepts used to investigate the design of “exploration” for learning. The course will visit this topic, but with far less depth.

Syllabus

The content will closely follow the new monograph by Sean Meyn “Control Systems and Reinforcement Learning”, covering the following topics:

1. Revising the ODE method (drawing from chapters 4 and 8):

This is a survey that is relevant to a much broader community of researchers in machine learning. The key message: Algorithm acceleration is possible once we better understand the nonlinear dynamics associated with the algorithms we construct. There are many new techniques to gain understanding.

2. Variance matters (from chapters 7 and 8):

How nonlinear dynamics coupled with random disturbances impact convergence rates.

3. Stochastic control and TD learning (chapters 9 and 10):

Putting algorithm design to work: TD and Q-learning are often very slow to converge. We will learn why, and how these algorithms can be accelerated.

4. Zap and Deep Q-learning (chapters 9 and 10, and recent literature):

New techniques to tame complex nonlinear algorithm dynamics.

References

  • S. Meyn. Control Systems and Reinforcement Learning, is to be published by Cambridge University Press. Pre-publication version online: https://meyn.ece.ufl.edu/2021/08/01/control-systems-and-reinforcement-learning/
  • A. M. Devraj, A. Busic, and S. Meyn. Fundamental design principles for reinforcement learning algorithms. In K. G. Vamvoudakis, Y. Wan, F. L. Lewis, and D. Cansever, editors, Handbook on Reinforcement Learning and Control, Studies in Systems, Decision and Control series (SSDC, volume 325). Springer, 2021.
  • Theory of Reinforcement Learning Boot Camp, Aug 31, 2020 to Sep 4, 2020. Video available online: https://simons.berkeley.edu/workshops/rl-2020-bc
  • S. Chen, A. M. Devraj, F. Lu, A. Busic, and S. Meyn. Zap Q-Learning with nonlinear function approximation. In H. Larochelle, M. Ranzato, R. Hadsell, M. F. Balcan, and H. Lin, editors, Advances in Neural Information Processing Systems, and arXiv e-prints 1910.05405, volume 33, pages 16879–16890, 2020.

Pre-requisites

Reinforcement learning has several foundations, including algorithm design and Markov Decision Processes (which is roughly equivalent to stochastic control theory). The course will present a fresh look at these foundations and how they lead to standard algorithms for reinforcement learning, as well as recent techniques designed to improve reliability. The control theory will be reviewed in lecture. It is most important that the students come with a good grasp of stochastic process fundamentals, as well as basics of ordinary differential equations and matrix algebra.

Short bio

Sean Meyn was raised by the beach in Southern California. Following his BA in mathematics at UCLA, he moved on to pursue a PhD with Peter Caines at McGill University. After about 20 years as a professor of ECE at the University of Illinois, in 2012 he moved to beautiful Gainesville. He is now Professor and Robert C. Pittman Eminent Scholar Chair in the Department of Electrical and Computer Engineering at the University of Florida, and director of the Laboratory for Cognition and Control. He also holds an Inria International Chair to support research with colleagues in France. His interests span many aspects of stochastic control, stochastic processes, information theory, and optimization. For the past decade, his applied research has focused on engineering, markets, and policy in energy systems.

Other Courses

Wahid BhimjiWahid Bhimji
zyro-imageJoachim M. Buhmann
deeplearn-kate-saenkoKate Saenko
Arindam BanerjeeArindam Banerjee
deeplearn-pierre-baldiPierre Baldi
Mikhail BelkinMikhail Belkin
deeplearn-arthur-grettonArthur Gretton
deeplearn-philip-isolaPhillip Isola
Mohit IyyerMohit Iyyer
Irwin King 2Irwin King
Tor LattimoreTor Lattimore
Vincent LepetitVincent Lepetit
Dimitris N. MetaxasDimitris N. Metaxas
deeplearn-louis-philippe-morencyLouis-Philippe Morency
Wojciech SamekWojciech Samek
Clara I. SánchezClarisa Sánchez
Björn W. SchullerBjörn W. Schuller
Jonathon ShlensJonathon Shlens
deeplearn-johan-suykensJohan Suykens
deeplearn-murat-tekalpA. Murat Tekalp
deeplearn-tkatchenkoAlexandre Tkatchenko
Li XiongLi Xiong
deeplearn-ming-yuanMing Yuan

DeepLearn 2022 Spring

CO-ORGANIZERS

Universidad de Las Palmas de Gran Canaria

Universitat Rovira i Virgili

Institute for Research Development, Training and Advice – IRDTA, Brussels/London

Active links
  • DeepLearn 2023 Winter– 8th International School on Deep Learning
  • DeepLearn 2022 Autumn – 7th International School on Deep Learning
Past links
  • DeepLearn 2022 Spring
  • DeepLearn 2021 Summer
  • DeepLearn 2019
  • DeepLearn 2018
  • DeepLearn 2017
© IRDTA 2021. All Rights Reserved.
We use cookies on our website to give you the most relevant experience by remembering your preferences and repeat visits. By clicking “Accept All”, you consent to the use of ALL the cookies. However, you may visit "Cookie Settings" to provide a controlled consent.
Cookie SettingsAccept All
Manage consent

Privacy Overview

This website uses cookies to improve your experience while you navigate through the website. Out of these, the cookies that are categorized as necessary are stored on your browser as they are essential for the working of basic functionalities of the website. We also use third-party cookies that help us analyze and understand how you use this website. These cookies will be stored in your browser only with your consent. You also have the option to opt-out of these cookies. But opting out of some of these cookies may affect your browsing experience.
Necessary
Always Enabled
Necessary cookies are absolutely essential for the website to function properly. These cookies ensure basic functionalities and security features of the website, anonymously.
CookieDurationDescription
cookielawinfo-checkbox-advertisement1 yearThe cookie is set by GDPR cookie consent to record the user consent for the cookies in the category "Advertisement".
cookielawinfo-checkbox-analytics11 monthsThis cookie is set by GDPR Cookie Consent plugin. The cookie is used to store the user consent for the cookies in the category "Analytics".
cookielawinfo-checkbox-functional11 monthsThe cookie is set by GDPR cookie consent to record the user consent for the cookies in the category "Functional".
cookielawinfo-checkbox-necessary11 monthsThis cookie is set by GDPR Cookie Consent plugin. The cookies is used to store the user consent for the cookies in the category "Necessary".
cookielawinfo-checkbox-others11 monthsThis cookie is set by GDPR Cookie Consent plugin. The cookie is used to store the user consent for the cookies in the category "Other.
cookielawinfo-checkbox-performance11 monthsThis cookie is set by GDPR Cookie Consent plugin. The cookie is used to store the user consent for the cookies in the category "Performance".
PHPSESSIDsessionThis cookie is native to PHP applications. The cookie is used to store and identify a users' unique session ID for the purpose of managing user session on the website. The cookie is a session cookies and is deleted when all the browser windows are closed.
viewed_cookie_policy11 monthsThe cookie is set by the GDPR Cookie Consent plugin and is used to store whether or not user has consented to the use of cookies. It does not store any personal data.
Functional
Functional cookies help to perform certain functionalities like sharing the content of the website on social media platforms, collect feedbacks, and other third-party features.
Performance
Performance cookies are used to understand and analyze the key performance indexes of the website which helps in delivering a better user experience for the visitors.
Analytics
Analytical cookies are used to understand how visitors interact with the website. These cookies help provide information on metrics the number of visitors, bounce rate, traffic source, etc.
CookieDurationDescription
_ga2 yearsThis cookie is installed by Google Analytics. The cookie is used to calculate visitor, session, campaign data and keep track of site usage for the site's analytics report. The cookies store information anonymously and assign a randomly generated number to identify unique visitors.
_gat_gtag_UA_74880351_91 minuteThis cookie is set by Google and is used to distinguish users.
_gid1 dayThis cookie is installed by Google Analytics. The cookie is used to store information of how visitors use a website and helps in creating an analytics report of how the website is doing. The data collected including the number visitors, the source where they have come from, and the pages visted in an anonymous form.
Advertisement
Advertisement cookies are used to provide visitors with relevant ads and marketing campaigns. These cookies track visitors across websites and collect information to provide customized ads.
Others
Other uncategorized cookies are those that are being analyzed and have not been classified into a category as yet.
SAVE & ACCEPT
Powered by CookieYes Logo