DeepLearn 2024
11th International School on Deep Learning
(and the Future of Artificial Intelligence)
Porto - Maia, Portugal · July 15-19, 2024
Registration
Downloads
  • Call DeepLearn 2024
  • Poster DeepLearn 2024
  • Lecture Materials
  • Home
  • Lecturers
  • Schedule
  • Sponsoring
  • News
  • Info
    • Travel
    • Accommodation
    • UMaia / UP students and staff
    • Visa
    • Code of conduct
    • Testimonials
  • Home
  • Lecturers
  • Schedule
  • Sponsoring
  • News
  • Info
    • Travel
    • Accommodation
    • UMaia / UP students and staff
    • Visa
    • Code of conduct
    • Testimonials
deeeplearn-speakers-cui

Peng Cui

Tsinghua University

[intermediate/advanced] Stable Learning for Out-of-Distribution Generalization: Invariance, Causality and Heterogeneity

Summary

The traditional framework of machine learning (ML) operates under the assumption that training and testing datasets are independent and identically distributed (i.i.d.). This assumption, however, often proves inadequate in real-world scenarios where distributional shifts between training and test data can significantly impair model performance post-deployment. Such phenomena underscore the critical importance of addressing the Out-of-Distribution (OOD) generalization problem, an emerging topic of ML research that focuses on scenarios wherein the test distributions differ from the training ones.

This course aims to provide a comprehensive view on the stable learning framework, which aims to enhance the model’s OOD generalization ability from three perspectives, including invariance, causality, and heterogeneity. Invariance is at the core of stable learning, which seeks invariant prediction mechanisms that hold across different domains / distributions. This course will share the recent progress as well as the drawbacks of invariant learning. Then, we will move on to the causality, serving as a foundation of invariance from causal inference. The course will introduce essential concepts, methodologies, and the latest advancements of causality, and demonstrate how feature decorrelation can lead to the generalization out of distribution.

Beyond model-centric strategies, this course will delve into heterogeneity-aware ML, as another way to pursue invariance by leveraging the “variance” within data. This data-centric approach aims to enhance generalization under distributional shifts by modeling and leveraging data heterogeneity throughout the whole ML pipeline. Attendees will learn about the types of data heterogeneity, alongside quantitative metrics and algorithms that consider heterogeneous data. Real-world applications, including healthcare, autonomous control systems (like self-driving cars), and finance, will serve as practical examples throughout the course.

Finally, we will discuss the potential directions in this field, covering understanding real-world distribution shifts, scaling methods to large language models (LLMs) / foundation models, and benchmarking OOD generalization capabilities.

At the end of the course, attendees will gain a comprehensive understanding of OOD generalization’s foundational principles, including key methodologies, recent developments, limitations, and exciting prospects for future research in the field.

Syllabus

  • Background: performance degradation of ML models in real-world applications, the causes of poor OOD generalization performances.
  • OOD generalization problem: the problem setting of OOD generalization, differences with related fields, typical methodologies.
  • Stable learning: framework, core ideas.
  • Invariance: concepts, recent progress, and drawbacks in practice.
  • Causality: essential concepts, methodologies, and the latest advancements.
  • Heterogeneity: quantitative metrics, typical algorithms built on heterogeneous data, benchmarks.
  • Future directions: the pattern of distribution shifts, OOD generalization benchmarks.

References

Cui, P., & Athey, S. (2022). Stable learning establishes some common ground between causal inference and machine learning. Nature Machine Intelligence, 4(2), 110-115.

Shen, Z., Cui, P., Zhang, T., & Kuang, K. (2020, April). Stable learning via sample reweighting. In Proceedings of the AAAI Conference on Artificial Intelligence (Vol. 34, No. 04, pp. 5692-5699).

Zhang, X., Cui, P., Xu, R., Zhou, L., He, Y., & Shen, Z. (2021). Deep stable learning for out-of-distribution generalization. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (pp. 5372-5382).

Liu, J., Hu, Z., Cui, P., Li, B., & Shen, Z. (2021, July). Heterogeneous risk minimization. In International Conference on Machine Learning (pp. 6804-6814). PMLR.

Liu, J., Wu, J., Pi, R., Xu, R., Zhang, X., Li, B., & Cui, P. (2022, September). Measure the Predictive Heterogeneity. In The Eleventh International Conference on Learning Representations.

Liu, J., Wang, T., Cui, P., & Namkoong, H. (2024). On the need for a language describing distribution shifts: Illustrations on tabular datasets. Advances in Neural Information Processing Systems, 36.

Pre-requisites

General machine learning knowledge.

Short bio

Peng Cui is an Associate Professor with tenure in Tsinghua University. He is interested in research on stable prediction, decision-making based on causal principles, and network representation learning at a large scale. Since 2016, he has been exploring how to combine causal statistics with machine learning methods, and developed a theoretical framework for stable learning inspired by causality. His research results have been widely used in industrial domains such as intelligent health care and the Internet economy. He has published more than 100 papers in top artificial intelligence conferences and received 7 awards for his papers from international conferences or journals. He is an associate editor of international journals such as IEEE TKDE, ACM TOMM, ACM TIST, IEEE TBD, KAIS, etc., and has been area chair or senior PC member of top conferences like NeurIPS, ICML, UAI, etc. He has won the second prize of the National Natural Science Award in China, the first prize of the Natural Science Award of the Ministry of Education in China, the CCF-IEEE CS Young Scientist Award, and he is a distinguished member of the ACM.

Other Courses

deeeplearn-speakers-hanJiawei Han
Katia SycaraKatia Sycara
deeeplearn-speakers-beniniLuca Benini
Gustau Camps-VallsGustau Camps-Valls
Nitesh ChawlaNitesh Chawla
Daniel CremersDaniel Cremers
deeeplearn-speakers-gleyzerSergei V. Gleyzer
deeeplearn-speakers-heYulan He
Frank HutterFrank Hutter
deeeplearn-speakers-karypisGeorge Karypis
deeeplearn-speakers-neyHermann Ney
Massimiliano PontilMassimiliano Pontil
Elisa RicciElisa Ricci
Wojciech SamekWojciech Samek
Xinghua Mindy ShiXinghua Mindy Shi
Michalis VazirgiannisMichalis Vazirgiannis
James ZouJames Zou

DeepLearn 2023 Summer

CO-ORGANIZERS

University of Maia

Institute for Research Development, Training and Advice – IRDTA, Brussels/London

Active links
    Past links
    • DeepLearn 2023 Summer
    • DeepLearn 2023 Spring
    • DeepLearn 2023 Winter
    • DeepLearn 2022 Autumn
    • DeepLearn 2022 Summer
    • DeepLearn 2022 Spring
    • DeepLearn 2021 Summer
    • DeepLearn 2019
    • DeepLearn 2018
    • DeepLearn 2017
    © IRDTA 2023. All Rights Reserved.
    We use cookies on our website to give you the most relevant experience by remembering your preferences and repeat visits. By clicking “Accept All”, you consent to the use of ALL the cookies. However, you may visit "Cookie Settings" to provide a controlled consent.
    Cookie SettingsAccept All
    Manage consent

    Privacy Overview

    This website uses cookies to improve your experience while you navigate through the website. Out of these, the cookies that are categorized as necessary are stored on your browser as they are essential for the working of basic functionalities of the website. We also use third-party cookies that help us analyze and understand how you use this website. These cookies will be stored in your browser only with your consent. You also have the option to opt-out of these cookies. But opting out of some of these cookies may affect your browsing experience.
    Necessary
    Always Enabled
    Necessary cookies are absolutely essential for the website to function properly. These cookies ensure basic functionalities and security features of the website, anonymously.
    CookieDurationDescription
    cookielawinfo-checkbox-advertisement1 yearThe cookie is set by GDPR cookie consent to record the user consent for the cookies in the category "Advertisement".
    cookielawinfo-checkbox-analytics11 monthsThis cookie is set by GDPR Cookie Consent plugin. The cookie is used to store the user consent for the cookies in the category "Analytics".
    cookielawinfo-checkbox-functional11 monthsThe cookie is set by GDPR cookie consent to record the user consent for the cookies in the category "Functional".
    cookielawinfo-checkbox-necessary11 monthsThis cookie is set by GDPR Cookie Consent plugin. The cookies is used to store the user consent for the cookies in the category "Necessary".
    cookielawinfo-checkbox-others11 monthsThis cookie is set by GDPR Cookie Consent plugin. The cookie is used to store the user consent for the cookies in the category "Other.
    cookielawinfo-checkbox-performance11 monthsThis cookie is set by GDPR Cookie Consent plugin. The cookie is used to store the user consent for the cookies in the category "Performance".
    PHPSESSIDsessionThis cookie is native to PHP applications. The cookie is used to store and identify a users' unique session ID for the purpose of managing user session on the website. The cookie is a session cookies and is deleted when all the browser windows are closed.
    viewed_cookie_policy11 monthsThe cookie is set by the GDPR Cookie Consent plugin and is used to store whether or not user has consented to the use of cookies. It does not store any personal data.
    Functional
    Functional cookies help to perform certain functionalities like sharing the content of the website on social media platforms, collect feedbacks, and other third-party features.
    Performance
    Performance cookies are used to understand and analyze the key performance indexes of the website which helps in delivering a better user experience for the visitors.
    Analytics
    Analytical cookies are used to understand how visitors interact with the website. These cookies help provide information on metrics the number of visitors, bounce rate, traffic source, etc.
    CookieDurationDescription
    _ga2 yearsThis cookie is installed by Google Analytics. The cookie is used to calculate visitor, session, campaign data and keep track of site usage for the site's analytics report. The cookies store information anonymously and assign a randomly generated number to identify unique visitors.
    _gat_gtag_UA_74880351_91 minuteThis cookie is set by Google and is used to distinguish users.
    _gid1 dayThis cookie is installed by Google Analytics. The cookie is used to store information of how visitors use a website and helps in creating an analytics report of how the website is doing. The data collected including the number visitors, the source where they have come from, and the pages visted in an anonymous form.
    Advertisement
    Advertisement cookies are used to provide visitors with relevant ads and marketing campaigns. These cookies track visitors across websites and collect information to provide customized ads.
    Others
    Other uncategorized cookies are those that are being analyzed and have not been classified into a category as yet.
    SAVE & ACCEPT
    Powered by CookieYes Logo