Yvan Saeys

Ghent University

[introductory/intermediate] Interpreting Machine Learning Models

Summary

In this series of lectures we will cover a wide range of topics related to the interpretation of machine learning models. While some machine learning models can be easily interpreted, other models are more black boxes, and in general there is a trade-off between the predictive performance and the ease of interpretation of machine learning models. To cope with this, many techniques have been introduced to shed light on the inner workings of machine learning models and how to extract insights from them.

We will start by a general introduction on machine learning interpretability, relating it also to trustworthiness of machine learning models. We will discuss the different types of interpretability methods, including model-based and post-hoc interpretability methods as well as model distillation techniques.

Next we will cover several interpretability methods in detail, starting with model-based techniques, followed by post-hoc interpretability methods (including LIME and SHAP) and instance based explanations (including counterfactuals). A large part of the lectures will also be devoted to feature attribution techniques to interpret deep learning models (including class activation maps, DeepLift, and several gradient based methods). In between the theory underlying the methods, examples on real world data sets will be used to illustrate the different interpretability methods.

Syllabus

Introduction, general context, and overview of the field
Model based interpretability methods
Post-hoc interpretability methods
Interpreting deep learning models
Model distillation
Evaluating interpretability methods
Case studies on real world data sets

References

Christoph Molnar. Interpretable Machine Learning. A Guide for Making Black Box Models Explainable.
F. Doshi-Velez and B. Kim, “Towards a rigorous science of interpretable machine learning,” 2017, arXiv:1702.08608.
G. Hinton, O. Vinyals, and J. Dean (2015). Distilling the Knowledge in a Neural Network. NIPS Deep Learning and Representation Learning Workshop. https://arxiv.org/abs/1503.02531
Ribeiro, M., Singh, S. & Guestrin, C., 2016. “Why Should I Trust You?”: Explaining the Predictions of Any Classifier. In Proceedings of the 2016 Conference of the North American Chapter of the Association for Computational Linguistics: Demonstrations. Available at: http://dx.doi.org/10.18653/v1/n16-3020
Murdoch WJ, Singh C, Kumbier K, Abbasi-Asl R, Yu B. Definitions, methods, and applications in interpretable machine learning. Proceedings of the National Academy of Sciences of the United States of America. 2019 Oct;116(44):22071-22080. DOI: 10.1073/pnas.1900654116. PMID: 31619572; PMCID: PMC6825274.
R. R. Selvaraju, M. Cogswell, A. Das, R. Vedantam, D. Parikh and D. Batra, “Grad-CAM: Visual Explanations from Deep Networks via Gradient-Based Localization,” 2017 IEEE International Conference on Computer Vision (ICCV), 2017, pp. 618-626, doi: 10.1109/ICCV.2017.74.

Pre-requisites

Familiarity with basic machine learning problems (classification, regression, clustering, dimensionality reduction) and approaches (linear models, decision trees, Random Forests, neural networks).

Short bio

Yvan Saeys received a PhD in computer science from Ghent University. After spending time abroad at the University of the Basque Country and the University Claude Bernard Lyon, he returned to Belgium, where he is currently a professor in computer science at the Department of Applied Mathematics, Computer Science and Statistics of Ghent University, and a principal investigator at the Flemish Biotechnology Institute (VIB) where he is heading the “Data Mining and Modeling for Biomedicine” group. His research interests include robustness and interpretability of machine learning methods, and their application to computational biology and medicine.

At Ghent University he teaches several machine learning related courses, including Artificial Intelligence, Machine Learning, Big Data Science and Data Mining. He has published more than 180 papers in computer science and computational biology journals, is serving on the editorial board of the journal “Information Fusion”, and has been reviewing for many computer science and biotechnology journals.

The methods he developed obtained the best results in several machine learning competitions, including winner of the DREAM5 Network Inference challenge, the most important benchmark for assessing machine learning techniques for systems biology, and the best algorithm in the FlowCAP IV challenge on predicting HIV to AIDS progression. He was recently nominated as “Highly cited researcher” in the Cross-Field category (Multiple highly-cited papers that rank in the top 1% by citations, as analyzed by Clarivate Analytics) and received the FWO/AstraZeneca Foundation Award “Patient care in the AI era”.

Cookie	Duration	Description
cookielawinfo-checkbox-advertisement	1 year	The cookie is set by GDPR cookie consent to record the user consent for the cookies in the category "Advertisement".
cookielawinfo-checkbox-analytics	11 months	This cookie is set by GDPR Cookie Consent plugin. The cookie is used to store the user consent for the cookies in the category "Analytics".
cookielawinfo-checkbox-functional	11 months	The cookie is set by GDPR cookie consent to record the user consent for the cookies in the category "Functional".
cookielawinfo-checkbox-necessary	11 months	This cookie is set by GDPR Cookie Consent plugin. The cookies is used to store the user consent for the cookies in the category "Necessary".
cookielawinfo-checkbox-others	11 months	This cookie is set by GDPR Cookie Consent plugin. The cookie is used to store the user consent for the cookies in the category "Other.
cookielawinfo-checkbox-performance	11 months	This cookie is set by GDPR Cookie Consent plugin. The cookie is used to store the user consent for the cookies in the category "Performance".
PHPSESSID	session	This cookie is native to PHP applications. The cookie is used to store and identify a users' unique session ID for the purpose of managing user session on the website. The cookie is a session cookies and is deleted when all the browser windows are closed.
viewed_cookie_policy	11 months	The cookie is set by the GDPR Cookie Consent plugin and is used to store whether or not user has consented to the use of cookies. It does not store any personal data.

Cookie	Duration	Description
_ga	2 years	This cookie is installed by Google Analytics. The cookie is used to calculate visitor, session, campaign data and keep track of site usage for the site's analytics report. The cookies store information anonymously and assign a randomly generated number to identify unique visitors.
_gat_gtag_UA_74880351_9	1 minute	This cookie is set by Google and is used to distinguish users.
_gid	1 day	This cookie is installed by Google Analytics. The cookie is used to store information of how visitors use a website and helps in creating an analytics report of how the website is doing. The data collected including the number visitors, the source where they have come from, and the pages visted in an anonymous form.