Sergei V. Gleyzer

University of Alabama

[introductory/intermediate] Machine Learning Fundamentals and Their Applications to Very Large Scientific Data: Rare Signal and Feature Extraction, End-to-End Deep Learning, Uncertainty Estimation and Realtime Machine Learning Applications in Software and Hardware

Summary

Deep learning has become one of the most widely used tools in modern science and engineering, leading to breakthroughs in many areas ranging from computer vision to natural language processing to physics and medicine. This mini-course will introduce the basics of machine learning and classification theory based on statistical learning and describe two classes of popular algorithms in depth: decision and rule-based methods (decision trees, decision rules, bagging and boosting, random forests) and deep neural network-based models of various types (fully-connected, convolutional, recurrent/LSTM, graph neural networks and transformers). The course will focus on practical applications in analysis of large scientific data, interpretability, uncertainty estimation and how to best extract meaningful features, while implementing realtime deep learning algorithms in software and hardware. No previous machine learning background is required.

Syllabus

Introduction to Machine Learning: Theoretical Foundation, Classification Theory
Practical Applications and Examples in Sciences and Engineering with Large Scientific Data
Tree-based Algorithms: decision trees, rules, bagging, boosting, random forests
Deep Learning Methods: theory, loss functions, fully-connected networks, convolutional, recurrent, graph neural networks and geometric deep learning, transformers
Fundamentals of Feature Extraction and End-to-end Deep Learning
Uncertainty Estimation and Machine Learning Model Interpretations
Symbolic machine learning
Realtime Implementation of Deep Learning in Software and Hardware

References

I. Goodfellow, Y. Bengio and A. Courville, “Deep Learning” MIT Press 2016

G. James et al., “Introduction to Statistical Learning” Springer 2013

A. Géron, “Hands-On Machine Learning with Scikit-Learn, Keras, and TensorFlow: Concepts, Tools, and Techniques to Build Intelligent Systems” O’Reilly 2019

C.M. Bishop, “Pattern Recognition and Machine Learning” Springer 2006

J.R. Quinlan, “C4.5: Programs for Machine Learning” Morgan Kaufmann 1992

Pre-requisites

None.

Short bio

Sergei Gleyzer is a particle physicist working at the interface of machine learning and physics and astronomy towards more intelligent systems that can extract meaningful information from the data collected by the Large Hadron Collider (LHC), the world’s highest-energy particle physics experiment located at the CERN laboratory, near Geneva, Switzerland and Vera Rubin Observatory in Chile. He is a co-discoverer of the Higgs Boson and founder of several major machine learning initiatives such as the Inter-experimental Machine Learning Working Group and Compact Muon Solenoid experiment’s Machine Learning Forum. Professor Gleyzer is working on applying advanced machine learning methods to searches for new physics, such as dark matter. He also teaches a popular machine learning course at the University of Alabama and is the founder of the Machine Learning for Science (ML4SCI) Open Source Foundation.

Cookie	Duration	Description
cookielawinfo-checkbox-advertisement	1 year	The cookie is set by GDPR cookie consent to record the user consent for the cookies in the category "Advertisement".
cookielawinfo-checkbox-analytics	11 months	This cookie is set by GDPR Cookie Consent plugin. The cookie is used to store the user consent for the cookies in the category "Analytics".
cookielawinfo-checkbox-functional	11 months	The cookie is set by GDPR cookie consent to record the user consent for the cookies in the category "Functional".
cookielawinfo-checkbox-necessary	11 months	This cookie is set by GDPR Cookie Consent plugin. The cookies is used to store the user consent for the cookies in the category "Necessary".
cookielawinfo-checkbox-others	11 months	This cookie is set by GDPR Cookie Consent plugin. The cookie is used to store the user consent for the cookies in the category "Other.
cookielawinfo-checkbox-performance	11 months	This cookie is set by GDPR Cookie Consent plugin. The cookie is used to store the user consent for the cookies in the category "Performance".
PHPSESSID	session	This cookie is native to PHP applications. The cookie is used to store and identify a users' unique session ID for the purpose of managing user session on the website. The cookie is a session cookies and is deleted when all the browser windows are closed.
viewed_cookie_policy	11 months	The cookie is set by the GDPR Cookie Consent plugin and is used to store whether or not user has consented to the use of cookies. It does not store any personal data.

Cookie	Duration	Description
_ga	2 years	This cookie is installed by Google Analytics. The cookie is used to calculate visitor, session, campaign data and keep track of site usage for the site's analytics report. The cookies store information anonymously and assign a randomly generated number to identify unique visitors.
_gat_gtag_UA_74880351_9	1 minute	This cookie is set by Google and is used to distinguish users.
_gid	1 day	This cookie is installed by Google Analytics. The cookie is used to store information of how visitors use a website and helps in creating an analytics report of how the website is doing. The data collected including the number visitors, the source where they have come from, and the pages visted in an anonymous form.