Shih-Chieh Hsu

University of Washington

[intermediate/advanced] Real-Time Artificial Intelligence for Science and Engineering

Summary

Artificial Intelligence (AI) applications have exploded in the last decades across a wealth of research domains and industries. With edge computing, real-time inference of deep neural networks on custom hardware has become increasingly relevant to these applications. The Large Hadron Collider (LHC) experiments at CERN are running AI algorithms on field-programmable gate arrays (FPGAs) to detect rare physics events from millions of proton-proton collisions in every second. Smartphone companies are incorporating AI chips in their design for on-device inference to improve user experience and tighten data security. The autonomous vehicle industry is turning to application-specific integrated circuits (ASICs) to achieve low latency. The typically acceptable latency for real-time inference in applications above is ranged between O(1) microsecond to nanoseconds, and resources are strictly limited. To address this challenge, software tools have been developed to utilize specialized hardware for inference acceleration. These tools can improve the overall latency and throughputof inference, reduce the computing complexity, and significantly lower the cost for users to develop optimized workflows.

Syllabus

In this lecture, I will give an overview about the challenges of the physics community regarding AI across latency and throughput regimes, and the tools and resources to address these challenges. I will introduce various techniques for model compression using state-of-the-art techniques such as pruning and quantization. There will be tutorials for you to get familiar with these techniques using the hls4ml library. This library converts pre-trained Machine Learning models into FPGA firmware targeting extreme low-latency inference to stay within the strict constraints imposed by the CERN particle detectors.

References

Lectures will be based on the following papers:

P. Harris et. al., “Physics Community Needs, Tools, and Resources for Machine Learning,” https://arxiv.org/abs/2203.16255

E. E. Khoda et al., “Ultra-low latency recurrent neural network inference on FPGAs for physics applications with hls4ml,” https://arxiv.org/abs/2207.00559

A. Eland et al., “Graph Neural Networks for Charged Particle Tracking on FPGAs,” Front. Big Data 5 (2022) 828666

F. Fahim et al., “hls4ml: An Open-Source Codesign Workflow to Empower Scientific Low-Power Machine Learning Devices,” TinyML Research Symposium 2021 https://arxiv.org/abs/2103.05579

J. Duarte et al., “Low-latency machine learning inference on FPGAs,” NeurIPS ML4PS 2019 74

Tutorials will be based on the hls4ml library https://github.com/fastmachinelearning/hls4ml-tutorial

Pre-requisites

Basic knowledge of machine learning and neural networks.

Short bio

Shih-Chieh Hsu earned a MS degree in Physics from National Taiwan University and a PhD in Physics from University of California San Diego. He is currently an Associate Professor in Physics, Adjunct Associate Professor in Electrical and Computer Engineering at University of Washington, and Director of NSF HDR Institute: Accelerated Artificial Intelligence Algorithms for Data-Driven Discovery (A3D3). He is working on experimental particle physics using proton-proton collision data from the Large Hadron Collider. His research interests range from dark matter searches with the ATLAS experiment, neutrino cross-section measurement with the FASER experiment, innovative Artificial Intelligence algorithms for data-intensive discovery, and accelerated machine learning with heterogeneous computing. He is a recipient of DOE Early career award and Undergraduate research mentor award.

Cookie	Duration	Description
cookielawinfo-checkbox-advertisement	1 year	The cookie is set by GDPR cookie consent to record the user consent for the cookies in the category "Advertisement".
cookielawinfo-checkbox-analytics	11 months	This cookie is set by GDPR Cookie Consent plugin. The cookie is used to store the user consent for the cookies in the category "Analytics".
cookielawinfo-checkbox-functional	11 months	The cookie is set by GDPR cookie consent to record the user consent for the cookies in the category "Functional".
cookielawinfo-checkbox-necessary	11 months	This cookie is set by GDPR Cookie Consent plugin. The cookies is used to store the user consent for the cookies in the category "Necessary".
cookielawinfo-checkbox-others	11 months	This cookie is set by GDPR Cookie Consent plugin. The cookie is used to store the user consent for the cookies in the category "Other.
cookielawinfo-checkbox-performance	11 months	This cookie is set by GDPR Cookie Consent plugin. The cookie is used to store the user consent for the cookies in the category "Performance".
PHPSESSID	session	This cookie is native to PHP applications. The cookie is used to store and identify a users' unique session ID for the purpose of managing user session on the website. The cookie is a session cookies and is deleted when all the browser windows are closed.
viewed_cookie_policy	11 months	The cookie is set by the GDPR Cookie Consent plugin and is used to store whether or not user has consented to the use of cookies. It does not store any personal data.

Cookie	Duration	Description
_ga	2 years	This cookie is installed by Google Analytics. The cookie is used to calculate visitor, session, campaign data and keep track of site usage for the site's analytics report. The cookies store information anonymously and assign a randomly generated number to identify unique visitors.
_gat_gtag_UA_74880351_9	1 minute	This cookie is set by Google and is used to distinguish users.
_gid	1 day	This cookie is installed by Google Analytics. The cookie is used to store information of how visitors use a website and helps in creating an analytics report of how the website is doing. The data collected including the number visitors, the source where they have come from, and the pages visted in an anonymous form.