Kunal Talwar

Apple

[introductory/intermediate] Foundations of Differentially Private Learning

Summary

Machine Learning models can memorize training data and have been shown to allow extraction of information found in individual training examples. Differential Privacy is a standard definition of privacy in statistical settings and can allow training of models that capture global trends while provably preventing learning information specific to one or a few training examples. Recent advances have shown that one can in many settings, one can learn with differential privacy with little or no impact on accuracy. This includes both theoretical results showing that provably one can learn convex models with little impact on accuracy for a large range of parameters, and practical results showing that one can train neural network models with accuracy close to non-private methods. Similarly, learning popular items from a discrete set can be done privately at a small cost to accuracy. This course will study the foundational techniques that allow for solving these problems.

Syllabus

Differential Privacy definition, properties
Basic Algorithms, Composition
Convex Optimization with Differential Privacy
Privacy Amplification, Moments Accountant, Differentially Private SGD
Frequency Estimation, Shuffling, Compression
Principal Component Analysis

References

Cynthia Dwork, Aaron Roth. The Algorithmic Foundations of Differential Privacy. https://www.cis.upenn.edu/~aaroth/privacybook.html

Pre-requisites

Basic understanding of probabilities and statistics, calculus, linear algebra, and machine learning.

Short bio

Kunal Talwar is a Research Scientist at Apple, leading a research group focusing on foundations of ML and Private Data Analysis. His research interests span various aspects of Computer Science including Differential Privacy, Machine Learning, Algorithms and Data Structures. He got his B.Tech. from IIT Delhi (2000) and his PhD from UC Berkeley (2004). Prior to joining Apple, he worked at Microsoft Research in Silicon Valley from 2005 to 2014, and at Google Brain from 2014 to 2019. He has made major contributions to Differential Privacy, Metric Embeddings and Discrepancy Theory. His work has been recognized by the Privacy Enhancing Technologies award in 2009 and the ICLR Best Paper award in 2017.

Cookie	Duration	Description
cookielawinfo-checkbox-advertisement	1 year	The cookie is set by GDPR cookie consent to record the user consent for the cookies in the category "Advertisement".
cookielawinfo-checkbox-analytics	11 months	This cookie is set by GDPR Cookie Consent plugin. The cookie is used to store the user consent for the cookies in the category "Analytics".
cookielawinfo-checkbox-functional	11 months	The cookie is set by GDPR cookie consent to record the user consent for the cookies in the category "Functional".
cookielawinfo-checkbox-necessary	11 months	This cookie is set by GDPR Cookie Consent plugin. The cookies is used to store the user consent for the cookies in the category "Necessary".
cookielawinfo-checkbox-others	11 months	This cookie is set by GDPR Cookie Consent plugin. The cookie is used to store the user consent for the cookies in the category "Other.
cookielawinfo-checkbox-performance	11 months	This cookie is set by GDPR Cookie Consent plugin. The cookie is used to store the user consent for the cookies in the category "Performance".
PHPSESSID	session	This cookie is native to PHP applications. The cookie is used to store and identify a users' unique session ID for the purpose of managing user session on the website. The cookie is a session cookies and is deleted when all the browser windows are closed.
viewed_cookie_policy	11 months	The cookie is set by the GDPR Cookie Consent plugin and is used to store whether or not user has consented to the use of cookies. It does not store any personal data.

Cookie	Duration	Description
_ga	2 years	This cookie is installed by Google Analytics. The cookie is used to calculate visitor, session, campaign data and keep track of site usage for the site's analytics report. The cookies store information anonymously and assign a randomly generated number to identify unique visitors.
_gat_gtag_UA_74880351_9	1 minute	This cookie is set by Google and is used to distinguish users.
_gid	1 day	This cookie is installed by Google Analytics. The cookie is used to store information of how visitors use a website and helps in creating an analytics report of how the website is doing. The data collected including the number visitors, the source where they have come from, and the pages visted in an anonymous form.