DeepLearn 2023 Spring
9th International School
on Deep Learning
Bari, Italy · April 03-07, 2023
Registration
Downloads
  • Call DeepLearn 2023 Spring
  • Poster DeepLearn 2023 Spring
  • Lecture Materials
  • Home
  • Schedule
  • Lecturers
  • Sponsoring
  • News
  • Info
    • Accommodation
    • Travel to Bari
    • Code of conduct
    • Visa
    • Testimonials
  • Home
  • Schedule
  • Lecturers
  • Sponsoring
  • News
  • Info
    • Accommodation
    • Travel to Bari
    • Code of conduct
    • Visa
    • Testimonials
Babak Ehteshami Bejnordi

Babak Ehteshami Bejnordi

Qualcomm AI Research

[intermediate/advanced] Conditional Computation for Efficient Deep Learning with Applications to Computer Vision, Multi-Task Learning, and Continual Learning

Summary

To date, most AI systems focus on solving a single task or a narrow domain at a time. Developing an AI system that can solve hundreds of tasks can lead to increased efficiency, better generalization, and greater adaptability, making it a powerful tool for a wide range of applications. However, solving hundreds of tasks requires building a very large capacity model. With our current dense architectures which require the whole network to activate regardless of the input or task, this is prohibitively expensive.

Conditional computation is a technique that enables neural networks to perform different computations depending on the input or task at hand. This is achieved by routing each input through different branches or subnetworks within the network such that only the relevant parts of the network are executed. There are several benefits to using conditional computation in neural networks. Conditional computation leads to improved training and inference costs because only a fraction of the model branches is used for each example. In addition, because of their modular nature and their ability to activate/deactivate different branches, such models can be modified or extended more easily making them particularly suited for multi-task learning and continual learning settings. Finally, conditional computation can help interpretability by making it easier to understand how the model is making decisions and to identify what factors are most important for a particular task.

Syllabus

  • Introduction to conditional computation
  • Various approaches for implementation of conditional computing such as the mixture of experts, early exiting, dynamic token pruning, etc.
  • Conditional computation in language and vision
  • Various routing algorithms to learn how to route the input to the appropriate pathway in the network
  • The chicken and egg problem in learning expert modules and learning how to route at the same time
  • Example state-of-the-art solutions
  • An introduction to multi-task learning (MTL) and continual learning (CL)
  • The issue of task-interference
  • How conditional computation enables learning more efficient and accurate MTL models
  • State-of-the-art conditional compute models for MTL and CL
  • Summary

References

Bengio, Yoshua, Nicholas Léonard, and Aaron Courville. “Estimating or propagating gradients through stochastic neurons for conditional computation.” arXiv preprint arXiv:1308.3432 (2013).

Bengio, Emmanuel, et al. “Conditional computation in neural networks for faster models.” arXiv preprint arXiv:1511.06297 (2015).

Shazeer, Noam, et al. “Outrageously large neural networks.” Proceedings of the International Conference on Learning Representations. ICLR 2017.

Veit, Andreas, and Serge Belongie. “Convolutional networks with adaptive inference graphs.” Proceedings of the European Conference on Computer Vision. ECCV 2018.

Ehteshami Bejnordi, Babak, Tijmen Blankevoort, and Max Welling. “Batch-shaping for learning conditional channel gated networks.” Proceedings of the International Conference on Learning Representations. ICLR 2020.

Teerapittayanon, Surat, Bradley McDanel, and H. T. Kung. “Branchynet: Fast inference via early exiting from deep neural networks.” CoRR abs/1709.01686 (2017). arXiv preprint arXiv:1709.01686 (2017).

Ghodrati, Amir, Babak Ehteshami Bejnordi, and Amirhossein Habibian. “Frameexit: Conditional early exiting for efficient video recognition.” Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition. 2021.

Raychaudhuri, Dripta S., et al. “Controllable Dynamic Multi-Task Architectures.” Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition. 2022.

Lin, Min, Jie Fu, and Yoshua Bengio. “Conditional computation for continual learning.” arXiv preprint arXiv:1906.06635 (2019).

Abati, Davide, et al. “Conditional channel gated networks for task-aware continual learning.” Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition. 2020.

Barham, Paul, et al. “Pathways: Asynchronous distributed dataflow for ML.” Proceedings of Machine Learning and Systems 4 (2022): 430-449.

Pre-requisites

Basic knowledge of machine learning and deep learning. A basic understanding of transformers and convolutional neural networks is preferred but not mandatory.

Short bio

https://scholar.google.com/citations?user=Qk-AMk0AAAAJ&hl=en

https://www.linkedin.com/in/babakint/

http://babakint.com/

Babak Ehteshami Bejnordi is a Research Scientist at Qualcomm AI Research in the Netherlands, leading a research group focusing on conditional computation for efficient deep learning. His research interests are in Conditional Computation, Efficient Deep Learning for Computer Vision, Multi-Task Learning, and Continual Learning. Babak obtained his Ph.D. in machine learning for breast cancer diagnosis from Radboud University in the Netherlands. During his Ph.D., he organized the CAMELYON16 challenge on breast cancer metastases detection which demonstrated one of the first medical diagnostic tasks in which AI algorithms outperform expert pathologists. Before joining Qualcomm he was a visiting researcher at Harvard University, BeckLab, and a member of the Broad Institute of MIT and Harvard. He has been the organizer of the Qualcomm Innovation Fellowship Program in Europe since 2019.

Other Courses

speakers-gleyzerSergei V. Gleyzer
speakers-kumarVipin Kumar
speakers-goldbergerJacob Goldberger
Christoph LampertChristoph Lampert
speakers-jingbianYingbin Liang
Xiaoming LiuXiaoming Liu
Michael MahoneyMichael Mahoney
Liza MijovicLiza Mijovic
William S. NobleWilliam S. Noble
Bhiksha RajBhiksha Raj
Holger Rauhut‪Holger Rauhut
Bart ter Haar RomenyBart ter Haar Romeny
Tara SainathTara Sainath
Martin SchultzMartin Schultz
Adi Laurentiu TarcaAdi Laurentiu Tarca
Emma TolleyEmma Tolley
Michalis VazirgiannisMichalis Vazirgiannis
Atlas WangAtlas Wang
Guo-Wei WeiGuo-Wei Wei
Lei XingLei Xing
Xiaowei XuXiaowei Xu

DeepLearn 2023 Spring

CO-ORGANIZERS

Department of Computer Science
University of Bari “Aldo Moro”

Institute for Research Development, Training and Advice – IRDTA, Brussels/London

Active links
  • DeepLearn 2023 Summer – 10th International Gran Canaria School on Deep Learning
  • BigDat 2023 Summer – 7th International School on Big Data

Photos by: Ph. Eufemia Lella

Past links
  • DeepLearn 2023 Winter
  • DeepLearn 2022 Autumn
  • DeepLearn 2022 Summer
  • DeepLearn 2022 Spring
  • DeepLearn 2021 Summer
  • DeepLearn 2019
  • DeepLearn 2018
  • DeepLearn 2017
© IRDTA 2021. All Rights Reserved.
We use cookies on our website to give you the most relevant experience by remembering your preferences and repeat visits. By clicking “Accept All”, you consent to the use of ALL the cookies. However, you may visit "Cookie Settings" to provide a controlled consent.
Cookie SettingsAccept All
Manage consent

Privacy Overview

This website uses cookies to improve your experience while you navigate through the website. Out of these, the cookies that are categorized as necessary are stored on your browser as they are essential for the working of basic functionalities of the website. We also use third-party cookies that help us analyze and understand how you use this website. These cookies will be stored in your browser only with your consent. You also have the option to opt-out of these cookies. But opting out of some of these cookies may affect your browsing experience.
Necessary
Always Enabled
Necessary cookies are absolutely essential for the website to function properly. These cookies ensure basic functionalities and security features of the website, anonymously.
CookieDurationDescription
cookielawinfo-checkbox-advertisement1 yearThe cookie is set by GDPR cookie consent to record the user consent for the cookies in the category "Advertisement".
cookielawinfo-checkbox-analytics11 monthsThis cookie is set by GDPR Cookie Consent plugin. The cookie is used to store the user consent for the cookies in the category "Analytics".
cookielawinfo-checkbox-functional11 monthsThe cookie is set by GDPR cookie consent to record the user consent for the cookies in the category "Functional".
cookielawinfo-checkbox-necessary11 monthsThis cookie is set by GDPR Cookie Consent plugin. The cookies is used to store the user consent for the cookies in the category "Necessary".
cookielawinfo-checkbox-others11 monthsThis cookie is set by GDPR Cookie Consent plugin. The cookie is used to store the user consent for the cookies in the category "Other.
cookielawinfo-checkbox-performance11 monthsThis cookie is set by GDPR Cookie Consent plugin. The cookie is used to store the user consent for the cookies in the category "Performance".
PHPSESSIDsessionThis cookie is native to PHP applications. The cookie is used to store and identify a users' unique session ID for the purpose of managing user session on the website. The cookie is a session cookies and is deleted when all the browser windows are closed.
viewed_cookie_policy11 monthsThe cookie is set by the GDPR Cookie Consent plugin and is used to store whether or not user has consented to the use of cookies. It does not store any personal data.
Functional
Functional cookies help to perform certain functionalities like sharing the content of the website on social media platforms, collect feedbacks, and other third-party features.
Performance
Performance cookies are used to understand and analyze the key performance indexes of the website which helps in delivering a better user experience for the visitors.
Analytics
Analytical cookies are used to understand how visitors interact with the website. These cookies help provide information on metrics the number of visitors, bounce rate, traffic source, etc.
CookieDurationDescription
_ga2 yearsThis cookie is installed by Google Analytics. The cookie is used to calculate visitor, session, campaign data and keep track of site usage for the site's analytics report. The cookies store information anonymously and assign a randomly generated number to identify unique visitors.
_gat_gtag_UA_74880351_91 minuteThis cookie is set by Google and is used to distinguish users.
_gid1 dayThis cookie is installed by Google Analytics. The cookie is used to store information of how visitors use a website and helps in creating an analytics report of how the website is doing. The data collected including the number visitors, the source where they have come from, and the pages visted in an anonymous form.
Advertisement
Advertisement cookies are used to provide visitors with relevant ads and marketing campaigns. These cookies track visitors across websites and collect information to provide customized ads.
Others
Other uncategorized cookies are those that are being analyzed and have not been classified into a category as yet.
SAVE & ACCEPT
Powered by CookieYes Logo