Rex Ying

Yale University

[intermediate/advanced] Multimodal Foundation Models for Graph-Structured Data: Framework and Scientific Applications

Summary

Foundation models have demonstrated remarkable capabilities across a diverse range of tasks and modalities such as natural language, images, time series, graphs and tabular data. They leverage abundant unsupervised data to map them into an embedding space (usually Euclidean), such that the embeddings can be easily used for a variety of generative and predictive downstream tasks with little demonstration or label. However, the complex structural properties of these data modalities often exhibit non-Euclidean characteristics including hierarchy, power-law distribution and clustering. In this course, we will demonstrate the underlying rationale for non-Euclidean representation learning. We will further introduce research breakthroughs in leveraging non-Euclidean geometry, such as hyperbolic geometry, as a powerful embedding technique to be integrated in powerful large language models, multimodal foundation models and graph foundation models.

Syllabus

Non-Euclidean Representation Learning
Hyperbolic Deep Learning Architectures
Non-Euclidean Training and Fine-tuning of Foundation Models
Tools, Benchmarks, and Applications

References

Nickel, Maximillian, and Douwe Kiela. “Poincaré embeddings for learning hierarchical representations.” Advances in Neural Information Processing Systems 30 (2017).

Gu, Albert, et al. “Learning mixed-curvature representations in product spaces.” International Conference on Learning Representations. 2018.

Chami, Ines, et al. “Hyperbolic graph convolutional neural networks.” Advances in Neural Information Processing Systems 32 (2019).

Desai, Karan, et al. “Hyperbolic image-text representations.” International Conference on Machine Learning. PMLR, 2023.

Yang, Menglin, et al. “Hypformer: Exploring efficient transformer fully in hyperbolic space.” Proceedings of the 30th ACM SIGKDD Conference on Knowledge Discovery and Data Mining. 2024.

Yang, Menglin, et al. “Hyperbolic Fine-tuning for Large Language Models.” arXiv:2410.04010 (2024).

Pre-requisites

Deep learning and representation learning basics. Foundation models.

Short bio

Rex Ying is an assistant professor in the Department of Computer Science at Yale University. His research focus includes algorithms for foundation models, multimodal models, graph neural networks, geometric deep learning, and trustworthy deep learning. He is interested in the use of graphs and geometry to empower representation learning in expressiveness and trustworthiness in large-scale settings. Rex has built multi-modal foundation models in engineering, natural science, social science and financial domains. He won the best dissertation award at KDD 2022, and the Amazon Research Award in 2024. His research is in part supported by National Science Foundation, Gordon and Betty Moore Foundation, Snap Research and Amazon Research Award.

Cookie	Duration	Description
cookielawinfo-checkbox-advertisement	1 year	The cookie is set by GDPR cookie consent to record the user consent for the cookies in the category "Advertisement".
cookielawinfo-checkbox-analytics	11 months	This cookie is set by GDPR Cookie Consent plugin. The cookie is used to store the user consent for the cookies in the category "Analytics".
cookielawinfo-checkbox-functional	11 months	The cookie is set by GDPR cookie consent to record the user consent for the cookies in the category "Functional".
cookielawinfo-checkbox-necessary	11 months	This cookie is set by GDPR Cookie Consent plugin. The cookies is used to store the user consent for the cookies in the category "Necessary".
cookielawinfo-checkbox-others	11 months	This cookie is set by GDPR Cookie Consent plugin. The cookie is used to store the user consent for the cookies in the category "Other.
cookielawinfo-checkbox-performance	11 months	This cookie is set by GDPR Cookie Consent plugin. The cookie is used to store the user consent for the cookies in the category "Performance".
PHPSESSID	session	This cookie is native to PHP applications. The cookie is used to store and identify a users' unique session ID for the purpose of managing user session on the website. The cookie is a session cookies and is deleted when all the browser windows are closed.
viewed_cookie_policy	11 months	The cookie is set by the GDPR Cookie Consent plugin and is used to store whether or not user has consented to the use of cookies. It does not store any personal data.

Cookie	Duration	Description
_ga	2 years	This cookie is installed by Google Analytics. The cookie is used to calculate visitor, session, campaign data and keep track of site usage for the site's analytics report. The cookies store information anonymously and assign a randomly generated number to identify unique visitors.
_gat_gtag_UA_74880351_9	1 minute	This cookie is set by Google and is used to distinguish users.
_gid	1 day	This cookie is installed by Google Analytics. The cookie is used to store information of how visitors use a website and helps in creating an analytics report of how the website is doing. The data collected including the number visitors, the source where they have come from, and the pages visted in an anonymous form.