Xiang Wang

University of Science and Technology of China

[advanced] Large Language Models for User Behavior Modeling: Cross-Modal Interpretation, Preference Optimization, and Agentic Simulation

Summary

This lecture examines the pivotal role of large language models (LLMs) in advancing user behavior modeling, with a specific focus on personalized recommendation systems. It highlights how LLMs enhance the understanding of user intent, optimize behavioral preferences, and simulate intricate user interactions, with three foundational pillars:

Cross-Modal Interpretation: This section explores how LLMs process diverse behavioral modalities (e.g., clicks, purchases, and views) alongside textual data, enabling a unified understanding of user behaviors across multiple input formats.
Preference Alignment: It introduces state-of-the-art approaches such as reinforcement learning with human feedback (RLHF) and direct preference optimization (DPO), framing user behaviors as preference data to refine and personalize recommendations effectively.
Agentic Simulation: The lecture further discusses decomposing complex user modeling tasks into manageable subtasks, each addressed by specialized LLMs operating collaboratively within a network of agentic experts.

The lecture provides both theoretical insights and practical applications, bridging the gap between LLMs research and real-world recommender systems.

Syllabus

Background: role of LLMs in recommendation systems.
Introduction: key pillars (cross-modal interpretation, preference optimization, agentic simulation).
Cross-Modal Interpretation: concepts, paradigms, and core ideas of fine-tuning.
Preference Alignment: concepts, methodologies, and recent progress of preference optimization.
Agentic Simulation: concepts, frameworks, and latest advancements of agents.
Key takeaways and future trends

References

LLaRA: Large Language-Recommendation Assistant. In SIGIR 2024.

On generative agents in recommendation. In SIGIR 2024.

On Softmax Direct Preference Optimization for Recommendation. In NeurIPS 2024.

Customizing Language Models with Instance-wise LoRA for Sequential Recommendation. In NeurIPS 2024.

beta-DPO: Direct Preference Optimization with Dynamic beta. In NeurIPS 2024.

Language Representations Can Be What Recommenders Need: Findings and Potentials.

Pre-requisites

Machine learning basics.

Short bio

Xiang Wang is a professor at the University of Science and Technology of China (USTC). He obtained his Ph.D. from the National University of Singapore in 2019. His research focuses on multi-modal large language models, user modeling, and data mining. He has authored over 100 publications in top-tier artificial intelligence conferences, including more than 10 papers recognized as highly influential and 3 papers selected as Best Paper Finalists. He has served as an Area Chair for leading conferences such as NeurIPS and ICML. Dr. Wang’s achievements include the SIGIR Early Career Award and recognition as one of MIT Technology Review’s Innovators Under 35 (TR35) in China.

Cookie	Duration	Description
cookielawinfo-checkbox-advertisement	1 year	The cookie is set by GDPR cookie consent to record the user consent for the cookies in the category "Advertisement".
cookielawinfo-checkbox-analytics	11 months	This cookie is set by GDPR Cookie Consent plugin. The cookie is used to store the user consent for the cookies in the category "Analytics".
cookielawinfo-checkbox-functional	11 months	The cookie is set by GDPR cookie consent to record the user consent for the cookies in the category "Functional".
cookielawinfo-checkbox-necessary	11 months	This cookie is set by GDPR Cookie Consent plugin. The cookies is used to store the user consent for the cookies in the category "Necessary".
cookielawinfo-checkbox-others	11 months	This cookie is set by GDPR Cookie Consent plugin. The cookie is used to store the user consent for the cookies in the category "Other.
cookielawinfo-checkbox-performance	11 months	This cookie is set by GDPR Cookie Consent plugin. The cookie is used to store the user consent for the cookies in the category "Performance".
PHPSESSID	session	This cookie is native to PHP applications. The cookie is used to store and identify a users' unique session ID for the purpose of managing user session on the website. The cookie is a session cookies and is deleted when all the browser windows are closed.
viewed_cookie_policy	11 months	The cookie is set by the GDPR Cookie Consent plugin and is used to store whether or not user has consented to the use of cookies. It does not store any personal data.

Cookie	Duration	Description
_ga	2 years	This cookie is installed by Google Analytics. The cookie is used to calculate visitor, session, campaign data and keep track of site usage for the site's analytics report. The cookies store information anonymously and assign a randomly generated number to identify unique visitors.
_gat_gtag_UA_74880351_9	1 minute	This cookie is set by Google and is used to distinguish users.
_gid	1 day	This cookie is installed by Google Analytics. The cookie is used to store information of how visitors use a website and helps in creating an analytics report of how the website is doing. The data collected including the number visitors, the source where they have come from, and the pages visted in an anonymous form.