Furong Huang

University of Maryland

[advanced] Generative AI Agents

Summary

This advanced course introduces the foundations and frontiers of generative AI agents: autonomous systems built on large language and vision models that can reason, plan, act, use tools, and adapt in complex environments. The course connects theoretical foundations with emerging practice, covering alignment and post-training, reasoning and self-improvement, multi-agent workflows, web and code agents, and agent safety. The emphasis is on understanding the algorithmic building blocks of modern agent systems, how they are evaluated, and where the key research challenges remain.

Syllabus

Foundations of generative AI agents: agent architectures, memory, planning, action, and tool use
Alignment and post-training for agents: RL/RLHF, preference optimization, and test-time alignment
Reasoning and self-improvement: chain-of-thought, search-based reasoning, reflection, and ensemble strategies
Agentive workflows and multi-agent systems: design patterns, communication graphs, and role allocation
Web and code agents: tool-using agents, software engineering agents, and benchmark environments
Robustness, safety, and evaluation: adversarial vulnerabilities, benchmarking, and AI-generated content detection/watermarking
Beyond digital environments: world models and agents for robotics and simulation

References

Sutton, R.S. and Barto, A.G. Reinforcement Learning: An Introduction. Second edition, 2018. https://web.stanford.edu/class/psych209/Readings/SuttonBartoIPRLBook2ndEd.pdf

Wang, L. et al. A Survey on Large Language Model based Autonomous Agents. 2024. https://arxiv.org/abs/2308.11432

Rafailov, R. et al. Direct Preference Optimization: Your Language Model is Secretly a Reward Model. 2023. https://arxiv.org/abs/2305.18290

Yao, S. et al. ReAct: Synergizing Reasoning and Acting in Language Models. 2023. https://arxiv.org/abs/2210.03629

Schick, T. et al. Toolformer: Language Models Can Teach Themselves to Use Tools. 2023. https://arxiv.org/abs/2302.04761

Wu, Q. et al. AutoGen: Enabling Next-Gen LLM Applications via Multi-Agent Conversation. 2023. https://arxiv.org/abs/2308.08155

Zhou, S. et al. WebArena: A Realistic Web Environment for Building Autonomous Agents. 2024. https://arxiv.org/abs/2307.13854

Jimenez, C.E. et al. SWE-bench: Can Language Models Resolve Real-World GitHub Issues? 2024. https://arxiv.org/abs/2310.06770

Pre-requisites

Students should have a solid foundation in machine learning and deep learning, including neural networks and basic reinforcement learning. They should also be comfortable with linear algebra, calculus, probability, optimization, and Python-based implementation (e.g., PyTorch or TensorFlow). Familiarity with large language models is helpful, but not strictly required.

Short bio

Furong Huang is an Associate Professor in the Department of Computer Science at the University of Maryland, College Park, with affiliations in UMIACS, the Maryland Robotics Center, AMSC, and ECE. Her research bridges trustworthy machine learning, sequential decision-making, and generative AI, with a strong emphasis on building reliable, interpretable, and aligned foundation models for robotics and autonomous systems. Her work combines theoretical rigor with practical impact in robust, adaptable, and safe intelligent systems.

Cookie	Duration	Description
cookielawinfo-checkbox-advertisement	1 year	The cookie is set by GDPR cookie consent to record the user consent for the cookies in the category "Advertisement".
cookielawinfo-checkbox-analytics	11 months	This cookie is set by GDPR Cookie Consent plugin. The cookie is used to store the user consent for the cookies in the category "Analytics".
cookielawinfo-checkbox-functional	11 months	The cookie is set by GDPR cookie consent to record the user consent for the cookies in the category "Functional".
cookielawinfo-checkbox-necessary	11 months	This cookie is set by GDPR Cookie Consent plugin. The cookies is used to store the user consent for the cookies in the category "Necessary".
cookielawinfo-checkbox-others	11 months	This cookie is set by GDPR Cookie Consent plugin. The cookie is used to store the user consent for the cookies in the category "Other.
cookielawinfo-checkbox-performance	11 months	This cookie is set by GDPR Cookie Consent plugin. The cookie is used to store the user consent for the cookies in the category "Performance".
PHPSESSID	session	This cookie is native to PHP applications. The cookie is used to store and identify a users' unique session ID for the purpose of managing user session on the website. The cookie is a session cookies and is deleted when all the browser windows are closed.
viewed_cookie_policy	11 months	The cookie is set by the GDPR Cookie Consent plugin and is used to store whether or not user has consented to the use of cookies. It does not store any personal data.

Cookie	Duration	Description
_ga	2 years	This cookie is installed by Google Analytics. The cookie is used to calculate visitor, session, campaign data and keep track of site usage for the site's analytics report. The cookies store information anonymously and assign a randomly generated number to identify unique visitors.
_gat_gtag_UA_74880351_9	1 minute	This cookie is set by Google and is used to distinguish users.
_gid	1 day	This cookie is installed by Google Analytics. The cookie is used to store information of how visitors use a website and helps in creating an analytics report of how the website is doing. The data collected including the number visitors, the source where they have come from, and the pages visted in an anonymous form.