Sitemap

A list of all the posts and pages found on the site. For you robots out there, there is an XML version available for digesting as well.

Pages

Posts

portfolio

GUIAgentDebugger Permalink

Published:

Self-evolving VLM-agent debugging framework with a GUI-agent error taxonomy and dual-layer memory.

publications

Artificial Leviathan: Exploring Social Evolution of LLM Agents Through the Lens of Hobbesian Social Contract Theory

Published in The First Workshop on AI Behavioral Science, ACM SIGKDD 2024, 2024

A simulated LLM agent society for studying emergent social contracts through Hobbesian social contract theory.

Recommended citation: Gordon Dai*, Weijia Zhang*, Jinhan Li, Siqi Yang, Srihas Rao, Arthur Caetano, and Misha Sra. (2024). "Artificial Leviathan: Exploring Social Evolution of LLM Agents Through the Lens of Hobbesian Social Contract Theory." The First Workshop on AI Behavioral Science, ACM SIGKDD 2024.
Download Paper

Where LLM Agents Fail and How They Can Learn From Failures

Published in arXiv preprint arXiv:2509.25370, 2025

A study of LLM agent failures and how agents can learn from failed trajectories.

Recommended citation: Kunlun Zhu, Zijia Liu, Bingxuan Li, Muxin Tian, Yingxuan Yang, Jiaxun Zhang, Pengrui Han, Qipeng Xie, Fuyang Cui, Weijia Zhang, Xiaoteng Ma, Xiaodong Yu, Gowtham Ramesh, Jialian Wu, Zicheng Liu, Pan Lu, James Zou, and Jiaxuan You. (2025). "Where LLM Agents Fail and How They Can Learn From Failures." arXiv preprint arXiv:2509.25370.
Download Paper

SeeingEye: Agentic Information Flow Unlocks Multimodal Reasoning in Text-only LLMs

Published in arXiv preprint arXiv:2510.25092; under review at EMNLP 2026, 2025

Agentic information flow for unlocking multimodal reasoning in text-only LLMs.

Recommended citation: Weijia Zhang*, Zijia Liu*, Haoru Li*, Haoqi Chen*, and Jiaxuan You. (2025). "SeeingEye: Agentic Information Flow Unlocks Multimodal Reasoning in Text-only LLMs." arXiv preprint arXiv:2510.25092. Under review at EMNLP 2026.
Download Paper

Understanding and Mitigating the Bias Inheritance in LLM-based Data Augmentation on Downstream Tasks

Published in ACL 2026, Oral, 2026

A systematic study of how bias can be inherited through LLM-generated synthetic data and how mitigation strategies behave across tasks.

Recommended citation: Miaomiao Li, Hao Chen, Yang Wang, Tingyuan Zhu, Weijia Zhang, Kaijie Zhu, Kam-Fai Wong, and Jindong Wang. (2026). "Understanding and Mitigating the Bias Inheritance in LLM-based Data Augmentation on Downstream Tasks." ACL 2026. Oral.
Download Paper

CUADebug: Diagnosing and Repairing Computer-Use Agent Failures

Published in Under review at EMNLP 2026, 2026

A framework for diagnosing and repairing computer-use agent failures with a CUA-specific error taxonomy, benchmark, and tool-augmented debugger.

Recommended citation: Weijia Zhang et al. (2026). "CUADebug: Diagnosing and Repairing Computer-Use Agent Failures." Under review at EMNLP 2026.
Download Paper

talks

teaching

Exec and Workshop Lead

Workshop leadership, UIUC ACM Gamebuilders, 2023

Led weekly workshops on game development topics including computer graphics, Blender, Unity, C#, game AI, and performance optimization.

Course Assistant, CS 233: Computer Architecture

Course assistant, University of Illinois Urbana-Champaign, 2024

Held office hours and one-on-one help sessions on pipelining, caching, memory hierarchy, and related topics. Co-created homework puzzles and helped build and debug the Spimbot MIPS assembly competition.