Sitemap
A list of all the posts and pages found on the site. For you robots out there, there is an XML version available for digesting as well.
Pages
Posts
portfolio
OpenManus & OpenManus-RL Permalink
Published:
Core contributor to OpenManus-RL in an open-source agent ecosystem with 60,000+ GitHub stars.
GUIAgentDebugger Permalink
Published:
Self-evolving VLM-agent debugging framework with a GUI-agent error taxonomy and dual-layer memory.
publications
Artificial Leviathan: Exploring Social Evolution of LLM Agents Through the Lens of Hobbesian Social Contract Theory
Published in The First Workshop on AI Behavioral Science, ACM SIGKDD 2024, 2024
A simulated LLM agent society for studying emergent social contracts through Hobbesian social contract theory.
Recommended citation: Gordon Dai*, Weijia Zhang*, Jinhan Li, Siqi Yang, Srihas Rao, Arthur Caetano, and Misha Sra. (2024). "Artificial Leviathan: Exploring Social Evolution of LLM Agents Through the Lens of Hobbesian Social Contract Theory." The First Workshop on AI Behavioral Science, ACM SIGKDD 2024.
Download Paper
Where LLM Agents Fail and How They Can Learn From Failures
Published in arXiv preprint arXiv:2509.25370, 2025
A study of LLM agent failures and how agents can learn from failed trajectories.
Recommended citation: Kunlun Zhu, Zijia Liu, Bingxuan Li, Muxin Tian, Yingxuan Yang, Jiaxun Zhang, Pengrui Han, Qipeng Xie, Fuyang Cui, Weijia Zhang, Xiaoteng Ma, Xiaodong Yu, Gowtham Ramesh, Jialian Wu, Zicheng Liu, Pan Lu, James Zou, and Jiaxuan You. (2025). "Where LLM Agents Fail and How They Can Learn From Failures." arXiv preprint arXiv:2509.25370.
Download Paper
SeeingEye: Agentic Information Flow Unlocks Multimodal Reasoning in Text-only LLMs
Published in arXiv preprint arXiv:2510.25092; under review at EMNLP 2026, 2025
Agentic information flow for unlocking multimodal reasoning in text-only LLMs.
Recommended citation: Weijia Zhang*, Zijia Liu*, Haoru Li*, Haoqi Chen*, and Jiaxuan You. (2025). "SeeingEye: Agentic Information Flow Unlocks Multimodal Reasoning in Text-only LLMs." arXiv preprint arXiv:2510.25092. Under review at EMNLP 2026.
Download Paper
Understanding and Mitigating the Bias Inheritance in LLM-based Data Augmentation on Downstream Tasks
Published in ACL 2026, Oral, 2026
A systematic study of how bias can be inherited through LLM-generated synthetic data and how mitigation strategies behave across tasks.
Recommended citation: Miaomiao Li, Hao Chen, Yang Wang, Tingyuan Zhu, Weijia Zhang, Kaijie Zhu, Kam-Fai Wong, and Jindong Wang. (2026). "Understanding and Mitigating the Bias Inheritance in LLM-based Data Augmentation on Downstream Tasks." ACL 2026. Oral.
Download Paper
CUADebug: Diagnosing and Repairing Computer-Use Agent Failures
Published in Under review at EMNLP 2026, 2026
A framework for diagnosing and repairing computer-use agent failures with a CUA-specific error taxonomy, benchmark, and tool-augmented debugger.
Recommended citation: Weijia Zhang et al. (2026). "CUADebug: Diagnosing and Repairing Computer-Use Agent Failures." Under review at EMNLP 2026.
Download Paper
Every Act Has Its Price: Compressed Moral Composition in Frontier LLMs
Published in Under review at EMNLP 2026, 2026
A study of compressed moral composition in frontier LLMs.
Recommended citation: Weijia Zhang, Ruiqi Chen, Yunze Xiao, and Weihao Xuan. (2026). "Every Act Has Its Price: Compressed Moral Composition in Frontier LLMs." Under review at EMNLP 2026.
Download Paper
How Much Vision Does Multimodal Reasoning Need? Vision-Stripping for Multimodal Benchmarks
Published in Under review at NeurIPS 2026, 2026
Vision-stripping for multimodal benchmarks and multimodal reasoning.
Recommended citation: Weijia Zhang, Zijia Liu, Tianyi Zhang, Ruiqi Chen, Lian Zhang, Haoru Li, Haoqi Chen, and Jiaxuan You. (2026). "How Much Vision Does Multimodal Reasoning Need? Vision-Stripping for Multimodal Benchmarks." Under review at NeurIPS 2026.
Download Paper
talks
teaching
Exec and Workshop Lead
Workshop leadership, UIUC ACM Gamebuilders, 2023
Led weekly workshops on game development topics including computer graphics, Blender, Unity, C#, game AI, and performance optimization.
Course Assistant, CS 233: Computer Architecture
Course assistant, University of Illinois Urbana-Champaign, 2024
Held office hours and one-on-one help sessions on pipelining, caching, memory hierarchy, and related topics. Co-created homework puzzles and helped build and debug the Spimbot MIPS assembly competition.




