Weijia Zhang

Weijia Zhang

2172003915zhangwj.charlie@gmail.comlinkedin.com/in/weijia-charlie-zhangcharliedreemur.github.ioGoogle Scholar

Education

Yale UniversityAug 2026 - May 2028
M.S. in Computer Science, Thesis Track with Full ScholarshipIncoming Student
University of Illinois Urbana-Champaign (UIUC)Aug 2022 - May 2026
B.S. in Computer Science and MathematicsGPA: 3.7/4.0; Dean’s List 2025

Work Experience

Microsoft Research Asia (MSRA)Research InternJuly 2025 - Sep 2025
  • Conducted VLM/LLM agent research for Excel Copilot, focusing on spreadsheet text-analysis workflows and Office.js code generation for the TextAnalysis API.
  • Built a TextAnalysisSFT data-generation pipeline: mined 2,000+ Kaggle spreadsheet samples, filtered heavy-text sheets, generated query/Office.js pairs, and validated outputs with SheetEngine; internal validation reported a 75% relative improvement in Office Script code-generation accuracy.
Reborn NetworkAI EngineerMay 2023 - July 2023
  • Developed a Unity VR role-playing agent game with real-time text, voice, and VR-action interaction under a sub-second latency target.
  • Added RAG/vector-store memory for persistent character context, improving human-rated dialogue coherence from 2/5 to 4/5.
  • Designed a reusable character-card schema for UGC personas, cutting persona setup effort by roughly 3x.
Tencent, WeChat GroupSoftware EngineerAug 2024 - Sep 2024
  • Built a cross-platform Android/iOS performance analysis tool for WeChat Mini Programs, helping 200+ partner teams diagnose hardware bottlenecks.
  • Implemented a Unity Mono memory profiler and allocation tracing workflow; identified 40+ hidden allocation sites and supported memory-leak crash investigation.

Selected Publications

  • Weijia Zhang, Zijia Liu, Tianyi Zhang, Ruiqi Chen, Lian Zhang, Haoru Li, Haoqi Chen, Jiaxuan You. “How Much Vision Does Multimodal Reasoning Need? Vision-Stripping for Multimodal Benchmarks.” Under review at NeurIPS 2026.
  • Weijia Zhang, Ruiqi Chen, Yunze Xiao, Weihao Xuan. “Every Act Has Its Price: Compressed Moral Composition in Frontier LLMs.” Under review at EMNLP 2026.
  • Weijia Zhang, Zijia Liu, Haoru Li, Haoqi Chen, Jiaxuan You. “SeeingEye: Agentic Information Flow Unlocks Multimodal Reasoning in Text-only LLMs.” arXiv preprint, under review at EMNLP 2026.
  • Gordon Dai, Weijia Zhang, Jinhan Li, Siqi Yang, Srihas Rao, Arthur Caetano, Misha Sra. “Artificial Leviathan: Exploring Social Evolution of LLM Agents Through the Lens of Hobbesian Social Contract Theory.” AI Behavioral Science Workshop, ACM SIGKDD 2024.
  • Miaomiao Li, Hao Chen, Yang Wang, Tingyuan Zhu, Weijia Zhang, Kaijie Zhu, Kam-Fai Wong, Jindong Wang. “Understanding and Mitigating the Bias Inheritance in LLM-based Data Augmentation on Downstream Tasks.” ACL 2026 Oral.
  • Kunlun Zhu, Zijia Liu, Bingxuan Li, Muxin Tian, Yingxuan Yang, Jiaxun Zhang, Pengrui Han, Qipeng Xie, Fuyang Cui, Weijia Zhang, Xiaoteng Ma, Xiaodong Yu, Gowtham Ramesh, Jialian Wu, Zicheng Liu, Pan Lu, James Zou, Jiaxuan You. “Where LLM Agents Fail and How They Can Learn From Failures.” arXiv preprint.

Research Projects

CUADebug / GUIAgentDebuggerFirst Author / LeadJan 2026 - Present
  • Designed a CUA root-cause taxonomy with 4 top-level modules and 29 subtypes, covering perception, grounding and interaction, task reasoning/control, and external/system failures.
  • Built CUAErrorBench from 184 human-annotated OSWorld failure trajectories across Claude, Gemini, and Qwen agents, with labels for root step, subtype, evidence, correction, and confidence.
  • Developed a tool-augmented RCA debugger that inspects paired before/after screenshots, action traces, and execution status, then submits structured root-cause evidence and repair recipes.
  • Added episodic-memory retrieval for reusable debugging lessons; in continual re-rollout experiments, improved success from 12.20% direct continuation to 25.86%, approaching 29.21% with human RCA.
OpenManus & OpenManus-RLResearcher / Core ContributorJan 2025 - July 2025
  • Collaborated with the official OpenManus team on OpenManus-RL, the follow-up to the 60k+ star OpenManus open-source agent ecosystem.
  • Implemented ReAct-style reasoning and multi-agent collaboration workflows, including an MCP-based tool-library interface for automated tool invocation and output handling.
  • Built SFT data and step-level reward signals for AlfWorld and WebShop agent benchmarks to support planning and memory experiments.

Activities & Awards

C.W. Gear Outstanding Undergraduate Student (2 people per year)May 2026
UIUC Dean’s List2025
RecSys 2025 ReviewerAug 2025
UIUC MechMania 28 AI Hackathon, 2nd PlaceSep 2022

Skills

Programming: Python, C/C++, C#, Java, JavaScript/TypeScript, HTML/CSS, SQL, Rust
Frameworks: VERL, vLLM, LangGraph, LangChain, PyTorch, TensorFlow
AI Focus: LLM Agents, Multimodal Reasoning, SFT, RL/Post-training, RAG, Evaluation