Projects

Systems and studies that shaped my current research agenda.

These projects span improvement methods for GUI agents, domain-specific benchmarks, multi-app automation, data construction, and reinforcement learning pipelines.

GUIDE framework overview
Mar 2025 - Present Shanghai Jiao Tong University / BIGAI

GUIDE

A plug-and-play framework for reducing planning and grounding bias in domain-specific GUI agents by retrieving and distilling task-relevant tutorial knowledge.

GUI AgentsRetrieval-Augmented SystemsGroundingPlanning
Read project details
MatToolBench execution architecture
Oct 2025 - Present Shanghai Jiao Tong University / Suzhou Laboratory

MatToolBench

A real-environment benchmark for multimodal agents working with professional materials science software, covering GUI operation, code execution, and cross-tool workflows.

BenchmarkingReal-Environment EvaluationScientific SoftwareAgent Infrastructure
Read project details
Oct 2024 - Apr 2025 BIGAI / Shanghai Jiao Tong University

TongUI

A large-scale trajectory data construction effort that transforms multimodal web tutorials into agent training data across operating systems and application types.

Data ConstructionEvaluationMultimodal Training
Read project details
Mar 2025 - May 2025 BIGAI

Multi-App macOS Agent

A multi-application desktop agent for macOS workflows, including browser use, calendar management, and messaging coordination.

Desktop AgentsCross-App AutomationmacOS
Read project details
Jun 2025 - Aug 2025 BIGAI

GUI Agent Reinforcement Learning

Early-stage reinforcement learning work for GUI agents, including task generation and training-method adaptation.

Reinforcement LearningTask GenerationAgent Training
Read project details
Apr 2024 - Nov 2024 Shanghai Jiao Tong University · X-LANCE Lab

Mobile-Env

A benchmark effort for evaluating LLM-based GUI interaction in mobile environments with isolated tasks, replay infrastructure, and behavior analysis.

Mobile AgentsBenchmarkingEvaluation
Read project details