Projects

Systems and studies that shaped my current research agenda.

These projects span improvement methods for GUI agents, domain-specific benchmarks, multi-app automation, data construction, and reinforcement learning pipelines.

May 2026 - Present Shanghai Jiao Tong University · X-LANCE Lab

ASIL

An agent-native interface for software-operating agents that replaces screenshot-and-click control with structured state and semantic actions.

GUI AgentsSoftware InteractionAgentic RLBenchmarking

Read project details

Mar 2025 - Present Shanghai Jiao Tong University / BIGAI

GUIDE

A plug-and-play framework for reducing planning and grounding bias in domain-specific GUI agents by retrieving and distilling task-relevant tutorial knowledge.

GUI AgentsRetrieval-Augmented SystemsGroundingPlanning

Read project details

Oct 2025 - Present Shanghai Jiao Tong University / Suzhou Laboratory

MatToolBench

A real-environment benchmark for multimodal agents working with professional materials science software, covering GUI operation, code execution, and cross-tool workflows.

BenchmarkingReal-Environment EvaluationScientific SoftwareAgent Infrastructure

Read project details

Oct 2024 - Apr 2025 BIGAI / Shanghai Jiao Tong University

TongUI

A large-scale trajectory data construction effort that transforms multimodal web tutorials into agent training data across operating systems and application types.

Data ConstructionEvaluationMultimodal Training

Read project details

Mar 2025 - May 2025 BIGAI

Multi-App macOS Agent

A multi-application desktop agent for macOS workflows, including browser use, calendar management, and messaging coordination.

Desktop AgentsCross-App AutomationmacOS

Read project details

Jun 2025 - Aug 2025 BIGAI

GUI Agent Reinforcement Learning

Early-stage reinforcement learning work for GUI agents, including task generation and training-method adaptation.

Reinforcement LearningTask GenerationAgent Training

Read project details

Apr 2024 - Nov 2024 Shanghai Jiao Tong University · X-LANCE Lab

Mobile-Env

A benchmark effort for evaluating LLM-based GUI interaction in mobile environments with isolated tasks, replay infrastructure, and behavior analysis.

Mobile AgentsBenchmarkingEvaluation

Read project details