About

A research profile grounded in real software interaction.

I am interested in the moment when agent reasoning has to meet practical interfaces: messy software, professional tools, shifting workflows, and the need for reliable evaluation.

Research direction

GUI agents, LLM agents, RL, and broader general-purpose agent systems.

My research sits at the intersection of multimodal reasoning, real-environment evaluation, and agent interaction systems. I am especially interested in how agent capabilities can scale from software interfaces toward more general task competence in the real world.

Across recent projects, I have worked on plug-and-play improvement methods for GUI agents, real-environment benchmarks, large-scale trajectory data construction, multi-app desktop automation, and reinforcement learning pipelines for agent behavior.

My current work focuses on making agent systems more dependable in real environments by combining method design, benchmark construction, and systems-oriented implementation.

Interests

Topics I am currently excited about.

GUI agentsLLM agentsRLAGI
Timeline

Education and research experience.

2025 - Present

PhD Student

Shanghai Jiao Tong University · X-LANCE Lab

Researching multimodal agents with an emphasis on GUI interaction, domain adaptation, and real-environment evaluation.

2025

Research Intern

Beijing Institute for General Artificial Intelligence

Worked on multi-app desktop agents, early-stage reinforcement learning pipelines, and practical software interaction systems.

2024 - 2025

Research Collaborator

Shanghai Jiao Tong University / BIGAI

Contributed to large-scale trajectory data construction and mobile GUI evaluation benchmarks.

2021 - 2025

B.Eng. in Computer Science

Shanghai Jiao Tong University

Built a strong foundation in systems, algorithms, architecture, and machine learning while moving into agent research.

Contact

Best ways to get in touch.