About Me
I am a 3rd-year undergraduate student at Tsinghua University, majoring in Electronic Engineering. My research interests focus on 3D Vision, Video Generation, and Multimodal Large Language Models (MLLMs).
Currently, I am an Undergraduate Research Assistant at the Lab of Prof. Yueqi Duan at Tsinghua University. I am passionate about building intelligent systems that can perceive and understand the physical world in 3D.
Education
- Tsinghua University, Beijing, China (Sept 2023 – Present)
- B.Eng. in Electronic Engineering
- GPA: 3.96/4.0 (Rank: 6/265, top 3%)
- Selected Courses: Advanced Calculus, Linear Algebra, Probability and Stochastic Process, Signals and Systems, Programming Basics (C/C++), Data Structures and Algorithms, Media and Recognition.
Research Interests
- 3D Vision & Spatial Intelligence: Understanding 3D structures and spatial relationships from 2D/3D inputs.
- Video Generation: Developing controllable and high-fidelity video generation frameworks.
- Multimodal Models: Finetuning and scaling MLLMs for enhanced perception and reasoning.
News
- [Mar. 2025] Our paper Video-T1: Test-Time Scaling for Video Generation has been accepted to ICCV 2025!
- [Feb. 2026] Released preprint: Spatial-TTT: Streaming Visual-based Spatial Intelligence with Test-Time Training.