About Me

I am an Assistant Professor at AI Thrust, Information Hub of Hong Kong University of Science and Technology (Guangzhou). I am also an Affiliated Assistant Professor of the department of Computer Science & Engineering (Clear Water Bay Campus). I am a faculty member of the Deep Vision Lab. Previously, I was a Postdoctoral Associate at Computer Science & Artificial Intelligence Lab of Massachusetts Institute of Technology, where I had the privilege of working with Prof. Dina Katabi. I earned my Ph.D. degree from the Chinese University of Hong Kong, under the mentorship of Prof. Jiaya Jia. I am honored as Distinguished Young Scholars (Overseas). For more information about my research group, please visit EnVision-Research.

Interests
  • Computer Vision
  • Generative Models
  • AI+X
📚 My Research
My research focuses on visual generative models, exploring their fundamental principles with the aim of improving their quality, efficiency, diversity, and controllability. Beyond foundational research, I am dedicated to applying these models to solve real-world challenges in sectors such as autonomous driving, smart manufacturing, and content creation. My overarching goal is to advance the field of generative models by tackling sophisticated real-world challenges, thereby pushing the boundaries of academic research in different disciplines. Please find the collection of our open-source code at .
Recent News
Recent Publications
TiViBench: Benchmarking Think-in-Video Reasoning for Video Generative Models. Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), 2026.
Dual-balancing for multi-task learning. Neural Networks, 2026.
T-Rex-Omni: Integrating Negative Visual Prompt in Generic Object Detection. AAAI Conference on Artificial Intelligence (AAAI), 2026.
UniCalli: A Unified Diffusion Framework for Column-Level Generation and Recognition of Chinese Calligraphy. International Conference on Learning Representations (ICLR), 2026.
STANCE: Motion Coherent Video Generation Via Sparse-to-Dense Anchored Encoding. arXiv preprint arXiv:2510.14588, 2025.