Ying-Cong Chen

Assistant Professor

Hong Kong University of Science and Technology (Guangzhou)

About Me

I am an Assistant Professor at AI Thrust, Information Hub of Hong Kong University of Science and Technology (Guangzhou). I am also an Affiliated Assistant Professor of the department of Computer Science & Engineering (Clear Water Bay Campus). I am a faculty member of the Deep Vision Lab. Previously, I was a Postdoctoral Associate at Computer Science & Artificial Intelligence Lab of Massachusetts Institute of Technology, where I had the privilege of working with Prof. Dina Katabi. I earned my Ph.D. degree from the Chinese University of Hong Kong, under the mentorship of Prof. Jiaya Jia. I am honored as Distinguished Young Scholars (Overseas). For more information about my research group, please visit EnVision-Research.

Interests

Computer Vision
Generative Models
AI+X

📚 My Research

My research focuses on visual generative models, exploring their fundamental principles with the aim of improving their quality, efficiency, diversity, and controllability. Beyond foundational research, I am dedicated to applying these models to solve real-world challenges in sectors such as autonomous driving, smart manufacturing, and content creation. My overarching goal is to advance the field of generative models by tackling sophisticated real-world challenges, thereby pushing the boundaries of academic research in different disciplines. Please find the collection of our open-source code at .

Our CVPR paper "Kiss3DGen: Repurposing Image Diffusion Models for 3D Asset Generation" is getting noticed

Mar 13, 2025

Jiantao Lin, Xin Yang, Meixi Chen, Yingjie Xu, Dongyu Yan, Leyi Wu, Xinli Xu, Lie Xu, Shunsi Zhang, Ying-Cong Chen

Mar 13, 2025

Our CVPR paper "TransPixeler: Advancing Text-to-Video Generation with Transparency" is getting noticed

Jan 9, 2025

Luozhou Wang, Yijun Li, Zhifei Chen, Jui-Hsien Wang, Zhifei Zhang, He Zhang, Zhe Lin, Ying-Cong Chen

Jan 9, 2025

Our arxiv paper "Lotus Diffusion-based Visual Foundation Model for High-quality Dense Prediction" is getting noticed

Sep 27, 2024

Jing He, Haodong Li, Wei Yin, Yixun Liang, Leheng Li, Kaiqiang Zhou, Hongbo Zhang, Bingbing Liu, Ying-Cong Chen

Sep 27, 2024

See all

Recent Publications

Jing He, Haodong Li, Yongzhe Hu, Guibao Shen, Yingjie Cai, Weichao Qiu, Ying-Cong Chen. DisEnvisioner: Disentangled and Enriched Visual Prompt for Customized Image Generation. International Conference on Learning Representations, 2025.

Cite

Jiantao Lin, Xin Yang, Meixi Chen, Yingjie Xu, Dongyu Yan, Leyi Wu, Xinli Xu, Lie Xu, Shunsi Zhang, Ying-Cong Chen. Kiss3DGen: Repurposing Image Diffusion Models for 3D Asset Generation. Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2025.

Cite

Jing He, Haodong Li, Wei Yin, Yixun Liang, Leheng Li, Kaiqiang Zhou, Hongbo Zhang, Bingbing Liu, Ying-Cong Chen. Lotus: Diffusion-based visual foundation model for high-quality dense prediction. International Conference on Learning Representations, 2025.

Cite

Haoyu Chen, Xiaojie Xu, Wenbo Li, Jingjing Ren, Tian Ye, Songhua Liu, Ying-Cong Chen, Lei Zhu, Xinchao Wang. POSTA: A Go-to Framework for Customized Artistic Poster Generation. Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2025.

Cite

Zhen Yang, Guibao Shen, Liang Hou, Mushui Liu, Luozhou Wang, Xin Tao, Pengfei Wan, Di Zhang, Ying-Cong Chen. RectifiedHR: Enable Efficient High-Resolution Image Generation via Energy Rectification. arXiv preprint arXiv:2503.02537, 2025.

Cite

See all publications

Playground

Transpixar

Advancing Text-to-Video Generation with Transparency

Jan 9, 2025

Depth Prediction

Lotus: Diffusion-based Visual Foundation Model for High-quality Dense Prediction

Dec 4, 2024

See all