👋 I am a first-year research master’s student at Tsinghua University. My research interests span avatars, AI agents, AIGC, and embodied AI, especially in areas that have a strong connection to humans.
And would like to realize AGI for the benefit of humanity through generative AI.
My google scholar is here
🎓 I graduated first in my college (rank 1/233) with a B.S. in Software Engineering from Harbin Engineering University. I am now a first-year master’s student at Tsinghua University, supervised by Prof Ruqi Huang, and expect to graduate in Fall 2027.
🧑💻 I closely collaborated with Prof. Hao Zhao at AIR, Tsinghua University, Prof. Xiaoxiao Long at Nanjing University, Dr. Jianjin Xu at Carnegie Mellon University, Dr. Junting Dong at Shanghai AI Laboratory, and Zijiao Zeng at Tencent Games.
我将在 2025 年 6 月 10–15 日 前往美国田纳西州参加 CVPR 2025,并现场介绍我们的论文 DRiVE。
🔥 我正在积极寻找 2027 年秋季入学的博士(PhD)机会,研究方向包括 Avatar、AIGC、具身智能!
如有合适的机会,欢迎联系我:junhao-c24@mails.tsinghua.edu.cn
I will be in Tennessee, USA, June 10–15, 2025 to attend CVPR 2025 and present our paper DRiVE.
🔥 I am actively seeking PhD position starting Fall 2027 in Avatar, AIGC, and Embodied AI !
Feel free to reach out: junhao-c24@mails.tsinghua.edu.cn
🙋♂️ If you are interested in working with me, feel free to drop me an email. yisuanwang AT gmail DOT com.
😥 Click here to enter emo time !
⬅️ Never place your mouse over the left avatar!
🔥 News
- 2025.05: 🎉 We released DanceTog, this work generates identity-preserving multi-person interactive dance videos with controllable motion and appearance!
- 2025.05: 🎉 IW-bench has been accepted by ACL 2025! 🇦🇹See you from July 27th to August 1st, 2025 in Vienna, Austria!
- 2025.02: 🎉 DRiVE has been accepted by CVPR 2025! 🇺🇸See you from June 11th to June 15th, 2025 at the Music City Center, Nashville, TN.
- 2024.11: 🎉 Idea23D has been accepted by COLING 2025! 🇦🇪 See you in Abu Dhabi, UAE, from January 19 to 24, 2025!
- 2024.11: 🎉 We released DRiVE, generate skeleton and skinning with clothes and hair for 3d gaussian avatar!
- 2024.09: 🎉 We released IW-bench, evaluating Large Multimodal Models for Converting Image-to-Web!
📝 Publications
🧑🎨 AIGC & Controllable World Model
DanceTogether! Identity-Preserving Multi-Person Interactive Video Generation
Junhao Chen, Mingjin Chen, Jianjin Xu, Xiang Li, Junting Dong †, Mingze Sun, Puhua Jiang, Hongxiang Li, Yuhang Yang, Hao Zhao, Xiaoxiao Long, Ruqi Huang †
- This work generates identity-preserving multi-person interactive dance videos with controllable motion and appearance!
DRiVE: Diffusion-based Rigging Empowers Generation of Versatile and Expressive Characters
Mingze Sun *, Junhao Chen *, Junting Dong †, Yurun Chen, Xinyu Jiang, Shiwei Mao, Puhua Jiang, Jingbo Wang, Bo Dai, Ruqi Huang †
- This work generates skeleton and skinning with clothes and hair for 3d gaussian avatar!
Idea23D: Collaborative LMM Agents Enable 3D Model Generation from Interleaved Multimodal Inputs
Junhao Chen *, Xiang Li *, Xiaojun Ye, Chao Li, Zhaoxin Fan †, Hao Zhao †
- This work enables automated 3D model design and generation for people!
Ultraman: Single Image 3D Human Reconstruction with Ultra Speed and Detail
Mingjin Chen *, Junhao Chen *, Huan-ang Gao, Xiaoxue Chen, Zhaoxin Fan, Hao Zhao †
- This work converts a single image of the human body into a lifelike 3D model!
👀 Multi-modal
IW-Bench: Evaluating Large Multimodal Models for Converting Image-to-Web
Hongcheng Guo, Wei Zhang, Junhao Chen, Yaonan Gu, Jian Yang, Junjia Du, Shaosheng Cao, Binyuan Hui, Tianyu Liu, Jianxin Ma, Chang Zhou, Zhoujun Li
- This work is a benchmark for evaluating MLLM image-2-html code generation capabilities.

MMAD: Multi-modal Movie Audio Description
Xiaojun Ye, Junhao Chen, Xiang Li, Haidong Xin, Chao Li, Sheng Zhou †, Jiajun Bu
- This work has unlocked a whole new experience of watching movies for the visually impaired.

FineStyler: Text-guided Instance-level Fine-grained Image Style Transfer
Junhao Chen, Rong Peng, Xiang Li, Jingbo Sun, Hao Zhao, Ruqi Huang
- This work enables fine-grained stylization of a single image through text-guidance!
🎙 NLP & LLM

Towards Energy-Efficient Sentiment Classification with Spiking Neural Networks
Junhao Chen, Xiaojun Ye, Jingbo Sun, Chao Li †
- This work applies a pulsed neural network to a natural language sentiment categorization task, reaching the leading edge in terms of energy consumption.

ZhuJiu: A Multi-dimensional, Multi-faceted Chinese Benchmark for Large Language Models
Baoli Zhang *, Haining Xie *, Pengfan Du, Junhao Chen, Pengfei Cao, Yubo Chen †, Shengping Liu, Kang Liu, Jun Zhao
[🏆Leaderboard ]
[📜Paper]
[🎥Video]
- This work serves as a benchmark for evaluating the Chinese language capabilities of large language models.
🎖 Honors and Awards
Innovation and Entrepreneurship Competition Award Cumulative Awards National *10, Provincial *45, School-level *11, totaling 66.
Honors awards cumulative awards national *6, provincial *2, school-level *20, a total of 28.
Competition awards and individual honors total 94 (as of 11, 18, 2024).
List of all awards received.
Selected Awards and Honors
📖 Educations
- 2024.08 - 2027.06, M.Eng. in Data Science @ Shenzhen International Graduate School (SIGS), Tsinghua University, Shenzhen.
- 2021.06 - 2024.06, B.Eng. in Software Engineering (rank 1 / 223) @ College of Software, Harbin Engineering University, Harbin.
- 2020.09 - 2021.06, Undergraduate, College of Electromechanical Engineering, Harbin Engineering University, Harbin.
💻 Experiences
- 2024.06 - 2024.08, Lightillusions, Shenzhen.
- 2023.08 - 2024.01, DISCOVERLab@Institute for AI Industry Research (AIR), Tsinghua University, Wuxi.
- 2023.04 - 2023.08, Research Group of Speech and Language Technology, National Laboratory of Pattern Recognition@Institute of Automation, Chinese Academy of Sciences, Remote.