Zhaojian Yu
Hi, I’m Zhaojian Yu, currently a second-year M.S. student at SIGS, Tsinghua University, advised by Prof. Xiao-Ping Zhang. Before that, I received my bachelor’s degree in Computer Science from Jinan University in Jun. 2023. My currect research interest lies in LLMs, and its application in various areas. My email is yzj23@mails.tsinghua.edu.cn.
Education
Aug. 2023 - Jun. 2026 (Expected) M.Sc., iDI, SIGS, Tsinghua University, Beijing, China.
Sep. 2019 - Jun. 2023 B.Sc., School of Cybersecurity, Jinan University, Guangzhou, China.
Publications
(* indicates equal contribution)
- [ACL 2024] WaveCoder: Widespread And Versatile Enhanced Instruction Tuning with Refined Data Generation [model]
Zhaojian Yu, Xin Zhang, Ning Shang, Yangyu Huang, Can Xu, Yishujie Zhao, Wenxiang Hu, Qiufeng Yin
WaveCoder is the first SOTA open-source Code LLM with the closest capabilities to GPT-4 on mutiple tasks.
Preprint
[Arxiv 2024] HumanEval Pro and MBPP Pro: Evaluating Large Language Models on Self-invoking Code Generation
Zhaojian Yu, Yilun Zhao, Arman Cohan, Xiao-Ping Zhang
We present HumanEval Pro and MBPP Pro, two expanded versions of the traditional HumanEval and MBPP benchmarks to evaluate LLMs on self-invoking code generation task. Self-invoking code generation, a new task designed to evaluate the progressive reasoning and problem-solving capabilities of LLMs. In this task, models are presented with a base problem and a related, more complex problem. They must solve the base problem and then utilize its solution to address the more complex one.[Arxiv 2024] OpenCarbonEval: A Unified Carbon Emission Estimation Framework in Large-Scale AI Models
Zhaojian Yu*, Yinghao Wu*, Zhuotao Deng, Yansong Tang, Xiao-Ping Zhang
OpenCarbonEval is a unified framework for integrating large-scale models across diverse modalities to predict carbon emissions, which could provide AI service providers and users with a means to estimate emissions beforehand and help mitigate the environmental pressure associated with these models.
Experience
- (May. 2023 - May. 2024) Reserch Intern, Microsoft Research Asia, Beijing, China.
Working on code large language models, focusing on its alignment.
Community Service
Reviewer:
- ICLR 2025
- IEEE Transactions on Computers