Zhaojian Yu

Hi, I’m Zhaojian Yu, currently a second-year M.S. student at SIGS, Tsinghua University, advised by Prof. Xiao-Ping Zhang. Before that, I received my bachelor’s degree in Computer Science from Jinan University in Jun. 2023. My currect research interest lies in LLMs, and its application in various areas. My email is yzj23@mails.tsinghua.edu.cn.

Education

Aug. 2023 - Jun. 2026 (Expected) M.Sc., iDI, SIGS, Tsinghua University, Beijing, China.
Sep. 2019 - Jun. 2023 B.Sc., School of Cybersecurity, Jinan University, Guangzhou, China.

Publications

(* indicates equal contribution)

[ACL 2024] WaveCoder: Widespread And Versatile Enhanced Instruction Tuning with Refined Data Generation [model]
Zhaojian Yu, Xin Zhang, Ning Shang, Yangyu Huang, Can Xu, Yishujie Zhao, Wenxiang Hu, Qiufeng Yin
WaveCoder is the first SOTA open-source Code LLM with the closest capabilities to GPT-4 on mutiple tasks.

Preprint

[Arxiv 2024] HumanEval Pro and MBPP Pro: Evaluating Large Language Models on Self-invoking Code Generation
Zhaojian Yu, Yilun Zhao, Arman Cohan, Xiao-Ping Zhang
We present HumanEval Pro and MBPP Pro, two expanded versions of the traditional HumanEval and MBPP benchmarks to evaluate LLMs on self-invoking code generation task.
[Arxiv 2024] HumanEval Pro and MBPP Pro: Evaluating Large Language Models on Self-invoking Code Generation
Zhaojian Yu, Yilun Zhao, Arman Cohan, Xiao-Ping Zhang
We present HumanEval Pro and MBPP Pro, two expanded versions of the traditional HumanEval and MBPP benchmarks to evaluate LLMs on self-invoking code generation task.
[Arxiv 2024] OpenCarbonEval: A Unified Carbon Emission Estimation Framework in Large-Scale AI Models
Zhaojian Yu*, Yinghao Wu*, Zhuotao Deng, Yansong Tang, Xiao-Ping Zhang
OpenCarbonEval is a unified framework for integrating large-scale models across diverse modalities to predict carbon emissions, which could provide AI service providers and users with a means to estimate emissions beforehand and help mitigate the environmental pressure associated with these models.

Experience

(May. 2023 - May. 2024) Reserch Intern, Microsoft Research Asia, Beijing, China.
Working on code large language models, focusing on its alignment.

Community Service

Reviewer:

ICLR, IEEE Transactions on Computers, etc.