About me
Yuxuan is a student at Department of Computer Science and Technology, Tsinghua University, and a core contributor of verl project. Yuxuan led the infrastruture and contributed to the algorithm of DAPO: an open source advanced LLM reinforcement learning recipe at scale as a member of Seed team at ByteDance.