Loading…
PyTorch Day China 2025
In-person | 2025 June 7
Learn more on our website

The Sched app allows you to build your schedule but is not a substitute for your event registration. You must be registered to participate in the sessions. If you have not registered but would like to join us, please visit the BAAI Conference webpage.

Please note: This schedule is automatically displayed in China Standard Time (UTC+08:00)To see the schedule in your preferred timezone, please select from the drop-down located at the bottom of the menu to the right.

IMPORTANT NOTE: Timing of sessions and room locations are subject to change.
Saturday June 7, 2025 10:00 - 10:20 CST
Recent advances in reinforcement learning significantly boosts the reasoning capabilities of LLMs. Models such as OpenAI o3, DeepSeek r1, etc,. demonstrates magnificent performance in STEM and coding tasks. Yet, training such models requires complex infrastructures.
In this talk, we present verl (https://github.com/volcengine/verl), a comprehensive framework that utilizes HybridFlow programming abstraction to achieve both flexibility to implement various algorithms and high performance. verl has been adopted by various universities and companies for RL training, and is contributed by 100+ contributors from the community.
Through this talk, audiences will gain i) a basic understanding of various RL algorithms including GRPO; ii) best practices to implement tool calling and multi-turn rollout for agentic tasks, as well vision language model reasoning; iii) latest large scale performance optimization techniques for RL with MOE models such as DeepSeek v3.
Speakers
avatar for Yuxuan Tong

Yuxuan Tong

Researcher, Bytedance
Yuxuan is a student at Department of Computer Science and Technology, Tsinghua University, and a core contributor of verl project. Yuxuan led the infrastruture and contributed to the algorithm of DAPO: an open source advanced LLM reinforcement learning recipe at scale as a member... Read More →
Saturday June 7, 2025 10:00 - 10:20 CST
TBA

Sign up or log in to save this to your schedule, view media, leave feedback and see who's attending!

Share Modal

Share this link via

Or copy link