Loading…
PyTorch Day China 2025
In-person | 2025 June 7
Learn more on our website

The Sched app allows you to build your schedule but is not a substitute for your event registration. You must be registered to participate in the sessions. If you have not registered but would like to join us, please visit the BAAI Conference webpage.

Please note: This schedule is automatically displayed in China Standard Time (UTC+08:00)To see the schedule in your preferred timezone, please select from the drop-down located at the bottom of the menu to the right.

IMPORTANT NOTE: Timing of sessions and room locations are subject to change.
Saturday June 7, 2025 16:40 - 17:00 CST
SGLang is an open-source Large Language Model (LLM) inference system that is highly efficient and widely adopted by many companies like xAI, Nvidia and AMD. In this session, I will introduce some key features of SGLang, including the design and implementation of PD disaggregation, large-scale expert parallelism and data parallelism for DeepSeek models, hierarchical KV cache offloading, and highly efficient speculative decoding. I will also share some insights into the future development of the SGLang community.
Speakers
avatar for Liangsheng Yin

Liangsheng Yin

Student / Developer, Shanghai Jiao Tong University / LMSYS
He is an undergraduate student at Shanghai Jiao Tong University and one of the earliest core developers of SGLang, a popular open-source inference engine with 15K+ GitHub stars and 20K+ monthly downloads. SGLang is used by xAI (Grok 3), Microsoft Azure (DeepSeek R1), NVIDIA, AMD... Read More →
Saturday June 7, 2025 16:40 - 17:00 CST
TBA

Sign up or log in to save this to your schedule, view media, leave feedback and see who's attending!

Share Modal

Share this link via

Or copy link