Loading…
Attending this event?
September 18-19, 2024
San Francisco, California
View More Details & Registration
Note: The schedule is subject to change.

The Sched app allows you to build your schedule but is not a substitute for your event registration. You must be registered for PyTorch Conference 2024 to participate in the sessions. If you have not registered but would like to join us, please go to the event registration page to purchase a registration.

This schedule is automatically displayed in Pacific Daylight Time (UTC-7). To see the schedule in your preferred timezone, please select from the drop-down located at the bottom of the menu to the right.

IMPORTANT NOTE: Timing of sessions and room locations are subject to change.

Wednesday September 18, 2024 11:10am - 11:35am PDT
Aligning AI models with human values and preferences is essential for making them safe and helpful. However, building an efficient and scalable toolkit for alignment can be challenging, especially when applied to state of the art foundation models with billions or trillions of parameters. NeMo-Aligner is an open-source, optimized and scalable toolkit that implements alignment algorithms such as Reinforcement Learning from Human Feedback (RLHF), Direct Preference Optimization (DPO), SteerLM and Self-Play Fine Tuning (SPIN). This talk will introduce NeMo-Aligner and show the steps we took to design and optimize the toolkit around various alignment algorithms. In particular, we discuss the RLHF implementation where we observe close to 7x speedup and excellent scaling performance by adding TRT-LLM integration, carefully orchestrating communication and utilizing fast training kernels. We’re able to align state-of-the-art open source models with NeMo-Aligner and hope our framework can enable the community to performantly customize, fine-tune and align foundational models at any scale.
Speakers
avatar for Gerald Shen

Gerald Shen

Engineer, NVIDIA
Gerald Shen is a member of the NVIDIA NeMo NLP Team specializing in model alignment. He leads the development of the NeMo-Aligner toolkit, a scalable toolkit to align large language models. This toolkit has been used to align models at NVIDIA with algorithms such as reinforcement... Read More →
avatar for Jimmy Zhang

Jimmy Zhang

Machine Learning Engineer, NVIDIA
Jimmy Zhang is a Senior Deep Learning Architect at NVIDIA. His work focuses on researching and developing the performance of deep learning frameworks, including NeMo and Megatron-LM. He completed his M.S. at UIUC where he was mentored under Professor Rakesh Kumar.
Wednesday September 18, 2024 11:10am - 11:35am PDT
Festival Pavilion - Breakout Room B
Log in to leave feedback.

Sign up or log in to save this to your schedule, view media, leave feedback and see who's attending!