SLICK: Driving SLO Culture At Meta | Dávid Bartók & Filip Klepo

Beyond the Hype: A Realistic Look at Large Language Models • Jodie Burchell • GOTO 2024

500$ РАДИОУПРАВЛЯЕМАЯ VS НАСТОЯЩАЯ МАШИНКА !)

БАЯНИСТ разносит ЗАПАД РУССКОЙ МУЗЫКОЙ | Реакция ИНОСТРАНЦЕВ в ЧАТ РУЛЕТКЕ

Игра АЛИБИ!**Егорик, Екатзе и Саня Монтажник**

Егор Крид , Toxi$ - CowBoys ( Премьера клипа 2024 )

Efficient Large-Scale Language Model Training on GPU Clusters Using Megatron-LM | Jared Casper

Подписаться 18 тыс.

Просмотров 3,6 тыс.

50% 1

Видео Поделиться Скачать Добавить в

In this talk we present how we trained a 530B parameter language model on a DGX SuperPOD with over 3,000 A100 GPUs and a high speed Infiniband interconnect, and how we can scale to even larger models. We explore three types of parallelism: data, tensor, and pipeline and how these different types can be composed to achieve maximum efficiency. Our approach allows us to perform training iterations on a model with 1 trillion parameters at 502 petaFLOP/s on 3072 GPUs (per-GPU throughput of 52% of theoretical peak). We discuss challenges that we faced when training the 530B Megatron-Turing NLG model and give practical advice on how to successfully train very large language models.

Опубликовано:

29 сен 2024

Поделиться:

Ссылка:

Скачать:

Готовим ссылку...

Добавить в:

Мой плейлист

Посмотреть позже

Комментарии : 3

@prajyot2021 4 месяца назад

Need more such detailed content Jared. Appreciate your Work. Thanks Mate

@voncolborn9437 8 месяцев назад

Being an old-timer on computer ops (from back in the 80s), I find this whole new world of computer operations totally facinating. It really is hard for me to wrap my head around the size and performance of these systems. My hat is off to you guys. I'm watching and learning a little, too.

@kazimejbaulislam9185 9 месяцев назад

amazing explanation! Thanks

Далее

SLICK: Driving SLO Culture At Meta | Dávid Bartók & Filip Klepo

18:28

SLICK: Driving SLO Culture At Meta | Dávid Bartók & Filip Klepo

Просмотров 157

Beyond the Hype: A Realistic Look at Large Language Models • Jodie Burchell • GOTO 2024

42:52

Beyond the Hype: A Realistic Look at Large Language Models • Jodie Burchell • GOTO 2024

Просмотров 106 тыс.

500$ РАДИОУПРАВЛЯЕМАЯ VS НАСТОЯЩАЯ МАШИНКА !)

21:54

500$ РАДИОУПРАВЛЯЕМАЯ VS НАСТОЯЩАЯ МАШИНКА !)

Просмотров 191 тыс.

БАЯНИСТ разносит ЗАПАД РУССКОЙ МУЗЫКОЙ | Реакция ИНОСТРАНЦЕВ в ЧАТ РУЛЕТКЕ

18:14

БАЯНИСТ разносит ЗАПАД РУССКОЙ МУЗЫКОЙ | Реакция ИНОСТРАНЦЕВ в ЧАТ РУЛЕТКЕ

Просмотров 196 тыс.

Игра АЛИБИ!**Егорик, Екатзе и Саня Монтажник**

32:55

Игра АЛИБИ!**Егорик, Екатзе и Саня Монтажник**

Просмотров 594 тыс.

Егор Крид , Toxi$ - CowBoys ( Премьера клипа 2024 )

03:03

Егор Крид , Toxi$ - CowBoys ( Премьера клипа 2024 )

Просмотров 292 тыс.

NSDI '24 - MegaScale: Scaling Large Language Model Training to More Than 10,000 GPUs

16:26

NSDI '24 - MegaScale: Scaling Large Language Model Training to More Than 10,000 GPUs

Просмотров 328

Optics in AI Clusters - Meta Perspective

19:28

Optics in AI Clusters - Meta Perspective

Просмотров 1,5 тыс.

Colossal-AI: A Unified Deep Learning System For Large-Scale Parallel Training

25:26

Colossal-AI: A Unified Deep Learning System For Large-Scale Parallel Training

Просмотров 4,5 тыс.

Training LLMs at Scale - Deepak Narayanan | Stanford MLSys #83

55:59

Training LLMs at Scale - Deepak Narayanan | Stanford MLSys #83

Просмотров 8 тыс.

Turing-NLG, DeepSpeed and the ZeRO optimizer

21:18

Turing-NLG, DeepSpeed and the ZeRO optimizer

Просмотров 17 тыс.

LLMs in the Enterprise: Tips from Netflix, Nvidia, & Meta | TransformX 2022

44:20

LLMs in the Enterprise: Tips from Netflix, Nvidia, & Meta | TransformX 2022

Просмотров 6 тыс.

Scaling AI Workloads with Kubernetes: Sharing GPU Resources Across Multiple Containers - Jack Ong

31:49

Scaling AI Workloads with Kubernetes: Sharing GPU Resources Across Multiple Containers - Jack Ong

Просмотров 7 тыс.

Large Model Training and Inference with DeepSpeed // Samyam Rajbhandari // LLMs in Prod Conference

36:23

Large Model Training and Inference with DeepSpeed // Samyam Rajbhandari // LLMs in Prod Conference

Просмотров 7 тыс.

Has Generative AI Already Peaked? - Computerphile

12:48

Has Generative AI Already Peaked? - Computerphile

Просмотров 994 тыс.

ChatGPT vs Thousands of GPUs! || How ML Models Train at Scale!

13:26

ChatGPT vs Thousands of GPUs! || How ML Models Train at Scale!

Просмотров 846

500$ РАДИОУПРАВЛЯЕМАЯ VS НАСТОЯЩАЯ МАШИНКА !)

21:54

500$ РАДИОУПРАВЛЯЕМАЯ VS НАСТОЯЩАЯ МАШИНКА !)

Просмотров 191 тыс.