What is Prompt Tuning?

What are Diffusion Models?

ВЕРНУЛСЯ В ШКОЛУ НА 24 ЧАСА - SCHOOLBOY RUNAWAY В РЕАЛЬНОЙ ЖИЗНИ!

@HolyBaam ультанул в конце 🧨

How ice hockey pitches are set up! 😮🏒 - 🎥 thoskins77

Buckethead Zombie 💥 #plantsvszombies #pvz #animation #laurashigihara #pvz2 #videogames #cartoon

MegaScale: Scaling Large Language Model Training to More Than 10,000 GPUs | Haibin Lin

Подписаться 18 тыс.

Просмотров 1,1 тыс.

50% 1

Видео Поделиться Скачать Добавить в

MegaScale: Scaling Large Language Model Training to More Than 10,000 GPUs | Haibin Lin
In this presentation, I will discuss the design, implementation and engineering experience in building and deploying MegaScale, a production system for training large language models (LLMs) at the scale of more than 10,000 GPUs. Training LLMs at this scale brings unprecedented challenges to training efficiency and stability. Maintaining high efficiency throughout the training process (i.e., stability) is an important consideration in production given the long extent of LLM training jobs. Many hard stability issues only emerge at large scale, and in-depth observability is the key to address them. We developed a set of diagnostic tools to monitor system components and events deep in the stack, identify root causes, and derive effective techniques to achieve fault tolerance and mitigate stragglers. We share our operational experience in identifying and fixing failures and stragglers.

Опубликовано:

15 сен 2024

Поделиться:

Ссылка:

Скачать:

Готовим ссылку...

Добавить в:

Мой плейлист

Посмотреть позже

Комментарии

Далее

What is Prompt Tuning?

8:33

What is Prompt Tuning?

Просмотров 202 тыс.

What are Diffusion Models?

15:28

What are Diffusion Models?

Просмотров 219 тыс.

ВЕРНУЛСЯ В ШКОЛУ НА 24 ЧАСА - SCHOOLBOY RUNAWAY В РЕАЛЬНОЙ ЖИЗНИ!

28:26

ВЕРНУЛСЯ В ШКОЛУ НА 24 ЧАСА - SCHOOLBOY RUNAWAY В РЕАЛЬНОЙ ЖИЗНИ!

Просмотров 345 тыс.

@HolyBaam ультанул в конце 🧨

00:34

@HolyBaam ультанул в конце 🧨

Просмотров 245 тыс.

How ice hockey pitches are set up! 😮🏒 - 🎥 thoskins77

00:31

How ice hockey pitches are set up! 😮🏒 - 🎥 thoskins77

Просмотров 2,6 млн

Buckethead Zombie 💥 #plantsvszombies #pvz #animation #laurashigihara #pvz2 #videogames #cartoon

00:54

Buckethead Zombie 💥 #plantsvszombies #pvz #animation #laurashigihara #pvz2 #videogames #cartoon

Просмотров 898 тыс.

Can ChatGPT o1-preview Solve PhD-level Physics Textbook Problems?

19:53

Can ChatGPT o1-preview Solve PhD-level Physics Textbook Problems?

Просмотров 26 тыс.

Large Language Models and The End of Programming - CS50 Tech Talk with Dr. Matt Welsh

1:06:56

Large Language Models and The End of Programming - CS50 Tech Talk with Dr. Matt Welsh

Просмотров 804 тыс.

AI Training Orchestration Evolution with Serverless Building Blocks

20:52

AI Training Orchestration Evolution with Serverless Building Blocks

Просмотров 549

GraphRAG: The Marriage of Knowledge Graphs and RAG: Emil Eifrem

19:15

GraphRAG: The Marriage of Knowledge Graphs and RAG: Emil Eifrem

Просмотров 39 тыс.

DATA CENTER Infrastructure Design for Aspiring Engineers @sangelsoft #creative sessions

14:34

DATA CENTER Infrastructure Design for Aspiring Engineers @sangelsoft #creative sessions

Просмотров 1,5 тыс.

GPT-o1: The Best Model I've Ever Tested 🍓 I Need New Tests!

10:58

GPT-o1: The Best Model I've Ever Tested 🍓 I Need New Tests!

Просмотров 129 тыс.

Generative AI in a Nutshell - how to survive and thrive in the age of AI

17:57

Generative AI in a Nutshell - how to survive and thrive in the age of AI

Просмотров 1,9 млн

LangGraph Deep Dive: Build Better Agents

46:13

LangGraph Deep Dive: Build Better Agents

Просмотров 15 тыс.

Alibaba HPN: A Data Center Network for Large Language Model Training | Jiaqi Gao

14:37

Alibaba HPN: A Data Center Network for Large Language Model Training | Jiaqi Gao

Просмотров 150

QLoRA-How to Fine-tune an LLM on a Single GPU (w/ Python Code)

36:58

QLoRA-How to Fine-tune an LLM on a Single GPU (w/ Python Code)

Просмотров 59 тыс.

ВЕРНУЛСЯ В ШКОЛУ НА 24 ЧАСА - SCHOOLBOY RUNAWAY В РЕАЛЬНОЙ ЖИЗНИ!

28:26

ВЕРНУЛСЯ В ШКОЛУ НА 24 ЧАСА - SCHOOLBOY RUNAWAY В РЕАЛЬНОЙ ЖИЗНИ!

Просмотров 345 тыс.