Тёмный

LLMs in the Enterprise: Tips from Netflix, Nvidia, & Meta | TransformX 2022 

Scale AI
Подписаться 24 тыс.
Просмотров 6 тыс.
50% 1

Join this enterprise-focused, spirited discussion on how best to train, use, and fine-tune foundation models in the enterprise. Elliot Branson, Director of Machine Learning & Engineering, Scale AI, will moderate the panel with industry experts from AWS, NVIDIA, Netflix, and Meta.
Erhan Bas, formerly Applied Scientist at Amazon Web Services and now at Scale, will share his perspective on training large language models (LLMs). Bryan Catanzaro, Vice President of Applied Deep Learning Research at NVIDIA, will share how the GPU manufacturer is targeting foundation models as a core workflow for enterprise customers. Faisal Siddiqi, Director of Machine Learning Platform at Netflix, will share how his company is using foundation models to analyze highly produced video content. Susan Zhang, Researcher at Facebook AI Research (FAIR), a division of Meta, will share insights from training and fine-tuning Meta’s OPT model.
Members of the panel will share how they scale their training across multiple nodes, attempt to avoid overfitting by mitigating data quality issues early on, and address bias in models trained on a large internet-based text corpus. The panelists will discuss the compute cost inherent in training an LLM from scratch, how to avoid costly and tedious hyperparameter optimization, the need to mitigate training failure risk in clusters with thousands of GPUs, including sticking to synchronous gradient descent, and the need for extremely fast storage devices to save and load training checkpoints.
👉 Check out more here: scl.ai/3sjd5rY

Опубликовано:

 

29 сен 2024

Поделиться:

Ссылка:

Скачать:

Готовим ссылку...

Добавить в:

Мой плейлист
Посмотреть позже
Комментарии : 5   
@herp_derpingson
@herp_derpingson 11 месяцев назад
Sounds like a bunch of MBAs saying MBA stuff. Where are the tips?
@infotalk12
@infotalk12 Год назад
Great panel. @Faisal, which areas in general you think will continue to use smaller models and wont benefit so much with generative AI ?
@user-wr4yl7tx3w
@user-wr4yl7tx3w Год назад
I wish some of the microphones were better
@WSBWallstreetBets
@WSBWallstreetBets Год назад
Time stamps?
@bleacherz7503
@bleacherz7503 Год назад
Great panel thx
Далее
🦊🎀
00:16
Просмотров 235 тыс.
pumpkins #shorts
00:39
Просмотров 14 млн
Exploring the Future of AI at NVIDIA GTC
55:49
[1hr Talk] Intro to Large Language Models
59:48
Просмотров 2,2 млн
What Makes Large Language Models Expensive?
19:20
Просмотров 70 тыс.