Stanford CS25: V4 I Aligning Open Language Models

Подписаться 639 тыс.

Просмотров 22 тыс.

50% 1

April 18, 2024
Speaker: Nathan Lambert, Allen Institute for AI (AI2)
Aligning Open Language Models
Since the emergence of ChatGPT there has been an explosion of methods and models attempting to make open language models easier to use. This talk retells the major chapters in the evolution of open chat, instruct, and aligned models, covering the most important techniques, datasets, and models. Alpaca, QLoRA, DPO, PPO, and everything in between will be covered. The talk will conclude with predictions and expectations for the future of aligning open language models. Slides posted here: docs.google.co...
All the models in the figures are in this HuggingFace collection: huggingface.co...
About the speaker:
Nathan Lambert is a Research Scientist at the Allen Institute for AI focusing on RLHF and the author of Interconnects.ai. Previously, he helped build an RLHF research team at HuggingFace. He received his PhD from the University of California, Berkeley working at the intersection of machine learning and robotics. He was advised by Professor Kristofer Pister in the Berkeley Autonomous Microsystems Lab and Roberto Calandra at Meta AI Research.
More about the course can be found here: web.stanford.e...
View the entire CS25 Transformers United playlist: • Stanford CS25 - Transf...

Опубликовано:

29 сен 2024

Ссылка:

Скачать:

Готовим ссылку...

Добавить в:

Мой плейлист

Посмотреть позже

Комментарии : 11

@SebastianRaschka 4 месяца назад

This talk and the slides are gold! Love the whirlwind tour of the BC (Before ChatGPT) and AD (After Deployment of ChatGPT). How fast time went by ... I barely remember Vicuna, and it was just a year ago :D

@maydayay 6 дней назад

One Stanford lecture is better than some semester-long courses in some universities

@nafikhan13-4-23 4 месяца назад

I love 💓💓💓💓Stanford Online💓💓💓💓

@user-wr4yl7tx3w 4 месяца назад

Shouldn’t there be a different RU-vid channel for AI from Stanford.

@raul36 4 месяца назад

If AI is aligned then it is not AGI.

@bamh1re318 2 месяца назад

Nemotron 4 - 340b is released too recent to make this list. What chips do Qwen and China use to train their LLM?

@felipe741 Месяц назад

Nathan is a great explainer 👍 Up there with raschka and molnar

@mahirturjo7509 4 месяца назад

❤❤❤❤❤❤

@TheAlphaWavePodcast 3 месяца назад

Stanford has thirst traps? Who knew?

@CannabinatedFantasy 4 месяца назад

omg his forehead

@TheAlphaWavePodcast 3 месяца назад

I have a PhD in law and I have gone through the arduous process of establishing a very viable patent, trademark and copyright case that I am sure can help Nathan trademark his forehead so we can sue the creators of the film Megamind for billions. Please Nathan, contact me.