Тёмный

Meta Announces Llama 3 at Weights & Biases’ conference 

Weights & Biases
Подписаться 50 тыс.
Просмотров 81 тыс.
50% 1

In an engaging presentation at Weights & Biases’ Fully Connected conference, Joe Spisak, Product Director of GenAI at Meta, unveiled the latest family of Llama models, Llama 3.
Highlighting a significant milestone in AI development, the Llama 3 models, including the impressive 8 billion and 70 billion parameter models released during the conference, along with a glimpse into the future with a 400 billion parameter model still in the works.
Joe shared insights into the training processes and alignment of Llama 3, which now ranks as the top-performing model in the open weights category on the MMLU, GSM-K, HumanEval benchmarks.
Weights & Biases is proud to support our customers such as Meta as they push the boundaries of AI, to learn how to fine-tune your LLMs using torchtune and Weights & Biases, start here: wandb.me/torchtune
Timestamps:
00:00 Introduction
03:05 Overview of Llama at Meta
05:59 Introducing Meta Llama 3
7:04 Advancements in Llama 3: Training and Data Scale
10:02 Benchmarking Llama 3 Performance
14:01 Enhancing Model Safety and Red Teaming
16:23 Expanding the Ecosystem and Future Directions
23:00 Closing remarks: Future plans for Llama models, and an invitation to use Meta's Lama 3.
#MetaLlama #ArtificialIntelligence #AITrends #TechInnovation

Наука

Опубликовано:

 

16 июн 2024

Поделиться:

Ссылка:

Скачать:

Готовим ссылку...

Добавить в:

Мой плейлист
Посмотреть позже
Комментарии : 35   
@thenoblerot
@thenoblerot Месяц назад
Thanks for this W&B
@WeightsBiases
@WeightsBiases Месяц назад
Our pleasure!
@ihesiulo
@ihesiulo Месяц назад
There's a universe where Joseph Spisak is Mark Zuckerberg's brother. Oh, and nice presentation. Wonderful work they are doing at Meta AI.
@Crux69
@Crux69 Месяц назад
My favorite fact from this is that the smarter the model, the more it violates rules. Just like us :)
@utuberay007
@utuberay007 Месяц назад
Very true ! People who are way smarter on tax laws are the one who violate most , innocent people pay more than what they are supposed to etc . Same goes with many other laws
@techpiller2558
@techpiller2558 Месяц назад
Or, the rules it uses instead of the rules we assumed are different.
@why.do.I.even.try.
@why.do.I.even.try. Месяц назад
That's a great way to justify corruption and awful people.
@Crux69
@Crux69 Месяц назад
@@why.do.I.even.try. awful people are still human, best to understand how good people become awful
@why.do.I.even.try.
@why.do.I.even.try. Месяц назад
@@Crux69 Yes but we shouldn't repeat their actions just because they work. We should work towards more ethical means to advance, technologically and societally.
@RakeshMurria
@RakeshMurria Месяц назад
I really enjoyed this. Thanks
@WeightsBiases
@WeightsBiases Месяц назад
Glad you enjoyed it!
@naninano8813
@naninano8813 Месяц назад
so all those supervisor/safeguard models are only utilized during training? i mean, once the weights of llama3 are out, there is no safeguard network between user and inference engine right?
@Crux69
@Crux69 Месяц назад
I'm sure they have a safety model that tries to review every request and catch some negative responses.
@siloquant
@siloquant Месяц назад
Congratulations!
@ayyanarj7449
@ayyanarj7449 11 дней назад
Thanks Joe Spisak
@PeterLappo
@PeterLappo Месяц назад
How much did it cost to build, including hardware and engineering costs?
@techpiller2558
@techpiller2558 Месяц назад
What will be the SQLite of LLMs, with capability for local use? Llama?
@thegreatgustby
@thegreatgustby Месяц назад
I think he could have said "ridiculous" a bit more often
@gubatron
@gubatron Месяц назад
vin diesel!
@RichReportcom
@RichReportcom Месяц назад
Summary: Safety and size. The end.
@HoD999x
@HoD999x 7 дней назад
why is mmlu still being used? it's broken
@ericadar
@ericadar Месяц назад
a few hours go by...llama 3 no longer SOTA
@adinsoftic
@adinsoftic Месяц назад
That's why they open source it. They let the community figure things out and iterate. For Meta LLM is just a tool and not a product on itself
@SkepticButOptimist
@SkepticButOptimist Месяц назад
Wait what is sota now?
@adinsoftic
@adinsoftic Месяц назад
@@SkepticButOptimist "state of the art"
@JeiShian
@JeiShian Месяц назад
Which model is sota?
@MiraPloy
@MiraPloy Месяц назад
I think it's supposed ro be either phi or sensenova, neither of which are released ​@@JeiShian
@GerardSans
@GerardSans Месяц назад
How silly is to redteam a model which you control the training data to check for bioweapons capabilities. How stupid should you have to be? Isn’t easier to run a search on the data 😂😅
@matbeedotcom
@matbeedotcom Месяц назад
I’m glad they saw how useless they made codellama 😂, it was waaaay overly aligned
Далее
Navigating the Vector Database Landscape
1:06:05
Просмотров 1,3 тыс.
The Turing Lectures: The future of generative AI
1:37:37
Просмотров 560 тыс.
Аварийный выход
00:38
Просмотров 752 тыс.
How did they do?! 😂👀🕺 | Triple Charm #Shorts
00:16
Llama 3 - 8B & 70B Deep Dive
23:54
Просмотров 33 тыс.
Jeff Dean (Google): Exciting Trends in Machine Learning
1:12:30
A conversation with NVIDIA’s Jensen Huang
1:04:50
Просмотров 200 тыс.
The Most Important Algorithm in Machine Learning
40:08
Просмотров 264 тыс.
Mapping GPT revealed something strange...
1:09:14
Просмотров 193 тыс.
The Era of Generative AI
18:26
Просмотров 1,2 тыс.
Bardak ile Projektör Nasıl Yapılır?
0:19
Просмотров 6 млн
Для фанатов SEGA MEGADRIVE - Anbernic RG ARC
14:23