The architecture of mixtral8x7b - What is MoE(Mixture of experts) ?

Подписаться 2,3 тыс.

Просмотров 3,9 тыс.

50% 1

Mistral AI's second Large Language Model(LLM) Mixtral8x7b's papers have been released. In this video we will explore the code of Mixtral8x7b, learn how it works and also why it is called "Mixtral". We also learn about some interesting things about the history of "MoE" which is a key part in Mistral AI's Mixtral8x7b.
Paper: arxiv.org/abs/2401.04088
Hugging Face: huggingface.co/mistralai/Mixt...
Chapters:
0:00 Intro
1:34 Architecture(Mixture of Experts)
3:08 Code Walkthrough
7:51 Papers
10:10 Routing Analysis
11:03 Instruct Model
11:27 Outro
Mixtral 8x7b github: github.com/mistralai/mistral-src
Check out our socials:
Website: jarvislabs.ai/
X: / jarvislabsai
LinkedIn: / jarvislabsai
Instagram: / jarvislabs.ai
Medium: / jarvislabs
Connect with Vishnu:
X: / vishnuvig
Linkedin: / vishnusubramanian

Наука

Опубликовано:

31 май 2024

Ссылка:

Скачать:

Готовим ссылку...

Добавить в:

Мой плейлист

Посмотреть позже

Комментарии : 13

@goelnikhils 2 месяца назад

Amazing Explanation Vishnu

@JarvislabsAI Месяц назад

Thank you.

@atharvaingle3567 4 месяца назад

Thanks for the video, this was really a great explanation :) Hoping to see more of these videos :D

@JarvislabsAI 4 месяца назад

Thanks Atharva, we are planning for more videos. Thanks for the support.

@chinmayapani2831 4 месяца назад

Subscribing right now. great video with great explanation. Keep it up.👏

@JarvislabsAI 4 месяца назад

Awesome, thank you! We will create many more. Thanks for the support.

@None-kv9we 4 месяца назад

Thanks Vishnu for this Video. Great Explanation :)

@johnfedrickenator 4 месяца назад

Thanks, great that you enjoyed it

@user-wr4yl7tx3w 4 месяца назад

Great content. Thanks for

@JarvislabsAI 4 месяца назад

Glad you enjoyed it!

@user-ig2og2yq3b 3 месяца назад

please let me know how to create a fixed forms with the below structures with special command to LLM: Give me score out of 4 for (based on the TOEFL rubric) without any explanation, just display the score. General Description: Topic Development: Language Use: Delivery: Overall Score: Identify the number of grammatical and vocabulary errors, providing a sentence-by-sentence breakdown. 'Sentence 1: Errors: Grammar: Vocabulary: Recommend effective academic vocabulary and grammar:' 'Sentence 2: Errors: Grammar: Vocabulary: Recommend effective academic vocabulary and grammar:' .......