Stable Diffusion Deep Dive Notebook Run-through

Will Merrill: The Illusion of State in State-Space Models

СКУФ ИЛИ НЕ СКУФ? #иванабрамов #натальнаякарта #юмор #shorts

Я не буду дышать, ради тебя😉❤️

бедный дед на ламборгини- меняет вейп на секретные шкатулки - выиграл айфон, но отказался от приза

Дж. Роулинг и Джина Карано разносят Олимпиаду 2024 после того как мужчина уничтожил женщину в ринге!

Paper deep dive: Evolutionary Optimization of Model Merging Recipes

DataScienceCastnet

Подписаться 4,6 тыс.

Просмотров 3,3 тыс.

50% 1

Видео Поделиться Скачать Добавить в

Sakana AI has a great new paper exploring evolutionary approaches to model merging, showing how to find ways of combining existing models into new ones with impressive new skills. In this video, we dive into the paper and along the way spend some time learning about model merging in general, evolutionary algorithms, and more.

Опубликовано:

20 мар 2024

Поделиться:

Ссылка:

Скачать:

Готовим ссылку...

Добавить в:

Мой плейлист

Посмотреть позже

Комментарии : 6

@gedankenthesis 4 месяца назад

That was an excellent overview of not just Sakana's evolutionary methods to identify good merge candidates, but also the popular techniques TIES, DARE and Passthrough/Frankenmerge. Appreciate it as usual, Johno!

@jonatan01i 4 месяца назад

Oh my god, man you don't understand how happy I am for your storytelling about how things went in the timeline of developing on the idea of model merging up to this point, where it started how it went, that and how they were thinking about reasons why it works, etc.etc.. I want to get into this so that I understand the main ideas and be able to start working on these as well, but it's so hard to get to the root of things, it requires a huge amount of time to read and digest everything and slowly being able to put the pieces together, so boy do I mean it when I say thank you!

@UmerHA 4 месяца назад

Hi Johno, at the beginning you said you're somewhat skeptical of model merging. Iiuc, your criticism is only about iterative merging for a given goal, which leads to overfitting. Or are you skeptical of the general concept of model merging? Thanks!

@abse-mj8pw 4 месяца назад

very great introduction! I can see a lot of efforts have been put into this video! It helps a lot to understand the paper! thank you for sharing!!

@abse-mj8pw 4 месяца назад

However I have one small question about the overfit part at the end of this video. Is it about that the test set translated into Japanese might be learned or finetuned by the math 7B llm?

Далее

Stable Diffusion Deep Dive Notebook Run-through

41:09

Stable Diffusion Deep Dive Notebook Run-through

Просмотров 10 тыс.

Will Merrill: The Illusion of State in State-Space Models

45:43

Will Merrill: The Illusion of State in State-Space Models

Просмотров 1,1 тыс.

СКУФ ИЛИ НЕ СКУФ? #иванабрамов #натальнаякарта #юмор #shorts

00:58

СКУФ ИЛИ НЕ СКУФ? #иванабрамов #натальнаякарта #юмор #shorts

Просмотров 334 тыс.

Я не буду дышать, ради тебя😉❤️

00:27

Я не буду дышать, ради тебя😉❤️

Просмотров 372 тыс.

бедный дед на ламборгини- меняет вейп на секретные шкатулки - выиграл айфон, но отказался от приза

01:00

бедный дед на ламборгини- меняет вейп на секретные шкатулки - выиграл айфон, но отказался от приза

Просмотров 2,7 млн

Дж. Роулинг и Джина Карано разносят Олимпиаду 2024 после того как мужчина уничтожил женщину в ринге!

09:17

Дж. Роулинг и Джина Карано разносят Олимпиаду 2024 после того как мужчина уничтожил женщину в ринге!

Просмотров 1,1 млн

Gaussian Splatting explorations

32:45

Gaussian Splatting explorations

Просмотров 25 тыс.

What is Speculative Sampling?

15:21

What is Speculative Sampling?

Просмотров 2,3 тыс.

[Paper Review] The detail algorithm of 3D Gaussian Splatting

18:58

[Paper Review] The detail algorithm of 3D Gaussian Splatting

Просмотров 7 тыс.

Why Fine Tuning is Dead w/Emmanuel Ameisen

50:07

Why Fine Tuning is Dead w/Emmanuel Ameisen

Просмотров 29 тыс.

Sakana AI's Latest Release: Evolutionary Optimization of Model Merging Recipes

40:22

Sakana AI's Latest Release: Evolutionary Optimization of Model Merging Recipes

Просмотров 1,5 тыс.

Evaluating Diffusion Models with PickScore

14:32

Evaluating Diffusion Models with PickScore

Просмотров 843

Evolutionary Model Merge: Sakana AI's LLM Solution Ep.169

35:49

Evolutionary Model Merge: Sakana AI's LLM Solution Ep.169

Просмотров 155

ZipLoRA: Any Subject in Any Style (deep dive and paper explanation)

26:20

ZipLoRA: Any Subject in Any Style (deep dive and paper explanation)

Просмотров 1,4 тыс.

Supercharge Multi-LLM Intelligence w/ CALM

26:19

Supercharge Multi-LLM Intelligence w/ CALM

Просмотров 3 тыс.

Deep dive into how Mamba works - Linear-Time Sequence Modeling with SSMs - Arxiv Dives

44:23

Deep dive into how Mamba works - Linear-Time Sequence Modeling with SSMs - Arxiv Dives

Просмотров 15 тыс.

КАКОЙ SAMSUNG КУПИТЬ В 2024 ГОДУ

14:59

КАКОЙ SAMSUNG КУПИТЬ В 2024 ГОДУ

Просмотров 45 тыс.

КУПИЛ ИНДИЙСКИЙ IPHONE 15 ЗА 66000 РУБЛЕЙ!

9:47

КУПИЛ ИНДИЙСКИЙ IPHONE 15 ЗА 66000 РУБЛЕЙ!

Просмотров 27 тыс.

iPhone socket cleaning #Fixit

0:30

iPhone socket cleaning #Fixit

Просмотров 18 млн

КАК ИСПРАВИТЬ ЗАМЕДЛЕНИЕ ЮТУБА УСКОРЯЕМ YOUTUBE за 10 Секунд УСКОРИЛ ЮТУБ в ТЕЛЕФОНЕ и ПК ИНСТРУКЦИЯ

5:52

КАК ИСПРАВИТЬ ЗАМЕДЛЕНИЕ ЮТУБА УСКОРЯЕМ YOUTUBE за 10 Секунд УСКОРИЛ ЮТУБ в ТЕЛЕФОНЕ и ПК ИНСТРУКЦИЯ

Просмотров 55 тыс.

БЕЗОПАСНОСТЬ!! Apple выпустила iOS 17.6 Релиз для Айфона! Стоит ставить? Что Нового?

5:15

БЕЗОПАСНОСТЬ!! Apple выпустила iOS 17.6 Релиз для Айфона! Стоит ставить? Что Нового?

Просмотров 38 тыс.

Собрал ПК на ОЗОН, чтобы продать на АВИТО дороже! Сколько заработал на перепродаже компьютеров?

41:10

Собрал ПК на ОЗОН, чтобы продать на АВИТО дороже! Сколько заработал на перепродаже компьютеров?

Просмотров 480 тыс.

⚠️ ЧТО ДЕЛАТЬ, ЕСЛИ СМАРТФОН УПАЛ В ВОДУ?! Самый НЕОБЫЧНЫЙ способ от Xiaomi!

0:51

⚠️ ЧТО ДЕЛАТЬ, ЕСЛИ СМАРТФОН УПАЛ В ВОДУ?! Самый НЕОБЫЧНЫЙ способ от Xiaomi!

Просмотров 360 тыс.