Lesson 9A 2022 - Stable Diffusion deep dive

Jeremy Howard

Подписаться 122 тыс.

Просмотров 32 тыс.

50% 1

Видео Поделиться Скачать Добавить в

Опубликовано:

7 сен 2024

Ссылка:

Скачать:

Готовим ссылку...

Добавить в:

Мой плейлист

Посмотреть позже

Комментарии : 15

@markhopkins8731 Год назад

Love your simple explanation of a manifold Jonathan. It's the first time it's made sense to me. Looking forward to the coming lectures.

@al3030 Год назад

Thank you for this deep dive. The sampling explanation especially was helpful to try to get an intuition for what the model does.

@timandersen8030 Год назад

Appreciate this supplemental deep dive into code of stable diffusion!

@saidmoglu Год назад

pretty good video to further understand SD!

@spider853 Год назад

I finally understand the schedulers! Thank you!

@adityagupta-hm2vs 2 месяца назад

Also, are we using latent space as gradients here, as we are subtracting gradients from the latent, which we typically do from weights in conventional NN ?

@alexrichmonkey7845 Год назад

Please explain the ancestral samplers.

@adityagupta-hm2vs 2 месяца назад

How do we decide the scaling factor in VAE part i.e. 0.18215, any hint on how to decide it ? I did try changing and could see the different output, but what's a good way to choose ?

@climez Год назад

This is useful but I wish you went into more detail here and there. Is some CLIP or similar model included in the stable diffusion implementation? If so, are precomputed weights of the CLIP model used to calculate noise_prediction in each step? I.e. we pass the current noisy image (in a latent space) and the text embedding to CLIP and then calculate the gradient for each voxel of the image so that something (semantic similarity?) is maximized? I wish you would say what happens during training of the mode and what then happens during inference :).

@AM-yk5yd Год назад

I'm surprised how... complexity(?) raised up. It's second day and I only on 4th minute, spent 30 minutes debugging my coding-along session (I wrote rand_like instead of randn_like and my parrot photo went green instead of grambled)