Model Agnostic Meta Learning (MAML) | Machine Learning

Подписаться 2 тыс.

Просмотров 14 тыс.

50% 1

K-shot learning is a hot topic in research. Let's understand one of the first core algorithms introduced to train meta-models: Model Agnostic Meta Learning (MAML).

Наука

Опубликовано:

15 мар 2021

Ссылка:

Скачать:

Готовим ссылку...

Добавить в:

Мой плейлист

Посмотреть позже

Комментарии : 32

@jayk253 Год назад

The diagram at the end makes it really easy to understand the mechanism. Thank you so much!

@TwinEdProductions Год назад

Glad it was useful!

@arminhejazian5306 2 года назад

the visualization at the end of the video helped me a lot.

@TwinEdProductions 2 года назад

That's great to hear! Thanks

@user-yr9sf2yr3n 5 месяцев назад

great video. U actually managed to unite the formulaic representation with the logic behind it. Thanks.

@TwinEdProductions 5 месяцев назад

Thanks!

@xingangguo4169 2 года назад

This diagram is amazing! Love it

@TwinEdProductions 2 года назад

Thank you, glad you liked it!

@chantata 11 месяцев назад

thank you for your explanation! it is very easy to understanding.

@TwinEdProductions 11 месяцев назад

Thanks!

@mahadevprasadpanda6493 Год назад

thanks for the great explanation !!

@TwinEdProductions Год назад

Glad it was useful!

@mainaksen1 Год назад

can anyone say if MAML can be applied on binary classification or not? if we have a data set that contain only Dog vs Cat images, why we need to apply with MAML...

@ulasfiliz759 3 месяца назад

What I don't understand about MAML is that there is a parameter Phi in some methods, representing the inner loop training, where in your diagram can be related to the update of phi? Thank you very much.

@oceantran3863 Месяц назад

Thank you for your great explanation. Could you explain to me that in 5:21, why do we have to copy a model for training tasks please? 😁

@joon0105 2 года назад

Cheers to the nice diagram at the last!

@TwinEdProductions 2 года назад

Thanks!

@jinking4662 8 месяцев назад

Nice explaination!

@TwinEdProductions 8 месяцев назад

Thanks!

@John-sj5sk Год назад

6:10, how can I backpropagate? In the backpropagation algorithm for weights of a layer, they need input and output_loss which went through the layer. query set has never gone through the original model. How can I calculate gradient descent? I can't under stand...

@vyasraina3930 Год назад

hi. Each support set is passed to it's respective copy of the model, and then an overall loss is calculated - this I find is easiest to see as a single function for the overall loss at 4.38 ... Now we can calculate the gradient of the overall loss wrt to the original parameters, because the initialisation of each model copy is with the original model parameters (which are tied/same across the copied models) -> i.e. we have a differentiable function from the original model parameters to the output loss which can be differentiated and thus we can do back propagation to calculate this gradient (and then update the model parameters)... hope this is helpful?

@saranpandian6882 Год назад

What's the difference between transfer learning and mera learning?

@TwinEdProductions Год назад

Hi! Transfer learning is where a model trained on one task can be applied to another (usually similar task) while meta learning is generally trying to understand things like which parts of their data is most valuable / which approaches generate the best predictions on a given dataset (using machine learning techniques).

@shakibyazdani9276 2 года назад

Well explained

@TwinEdProductions 2 года назад

Cheers!

@arjavgarg5801 Год назад

What is the difference between this and ensemble learning?

@TwinEdProductions Год назад

Could you clarify what you mean by ensemble 'learning'? Do you mean training independent models with different seed initialisations and averaging their predictions?

@arjavgarg5801 Год назад

@@TwinEdProductions yes, I thought about it more and read more about it and I realized that maml creates a generalized model that can be later used to learn more specific things later on, whereas in Ensemble learning (all kinds) we only train on the specific task.