Тёмный
No video :(

How to use SMOTE, Borderline SMOTE, ADASYN to handle class imbalance 

Saptarsi Goswami
Подписаться 3,3 тыс.
Просмотров 11 тыс.
50% 1

Опубликовано:

 

21 авг 2024

Поделиться:

Ссылка:

Скачать:

Готовим ссылку...

Добавить в:

Мой плейлист
Посмотреть позже
Комментарии : 23   
@pallaviroy6631
@pallaviroy6631 3 года назад
This session creates a clear concept on class imbalance... Well explained... Thank You
@SaptarsiGoswami
@SaptarsiGoswami 3 года назад
Thanks a lot pallavi
@wazibansar1683
@wazibansar1683 3 года назад
Excellent video Sir. It has been systematically presented. The video starts from the discussion about imbalanced class problem to solutions using undersampling, oversampling along with its limitations. After that SMOTE is explained and how it has been improved using Borderline SMOTE. Finally, the concept of ADASYN is presented with an example. Based on the video, I have one question- while selecting the nearest neighbours of minority class in SMOTE, do we check which class the neighbours belong to?
@SaptarsiGoswami
@SaptarsiGoswami 3 года назад
The neighbors to be taken needs to be from minority observation s
@aritrabrahma2117
@aritrabrahma2117 3 года назад
Well explained sir. Just few questions to ask. 1. How many synthetic points to select between the points of original minority class? 2. Why point bridge creates problem in smote?
@SaptarsiGoswami
@SaptarsiGoswami 3 года назад
Answer to Question 1 is how much balance you want to create. In the example given if you have 10 such minority points and you want to create 20 synthetic points, you choose two neighbors. 2. Bridge is a problem because, these points are more similar to majority classes.
@Theroadrunner2002
@Theroadrunner2002 3 года назад
Just used SMOTE for a project recently. Nice session and thanks for the share🙏🏻
@SaptarsiGoswami
@SaptarsiGoswami 3 года назад
Thanks, sci kit by default allows to use borderline too
@suyashspeaks97
@suyashspeaks97 2 года назад
Great video sir! Thanks a ton. Just had a very fundamental doubt. SMOTE algorithm seems to be creating a linearly dependant set of new data points. Shouldn't this mean that these points are adding no new information for the ML algorithm to learn from? Thanks in advance for your response.
@SaptarsiGoswami
@SaptarsiGoswami 2 года назад
Intersting question, it is surely not possible to add completely new information we can only enhance it, smote is trying to just increase some more points often around the boundary
@pcooi7811
@pcooi7811 Год назад
Thank you sir.
@SaptarsiGoswami
@SaptarsiGoswami Год назад
Thanks a lot
@memoonashehzadi9660
@memoonashehzadi9660 Год назад
In SMOTE, on which bases do we identify a point from minority class, in step 1?
@sajadms4121
@sajadms4121 2 года назад
thank you so much for the video but i have a question in adasyn we choose must far instance to have higher chance of being sampled to avoid over fitting ? if yes but what if was a noisy one ?
@debjit08
@debjit08 3 года назад
Nice session sir❤️, can you share some more information (via link or videos) about adasyn concept ??
@SaptarsiGoswami
@SaptarsiGoswami 3 года назад
Yes Debjit, I went there little bit quickly please towardsdatascience.com/class-imbalance-smote-borderline-smote-adasyn-6e36c78d804
@sinan_islam
@sinan_islam Год назад
Did anyone had a case where SMOTE made ML models performance even worse?
@rebeenali4317
@rebeenali4317 3 года назад
regarding to ADASYN: why more samples will be created for the sample that has a high ri? if we create more sample for this then we will kindly reduce the gap between classes and it is risky to create samples in this area because its nearest neighbours are majority class
@SaptarsiGoswami
@SaptarsiGoswami 3 года назад
Hi Rebeen, thanks for watching the video, till the end. You are right, but that's a risk you will have to take. Because if you create samples from a dense region of the minority class, it will not add any knowledge there.
@solwanmohamed9400
@solwanmohamed9400 2 года назад
i need the material
Далее
A Tutorial on Semi Supervised Learning
13:37
Просмотров 6 тыс.
Smote with Python
16:34
Просмотров 11 тыс.
Handling Imbalanced Datasets   SMOTE Technique
24:32
Просмотров 49 тыс.
How to handle imbalanced datasets in Python
11:48
Просмотров 49 тыс.