Тёмный

Imbalanced Data in Machine Learning | Undersampling | Oversampling | SMOTE 

CampusX
Подписаться 221 тыс.
Просмотров 12 тыс.
50% 1

Imbalanced data refers to datasets where the distribution of classes is heavily skewed, with one class significantly outnumbering the others. Dealing with imbalanced data is crucial as it can lead to biased models that perform poorly on minority classes. Addressing Class Imbalance with Undersampling, Oversampling, SMOTE, and Ensemble Methods. Imbalanced datasets pose challenges for machine learning models, but techniques like undersampling (reducing majority class samples), oversampling (increasing minority class samples), SMOTE (Synthetic Minority Over-sampling Technique), and ensemble methods (combining multiple models) help mitigate bias and improve predictive performance on minority classes.
Code - colab.research.google.com/dri...
============================
Did you like my teaching style?
Check my affordable mentorship program at : learnwith.campusx.in
DSMP FAQ: docs.google.com/document/d/1O...
============================
📱 Grow with us:
CampusX' LinkedIn: / campusx-official
CampusX on Instagram for daily tips: / campusx.official
My LinkedIn: / nitish-singh-03412789
Discord: / discord
E-mail us at support@campusx.in
✨ Hashtags✨
#Datascience #Machinelearning #Imbalanceddata #CampusX
⌚Time Stamps⌚
00:00 - Intro
00:54 - What is Imbalanced Data?
04:10 - Problems with Imbalanced Data
08:00 - Imbalanced Data Demo
11:13 - Why studying imbalanced data is important?
16:58 - Undersampling
25:56 - Oversampling
31:06 - SMOTE
42:43 - Ensemble Learning
47:06 - Cost Sensitive Learning
51:30 - Other techniques

Опубликовано:

 

30 июл 2024

Поделиться:

Ссылка:

Скачать:

Готовим ссылку...

Добавить в:

Мой плейлист
Посмотреть позже
Комментарии : 45   
@campusx-official
@campusx-official 2 месяца назад
I had to reupload this video because I forgot to include the part on ensemble techniques due to an editing error in the previous upload. Check timestamps.
@Ashishkumar-id1nn
@Ashishkumar-id1nn 2 месяца назад
Sir, please make a video on the difference between encoding and embedding
@mohitjoshi8984
@mohitjoshi8984 2 месяца назад
Sir please make a video on AB testing
@AMANRAJ-dt8gu
@AMANRAJ-dt8gu 2 месяца назад
I am writing to request your assistance in creating videos that delve into metaheuristic approaches, such as genetic algorithms, ant colony optimization, and others. It has come to my attention that there is a noticeable scarcity of resources covering these topics on platforms like RU-vid.
@sagarbp-2854
@sagarbp-2854 2 месяца назад
Sir make video about AB testing
@advaitdanade7538
@advaitdanade7538 2 месяца назад
Thank you sir for the best series on RU-vid, I just completed it in 2 months by watching 4 hr daily at 1.5x speed
@shripaddeshpande5766
@shripaddeshpande5766 2 месяца назад
Another fantastic video by Nitish! Wonderful!!!
@ansh-t8e
@ansh-t8e 11 дней назад
Thanks sir for this beautiful playlist. Never have i ever thought i would be able to understand all concepts of ml so easily. i am really grateful to you. i just completed this playlist after grinding for around 2 months. I have been going with some problems in my life that i am fighting but i will come out stronger now that I have completed this, soon I will start the DL playlist too and complete it too. Thank you for everthing sir.
@manikarnikatiwari199
@manikarnikatiwari199 2 месяца назад
THANK you so much Nitish 😊u are the best in everything.🎉 Thanks for being my teacher 😊🙏
@user-mg5fk7mf5c
@user-mg5fk7mf5c 2 месяца назад
I understood everything sir Thank you so much You are the best
@divyakarlapudi
@divyakarlapudi 2 месяца назад
Thankyou so much for this video, very helpful sir 🤌
@balrajprajesh6473
@balrajprajesh6473 2 месяца назад
Thank you very much sir
@vinayakvijay108
@vinayakvijay108 2 месяца назад
Awesome Content
@ParthivShah
@ParthivShah 21 день назад
Thank You Sir.
@AMANRAJ-dt8gu
@AMANRAJ-dt8gu 2 месяца назад
I am writing to request your assistance in creating videos that delve into metaheuristic approaches, such as genetic algorithms, ant colony optimization, and others. It has come to my attention that there is a noticeable scarcity of resources covering these topics on platforms like RU-vid.
@uditbhandari5791
@uditbhandari5791 2 месяца назад
Sir, when will you start a new batch for DSMP?
@Sulehri226
@Sulehri226 2 месяца назад
Thanks Sir
@mukeshrajpurohit5593
@mukeshrajpurohit5593 2 месяца назад
Hi Sir, Big Fan!! I was searching for class imbalance video and you have uploaded it on right time. I am training an ANN model for customer churn prediction where my dataset has class imbalance issues 96:4. I have used Upsampling, Downsampling, SMOTE, SMOTE-ENN, Class Weight but neither of them gave promising results and fail to predict well on minority class the recall value is very low. What should be done in such case where the model is not predicting well on minority class. I have also trained XGBoost classifier but that model also did not perform well.
@nsbipritam9682
@nsbipritam9682 Месяц назад
very helpful video
@wamiqmushtaq2825
@wamiqmushtaq2825 2 месяца назад
Sir pls do a session on cross validation.... There's no sperate video on cross validation in the ml playlist
@bhushansonawane5915
@bhushansonawane5915 Месяц назад
Hello sir, how can i connect with you ? Need urgent help please
@not_amanullah
@not_amanullah 2 месяца назад
Thanks
@soumyaranjandas7394
@soumyaranjandas7394 2 месяца назад
Dear Nitish sir, plz make video on how to fine tune our custom data using LLama llm.
@souvik5560
@souvik5560 2 месяца назад
Nitish :- At 7:00 It will be "Testing data" for determining the accuracy. Am I correct ?
@himanshurathod4086
@himanshurathod4086 2 месяца назад
please continue your llm transformers series.and also please upload nlp ner and topic modeling
@tusharshukla9361
@tusharshukla9361 2 месяца назад
Nitishi Sir please update your Machine Learning Roadmap and add links of your new videos (We want more and more videos of yours)
@muhammadikram375
@muhammadikram375 2 месяца назад
Sir please do some working on MLOps playlist
@pujarameet9699
@pujarameet9699 Месяц назад
Is this series complete or anything remaining sirm
@haroonmalik2195
@haroonmalik2195 2 месяца назад
Sir Also make video on multi label classification problem.
@shlok7580
@shlok7580 6 дней назад
i had this in an interview recently and i fkd up a bit :( They gave me a dataset and were expecting how to handle such type of data and make it soo that the predictions that we make are reliable
@chandrimapramanick1111
@chandrimapramanick1111 2 месяца назад
Sir, I truly admire your work and love all of your videos, learning so much from them. Thank you!!! I have one question: at the end of the video you said that in spam filtering false positive is the critical one but if one msg is spam and classified as not spam(false negative) that will be the critical case isn't it? false negatives are generally considered to be more dangerous in this case because they can expose the recipient to potential harm.
@RajatTomar-r7i
@RajatTomar-r7i Месяц назад
I think false positive is more critical because it may send your important mail in spam which is more harmful rather than showing some spam mails as important mail.
@parth.mandaliya
@parth.mandaliya 2 месяца назад
Please make a new video on transformers 🙏
@not_amanullah
@not_amanullah 2 месяца назад
🖤
@anandshaw-ie3qk
@anandshaw-ie3qk 2 месяца назад
it's better
@user-vj3nx7sh8r
@user-vj3nx7sh8r Месяц назад
Playlist ke end tak aate aate aisa lag rha ki aap jawan se budhe ho gye.
@mohitnemade5320
@mohitnemade5320 2 месяца назад
Nitesh bhai aapka knowledge perfect hai but video itne long hote h ki chahke bhi pura nahi dekh pate.. please try to make video in short way🙏🤝👍
@Awm_king-y9i
@Awm_king-y9i 2 месяца назад
Sir app bahut peeche hai
@Awm_king-y9i
@Awm_king-y9i 2 месяца назад
Sab LLM ki bat Kar Rahe hai app Machine learning par ruke hai
@abhinavkale4632
@abhinavkale4632 2 месяца назад
Bhai LLM ke bhi videos cover kar Rahe hai nitesh sir. To us, these concepts are still gold and they are used everywhere.
@Awm_king-y9i
@Awm_king-y9i 2 месяца назад
@@abhinavkale4632 bhai sir ke sare video mere laptop me hai all total video LLM ka history padhe hai abhi tak
@omsaikommawar
@omsaikommawar 2 месяца назад
From an interviewer's perspective, an imbalanced dataset is a common topic in interviews. Focusing on simple topics can increase your chances of success in cracking the interview.
@samarmohanty6109
@samarmohanty6109 13 дней назад
ML hogya kya apka
@Awm_king-y9i
@Awm_king-y9i 13 дней назад
@@samarmohanty6109 ha Mera generative Ai bhi ho gaya
Далее
Это конец... Ютуб закрывают?
01:09
Imbalanced Data Handling
1:21:45
Просмотров 284
This might be better than pizza
0:57
Просмотров 16 млн
How to handle imbalanced datasets in Python
11:48
Просмотров 48 тыс.