Тёмный

Standardization Vs Normalization- Feature Scaling 

Krish Naik
Подписаться 997 тыс.
Просмотров 294 тыс.
50% 1

Please join as a member in my channel to get additional benefits like materials in Data Science, live streaming for Members and many more
/ @krishnaik06 If you are looking for Career Tansition Advice and Real Life Data Scientist Journey. Please check the below link
Spring board India RU-vid url: / channel
Connect with me here:
Twitter: / krishnaik06
Facebook: / krishnaik06
instagram: / krishnaik06

Опубликовано:

 

6 ноя 2019

Поделиться:

Ссылка:

Скачать:

Готовим ссылку...

Добавить в:

Мой плейлист
Посмотреть позже
Комментарии : 278   
@tymothylim6550
@tymothylim6550 3 года назад
Thank you Krish for this video! It was fantastic in helping me understand the difference between these 2 things and some additional advice regarding how it helps with some other things (e.g. helping some kinds of models optimize faster!)
@srikrithibhat6220
@srikrithibhat6220 2 года назад
One of the best and detailed explanation on Scaling Techniques. Thank You so much Krish ji.
@sagaryadav3473
@sagaryadav3473 4 года назад
I'm really in love with the way you explain. So nice :)
@rohitkamra1628
@rohitkamra1628 4 года назад
I have completed Statistics Playlist. You explained in a very good way. Thanks for this. :)
@raziekhairy5799
@raziekhairy5799 3 года назад
Thank you. I wish this world will be fulled of people like you!
@gc-0377
@gc-0377 3 года назад
I love you dude, thanks for the explaining you saved me, greetings from México wey you have a new sub
@GuitarreroDaniel
@GuitarreroDaniel 3 года назад
Incredible explanation, thank you very much!
@baharehghanbarikondori1965
@baharehghanbarikondori1965 3 года назад
the best video on Standardization & Normalization
@vaibhavnakrani2983
@vaibhavnakrani2983 8 месяцев назад
You explain it very simply. I love it. I even even recommend your videos to other guyz in ML.
@anujashinde5717
@anujashinde5717 4 года назад
Hi sir. I have seen lots of videos on machine learning but I personally feel like u r d only one who’s making the videos in very fantastic way. u explains all the things in such a way that even the person who is from non technical background can understand it. Just a small req for you. Can u pls make video on all the techniques that can be apply on single data set. Like when to scale the data & apply PCA, clusters, algorithms, when to do label encoding instead of one hot. Can u pls apply all these things on any dataset so that i can have clear insight on model building. Can u pls make video on this for end to end model building
@nakul469
@nakul469 2 месяца назад
Hi, I just read your comment and I wanted to know how's your data science career going? I just completed ML and going to create an ML project for resume. Can you please give me any kind of suggestion if you are reading this comment.
@momaalim3086
@momaalim3086 4 года назад
Brilliant explanation. Thank you sir!
@donaldngwira
@donaldngwira 2 года назад
One of the best teachings on this subject. Thanks Krish
@aashishsahni90
@aashishsahni90 3 года назад
Great way of teaching...really helpful!! :)
@varshapatil
@varshapatil 4 года назад
Great explanation. Very well conveyed with proper examples
@vgaurav3011
@vgaurav3011 4 года назад
Finally completed your statistics playlist and can definitely say learned much more than other online courses
@amanpatra8092
@amanpatra8092 2 года назад
hello , i want to learn statistics for data science i don't have prior knowledge, will this cover the basics as i want to start from scratch
@najmeh5707
@najmeh5707 4 года назад
Thanks for the intuitive explaining.
@chinmaybhat9636
@chinmaybhat9636 4 года назад
@Krish Naik Sir Github Link is not there in the description link, for the Jupyter Notebook shown in this video, Can you Share the Same ?? Thanks & Regards, CHINMAY N BHAT
@mapytech
@mapytech 4 года назад
Great!...very good explanation...plz keep posting...thanks
@amoldumrewal
@amoldumrewal 3 года назад
Hi Krish, one quick question. I was going through some tutorials for batch normalization and got confused with which technique is used there. It seem like they first do min-max followed by standardization. Can you please help me here?
@santoshandawarapu340
@santoshandawarapu340 5 месяцев назад
I have gone through other speakers videos but they are hard to follow. I really liked the way of explanation in a very simple way with great examples. Thank you brother.
@nonamenoname1942
@nonamenoname1942 3 года назад
Thank you! Perfectly explained.
@Kim-bn4ub
@Kim-bn4ub 3 года назад
HI, can you please add the github link in the description? the github address is missing.
@user-uz5ld4oi6r
@user-uz5ld4oi6r 2 года назад
Clear message, clear structure, easy to understand, thank you
@lolwhatyesme651
@lolwhatyesme651 2 года назад
You're a great teacher. Thank you.
@aditisrivastava7079
@aditisrivastava7079 4 года назад
Thanks for your suggestion 🙏
@murmupk
@murmupk 3 года назад
Sir, you should also mention the dataset link in the description. This will help us to follow you.
@ahmeddhiael-euch8105
@ahmeddhiael-euch8105 Год назад
Very informative and helpful, thanks a lot Krish
@akshatabm4491
@akshatabm4491 9 месяцев назад
Great content. Thank you for explaining in the best way possible. However a small suggestion, please include the links of dataset your are using in the description box. It will be helpful to practice along while watching the video. Thanks again, cheers!!
@anshukaurav2896
@anshukaurav2896 6 месяцев назад
Awesome sir, you are explaining very easy way .
@AbcAbc-kx3xm
@AbcAbc-kx3xm 3 года назад
So clear explanation, thanks Krish
@owoeyebabatope2425
@owoeyebabatope2425 2 года назад
This short video has helped me understand a great deal of feature engineering. God bless you. I wish to learn more from you. I recommend you do a video on a full data science project and focus more on the thought process. While you also do a soft touch on various alternatives to whatever method you have used. This is Great!
@aryanchauhan9086
@aryanchauhan9086 3 года назад
I hope one day I will become data scientist like you , you are really helpful for aspiring data scientist like me
@deeptipancholi8814
@deeptipancholi8814 5 месяцев назад
Hey Have you become data scientist ? If yes please suggest me something
@monicasharma4344
@monicasharma4344 4 года назад
Hello Sir, Generally the range for normaization is 0 to 1. But I have read papers where different ranges are used Say 1 to 7. I want to know the logic/criteria behind selecting the target range?
@Truthhurt419
@Truthhurt419 4 года назад
Hi Krish.. I recently use standardization in classifier model and it improve my accury. I am aware that for classification method standardization is not recommended but in my case its improving accury. What would u suggest?
@cool70523
@cool70523 Год назад
thanks for your excellent explanation, but it confuses me when I try to filter lower-variance features, standardization isn't suitable because it scales all features' variances to 1, so in this case I should try normalization, but then is it right to perform PCR or PLSR next?
@rasha9462
@rasha9462 2 года назад
wow! great explanation .. Thank you 🙏
@hasibullahaman50
@hasibullahaman50 2 года назад
Thanks to you Chanel... it's so helpful for my UNI Lesson
@idrissjairi
@idrissjairi 2 года назад
Great Explanation, Thank you!
@utkarshpandya3155
@utkarshpandya3155 2 года назад
Hi Krishna.You're saviour.Apologies in advance if it is already asked question.(please advise if you have already answered and will find out the video). 1.Do you have any usecase where you do standardisation (with mean & std ) followed by min-max normalization so that you can compare same scalled features and then fit them into 0 to 1 or let's say 0 to 100 or -50 to +50 etc ? 2. any pros and cons of standardisation followed by min-max normalization ? 3. am i missing any logic by asking ? is there any solution for a scanario where you have more than 5 + features and user want it to scale in a single number so that instead of viewing the movement or change of 5 features,you will only focus on final score by means of min-max norm....hope it's clear out my question looking forward to see your answer.Regards & TIA + Thanks for this video.
@sayantansinha4545
@sayantansinha4545 4 года назад
Thanks for the awesome explanation
@AlbertRyanstein
@AlbertRyanstein 3 года назад
Hi, I really enjoyed the video. I was wondering is this the same as normalisation on keras.
@user-qz1hd4xp1p
@user-qz1hd4xp1p 4 года назад
thank you so much you are the man !!
@MrJaga121
@MrJaga121 4 года назад
Hi Krish, What will happen to notmalization if outliers are present in the data? Outlier treatment is necessary before applying notmalization? There a method in sklearn normalize, will it same as minmaxnormalizer.
@alexanderdushenin7035
@alexanderdushenin7035 3 года назад
Hi, Krish. For instance, I train my model on normalized dataset and I need to use this model. I have to classify ONLY ONE test object. How should I normalize this object?
@andresherrera4023
@andresherrera4023 4 года назад
Great explanation !! Thanks
@uchchwasdas2675
@uchchwasdas2675 4 года назад
best explanation, keep it up
@ashimmaity64
@ashimmaity64 4 года назад
sir, please make video on difference between GD,SGD,SGD (mini batch),SGD with momentum.
@user-es3wr6uf2l
@user-es3wr6uf2l Год назад
Thank you. Great explanation.
@rathnakumarv3956
@rathnakumarv3956 Год назад
after fitting model and predicting values using normalised/ standardised data, how to get back the original values for predicted results. no where reversing of normalisation is shown??? have had any video on this?
@dungtran-vk3ed
@dungtran-vk3ed 4 года назад
Here you go. Hope it can help you guys df = pd.read_csv('raw.githubusercontent.com/rasbt/pattern_classification/master/data/wine_data.csv', header= None, usecols=[0,1,2])
@maruthiprasad8184
@maruthiprasad8184 2 года назад
Thanks
@Sarasara-dg8gb
@Sarasara-dg8gb 3 года назад
What is the most adequate way of features scaling for ANFIS algorithm, normalization or standardization?
@imamamansoor5174
@imamamansoor5174 10 месяцев назад
Krish whenever i get confused for any Data Science topic, i search it on YT, if your video pops up for it, i definitely select your explanation for that topic.
@ashabhumza3394
@ashabhumza3394 2 года назад
I had been watching all your previous statistics videos and understood each concept well. Since I am not from mathematics background, In this video I couldn't understand what you explained in the part while telling what process to use when. Will this be a matter to bother in my data science learning journey?
@belimmohsin
@belimmohsin 3 года назад
Thank you sir..nice explanation :)
@123anandik
@123anandik 4 года назад
Good one 🤘🤘🤘 Actually z score is much widely used for most of the algorithms as i have seen. And I do practice the same all the time. The reason is the affect of the outliers. Outliers can be easily detected by z score. Normalistion between 0 to 1 just shrinks curves.
@crackthecode1372
@crackthecode1372 4 года назад
can u please explain ur outliers point
@lars1597
@lars1597 3 года назад
@@crackthecode1372 outliers are just noise
@TheMaverickanupam
@TheMaverickanupam 3 года назад
@@lars1597 Sometimes outliers are important noise. Outliers can tell a lot about data. They can't simply be dropped.
@bhaskartripathi
@bhaskartripathi 3 года назад
Minmax scaler is the most widely used in forecasting research papers. Z-score is not very good in time series forecasting
@purvanyatyagi2494
@purvanyatyagi2494 3 года назад
sir if the features are not normally distributed and we apply standard scalar then does it become distributed according to a standard normal???
@AugustNocturne
@AugustNocturne 2 года назад
Thank you this was very helpful.
@maximilianovazquez59
@maximilianovazquez59 3 года назад
I love you mate, thx Cheers!
@maheshvangala8472
@maheshvangala8472 4 года назад
I am trying to build a digit recognizer using SVM. So I should use Min max normalization right sir ?
@nutsom
@nutsom 2 года назад
@krish - i didn't quite understand when to use Normalisation and when to use Standard Scaler. Can u share with an example why standard scaler was used and another one where normalizer or min max scaler was used, and why.
@ShahnawazKhan-xy1ll
@ShahnawazKhan-xy1ll Год назад
Great Job very well explained
@arhantjain1703
@arhantjain1703 4 года назад
Hi...Please clear my doubt that data normalisation or standardisation should be done along the column i.e along each dimension of data. Right?. I'm a little bit confused, whether to do row-wise or column-wise data normalisation in excel. Please reply in this thread.
@vikeshgiri2369
@vikeshgiri2369 3 года назад
Today only started this playlist and today only completed, it is possible because the way sir❤️ explain is just amazing..❤️ Now I move to next part.
@keerthanpu808
@keerthanpu808 4 месяца назад
super guru! U made it a cake walk
@sulemanmasood1382
@sulemanmasood1382 2 года назад
sir my db is in mysql using MYISAM.... No foreign keys implemented with some columns repeating in tables as well... i have approximatelly 40 tables and all containns approx 500 rows.... but two or 3 tables contains above one million rows....db is not normalized.....but it is still working fine...on LAN and on cloud as well ... what will be fuiture of my db... can u help me...
@kamaladey2442
@kamaladey2442 2 года назад
Thanks Sir for sharing all wonderful videos, kindly provide the github link to download the dataset ,not getting from description box
@nirmalpatil5370
@nirmalpatil5370 2 года назад
Thank you so much !!!
@lokeshgaikwad4337
@lokeshgaikwad4337 Месяц назад
thanks for the great explanation sir .the link of channel you are talking about is not working pls help with that
@josealjndro
@josealjndro 4 года назад
In most of the cases I reproduce this kind of videos at 1.25x velocity, this one 0.75x haha nice videos Krish!
@saltanatkhalyk3397
@saltanatkhalyk3397 3 года назад
always the best explanation!
@alexeivodopianov5440
@alexeivodopianov5440 2 года назад
Absolutely excellent explanation
@antonyamalrajmorais160
@antonyamalrajmorais160 3 года назад
Krish Naik, what if the data has outliers. When we do normalization or standardisation, then the extreme value will probably get a value close to 1, due to this, most of the data gets assigned close to 0, how can we handle this ?
@adhipathis12
@adhipathis12 3 года назад
Thanks a lot Krish :)
@marcoreiter2795
@marcoreiter2795 4 года назад
Thank you for your video Krish, it was really helpful!
@purnimaps9819
@purnimaps9819 11 месяцев назад
Very informative thank u
@surafelm.w4058
@surafelm.w4058 3 года назад
How to normalize gridded datasets (X1 and X2) for neural network in python?
@abhinavmahajan448
@abhinavmahajan448 3 года назад
Paji tusi great ho :)
@visualvalidator5384
@visualvalidator5384 2 года назад
Commenting exactly on the same date, such a coincidence though, Thank you Krish for this video!
@tosinlitics949
@tosinlitics949 3 года назад
Excellent explanation!
@bhabanisankardash7387
@bhabanisankardash7387 3 года назад
Nice and useful information 👍
@jalendarch89
@jalendarch89 2 года назад
Is scaling required when we are doing regression with Decision trees (Including RF, GBM. XGBM etc.,)?. I got this doubt because of variance is key to split the node in Regression case. Please correct me In case I am wrong. Thanks.
@shindepratibha31
@shindepratibha31 4 года назад
Thank you for the video. It was useful. Can you please provide the github link?
@georgekokkinakis7288
@georgekokkinakis7288 5 месяцев назад
Have to say that your presentations really stand out , basically because of the distilled informations and to the point suggestions you make. One question though. At some point you talk about CNNs and that we have to use MinMax scaler. I am using CNN but on non image data, basically I see my data as an image of point values. Should I go with MinMax scaler or I could also use Standard scaler? And in order to be more specific lets say that I have an image of 7x7 where I want to keep the relative value differences between a pixel and its neighbours. Which scaling should I use in your opinion? Can we use standarization on the dataset in order to train a CNN or the values should be in [0,1] so we have to use minmax scaler. I am really interested to hear your opinion based on you experience.
@sandipansarkar9211
@sandipansarkar9211 3 года назад
hello krish .awesome video .But lease provide the GitHub link for practice.I have searched over in GitHub profile of yours but could not find it
@princemensah1866
@princemensah1866 3 года назад
Great explanation!
@winyourself553
@winyourself553 3 года назад
what are the algorithms in which gradient descent is not used?
@bhavanasaraswat7808
@bhavanasaraswat7808 3 года назад
Thanks sir for explanation
@swm7f137
@swm7f137 3 года назад
I like your why of explaination
@dalwindersingh5902
@dalwindersingh5902 4 года назад
Sir my test data and training data has different distribution ...what to do sir
@GaviniLok
@GaviniLok Год назад
Can you send the Github link for this code
@akshaykrishnan7985
@akshaykrishnan7985 4 года назад
Good evening sir.. I have a doubt.. When you say collecting data, how exactly is it done? Is it done through market research methods or any other methods are there for collecting it? Please do elaborate..
@RitwikDandriyal
@RitwikDandriyal 4 года назад
Hey. Data collection or Data Acquisition is one of the very first steps in designing a data science pipeline. Often Data Scientists working in tech companies get easy access to this data but if you want to collect data for yourself (on a small scale) then stuff like web scraping comes into play. Web scraping is nothing but collecting data from a website. Web Scraping is a popular means of collecting data. Often companies survey people to collect data. It can be an online survey or an offline one. The means of collecting data are endless.
@akshaykrishnan7985
@akshaykrishnan7985 4 года назад
@@RitwikDandriyal thank you for the reply.. 😊
@manojkumarpalaparti397
@manojkumarpalaparti397 4 года назад
Bro video about difference between fit, transform and fit_transform !!
@latabisht3591
@latabisht3591 3 года назад
Great explanation Sir
@vks1700
@vks1700 3 года назад
does transformstion required affter standardising the data to make it gaussian?
@shadiyapp5552
@shadiyapp5552 Год назад
Thank you sir ♥️
@KrishnaChaitanyakosaraju
@KrishnaChaitanyakosaraju Год назад
In linear regression, one common assumption is that all the features have 0 mean same variance. Which is similar to standardization. Hence it works.
@danielagbaniyaka6196
@danielagbaniyaka6196 3 года назад
Clean explanation Thanks
@ashabisht8051
@ashabisht8051 3 года назад
Pls tell difference in machine learning , AI , Deep learning n NLP
@ahmedelsabagh6990
@ahmedelsabagh6990 2 года назад
Great explanation
@dude5697
@dude5697 2 года назад
Should the features are related to each other if we want to normalize or standardize them?
@mayurgupta4004
@mayurgupta4004 3 года назад
can somebody tell me that it is possible that standard normal distribution cannot be normal that is mean is not equals to median for standard normal distribution
Далее
What Is P Value In Statistics In Simple Language?
11:18
I gave 127 interviews. Top 5 Algorithms they asked me.
8:36
Feature Scaling  (How it really works?) Explained !!
6:57
Normalization & Standardization
15:36
Просмотров 9 тыс.