Тёмный

What are Skewness and Kurtosis? (Read info below for more intuition) 

BurkeyAcademy
Подписаться 33 тыс.
Просмотров 172 тыс.
50% 1

Link to Data Set: www.stat.berkeley.edu/~statlab...
Here I show you how to understand numbers for skewness and kurtosis, with some example data and histograms on newborns. With Excel formulas for skewness and kurtosis. Roughly speaking, Skewness measures whether data stretch out farther in one tail than another, and Kurtosis measures whether the data has heavy tails (higher probability of outliers), or whether data is more concentrated in the center.
Definition: Deviation = xi-mean
As for intuition WHY it makes sense that the 3rd power indicates which side outliers might be on: An odd power preserves the sign. Numbers below the mean will have a negative deviation. LARGE negative deviations will become VERY LARGE negative numbers when cubed. If there are a preponderance of numbers farther away below the mean than above, you'll get a negative skewness measure. The opposite if they are on the positive side. If symmetric, they will effectively cancel each other out.
As for intuition why it makes sense that the 4th power indicates the presence of many "outliers" on both sides, or the lack of them (here by "outliers" I just mean numbers pretty far away from average, think more than one standard deviation)... By "many", we mean compared to a normal distribution. Picture if you will, two distributions. Distribution A is a perfect semicircle lying with its flat side down (See blue curve here: upload.wikimedia.org/wikipedi...) Distribution A will have negative kurtosis. Distribution B is shaped something like the Eiffel Tower, but with skinny tails (yet fatter farther out than a normal would have) stretching off on both sides (see Red image). It will have positive Kurtosis.
When raising things to the 4th power, numbers more than 1 will get pretty big, numbers less than 1 will get pretty small. If all of the numbers in a distribution are within one standard deviation, kurtosis will be a very small number. The more numbers with z scores of 2, 3, and 4 that there are, the more 2^4 and 4^4 we have to add in. This makes a very large kurtosis number. At the end of calculating kurtosis we normally subtract 3 since that is what a normal distribution's kurtosis is. So, negative sort of indicates "more closely packed" than a normal, negative means more with z scores farther away.

Опубликовано:

 

23 июл 2024

Поделиться:

Ссылка:

Скачать:

Готовим ссылку...

Добавить в:

Мой плейлист
Посмотреть позже
Комментарии : 113   
@rahim655
@rahim655 6 лет назад
Did I just learn a concept in 11 mins which took my class teacher 4 hours to teach? Pinch me. Thanks a ton.
@BurkeyAcademy
@BurkeyAcademy 6 лет назад
Cool man! Thanks for letting me know!
@sanjuktaghosh8802
@sanjuktaghosh8802 4 года назад
Thanks for explaining so clearly. I think you are the best Stat tutor on youtube.
@XT69xt69XT69
@XT69xt69XT69 4 года назад
love this basic, down to earth explanation … now I get it … 100 thumbs UP for you sir.
@terryliu3635
@terryliu3635 3 года назад
Great video. The explanations with examples are so clear and easy to understand! Thanks!
@minxxdia1132
@minxxdia1132 4 года назад
now I can finally go back to completing my data science assignment, thankyou so much!
@MrBitviper
@MrBitviper 2 года назад
damn this is one of the best explanations I've seen for skewness and kurtosis this is kinda like activating cheatcodes to learning.. thank you so much for this explanation... much appreciated
@Mereo110
@Mereo110 6 лет назад
My god, I perfectly understood your explanation, spoken like a human... unlike my statistics prof...
@alanhuang4193
@alanhuang4193 7 лет назад
great explanation. making sense for me even without any background knowledge.
@leeuct7683
@leeuct7683 3 года назад
Thank you so much. This helped me to understand how to approach numerous variables in my data set.
@ishtiaquekhandker1720
@ishtiaquekhandker1720 2 года назад
Thank you so much for this video. Love your teaching style!
@TimSter15
@TimSter15 4 года назад
Thanks for this! I couldn't find any other videos or much information on what the kurtosis values ACTUALLY MEANT!
@FlashStarGaming
@FlashStarGaming 4 года назад
Tony Stark teaching me statistics haha!
@markanderson5922
@markanderson5922 2 года назад
Awesome video! Super helpful for a grad student trying to learn how to analyze data
@5464654135756
@5464654135756 3 года назад
One of the best videos ever! Thanks a lot!
@echoecho5244
@echoecho5244 Год назад
better than my uni days, a million times better
@makalugiabdulnasser6622
@makalugiabdulnasser6622 7 лет назад
so thankful sir... very easy to learn from your presentation
@elpiopro
@elpiopro 4 года назад
Amazing and very simple explanation 👍🧐 thank you!
@nitadyola1200
@nitadyola1200 5 лет назад
Now I can understand my data distribution. Thanks a lot
@zk9339
@zk9339 Год назад
Thank you! The examples were clear and easy to understand.
@poulamitheshutterbug
@poulamitheshutterbug 5 лет назад
After 5 long years.... i finally understood. thanks SirJi ♥
@XT69xt69XT69
@XT69xt69XT69 4 года назад
I'm agree whole heartedly. This is the BEST teacher of math and statistics out there! Plus, the gal who posted this comment is good looking!!!
@fabiangonto
@fabiangonto 4 года назад
Thanks for this video. It was really helpful.
@eel3238
@eel3238 4 года назад
This helped so much! Thank you!
@7479zm
@7479zm 6 лет назад
Very well explained, thank you very much
@figdonpat
@figdonpat 7 лет назад
Fantastic video. Great communication.
@jamesdavis6564
@jamesdavis6564 4 года назад
Thank you, thank you, thank you!!
@rewiredMohit
@rewiredMohit 6 лет назад
So, if a histogram has, say, 4 bars and all have same frequency, it will be symmetric graph, not left/right skewed? Also what will be it's kurtosis?
@grandlong5462
@grandlong5462 4 года назад
Great explanation! Thank you
@anweshadutta8782
@anweshadutta8782 4 года назад
Thank you this has been very helpful
@debmalyasur2879
@debmalyasur2879 4 года назад
Thanks for the great information sir... It clears all my doubt🤘... Just one thing I want to ask you, you have said that if the absolute value of skewness is less than 0.5 then the distribution is quite symmetric... But what about kurtosis??? You have told that if kurtosis is closer to zero then it's awesome but can you tell a significant range, by which I can easily decide whether I have to work on outliers or not... Again thanks for the information 🤘
@lavakumarreddy2899
@lavakumarreddy2899 2 года назад
Perfect Explanation. Thanks Burkey :)
@ABC2691
@ABC2691 6 лет назад
Such an amazing way of explanation! I learnt so much from this video :) Thanks so much !! Looking forward to more content from you, sir. Respect to you from India !! Liked and Subscribed !!
@BurkeyAcademy
@BurkeyAcademy 6 лет назад
Thanks niceguy! Glad you liked it.
@ABC2691
@ABC2691 6 лет назад
Hello !! Thank you so much, sir! It is my privilege :)
@danhlawrence
@danhlawrence 2 года назад
Made it easy to understand. Thank you!
@chrisdaniels4446
@chrisdaniels4446 2 года назад
Thanks a lot for the explanation.
@vinodkumarp8978
@vinodkumarp8978 5 лет назад
way of explanation is super.give us more subject if possible.
@Pk-fe1rz
@Pk-fe1rz 2 года назад
Ohh so great this video. Thanks so much!
@suprateekat5338
@suprateekat5338 3 года назад
Thank you very much ! This was very useful !
@aymanzein7
@aymanzein7 6 лет назад
Nice and easy to understand.Thanks
@ALPA85
@ALPA85 2 года назад
THanks for the video. I really like it.
@jadelag
@jadelag 5 лет назад
Very informative. Thanks
@Viral_Vio
@Viral_Vio 4 года назад
very well explained...thank you :)
@ajiths1689
@ajiths1689 3 года назад
very great explanation
@newbie8051
@newbie8051 Год назад
Amazing explainatin thanks !!!
@jukkaniittymaa
@jukkaniittymaa 4 года назад
Very cool video, thank you!
@elpiopro
@elpiopro 4 года назад
Love it!
@mathiasmaximilianoamarillo2418
Thank you so much. !
@eugenesheely5288
@eugenesheely5288 4 года назад
Great vid. I've subscribed, will check out the rest of your content later.
@dipankarrahuldey6249
@dipankarrahuldey6249 3 года назад
Excellent!! Impressive!! I just wonder why do we need shapiro wilk test when we easily can determine normal distribution looking at the skewness and Kurtosis?
@BurkeyAcademy
@BurkeyAcademy 3 года назад
Because no observed sample from a normal distribution will have exactly the properties is was drawn from.
@freechatu
@freechatu 7 лет назад
Thank you for nice presentation... I just want to ask about how to calculate standard error for skewness and kurtosis ether manually or by excel functions?
@BurkeyAcademy
@BurkeyAcademy 7 лет назад
It is easy to Google this, but here is a Google result that explains it: estatistics.eu/what-is-statistics-standard-error-of-skewness-standard-error-of-kurtosis/
@freechatu
@freechatu 7 лет назад
Thank you for fast response.. Actually I tried that by google but for some reason, any link with that issue did not opened, also yous !! Thanks again.
@BurkeyAcademy
@BurkeyAcademy 7 лет назад
Happy to help!
@sumittripathi9694
@sumittripathi9694 4 года назад
Really, It was awesome.
@joseignacioporfirioordonez9403
@joseignacioporfirioordonez9403 4 года назад
Great video
@TheRogueRockhound
@TheRogueRockhound 4 года назад
Thanks Buddy: Subscribed
@THUYBui-dx2uc
@THUYBui-dx2uc 5 лет назад
Dear sir, from an another website I have read, kurtosis has to approach 3? It's different from what you have told in 5:48.
@BurkeyAcademy
@BurkeyAcademy 5 лет назад
What does "has to" mean? You probably mean "to have similar kurtosis to a univariate Normal distribution? The "basic" formula for kurtosis will give a kurtosis of 3 for a normal, but MOST of the time formulas go ahead and subtract 3 to give the "excess kurtosis" compared to a normal distribution. Since that is the way 99% of stats formulas in programs calculate it, I am referring to "excess kurtosis" here.
@sukursukur3617
@sukursukur3617 3 года назад
Why do we calculate standard deviation by using mean? Namely, why dont we use mode instead of mean?
@versace1589
@versace1589 6 лет назад
how did you get the numbers in age and etc...? please reply ASAP
@BurkeyAcademy
@BurkeyAcademy 6 лет назад
The link is in the description of the video.
@bonakelesarahndzinisa903
@bonakelesarahndzinisa903 5 лет назад
I'm a first time statistic student, I hope your videos will help me a lot. but if a Mean=1.373 833, Median=1452141, Skewness=0.0544 and Kuitosis=1.434581, what do they mean in the graph
@aashishpandey6047
@aashishpandey6047 5 лет назад
almost normally distributed data , i think median value is not right
@srinivassri2596
@srinivassri2596 4 года назад
Is there any video for 'Bassel's correction'?
@BurkeyAcademy
@BurkeyAcademy 4 года назад
No, but I could do that...
@desmondnji8178
@desmondnji8178 5 лет назад
Excellent
@ifrah3224
@ifrah3224 6 лет назад
I have a doubt..... some people say kurtosis is measure of 'tailedness' , but you mentioned it measures the peak? So confused!😅
@BurkeyAcademy
@BurkeyAcademy 6 лет назад
You are correct- in the video I probably did emphasize the "peakness" too much because I was emphasizing what the words "platy" and "lepto" mean. In the video description I tried to add more on the "tailedness" idea to balance it out, and added (Read info below...) to the title to try to point people to that additional information. Maybe I should re-record this one so as to make the video better/more complete. I apologize if I added to your confusion.
@ifrah3224
@ifrah3224 6 лет назад
BurkeyAcademy Thank you so much for clearing my doubt😊....no need to apologize, it's a big help in itself that you take time to share your knowledge with us😊❤
@musahamba2538
@musahamba2538 6 лет назад
thx very much
@Tom-rs7nd
@Tom-rs7nd 6 лет назад
Question! I have an exam tomorrow and there is one question in the mock exam that is driving me nuts! A researcher created a regression model and decided to use a logarithm on the "y-variable". What would be the reason to do so? A. The variable Y must have been positively skewed. B. The variable Y must have been negatively skewed. C. The variable Y must have been platykurtic. D. The variable Y must have been leptokurtic. Can anyone help me?
@BurkeyAcademy
@BurkeyAcademy 6 лет назад
Taking the log can take some kinds of data that are positively skewed and make them more symmetric- e.g., there is a distribution called lognormal, that is skewed, but if you take the log it becomes a normal distribution. en.wikipedia.org/wiki/Log-normal_distribution
@Tom-rs7nd
@Tom-rs7nd 6 лет назад
You are a hero among men, thank you so much!
@slckaz1183
@slckaz1183 Год назад
superb👌
@kevingepulle4371
@kevingepulle4371 6 лет назад
Is it just me or he sounds like Mr Stark? (Have I watched too much Marvel Movies? :o )
@BurkeyAcademy
@BurkeyAcademy 6 лет назад
Now this is new... I might like this suggestion! :) People usually say "Tom Hanks".
@rockbike
@rockbike 2 года назад
Which playlist does this video belong to?
@BurkeyAcademy
@BurkeyAcademy 2 года назад
It wasn't in a playlist, so I added it to this one on "Numerical Descriptive Statistics: ru-vid.com/group/PLlnEW8MeJ4z4YdizTw_wV4HThhHJ2zp0F
@rockbike
@rockbike 2 года назад
@@BurkeyAcademy thank you :)
@rockbike
@rockbike 2 года назад
@@BurkeyAcademy thank you :)
@MrNabiwishes
@MrNabiwishes 4 года назад
Skewness : In Between -0.5 to 0.5 : Symmetry >0.5 to 1 : Positive skewed >1 : too Much skewed to positive side -1 to -0.5 : Negative skewed
@BurkeyAcademy
@BurkeyAcademy 4 года назад
Just because data has 0 skew and low excess kurtosis does not mean it has a normal distribution. There are an uncountable infinity of distributions and mixtures of them.
@MrNabiwishes
@MrNabiwishes 4 года назад
So what should be ideal number for skewness and kurtosis to follow normal distribution.
@pankajkumarjalodia3565
@pankajkumarjalodia3565 7 лет назад
sir cannot find the raw data file for child health and development study and no link in the description.
@BurkeyAcademy
@BurkeyAcademy 7 лет назад
Thanks for letting me know, and sorry about that! I added the link now!
@pankajkumarjalodia3565
@pankajkumarjalodia3565 7 лет назад
thank you... so much sir.
@godiusrweyongeza3532
@godiusrweyongeza3532 7 лет назад
BurkeyAcademy
@henrypeterson8497
@henrypeterson8497 4 года назад
Kurtosis has to do with relative frequency of outliers not "pointiness."
@villwang8005
@villwang8005 3 года назад
can curtosis be exactly 0?
@BurkeyAcademy
@BurkeyAcademy 3 года назад
Yes, it is possible (software usually reports excess kurtosis which is zero for a normal distribution)
@srinivassri2596
@srinivassri2596 4 года назад
Much better
@sandeeppatel420
@sandeeppatel420 2 года назад
data set link is not working kindly upload new link
@BurkeyAcademy
@BurkeyAcademy 2 года назад
I just tested the link, and it is working fine.
@faustopf-.
@faustopf-. Год назад
So when an age distribution has a positive kurtosis, then it means that the distribution is pointy, okay.
@BurkeyAcademy
@BurkeyAcademy Год назад
I might be pointy, or could have other shapes with more outliers than a normal distribution.
@edhinman9276
@edhinman9276 3 года назад
I thought normal distribution had a kurtosis of 3?
@BurkeyAcademy
@BurkeyAcademy 3 года назад
It does, but what every computer program actualy calculates if "Standardized" or "Excess" kurtosis compared to a normal distribution. Since it is extremely rare for people to actually calculate kurtosis manually, I am explaining what the numbers that Excel, R, SAS, Stata, SPSS, etc. will tell you. So, if you fed normally distributed data into one of these programs, you would get a value of zero.
@i_mnaftali2743
@i_mnaftali2743 2 года назад
4:00
@allenjeremyvillanueva8530
@allenjeremyvillanueva8530 3 года назад
Mr. Stark is that you
@ravirajshinde465
@ravirajshinde465 4 года назад
its not pointy what kurtosis tells ,its the outlier in the datasets which makes high kurtosis
@BurkeyAcademy
@BurkeyAcademy 4 года назад
Of course I agree, which is why I say "Kurtosis measures whether the data has heavy tails (higher probability of outliers), or whether data is more concentrated in the center." This video is a brief, practical look, and I would be happy to make a more theoretical video if you like. However in practice, most of the time (in my experience) data with heavy tails also become more pointy looking, which is why the common terms "platykurtosis" (like a block) and "leptokurtosis" (narrow) came into being. For example, visually compare the shapes of the normal, t, and cauchy with excess kurtosis of 0, 6/(df-4) for df>4, and undefined (though thinking of it as infinite is justified). As you "squeeze" data from the middle out into the tails, it gets pointier. OF COURSE, the calculation of kurtosis does not directly measure pointyness, but it is a very common side effect. If you know of some common empirical cases where high kurtosis is coupled with LESS pointyness, I would love to learn about these cases! Thanks for the comment!
@ravirajshinde465
@ravirajshinde465 4 года назад
@@BurkeyAcademy i understand the now more about the pointy ness as we are trying to squeeze the shoulders and spread the data on tails.
@macfhlannchadharonan4668
@macfhlannchadharonan4668 5 лет назад
I was of the belief that kurtosis tells you nothing about the peak and only the tails? The tails in your explanation all seems quite similar. ”Kurtosis tells you virtually nothing about the shape of the peak- its only unambiguous interpretation is in terms of tail extremity, that is,either existing outliers (for the sample kurtosis) or propensity to produce outliers (for the kurtosis of a probability distribution)” (Westfall 2014)
@BurkeyAcademy
@BurkeyAcademy 5 лет назад
If you read the extra information in the video description I clarify this a bit- "extremity" is the key word in your quote. I apologize for being a bit misleading in the video.
@pika2253
@pika2253 5 лет назад
You sound like Tony Stark... Holy
@manindersingh3247
@manindersingh3247 2 года назад
You sound like Iron Man :D
@albertmichelson1615
@albertmichelson1615 4 года назад
I am Iron Man!
@vamsikhatri
@vamsikhatri 4 года назад
It's never peakedness but tailedness
@carloguerrero6583
@carloguerrero6583 2 года назад
Just sayin'. Your voice sounds like Iron man's 7u7
Далее
🤔
00:28
Просмотров 953 тыс.
What is Kurtosis? (+ the "peakedness" controversy!)
16:00
What are "moments" in statistics? An intuitive video!
15:44
Kurtosis (for the @CFA Level 1 exam)
7:38
Просмотров 4 тыс.
Normal Distribution EXPLAINED with Examples
10:59
Просмотров 804 тыс.
Statistics 101: One-way ANOVA, A Visual Tutorial
24:14
Просмотров 577 тыс.
🤔
00:28
Просмотров 953 тыс.