Тёмный

Top 5 Statistics Concepts in Data Science Interviews: P-value, Confidence Interval, Power, Errors 

Emma Ding
Подписаться 56 тыс.
Просмотров 58 тыс.
50% 1

Top 5 Statistics Concepts in Data Science Interviews
In this video, we will talk about the top 5 statistics concepts in Data Science interviews. I will show you how to explain those concept to both technical and non-technical audiences.
Typos
10:09 "hull" hypothesis should be "null" hypothesis
🟢Get all my free data science interview resources
www.emmading.com/resources
🟡 Product Case Interview Cheatsheet www.emmading.com/product-case...
🟠 Statistics Interview Cheatsheet www.emmading.com/statistics-i...
🟣 Behavioral Interview Cheatsheet www.emmading.com/behavioral-i...
🔵 Data Science Resume Checklist www.emmading.com/data-science...
✅ We work with Experienced Data Scientists to help them land their next dream jobs. Apply now: www.emmading.com/coaching
// Comment
Got any questions? Something to add?
Write a comment below to chat.
// Let's connect on LinkedIn:
/ emmading001
====================
Contents of this video:
====================
0:00 Intro
1:27 Structure your answer for technical audience
2:08 Structure your answer for non-technical audience
3:04 Power, Type I error, Type II error (for technical audience)
5:15 Power, Type I error, Type II error (for non-technical audience)
6:17 Confidence interval (for technical audience)
8:33 Confidence interval (for non-technical audience)
9:20 P value (for technical audience)
11:29 P value (for non-technical audience)

Опубликовано:

 

4 авг 2024

Поделиться:

Ссылка:

Скачать:

Готовим ссылку...

Добавить в:

Мой плейлист
Посмотреть позже
Комментарии : 84   
@songxiyou2347
@songxiyou2347 3 года назад
自己复习才发现,Emma真是将这些内容完全吃透,整理成自己的体系。不管是product sense还是stat,全部是干货并且非常organized。多余的废话一句没有(对比我自己的录音回答发现了一堆废话hhh)。非常感谢行业内有这样的领路人。继续期待product sense实例分析/stat & probablity 考点/take home & presentation思路总结和其他DS相关内容!Emma 新年快乐!新的一年身体健康,工作顺利,万事如意!
@zihenglin5294
@zihenglin5294 3 года назад
Thought I already known those stats concepts but still learned a lot from your video. The tips for technical and non-technical audience are very helpful! Thanks Emma. Love your content!
@katekatebangbang2435
@katekatebangbang2435 3 года назад
作为一个在面试的人,来回来去看了好多次emma的视频了,常看常新。谢谢Emma
@goodjuju2132
@goodjuju2132 3 года назад
Emma thank you so much for all of your quality content!! You're doing so much for the community
@insigh01
@insigh01 3 года назад
The way you structure your response is concise, and it makes it easy to understand these concepts. Thank you Emma!
@josephjoestar995
@josephjoestar995 2 года назад
So glad I came across this goldmine of a channel, honestly such great relevant topics with the most useful explanations - I trust you 100% to help with my interviews haha
@weiyang2116
@weiyang2116 3 года назад
Yay! Exactly what I was looking for! Thanks Emma
@ishitasadhukhan1
@ishitasadhukhan1 2 года назад
Amazing videos Emma ! I am preparing for data science interviews and feel so lucky and grateful that I found your channel ! I am making it a point to follow your advice to the words ! Thank you so much for what you are sharing with us!
@thudang2597
@thudang2597 2 года назад
This is amazing Emma! Thank you so much for such great content. I'm prepping for DS intern interview and your videos literally save me
@jingyou3481
@jingyou3481 3 года назад
This is really great. I've been thinking about how to explain p value to non-technical person and find a great example for a while. This is definitely very clear! Hope you can continue to make some videos for stats concept like Simpson Paradox etc
@sitongchen6688
@sitongchen6688 3 года назад
This is super clear, and now I have a good sense or expectation from the interviewer! Thanks Emma!
@281019641
@281019641 3 года назад
Thanks Emma. Very clear description and helpful to see the categorization accordingly for technical and non-technical audience.
@yuanliu2496
@yuanliu2496 3 года назад
I came across your video and it turns out to be super helpful! Thank you! subscribed.
@yinqiu6780
@yinqiu6780 3 года назад
So well explained! Thank you Emma!
@Nancy-wr7zb
@Nancy-wr7zb 3 года назад
Great video Emma !! Technical vs non technical explanations were very impressive !!
@fengzhoupan771
@fengzhoupan771 Год назад
Love the video! Thank you so much for the tips!
@user-kq3qv5mv4y
@user-kq3qv5mv4y Год назад
Super useful. One of the best DS videos I have ever seen !
@mihirbosemj
@mihirbosemj 3 года назад
The content you publish is so helpful for us to learn data science and prepare for interviews. Keep up the great work, and all the best :-)
@jeoffleonora4612
@jeoffleonora4612 3 года назад
Well explained. Thank you!
@taozhang7696
@taozhang7696 3 года назад
thank you. it's really helpful!
@Sethsm1
@Sethsm1 2 года назад
Extremely helpful. Thank you.
@alifiaz7792
@alifiaz7792 3 года назад
Very intuitive video. Please also consider making a video explaining the metrics for regression, classification and clustering machine learning models from both technical and business perspective.
@DataProfessor
@DataProfessor 3 года назад
Thanks Emma! Awesome video also for practicing data scientists, it’s a great video to brush up on our stats knowledge 😆
@emma_ding
@emma_ding 3 года назад
Thank you Data Professor!
@mussdroid
@mussdroid 3 года назад
We are going to moon on Data Science 🚀🚀🚀🚀 🌜🌜🌜 ! Thanks Emma
@shauniktaneja4733
@shauniktaneja4733 3 года назад
Thank you so much!
@hameddadgour
@hameddadgour 2 года назад
Great content!
@guimaraesalysson
@guimaraesalysson 2 года назад
Great video, helps a lot
@spotting_experiment
@spotting_experiment 2 года назад
Landed here preparing for my upcoming interview and this is very useful as a revision material as well.
@nisithaukkarapattanakul8860
@nisithaukkarapattanakul8860 2 года назад
Very clear explanation, thanks
@jayzune1752
@jayzune1752 Год назад
Wooo, smart and elegant lady! Thanks for your video, helped me a lot!
@wongkitlongmarcus9310
@wongkitlongmarcus9310 4 месяца назад
thank you Emma
@chengqian5737
@chengqian5737 2 года назад
给你一个大大的赞!
@jaden2582
@jaden2582 2 года назад
NO one word of bullshit. Appreciate it, Emma.
@hehuang3536
@hehuang3536 2 года назад
Hi Emma, I have watched a lot of videos you made and they are super clear and helpful for preparing my DS interviews. Thank you so much!
@emma_ding
@emma_ding 2 года назад
Hey, I'm so happy to hear that my videos have been helpful. Best of luck with your interviews!
@yenliknurasheva6322
@yenliknurasheva6322 2 года назад
I am very grateful for your useful videos! Great content! You are so smart and beautiful! 😇 Also preparing for DS interview, these videos help a lot!!!
@michellewww8036
@michellewww8036 2 года назад
Like it!!!!!
@liumx31
@liumx31 2 года назад
Hi Emma, thanks for the great explanation, one question though -- how is power used to determine the sample size? I thought the sample size determined the power, i.e. the larger the sample size the higher the statistical power.
@aliciama1745
@aliciama1745 3 года назад
really helpful! Thank you very much for do this! Emma, can you introduce * how to do a project* for the people who want to transfer to data science from other unrelated fields? Appreciate ahead of time!
@emma_ding
@emma_ding 3 года назад
For learning purpose, Kaggle is a really place to start. For "real-life" projects, you have to look for opportunities of side projects or in your current position.
@qingchuanlyu4605
@qingchuanlyu4605 3 года назад
This is really helpful. Now I know where my mistakes were!
@jiayiwu4101
@jiayiwu4101 3 года назад
Wow, super cool summary! Really practical! Thanks Emma. Would you mind sharing slides or text then?
@emma_ding
@emma_ding 3 года назад
Sorry there are no slides. It's part of the video editing.
@niveditakumari701
@niveditakumari701 2 года назад
Thank you for the video, can you please share another example for p-value in the layman's term?
@yogiHalim
@yogiHalim Год назад
Significance (p-value 80%) is the probability of correctly [rejecting the null hypothesis while it is false.]. (probability of not testing positive pregnancy for male) for 3 or more outcome, [testing negative] >< [not testing positive]. Significance is thus the probability of Type I error, whereas 1−power is the probability of Type II error.
@anathemaconscience5666
@anathemaconscience5666 2 года назад
hi emma, i am kind of confused to the p value. At 10:33 you mentioned small p, more convinced of difference. But at 11:22, you said p value represents there is a diff given null hypo is true, meaning higher p, more convinced of difference. But given the height example, i believe small p larger difference, so at 11:22, why would you say p means there is a diff given null hypo is true?
@amitkhandelwal2999
@amitkhandelwal2999 3 года назад
Great Video. It would be great if you can also provide the info on how to deal with these concepts in practical scenario. I mean to say, how to increase power of test. How to decrease FP / FN / countereffects. That will give a complete end to end picture while dealing with them when someone encountered in such problems while implementing these things in practice. Loved all other videos which I have seen till today in your channel.
@kuifeiliu3203
@kuifeiliu3203 3 года назад
good explanation! better to put non-technical part first
@Mackymon
@Mackymon 3 года назад
Great Vid! Follow up question: how do you get a feel for how technical your audience actually is?
@emma_ding
@emma_ding 3 года назад
Look at their public profile like LinkedIn :)
@hehuang3536
@hehuang3536 2 года назад
In one of my technical interviews, the interviewer asked me how do you explain the concept to your grandma?
@waliatv
@waliatv Год назад
Very informative and helpful ❤
@emma_ding
@emma_ding Год назад
So happy to be of assistance, Mrinal! 😊
@waliatv
@waliatv Год назад
@@emma_ding just ended up with my data scientist internship interview and it was very very good. Thankyou for such amazing content. It was very helpful for last minute brushup of key skills and i am hoping for positive results from my interviewer 🤞✨
@emma_ding
@emma_ding Год назад
That's fantastic to hear, Mrinal! Feel free to keep me posted with how your results go. Fingers crossed, and sending you good luck! 💛
@waliatv
@waliatv Год назад
​@@emma_dingThankyou so much for the good wishes and all your hard work in videos was worth it because we benefited from them a lot. Also, I would like to share that I have accepted the Data Scientist Internship with Loblaws Companies in Toronto, Canada, for the coming Winter of 2023. I am so excited and obliged to start my new journey in Data Science. It was difficult but with consistent hard work and good resources such as your channel, I am now going to follow my dream career. Thankyou once again for all good work and keep posting such insights and helpful resources on DS, as it will still help me during my professional career.
@emma_ding
@emma_ding Год назад
Mrinal! This is fantastic news! Thank you for sharing this huge win with me, and congratulations on your new role. I can't wait to hear what else is in store for you in the future. Sending you all the best! 🥳
@yogiHalim
@yogiHalim Год назад
95% confidence interval shows 95% from the center of a normal distribution population is represented. ie: 5% outliers are not represented by the equation
@muse3324
@muse3324 3 месяца назад
1:41 "It should not be obscure like what you see in Wikipedia" 😅😁😁
@LouisChiaki
@LouisChiaki 3 года назад
A comment on the confidence interval, I think your interpretation (and a lot of data analyst) is from Frequentist's point of views. For Bayesian, there is no fixed true value.
@jennywu799
@jennywu799 3 года назад
Emma, 可以不可以出一个视频总结一下常用的distribution,有的时候面试的时候被问到sales data是什么样的distribution,我每次都答normal。。。
@jaden2582
@jaden2582 2 года назад
poisson distribution
@bhageerathbogi4951
@bhageerathbogi4951 3 года назад
Hi Emma, Can you please share a link to the slides.
@emma_ding
@emma_ding 3 года назад
Sorry, there's no slides, it's all part of the video editing. But I'll definitely consider providing it in the future if it helps!
@mussdroid
@mussdroid 3 года назад
#datascience
@plttji2615
@plttji2615 2 года назад
What if N increase, does it affect P-value?
@InoHimeYa
@InoHimeYa 2 года назад
13 mins saves me at least 3 hours
@mahdimerced
@mahdimerced 3 года назад
Why did you delete most of the previous movies?
@emma_ding
@emma_ding 3 года назад
You can find all my videos under the VIDEOS tab on my channel page. I changed the thumbnails of some videos a few weeks ago. :)
@nikhilmuthukrishnan7222
@nikhilmuthukrishnan7222 2 года назад
You think your thumbnails are so cute!!! Well they are
@LouisChiaki
@LouisChiaki 3 года назад
10:09 some typo on the slides. Should be "null" not "hull" hypothesis :D
@emma_ding
@emma_ding 3 года назад
Thanks for catching the typos!
@robertwilsoniii2048
@robertwilsoniii2048 Год назад
This is really basic... how do jobs require multiple years of experience when these interview questions are just basic thing you learn in an intro stats class... ???
@jaden2582
@jaden2582 2 года назад
could you explain the "AT LEAST as extreme as the data is actually observed" in the definition of the p value?
@emma_ding
@emma_ding 2 года назад
Hey so an example would be when you are doing a test - if the means of two populations are the same, your null hypothesis is that those two are the same. Now you have observed data that shows that the difference is 1. “AT LEAST as extreme as the data is actually observed" means the difference is 1 or larger. 1 is the observed data and AT LEAST as extreme means that is the minimum difference. I hope this helps!
@jaden2582
@jaden2582 2 года назад
@@emma_ding Thank you for this clear explanation!
@brothermalcolm
@brothermalcolm 3 года назад
Non-technical audience!
@djjiang3718
@djjiang3718 3 года назад
What a beautiful lady with high-quality content!
@Galax224
@Galax224 3 года назад
Hi Emma I suggest you name your channel so every time you introduce you can say welcome to !@$!@#$!@#~!# instead of my channel and it's unique to impress people.
@poopah4497
@poopah4497 2 года назад
the higher CL -> wider c.I? Is that a typo? I thought the opposite
@emma_ding
@emma_ding 2 года назад
Hey Ruiruo! It's not a typo, the higher CL, the wider the CI, because increasing the confidence will increase the margin of error resulting in a wider interval.
@davidwarner1248
@davidwarner1248 4 месяца назад
Such a poor pronunciation
Далее
🤡Украли У ВСЕХ🤪
00:37
Просмотров 179 тыс.
Bootstrapping Main Ideas!!!
9:27
Просмотров 443 тыс.
The most important skill in statistics
13:35
Просмотров 313 тыс.
A/B Testing Interview with a Google Data Scientist
13:06
Entropy (for data science) Clearly Explained!!!
16:35
Просмотров 592 тыс.