Statistics 101: Controlling Type II Error using Sample Size

Brandon Foltz

Подписаться 291 тыс.

Просмотров 38 тыс.

50% 1

Видео Поделиться Скачать Добавить в

Опубликовано:

9 сен 2024

Ссылка:

Скачать:

Готовим ссылку...

Добавить в:

Мой плейлист

Посмотреть позже

Комментарии : 81

@vulnerablerummy 5 лет назад

Top 10 characters that could single-handedly defeat Thanos : ... 1. Mr. Brandon Foltz with his statistical teaching skills.

@happysongytube 10 лет назад

Type 1 error is worrying too much, while type 2 error is care-free too much.:_) I watched 12 videos of Hypotheses within one day, now I understand them. THANK YOU SO MUCH. I guess I watch them for another 10 times, I can teach this course. Xie Xie.

@jorgemercent2995 4 года назад

Brndon! ERROR @ 25:58 3rd line. The sign for mu_0 should be -ve and mu_a should be +ve. So at the end, it should be (mu_a - mu_0)^2, not (mu_0 - mu_a)^2! Please add an erratum! Also make mention of effect size, becuase 'sigma/(mu_0 - mu_a)' is 1/(effect size). Thank you!

@cmcatholic1798 3 года назад

Yup

@emmanuelmorrone716 5 лет назад

Hi Brandon! I'm a med student from Naples, Italy. On 25/02 I'll have my statistics exam, and I have studied entirely from your lessons. You've been such an amazing discovery! YOU're talented and smart, and people believe in you from all over the world! Thank you for everything :)

@cococnk388 Год назад

you given him back the energy that he gives us!!!

@riazmazar1192 10 лет назад

Thank you very much for your great effort. I think there might a typo for solving n while controlling beta. At 26:00 the difference between µ0 and µa. When u multiplied equation by -1, I think they reverse their position. Anyway your response for clarity

@Chibuzo75 7 лет назад

Yeah, its mean to be ua-u0 not ua - u0

@sabiszokoly3695 5 лет назад

I agree, this seems an error to me too

@santtuhietamaki5831 5 лет назад

Yes, the error didn't end up affecting the result since it was squared but this should be still stickied.

@tuberats99 10 лет назад

Hail lord Brandon Foltz! The King of Stat-enborough!! You have given a humble peasant like myself immense previously unattainable knowledge and I cannot ask thy majesty for more. LONG LIVE THE KING! but seriously, you have helped me a lot and you are an amazing teacher, so thanks :)

@jiongwang7645 11 лет назад

thank you so much for sharing this, Brandon, you can never imagine how a group of chinese students are benefiting from your video, this is alot more helpful than our text book here !!

@tulikachakraborty2717 4 года назад

Hi Brandon, i was always afraid of statistics. I never met a single teacher in my life who would teach so efficiently and clearly. The clarity of topics you provide is awesome. I wish I came across this channel 2-3years back, life would be easy. Thank you so much :)

@chinedueleh3045 5 лет назад

Started, slept over it, woke up, continued and now I'm done... Thank you Brandon!

@user-ks5qp9pp4b 3 года назад

I needed that positive affirmation at the beginning. THANK YOU

@darlove2513 6 лет назад

This is yet another example of Heisenberg's Uncertainty Principle in action. When the means, the hypothesized one and the true one, are close enough, they will be almost indistinguishable by any test because the Type-II error will be huge (meaning small test power) and the sample size would have to be enormous to make the two distributions very narrow, hence distinguishable. But do we really care if the mean is 1 or 1.01? Sometimes yes, sometimes no. It depends on the phenomenon we're trying to investigate. But the standard practice should be (alas, it's not!) to first set on the minimal difference we want to be able to detect and then look for the minimum sample size that would give us enough statistical power to do that (the resolution of the test, I'd call it). So, for instance, if we want to be able to detect a difference of at least 1 between means, and we hypothesize that the true mean is 1, then we would calculate the sample size N so that the power is at least, say, 95% when the mean really is 2 (or 0, for two-sided tests). This power would only increase as the true mean moves further and further away from 1. This is the way to do statistical tests PROPERLY. The current practice, even in well-known journals like Nature, is totally flawed and inadequate, even misleading, but this is due to lack of statistical knowledge of the scientists that carry out tests... Well, mostly, because the truth also is that if you torture your data long enough, it's going to confess ANYTHING and people with a hidden agenda (sadly, especially true for pharma companies) use this principle to convince others about their "miraculous panacea for everything" and make easy money... Long story, though.

@ajitkhanal507 3 года назад

Loved this playlist on Hypothesis Testing. Got to learn so much. Thanks

@shivakumar2145 3 года назад

In India Parents and Guru (teacher) are worshipped as God, many Indians going through this should be worshipping you! You are our "Revered Global Guru"!!!

@kuhajeyangunaratnam8652 2 года назад

thanks a lot Brandon, clearly explained it. one suggestion, if you could point out other video(in desc) that we should check next it would greatly help to follow full statistical course. Thanks a lot again

@skoolwal3874 9 лет назад

This is the best series I have come across on hypothesis testing. Thanks a lot for all your efforts in making these videos. I have become you fan! I will surely watch all other playlists you have updated on Statistics.

@c1videoprod 10 лет назад

Best video of the series, Brandon! I especially like your use of animations to explain the effect of standard error on the shape of the distribution. Great job!

@sarrae100 6 лет назад

Lord Brandon ! How simply does this guy explain it all...

@WW-mv1ww 4 года назад

Absolutely the best teaching ever!

@mihirgadgil4441 4 года назад

I liked all your videos, and am in the process of giving thumbs up to all... the probability of me understanding any Statistics was very very low.. and these videos have changed the game for me!! Best wishes... hope many many more people benefit from your videos!!

@BrandonFoltz 4 года назад

Thanks for that!

@prabhudaskamath1353 4 года назад

Excellent Lecture. Thanks Brandon..

@Luutzen007 4 года назад

Nice explaining, good vidz. However, I would simplify things: set alpha = ß. And calculate µ- alternative on that condition. It will give you a population mean, with given sigma, sufficiently far away from mu zero to make BOTH hypothesis powerfull. If you already know in advance population mean and variance, there's nothing more to know, given the normal distribution.

@milkiyaskebede9478 6 месяцев назад

spectacular videos, i wish to see example in two tailed. very helpful videos overall!!!

@boxiangwang 3 года назад

The best explanation I've seen! Thank you!

@prabhudaskamath1353 4 года назад

Great Lecture. Thank you so much..

@syedadeelhussain2691 6 лет назад

type 1 and type 2 have a tradeoff. Like mean-variance bias tradeoff exists in linear modeling which leads to optimal or the right fit in regression models.

@wl2007 2 года назад

I feel proud that I not only know when to use the phrase 'accept the null hypothesis', but also understand why! Tremendous video, thanks Brandon! - One question - is the calculation performed here the same as a 'power calculation'? I.e. if you performed these calculations before the study would it be considered power calculation? As power is simply 1-beta I would guess yes

@MaverickRam Год назад

Fantastic explanation!

@abcddd580 4 года назад

High quality video. 11/10

@gagandeepchani2169 5 лет назад

Hi...Brandon! Thanks for the excellent video list. I am wondering that you used "accept Ho" in last slide. Is it typo? Looking forward to hearing from you!

@debojyotisarkar3008 4 года назад

you did a great job here. it was helpful. thank you

@akhtar1813uaf 11 лет назад

Sir imaging, thanks for sharing all this useful information to us. your style of communication is awesome,

@bernardita3006 6 лет назад

Excellent. Please continue doing it!!!

@ricardoafonso7563 3 года назад

. true... a nice direction needs at least four re-view of this lecture . thank you .

@ricardoafonso7563 3 года назад

.......... 3.699 ? at 16:00

@abhijeetparihar3349 7 лет назад

These videos are really helpful. Thank you soo much.

@mohamedkhadarabdimohamed462 6 лет назад

Thank you so much, Brandon Foltz

@govamurali2309 5 лет назад

Brandon your videos are excellent, I have one doubt would you please explain what would be the case/formula in two tailed test?

@MuhammedShiharMZaid 5 лет назад

Cudn't we have used 0.01 for both Alpha n Beta? (Or- Why aren't the type 1 and 2 chosen error rates same)?

@vulnerablerummy 5 лет назад

it's a deliberate decision. if you want the beta to have 0.01 probability too, just change the z-value from -1.645 to -2.326

@disvkanal7 8 лет назад

This video is really helpful, thank you very much :)

@suryateja5675 8 лет назад

wonderfully explained

@cy770906 8 лет назад

Awesome work! Thank you so much!

@yuzheng 4 года назад

Thank you for the video. But I am not sure if the controlling of type 2 is feasible in real since if we redo the experiment in 36 samples, the hypothetical mean changes, thus the two sides of the equation which used for the seek of n is unbalanced again. thus we will get a different n. Am i understand right?

@mostafahamed4729 7 лет назад

thank you , it is very helpful

@dalitso9317 4 года назад

You beauty of a man

@hayderibrahim361 3 года назад

Thank you!

@Zomgitsgeorge 7 лет назад

Awesome video! Just one question: the "mainstream" sample size formula is n= (1.96)^2*sigma/E^2...The denominator matches perfectly, however the only difference I see from the formula you came to is the 1.96 squared. Im trying to see how this relates to your (za+zb)^2 factor. Could you help me make the relationship.

@onetwoBias 5 лет назад

@ 15:17 you have both an alpha level of .01 written to the left of the curve, and then to the right of the curve you write that any sample with n =25 with a mean value of more than 3.699 would result in rejection of the null hypothesis if sigma and alfa remain the same, but here you have written that alfa is .05. This confused me a bit - shouldn't it be 0.01?

@xsli2876 4 года назад

Dear Sir: First of all, thank you very much for the great video! I have a question: your video does not include an example of proportion. In your example, the standard error (sigma) of the NULL distribution and the alternative distribution is the same. How about the case of proportion? Let's say the NULL distribution mean is 0.5(i.e. 50%), the alternative distribution mean is 0.7(i.e. 70%). Now the standard error of NULL distribution is sqrt(p*q)=sqrt(0.5*0.5)=0.5; the standard error of alternative distribution is sqrt(0.7*0.3)=0.458. From there, we can still figure out the appropriate n sample size for a specific beta. Am I correct?

@kinjalvora3352 3 года назад

if yes how would you do that? do you have any way of figuring the power and then sample size to increase power for example?

@lowqchannel 2 года назад

after we calculate the sample size, but we use alot more sample size than the calculated, will this increase the test power ?

@govamurali2309 5 лет назад

Could you please explain how the type 2 controlling /sample size works in case of 2 tailed test?

@inderpalwalia971 6 лет назад

Hi Brandon, Lectures are amazing, Great help. I have a question, can I download the ppt used in the lectures from some place? Thanks Regards, Inder

@juzforplei4711 7 лет назад

Does it mean that we cannot control type II error rate based on sample size if we don't know the population standard deviation (sigma)? Or do we have to use sample standard deviation (s) that we gathered on the first study to estimate the "optimal" sample size, then redo all the experiment using the "optimal" sample size?

@kinjalvora3352 3 года назад

Hi Everyone, how would this work when we have proportions like 50% and 60% and like the population is equally divided? and there are two independent categories and we are calculating or trying to calculate the n and power for an increase of let's say from 50 to 60 in one of the categories. with let's say an alpha of 0.05. How would one find a common alpha/beta/n/power?? 1000 equally divided into 500 and 500 for example for both categories... In this case we do not exactly have the standard deviation... we can get the standard error with sqrt(p(1-p)/n ) but how does one go about this problem?

@HansdeCocq 11 лет назад

Thanks for sharing!!!!!!

@merumomo 7 лет назад

what about when the actual mean is less than mu that we expected in two tail hypothesis test? What does type 2 error look like?

@MohamedAhmed-rs8rx 2 года назад

thanks man you are awesome !

@BrandonFoltz 2 года назад

You are awesome-er!

@wfalcao69 5 лет назад

Wonderful !!!!!

@lexgabrees 8 лет назад

Just a quick question ... So would we only control the Type II error if we were testing our from the perspective of the alternative hypothesis ? What I'm actually asking is : in practical situations, when would we want to control the Type II error ?

@darlove2513 6 лет назад

Hi there. When would we want to control the Type-II error? The short answer is: ALWAYS. This is how professional hypothesis testing should be carried out. And to preempt your question: Yes, it's possible to control the Type-II error but not in the way most people would like to think about it. To control this error for a continuum of alternative hypotheses, you have to first decide on the resolution of the test, that is, what kind of differences you want to be able to distinguish between... Do you really care if there is a difference of .01 between what you think is true and what really is true? This is the resolution of the test. The smaller the resolution, the more data you'll have to gather to be able to distinguish between what's true and what you think is true. But it's POSSIBLE. And you can construct tests with any power you desire for all alternatives you care about... and I mean "all alternatives you care about AT THE SAME TIME.", to be absolutely clear. It's not that hard if you notice that the power function is increasing as the means are getting further away from each other...

@abdelrahmanabdou8905 6 лет назад

Hi Brandon. Thanks for the great explanation. I don't understand why having a sample mean at point B would result in an incorrect Rejection (Type I Error). Isn't it falling in the rejection region?

@DeepLakkad 6 лет назад

Taking alpha = 0.01 means that 99 percent of the sample means are inside our interval but there are 1 percent of the samples which have the mean that corresponds to the same population mean. So, B point is that extreme sample which is a part of sample means but luckily we are incorrectly rejected the null hypothesis in a true sense because the two population samples are extremely far away. if they had been closer than indeed we can see that type 2 error decreases as alpha increases and vice versa. Let me know if you understood or not? thank you.

@genmasaotome3503 8 лет назад

OMG.. this is awesome... :)

@shailendrapatil994 5 лет назад

Suppose you have n = 35.6 So can we take any value of n greater then 35.6 or the immediate integer ( 36 in this case ) ?

@stalinamirtharaj1353 4 года назад

I think if we choose immediate integer ,type II error would be controlled exactly at 5% and if we choose sample size greater than 36 it will decrease type II error rate further as alternative population mean moves away from the hypothesized mean.

@sankalp6872 7 лет назад

When you redo the sample won't the sample mean change?

@inkyukang2358 9 лет назад

I think controlling type 2 error is impossible since we cannot choose one point of mu alternative. This means that we ALWAYS cannot say that we accept the null hypothesis. Am I right?

@darlove2513 6 лет назад

Controlling Type-II error is possible but not in the way most people would think about it. Since the error is different for any particular alternative, what you have to do is to first set on the resolution of the test (what is the smallest difference in means you want to be able to detect with a given power) and then calculate the minimum sample size to achieve the power you want. Say, for instance, that you think the mean is 0 and want to be able to detect a difference of at least 1 with a power of 99%. What you then do is calculate N for your alternative of 1 so that your test has the stated power. If the difference is even greater than 1, your test will have a guaranteed power of 99% anyway... This is how you control it.

@justinschoneman2914 5 лет назад

Thanks for the video. I have a small correction. At time 19:48 you have z_beta = -1.645 but at at 28:02 you have z_beta = +1.645. I think the latter is correct.

@empaulstube6947 4 года назад

Wow!

@andykim2902 5 лет назад

야~~~~ 기분조타!

@cmcatholic1798 3 года назад

All hail kim jon kyu

@doddpower 3 года назад

I'd recommend you consider updating/improving/remaking this video. It's long, dense, and difficult to understand. Some different approaches may help certain folks understand the concepts.