Тёмный

Understanding missing data and missing values. 5 ways to deal with missing data using R programming 

Global Health with Greg Martin
Подписаться 170 тыс.
Просмотров 74 тыс.
50% 1

In this video I talk about how to understand missing data and missing values. I also provide 5 strategies to deal with missing data using R programming. If you're doing quantitative analysis or statistical analysis, your dataset will almost certainly contain missing values. Dealing with missing data using R programming is easy and I provide a step by step approach. This is an R programming for beginners video.
SUBSCRIBE:
--------------------
Click here: ru-vid.com_...
LETS CONNECT:
---------------------------
Twitter: @drgregmartin
Linkedin: / drgregmartin
Facebook: / thisweekinglobalhealth
SUPPORT THIS CHANNEL
-----------------------------------------
Patreon: / drgregmartin

Опубликовано:

 

13 июл 2024

Поделиться:

Ссылка:

Скачать:

Готовим ссылку...

Добавить в:

Мой плейлист
Посмотреть позже
Комментарии : 57   
@gregmartin
@gregmartin 11 месяцев назад
Get my FREE cheat sheets for Public Health, Epidemiology, Research Methods and Statistics (including transcripts of these lessons) here: www.learnmore365.com/courses/public-health-epidemiology-research-methods-and-statistics-resource-library
@fernleaf1
@fernleaf1 3 года назад
Great video. Looking forward to your videos about imputation and the MICE package. Keep’em coming!
@asiathogmartin7725
@asiathogmartin7725 3 года назад
You said, "Boom Shakalaka" LOL! Most awesome video ever.
@danmungai555
@danmungai555 4 года назад
Hello sir, this is amazing. You're a wonderful teacher. Please do more. Very many thanks from me here in Kenya
@gregmartin
@gregmartin 4 года назад
Thank you very much for the feedback. I’ve been to Kenya. Lovely country.
@danmungai555
@danmungai555 3 года назад
I have been having problems with functions, can you help? I would appreciate so much
@arifmemovic3383
@arifmemovic3383 2 года назад
You have saved hundreds if not thousands of hours of beginning analysts time. Thanks!
@gregmartin
@gregmartin 2 года назад
You're welcome!
@tuanlong9238
@tuanlong9238 3 года назад
11:48 - " Take care, stay well, don't do drugs, always do best, speak to you soon. Bye! " - that's a cool outro
@xprownz
@xprownz 3 года назад
Great video, helped me a lot cleaning some datasets in an easy way.
@chertify
@chertify 2 года назад
I'm watching halfway. I just hit subscribe. The content you put here in this video is just so well-explained! You translate codes into layman's term and have "tidily" edited your video! I love the zoom in and out effect of it and the sound effect. Not too much. Just right. Not annoying, rather impressive. Thank you for sharing your knowledge to us, Greg!
@chertify
@chertify 2 года назад
I just finished watching and taking down notes. Huge applause to you, Greg!!!
@gregmartin
@gregmartin 2 года назад
What awesome feedback, thank you! I really appreciate it!
@Junecode
@Junecode 2 месяца назад
Greg, thanks for ALL your elaborate videos and the structure of the lessons. In addition, the way you explain the code methodically! Love it. I was so stressed about replacing NA with none for the variable, gender (Pt 3 of handling missing values), turns out the variable is sex. Phew
@rezzyraptor
@rezzyraptor 2 года назад
I know this video is old, but still very helpful! I love your channel, you make stats and R fun :D Thanks for making these, keep up the great work.
@gregmartin
@gregmartin 2 года назад
Glad you like them!
@sakanablesakanable
@sakanablesakanable Год назад
Thank you so much Greg, can you please tell me what software you use for video editing? Thanks in advance
@ostione
@ostione 3 года назад
Best r tutorial , visuals, pace, delivery....so good!
@gregmartin
@gregmartin 3 года назад
Wow, thanks!
@adrianfletcher8963
@adrianfletcher8963 3 года назад
Please do a video on imputation in R! I was working on something and I was confused as to whether my data was "missing at random" or another option so I wasn't sure how to handle imputation.
@ruhafza4719
@ruhafza4719 3 года назад
Waoo...another great video
@eridianestrada8923
@eridianestrada8923 3 года назад
Hello, thank you for these videos. They are very helpful. Is there a video on what program evaluation is and how that looks in the global health context?
@dineshlakshitha1259
@dineshlakshitha1259 2 года назад
supper video clear, thank you soo much
@lilikoimahalo
@lilikoimahalo Месяц назад
This is a very insightful explanation:) thank you!
@gregmartin
@gregmartin Месяц назад
Glad you find it insightful. Thank you
@ousmanelom6274
@ousmanelom6274 3 года назад
You are a good teacher i like your video
@topabschalala9900
@topabschalala9900 Год назад
Impressed!
@gregmartin
@gregmartin Год назад
Thank you. Cheers
@jamesparker7700
@jamesparker7700 4 года назад
Hi Greg love your videos! Im a medical student who is going to intercalate next year in public health which im very excited about. Ive got a choice however between MSc International public health (with a focused stream on humanitarian studies) or MSc Humanitarian studies. Im interested in the working humanitarian relief space, but im wondering if I should I keep my studies a bit broader at the moment and study the MPH. Would be interested to know what you think in terms of if one would be more advantageous in my career. thanks James
@hazemshahin4166
@hazemshahin4166 2 года назад
great way of yours to finally simplify stats ...thank you
@gregmartin
@gregmartin 2 года назад
Glad it was helpful!
@nour_hisham
@nour_hisham 2 года назад
Thanks for the help, really appreciate, I have exam tomorrow, and you really helped Sir.😃❤
@gregmartin
@gregmartin 2 года назад
Glad it was helpful! Thank you :)
@markelov
@markelov 9 месяцев назад
Could you please make a video on testing MCAR and, given its assumption of multivariate normality, talk specifically about what to do with factor variables or logicals?
@haraldurkarlsson1147
@haraldurkarlsson1147 2 года назад
Greg, I am having trouble seeing the difference between changing missing data to value vs imputation. Are they not the same? Can you explain the difference. Thanks! Great lessions by the way.
@AndrzejFLena
@AndrzejFLena 3 года назад
Great introductory video! Thanks! :D I have a question for everyone: I'm imputing missing values for Gender in a dataframe. Out of the complete rows (no NAs) Male=61.89% and Female=the rest obviously. Is there a way I can impute the values randomly but in these proportions? It feels like there must be but I am new to R... Thanks!!
@bradvanbakel5781
@bradvanbakel5781 2 года назад
im a bit late but i guess if you do a conditional on a random number generator. So 0-0.3811 is Female and 0.3812+ is male
@sydbyd5040
@sydbyd5040 Год назад
🖐 great video, thanks. But didn't work for my case. There is a char format column, in my table (14 columns * 50000 rows) with up to 7000 missing values, but na.omit() can't find them. Is it possible it's due to invisible typed "space" that na.omit() can't find them? I hope I was clear.
@heartheart5543
@heartheart5543 3 года назад
Dear Greg, I've been watching all you R video in your other channel " R Programming 101". Why didn't you put this R video in that channel?
@mugambwajonah8352
@mugambwajonah8352 5 месяцев назад
I like the audio quality
@gregmartin
@gregmartin 5 месяцев назад
Thank you!
@deadlyderp5856
@deadlyderp5856 Год назад
Hello Greg, I have a question. I ran the following code and i want to run a regression on the adjusted dataset now. However, it takes the unadjested dataset instead of the adjusted one. Also, it creates a new dataset called ''.'' (so just a dot). This dataset is the correct adjusted one, but I cannot even use it. I am confused. library(dplyr) library(ggplot2) library(tidyverse) iabbd_8010_v1%>% select(Destination, Year, Origin, Mstock_Total, Mstock_Low, Mstock_Med, Mstock_High, Distance, Democracy_origin, Democracy_destination, GDPpc_origin, GDPpc_destination, Language, Population_origin, Population_destination, Border)%>% mutate(Mstock_Total = replace(Mstock_Total, Mstock_Total == 0, NA))%>% drop_na(Mstock_Total)%>% mutate(Mstock_Total = log(Mstock_Total))%>% mutate(Population_origin = log(Population_origin))%>% mutate(Population_destination = log(Population_destination))%>% mutate(Distance = log(Distance))%>% mutate(GDPpc_destination = log(GDPpc_destination))%>% mutate(GDPpc_origin = log(GDPpc_origin))%>% View() reg1 = lm(Mstock_Total ~ Distance + Language + Border, data = iabbd_8010_v1) --> so actually i should do data = . but it says ''.'' doesn't exist summary(reg1)
@michaelegbujua9369
@michaelegbujua9369 11 месяцев назад
I want videos on text manipulation
@tsehayenegash8394
@tsehayenegash8394 2 года назад
if you know please upload a video for the matlab code of Multivariate imputation by Chained Equation(MICE)
@CanDoSo_org
@CanDoSo_org 2 года назад
Na_if( ), it is just what I am looking for.
@gregmartin
@gregmartin 2 года назад
Thank you for the feedback, Reddy. Hope all is well
@rajiahdynsley2356
@rajiahdynsley2356 3 года назад
Great vid but instead of using the "%>%" function, how could we have done it? Since we are not able to save these changes made to the original dataset using "%>%" function.
@WORDHOUSEUS
@WORDHOUSEUS 3 года назад
create a new dataframe
@mayankvermashubhamkaran4373
@mayankvermashubhamkaran4373 2 года назад
Drop_na, complete.cases worked perfectly on R studio . But when I write the same code in kaggle new data frame doesn't have any value ?? Any suggestions ??
@16kush
@16kush Год назад
can we replace the NA without using library?
@navicto
@navicto 3 года назад
This video has useful information. However, it didn't help me understand missing data. It helped me understand how to filter out or replace missing values with a constant. Not the same.
@violaz1141
@violaz1141 4 месяца назад
Why do you use pipe at the end of each command?
@yininggao9990
@yininggao9990 3 года назад
Why my latest R version shows that no tidyverse package 😫
@miscelleneoustubes
@miscelleneoustubes 2 года назад
Sound system is very poor
@edisonwang1765
@edisonwang1765 2 года назад
wa la
@CheeseCakes11944
@CheeseCakes11944 Год назад
*what! "Don't do drugs"?? , youtube is one of the most addictive drugs in the world.
Далее
Hypothesis testing
10:15
Просмотров 23 тыс.
Clean your data with R.   R programming for beginners.
27:31
Dealing With Missing Data - Multiple Imputation
11:02
Missing Data Assumptions (MCAR, MAR, MNAR)
11:01
Просмотров 28 тыс.
Handling Missing Values using R
16:07
Просмотров 44 тыс.
Don't Replace Missing Values In Your Dataset.
6:10
Просмотров 8 тыс.