Get my FREE cheat sheets for R programming and statistics (including transcripts of these lessons) here: www.learnmore365.com/pages/membership-r-programming-data-visualization-and-research-methods
Your videos are simply great! I feel I've become a bit more proficient with R also thanks to you. I recently started to do some PCA on my datasets and I'm wondering if you could give an introduction to multivariate analysis with R in your next videos. Keep up the good work 👍
Please, please keep making these videos. It is crucial for us data analyst who are just getting started and would like to continue expanding our knowledge. I would even donate if you had a patreon
Did anybody by any chance in the very beginning instead of having as the format 87 rows and 13 variables they have A tibble with 87 rows and 14 variables?? If so how could I change the variable count to match his? Any help is appreciated thank you!
> select(hair_color) Error in UseMethod("select") : no applicable method for 'select' applied to an object of class "character" This is the error system is throwing.
I have installed the package dplyr, still the problem persists. I am still not able to use the code - > starwars %>% + select(hair_color) %>% + arrange(desc(n)) %>% + View()
I've checked but can't find it. Where is the playlist for explore-clean-manipulate you mention in these videos? There are four playlists but none on that cycle exactly :)
This is so helpful, thank you so much! I am on the capstone project for Coursera analytics 8 part course and I am SO LOST lol. I know exactly what I want to see, but throughout the course there was only 1 module on "R" and many bookmarks were noted. But the commands are not organized so it takes hours to see what code is necessary to clean something or narrow the data that you are looking for. I think that I chose a project that has an overwhelming amount of data... and it has my Kaggle freaking out lol... I WAS going to use RStudio but I was having trouble importing the datasets that I needed so I figured that if I could just code in Kaggle; and that was the platform that I was going to showcase the project on (AND that is where I found the datasets that I am using) I might as well just run the code there.. but it is taking months lol.
Thanks for great videos, they are very helpful! Im just wondering, the sort command only shows the first 1000 rows.. is it possible to show/sort all of the rows? I have a daraser with 47k observation and 6k variables. Thanks!
Hello, I have been using R for 10 years and the Tidyverse for about a year or two. This channel is fantastic. What software do you use to do the coloured, highlighting, annotations ? Thanks
Wow thank you so much for this! You have taken away my fear of using this program, with this video I broke through a WALL that was in between me and my data analysis. Very clear way of explaining things as compared to other videos.
Hi, just a quick question when I use this line of codes sort(table(starwars$hair_color)) it is now showing the NA values , is it because of R does not know what the NA values are and can not sort it ? Do I think right ?
Honestly, I've never done anything in R before, just started randomly watching this video. I understood EVERYTHING! Just amazing, definitely subscribed.
Please HELP! Why are the functions "arrange" and "filter" not working on my data even after I have made sure and confirmed that the columns are numeric types of data?
Great tutorials and huge thank you for your content! I always use a split screen with my data set on the left and your videos on the right. Please keep providing! Greetings from Berlin
Thank you! I needed to catch up to all things tidyverse and your videos have been an absolute life saver. It's like you know exactly how my brain works. Are you going to get to spatial data and the 'sf' package at some point?
Amazing! Thx a lot, it really helped me to get my first steps in this new language, if you dont mind, I typed all the codes you showed and put some explanations on portuguese for those who might find it useful... ```{r} # Explorar # Limpar # Manipular # Descrever # Visualizar # Analisar install.packages("tidyverse") library(tidyverse) # Para ver quais sao as bases de dados que estao dentro do tidyverse data() # Qunado colocamos uma interrogacao antes da base de dados, o sistema retorna algumas informacoes sobre importantes sobre a base ?starwars # O comando abaixo retorna o tamanho da base de dados, ou seja, quantas observacoes e quantas variaveis. Esta e a dimensao da base de dados, linhas e colunas respectivamente. dim(starwars) # A funcao abaixo mostra a estrutura dos dados. Nela nos vemos o nome das variaveis, o tipo dos dados e o valor de algumas primeiras observacoes no formato de lista, pode ficar um pouco confuso... str(starwars) # O comando abaixo, glimpse, faz a mesma coisa, no entanto e um comando especifico do tidyverse. Ele e menos poluido visualmente glimpse(starwars) # O comando View, com V maiusculo, abaixo os traz um visao muito mais moderna da base de dados, como se tivessemos vendo no excel mesmo. No entanto esta visualizacao e feita em uma nova aba. View(starwars) # Usado para visualizar as primeiras observacoes da base, como padrao 6. head(starwars) # Mesma coisa do comando head, so que para as ultimas observacoes tail(starwars) # Se voce quiser ver especificamente uma unica variavel starwars$name # O comando attach faz com que o R entenda que estamos trabalhando somente com essa base, e, entao nao precisamos mais indicar que se trata da base de dados starwars por exemplo attach(starwars) detach(starwars) # O comando names mostra os nomes das variaveis names(starwars) # O comando length traz pra gente o numero de colunas dessa base length(starwars) # O comando class retorna o tipo de dados de que se trata a variavel em questao class(hair_color) # Quando colocamos length no nome de uma variavel o que obtemos e o numero de observacoes dessa variavel length(hair_color) # O comando unique nos traz os valores unicos presentes nessa variavel unique(hair_color) # Traz a frequencia com que a variavel assumiu determinado valor table(hair_color) # Sort ira sortir os valores abaixo do comando table do menor para o maior sort(table(hair_color)) sort(table(hair_color), decreasing = TRUE) # Do maior para o menor # Abre uma nova aba onde podemos ver o resultado do comando sort dentro do layout do View View(sort(table(hair_color), decreasing = TRUE)) # O comando abaixo apresenta um grafico de barras sobre as observacoes sortidas em ordem decrescente barplot(sort(table(hair_color), decreasing = TRUE)) # O comando %>% significa para o R executar a partir de %>% ate antes do proximo %>% starwars %>% select(hair_color) %>% # Apresenta a variavel com o valor de suas observacoes count(hair_color) %>% # Conta quantas vezes a variavel assumiu determinado valor arrange(desc(n)) %>% # Mostra os primeiros valores num layout tipo excel # Vamos agora ver como isolar os valores NA, aqueles que nao foram coletados, para olhar mais de perto # Seleciona todas as linhas e colunas da base starwars[ , ] # Responde se cada observacao e ou nao e um NA is.na(hair_color) # Seleciona somenta as que estao com NA com todas colunas View(starwars[is.na(hair_color) , ]) # O comando abaixo serve para vermos o tipo da variavel height, lembre-se tem que ter rodado o comando attach antes para funcionar class(height) length(height) # numero de observacoes para cada varaivel summary(height) # resumo estatistico boxplot(height) # diagrama de caixa hist(height) # histograma ```