Тёмный
StatistikinDD
StatistikinDD
StatistikinDD
Подписаться
Love the R language for statistical computing and data analysis. Former SPSS and Stata user.

Wolf Riepl
Social Science Research
Hepkestr. 179
01277 Dresden
statistik-dresden.de
R Programming: My 10 Favorite Quotes
15:14
2 года назад
How to Include Data in Your R Package
16:00
2 года назад
Комментарии
@dominicacri5395
@dominicacri5395 29 дней назад
Very helpful. thank you!
@68chanakya18
@68chanakya18 Месяц назад
Thankyou for this I've completed my bachelor's in stats now thinking of doing my masters, this video really put my love for stats in words. Thank you from India.
@mikiallen7733
@mikiallen7733 2 месяца назад
I dont understand , how then one can update such programm in a dynmaic or.live setting, i.e. fetching data on a regular basis while embedding the whole thing within ui and server separate shiny app functions? Your input is highly appreciated Merci
@StatistikinDD
@StatistikinDD 2 месяца назад
@@mikiallen7733 I'd say esquisse is not meant for such settings, e.g. regular data updates. I'd use it to get started writing ggplot2 code. In shiny apps I wouldn't embed esquisse.
@mikiallen7733
@mikiallen7733 2 месяца назад
@@StatistikinDD thanks for the prompt response sir
@albinonhabanga8066
@albinonhabanga8066 2 месяца назад
I always get sane issue package ggstatsplot was built under R version 4.3.3 Doesn't work
@StatistikinDD
@StatistikinDD 2 месяца назад
You could try to get it from GitHub. Do you have the latest R Version installed?
@albinonhabanga8066
@albinonhabanga8066 2 месяца назад
@@StatistikinDD no. I think Now a have R version 4.3.2 I would like to update, do I need to Remove old version or I can try it: install.packages ("installr") library(installr) updateR()
@albinonhabanga8066
@albinonhabanga8066 2 месяца назад
@@StatistikinDD after that I Would like to Join two columns become unique column in a dataframe. So the new column might be character and have two categories corresponding to the old two columns names. While the old two columns had number of test observed and another the number of tests expected in the same health facility. These numbers is grouped in week (15 weeks). I want to keep this new column and another new column that have number of tests, and another the week identifications (1,2,3...15) If you have some video about basic r skills with dataframe manipulation, please suggest me
@StatistikinDD
@StatistikinDD 2 месяца назад
​@@albinonhabanga8066 Yes, installr is a good option. However, going over the 4.4 threshold you might need to re-install your packages. No need to remove the old R version.
@StatistikinDD
@StatistikinDD 2 месяца назад
​@@albinonhabanga8066 Other channels offer more on data manipulation, e. g. Statistics Globe (Joachim Schork). We have collaborated in the past.
@JessiJ-zz7rp
@JessiJ-zz7rp 2 месяца назад
Hallo wie Interpretiere ich denn die restlichen Effekte dann? D.h. Lernstunden alleine nicht signifikant (0,199 > 0,05) und nur in der Interaktion zwischen Lernstunden und Entspannung kann ich signifikanten Lernerfolg nachweisen?
@StatistikinDD
@StatistikinDD 2 месяца назад
Im Modell mit Interaktionsterm sind die Einzeleffekte nicht separat zu beurteilen. Ein Teil des Effekts steckt im einzelnen Koeffizienten, ein weiterer Teil im Interaktionsterm. Anschaulicher ist es, konkrete Werte für beide Prädiktoren einzusetzen und Vorhersagen dafür zu berechnen anhand der Regressionsformel.
@JessiJ-zz7rp
@JessiJ-zz7rp 2 месяца назад
@@StatistikinDD Danke! Mit p-Wert von 0,199 betrachte und interpretiere ich also auch nicht dessen Regressionskoeffizienten (-0,457), sondern nur den des Interaktionsterms? Gibt es eine Quelle dafür, dass man sich nur Variable des Interaktionsterms ansieht? Wie kann das zustande kommen, dass Lernstunden (x) auf Lernerfolg (y) in diesem Moderationsmodell nicht signifikant ist, wenn angenommen in einem linearen Modell eine Signifikanz zwischen Lernstunden und Lernerfolg (gleicher Datensatz) vorhanden wäre?
@StatistikinDD
@StatistikinDD 2 месяца назад
@@JessiJ-zz7rp Die Effekte verändern sich, wenn die Moderation mit berücksichtigt wird. Es ist schließlich ein anderes Modell! Alle Kombinationen sind möglich: einzeln signifikant, im Moderationsmodell nicht, oder umgekehrt, oder beide Mal sig. oder nicht sig. Die grafische Interpretation ist oft einfacher als das Grübeln über die (nicht mehr von einander unabhängigen!) Koeffizienten. Das Modell mit Interaktionsterm testet in erster Linie, ob eine (statistisch signifikante) Wechselwirkung vorliegt.
@saimajahan8317
@saimajahan8317 2 месяца назад
I like that video. I have a question is it possible to add data from a package (which is not mine) to t? If possible, then how?
@StatistikinDD
@StatistikinDD 2 месяца назад
You can add any data that you can get in the Global Environment. If it is already in a package, you may also add that package to Suggests or Imports.
@StatistikinDD
@StatistikinDD 2 месяца назад
So how: load data into Global Environment. Then proceed as in video. I'm just not sure if I'd like the same data in different packages. But there may be situations where it makes sense. E. g. to avoid dependency on that other package, maybe because it contains a lot more than the data, or because it is difficult to access in your environment.
@michael6154
@michael6154 3 месяца назад
You mentioned your GitHub profile, can you share the link for that? Thanks
@Epic-Poetry
@Epic-Poetry 3 месяца назад
That was beautifully explained.
@YEHE-ju7wm
@YEHE-ju7wm 5 месяцев назад
Thank you very much. Do you know how to use test_fun for R package?
@StatistikinDD
@StatistikinDD 5 месяцев назад
Can you give more details? Where do you get stuck?
@YEHE-ju7wm
@YEHE-ju7wm 5 месяцев назад
@@StatistikinDD I previously wrote an article introducing my r package, submitted it to the journal of statistical software, and received the following comment: 2 sample tests are run using TEST_FUN(group1, group2) and no formula version is available (it might be less natural in the multidimensional setting but e.g., Hotelling::hotelling.test implements one which is very clear and easy to use). test_fun is mentioned here, but I am not clearly how to use it
@StatistikinDD
@StatistikinDD 5 месяцев назад
@@YEHE-ju7wm Is TEST_FUN an actual function or maybe just a placeholder for more specific functions?
@mrtnzlmn1546
@mrtnzlmn1546 5 месяцев назад
Da ist was falsch! Wir "sortieren die Vermögen aller in aufsteigender Reihenfolge" und stellen fest, dass die untere Hälfte weniger zur Verfügung hat? Für den Median spielt das keine Rolle, aber die Aussage ist nicht richtig...
@StatistikinDD
@StatistikinDD 5 месяцев назад
Gut aufgepasst, danke für den Hinweis!
@deyvismejia7529
@deyvismejia7529 5 месяцев назад
i'm learning sql but want to do everything in R for convenience. this was very informative as i've seen many other videos show different approaches and packages
@Laura_Hiati
@Laura_Hiati 6 месяцев назад
Thank you.Well explained!
@haraldurkarlsson1147
@haraldurkarlsson1147 6 месяцев назад
Some Excel files are a real challenge to deal with. I found the unheadr package to be helpful in combination with some of the tidyverse packages (tidyr). It also helps to know a little bit about regular expressions (regex).
@xeniachomiak857
@xeniachomiak857 6 месяцев назад
Vielen Dank! Das hat mir sehr weitergeholfen. Super verständlich erklärt, in einem sehr angenehmen Tempo! :)
@shaileshchaskar6093
@shaileshchaskar6093 7 месяцев назад
Indeed very Informative
@ahmed007Jaber
@ahmed007Jaber 7 месяцев назад
thank you for this. I have come across the two packages but never heard about unpivotr::rectify. Thank you for sharing. This is really helpful
@tranle5614
@tranle5614 8 месяцев назад
Thank you so much. I knew about Quarto when it came out, but I needed convincing to learn it. I read some posts about Quarto to see if I need to use it, but I'm completely convinced after watching your video. Your video is short, but it covers all valuable points.
@StatistikinDD
@StatistikinDD 8 месяцев назад
Thanks, happy to read that!
@Mizantrop.4
@Mizantrop.4 8 месяцев назад
Hello friend, in statistics tool in R commander option summaries is blank, and i can't open it. Please give me the solution from this problem.
@StatistikinDD
@StatistikinDD 8 месяцев назад
Have you specified a dataset? Is the data suitable? Sometimes there may be conflicts with other packages. In such cases, closing Rcmdr, starting a new R session and reopening Rcmdr worked for me.
@StatisticsGlobe
@StatisticsGlobe 8 месяцев назад
Thanks for this presentation, Wolf! I haven't used quarto yet, so this was a great overview for me.
@StatistikinDD
@StatistikinDD 8 месяцев назад
Thanks, Joachim! On the one hand, it's fun to dive into new technology. On the other hand, it was another case of "Well, I guess I can throw away my training material on Reporting" when Quarto appeared on the scene ... 🙂
@StatisticsGlobe
@StatisticsGlobe 8 месяцев назад
@@StatistikinDD haha I know what you mean! :D
@langkorn707
@langkorn707 9 месяцев назад
Sie erinnern mich an meinen Lieblingsdozenten aus der Uni! Ich habe alle Funktionen schon in GitHub muss jetzt irgendwie richtig verstehen, wie ich diese in das Projekt bringe
@pipertripp
@pipertripp 9 месяцев назад
I'm pre-liking this. Gotta get some other stuff done first, but will return forthwith.
@StatistikinDD
@StatistikinDD 9 месяцев назад
Thanks!
@StatisticsGlobe
@StatisticsGlobe 9 месяцев назад
Thanks for the kind comment, glad you find the topic interesting! :)
@raould2590
@raould2590 9 месяцев назад
Thank you for this! The both of you are incredibly helpful with all things R! I enjoy your videos and look forward to more.
@StatisticsGlobe
@StatisticsGlobe 9 месяцев назад
Thank you so much for the kind words Raoul, glad you find our content helpful! :)
@tibaredha
@tibaredha 9 месяцев назад
Thank you for this series goooooooood job
@farnooshsheikhi
@farnooshsheikhi 10 месяцев назад
Hi this is really helpful, I was able to follow everything but when documenting the functions I got an error on "sew" in package knir . the man file wasn't created. I have 3 functions in one script
@StatistikinDD
@StatistikinDD 10 месяцев назад
Can you post the full error message? At what exact moment does it appear?
@user-sy9hj3zf7d
@user-sy9hj3zf7d 11 месяцев назад
Hi, mein kommt der Fehler (X11) ist für meine Version nicht verfügbar was kann ich machen?
@StatistikinDD
@StatistikinDD 11 месяцев назад
Hört sich nach Apple an. Habe keine Erfahrung mit Rcmdr auf iOS. Vielleicht helfen die Hinweise hier: www.john-fox.ca/RCommander/installation-notes.html
@huiqiao5251
@huiqiao5251 11 месяцев назад
Thank you so much for loading the videos, they have been super helpful! I tried Build -> More -> Load All and got this error message. The folder did have the processx and processx.rdb files, so I'm not sure what the error means. Could you please advise? Error in gzfile(file, "rb") : cannot open the connection In addition: Warning message: In gzfile(file, "rb") : cannot open compressed file '\\mypackages/processx/R/processx.rdx', probable reason 'No such file or directory' Error: unable to load R code in package ‘processx’meaasge
@StatistikinDD
@StatistikinDD 11 месяцев назад
Hi, I'm on holidays, just a quick idea: maybe an invalid file path? I think I got similar problems when my file path contained spaces.
@user-ve3no9pm9w
@user-ve3no9pm9w Год назад
wow, glad I dont use Stata, how inconvenient. great vid as always
@taitai645
@taitai645 Год назад
Crystal clear
@Ramira1407
@Ramira1407 Год назад
Vielen Dank für die Erklärung 🎉👍👍
@user-mh2pu1pe2n
@user-mh2pu1pe2n Год назад
Thanks, these videos are very good and I have created a package by following your instructions. My package is in a private github repo now. May I ask how do other users now access this library and use the functions? Do you have a video for that? Thank you very much!
@StatistikinDD
@StatistikinDD Год назад
To install from a private repo, you (i. e. potential users of your package) need to provide authentication. I wouldn't spread authentication information, so if I wanted others to access my package, I'd make the repo public. Maybe it helps if you have a private repo where you try out things, and a public repo containing a version you feel ok about being public?
@CanDoSo_org
@CanDoSo_org Год назад
Hi, guys. I got a strange problem. Some shortcuts work in Rmarkdown, but not in Quarto. For example, Insert Assignment Operator, Insert Pipe Operator. Anyone runs into the same problem?
@StatistikinDD
@StatistikinDD Год назад
Hi, haven't noticed that. What is your setup? Operating system, R version, quarto version, RStudio (?) version. For the quarto version, you can install the quarto R package. (Not a replacement for the quarto software, just a small interface.) Then use quarto::quarto_version().
@CanDoSo_org
@CanDoSo_org Год назад
@@StatistikinDD Thanks a lot. Problem solved by updating Quarto. Thanks man. But one more question: is there a way to automatically update the Quarto? I don't even know my Quarto is out of date if you did not mention it.
@StatistikinDD
@StatistikinDD Год назад
​@@CanDoSo_org Not aware of an automatic update, not from within R anyway. Quarto gets bundled with RStudio, so when you update that, you may get a new quarto version as well. You can install daily builds of RStudio to get quicker updates. (I don't do that.) Apart from that, I sometimes check the quarto website and manually download quarto.
@CanDoSo_org
@CanDoSo_org Год назад
@@StatistikinDD Thanks a lot.
@santiagosotelo6881
@santiagosotelo6881 Год назад
good info the argument device = agg_png instead of calling dev.on() - dev.off()
@user-mh2pu1pe2n
@user-mh2pu1pe2n Год назад
Thanks this is great! Would you know what the problem is with Error in .get_package_metadata(pkgdir) : Files 'DESCRIPTION' and 'DESCRIPTION.in' are missing
@StatistikinDD
@StatistikinDD Год назад
Do you have a DESCRIPTION text file at the top level of the package? Did you create the package using RStudio - New Project - Package?
@user-mh2pu1pe2n
@user-mh2pu1pe2n Год назад
@@StatistikinDD Thanks for your prompt reply! yes i do have a DESCRIPTION text file at the top level and I am creating the package using RStudio - New Project - Package.
@StatistikinDD
@StatistikinDD Год назад
@@user-mh2pu1pe2n Looks like a known issue. github.com/rstudio/rstudio/issues/12945 github.com/r-lib/devtools/issues/2512 Maybe try updating as much as possible, e. g. RStudio, devtools, pkgbuild
@user-mh2pu1pe2n
@user-mh2pu1pe2n Год назад
@@StatistikinDD Thank you!
@AestheticWaif
@AestheticWaif Год назад
March 💂 First lmao
@ricardito777
@ricardito777 Год назад
Thank you very much for this video! I am at the starting stage of my career and have only done data analysis from within a large corporation. However, there were many aspects of your experience that I could relate to like inheriting code and having to deal with Office format deliverables. I will follow you on twitter.
@Feeyful
@Feeyful Год назад
Dieses Video hat mir so sehr geholfen, vielen Dank!
@StatistikinDD
@StatistikinDD Год назад
Neu und besser: Adapter %dofuture% statt %dopar% Siehe www.jottr.org/2023/06/26/dofuture/ Dank an Henrik Bengtsson
@Laura-lh7xg
@Laura-lh7xg Год назад
Danke!
@MrBsir390
@MrBsir390 Год назад
what's funnier a Robot Telling jokes to Humans or Trump getting arrested by his own Rules concerning classified docs when he was president?
@anmolpardeshi3138
@anmolpardeshi3138 Год назад
how is the function accessing the DV even without mentioning it while calling the function?
@StatistikinDD
@StatistikinDD Год назад
Good question. The DV is hard-coded in the function and so not passed on as a function parameter. Line 14: model <- as.formula(paste("result ~", IV)) DV = result for all the models calculated here. Only the IV varies.
@anmolpardeshi3138
@anmolpardeshi3138 Год назад
@@StatistikinDD so the functions have access to local memory meaning they "know" where the result variable is stored?
@StatistikinDD
@StatistikinDD Год назад
@@anmolpardeshi3138 Functions can access R's Global Environment. The result variable (DV) is part of the dataset which is specifiied in the call to lm(), which accesses Global Environment. Note that, however, the workers for running the loops in parallel do not automatically have access to the Global Environment in this approach, at least on Windows. Each worker starts in an empty environment. So the data (and the reg function) are exported to the workers using the clusterExport function in this line of code: clusterExport(cl, c("reg", "data")) cl is the object resulting from the makeCluster call. I'd now recommend this approach: cl <- makeCluster(parallelly::availableCores(omit = 1)) See comment below and a later video on why to avoid detectCores(). ru-vid.com/video/%D0%B2%D0%B8%D0%B4%D0%B5%D0%BE-EYpuUKsGqPQ.html
@robRobblah
@robRobblah Год назад
Thanks for the video. very helpful. Looks promising, but for use cases where controlled page layout is critical (posters, "one pagers", brochures, etc), it seems maybe we'll have to wait for paged HTML support ala pagedown?
@StatistikinDD
@StatistikinDD Год назад
Currently I'm lacking experience in that. Read that fixed width and toggling with self-contained / embed-resources may help in PDF output stability. Thanks to Nicola Rennie.
@alvaroignaciotapia
@alvaroignaciotapia Год назад
Thanks for this brillian tutotial, I have a quesstion though: is it ok that I save the generated .tar file in my R package directory on my Desktop? Before ubmitting to CRAN. Because I assume that if I save the .tar file elsewhere, and submit my directory to CRAN, the vignette won't come up on the help tab, right? THANKS!
@StatistikinDD
@StatistikinDD Год назад
Hi, thanks for your feedback! For CRAN submission, you upload a tar.gz file to CRAN, which contains the vignette. From the R Packages book at r-pkgs.org/vignettes.html#sec-vignettes-how-built-checked Prepare built vignettes for a CRAN submission: Don’t try to do this by hand or in advance. Allow vignette (re-)building to happen as part of devtools::submit_cran() or devtools::release(), both of which build the package.
@markvenn230
@markvenn230 Год назад
So helpful. Many thanks, subscribed to see what else you can help with
@HDBT_
@HDBT_ Год назад
It works really great, thank you sharing this!
@robertzimbardo8227
@robertzimbardo8227 Год назад
Why do you waste so much time on explaining what the function inhouse_calc does? No one cares it scales, no one cares it's not exactly between 0 and 100, etc. The video title says this is about including data in your R package so it would be better to focus on that and leave everything distracting aside.
@StatistikinDD
@StatistikinDD Год назад
Thanks for your comment, Robert. I'll try to stick more to the point next time.
@marwanotrok4311
@marwanotrok4311 Год назад
@@StatistikinDD Clearly explained and straight to point. Saved me a lot of time just by watching this video. Thank you!
@sametayaz4891
@sametayaz4891 Год назад
You have a very logical point but some information about other stuff comes with main topic's wind. You don't just go from point a to b. You also see where the supermarket on your way to point b. This kind of informations is crucial for some people, and showing it with a realistic and alive example is also important. You can simply skip it. Anyways. Great tutorial @StatistikinDD keep it rooling!
@soumyatarafder7656
@soumyatarafder7656 Год назад
Great!
@uttammitra5683
@uttammitra5683 Год назад
great explanation!
@blaisepascal3905
@blaisepascal3905 Год назад
Thank you very much for your videos, I subscribed to the channel! Lately, I've been trying to convince my friends and colleagues in a subliminal way to go from Rmarkdown to Quarto. The arguments you presented are exactly what I was missing, thank you! I also have 2 questions: 1. Xarigan presentations are what prevent me from transferring all my remaining rmd to qmd. You said that reveal.js works pretty well on Quarto, is there a feature that is missing compared to Xarigan? 2. You said that Quarto could support languages that do not yet exist yet. I am also a Nim user, is it possible to use Nim in it and what would be the missing functionalities since the language has not been implemented yet? Again thank you for your video!
@StatistikinDD
@StatistikinDD Год назад
Thank you for your comment! Much appreciated. I still use xaringan when I need a pdf version. Had difficulties printing quarto/ reveal.js to pdf. Heard fixed width and toggling self-contained true/false could help. Haven't tried yet. For HTML, I really like reveal.js. Can't say about Nim, sorry. Maybe ask Posit forum.
@mikiallen7733
@mikiallen7733 Год назад
Where I can find a library in r which does very good visuals for the results of quantile regression ? More specifically for plotting which factors affect which quantile / sub group out of the whole sample ?
@StatistikinDD
@StatistikinDD Год назад
Check out the quantreg package. It has a plot function for quantile regression objects. Haven't used it myself though.
@pjoshi_15
@pjoshi_15 Год назад
Great video! Any idea on how to customize an individual page's URL in quarto?
@StatistikinDD
@StatistikinDD Год назад
Sorry, don't know. So far I've only used it for presentations.