For further learning, here are the links from the next to last slide: Arrow cheatsheet: raw.githubusercontent.com/rstudio/cheatsheets/master/arrow.pdf video intro: ru-vid.com/video/%D0%B2%D0%B8%D0%B4%D0%B5%D0%BE-O42LUmJZPx0.html full workshop from useR!: arrow-user2022.netlify.app DuckDB website: duckdb.org R package: cran.r-project.org/web/packages/duckdb/index.html data.table website: rdatatable.gitlab.io/data.table dtplyr (a data.table translator): dtplyr.tidyverse.org
A neat question to answer. I'm using the duckplyr library and it's nice to not have to think about anything. It does make a strong argument for having a fast hard drive (an SSD is an order of magnitude faster than a traditional HDD, an M2 is an order of magnitude faster than that, and modern nvme drives are even faster).