Тёмный

‪@duckdb‬ 

The Geek Narrator
Подписаться 38 тыс.
Просмотров 2,9 тыс.
50% 1

Deep Dive into DuckDB with CTO Mark Raasveldt
Decode the insights of databases with Geek Narrator podcast. In this episode, host Kaivalya Apte converses with Mark Raasveldt, the CTO of DuckDB labs, discussing his journey from being a database enthusiast to creating DuckDB. They delve into how DuckDB, an analytical database, differs from other databases, the design decisions, its internal mechanisms, and much more. The episode also highlights the advantages of DuckDB in analytics, the motivation behind its ACID compliance, and how DuckDB handles ingestion, transaction isolation, mutations, and queries. Join in to learn how your data workloads can benefit from DuckDB.
00:00 Introduction and Guest Introduction
00:44 Guest's Journey into Databases
03:40 The Birth of DuckDB
04:30 Challenges with Existing Databases
05:15 Technical Difficulties
05:16 Why Existing Databases Fall Short for Data Scientists
09:16 The Role of SQLite and Its Limitations
13:59 Defining DuckDB
16:48 Comparing DuckDB with Other Analytical Databases
19:50 Deployment Models for DuckDB
22:47 Data ingestion into DuckDB
22:51 Data Ingestion in DuckDB
30:24 How DuckDB Handles Updates and Mutations
35:35 Understanding Column Granularity and Rewrites
35:58 Implications of Compression on Data Updates
36:38 Trade-offs in Row Group Size
37:32 Benefits of Column Storage Model
38:15 Row Groups and Parallelism
39:02 Choosing Row Group Size: An Experimental Approach
40:00 Handling Data Type Changes in Columns
41:00 Internal Data Structures in DuckDB
42:21 Reading Data: Point Lookups, Aggregations, and Joins
47:22 Optimization for Full Table Scans
53:49 Understanding ACID Compliance in DuckDB
55:49 Multi-Version Concurrency Control (MVCC) in DuckDB
59:50 Use Cases and Applications of DuckDB
01:01:42 The Story Behind DuckDB's Name
01:02:34 Future Vision for DuckDB
References:
DuckDB: duckdb.org/
Mark's blog: mytherin.github.io/
===============================================================================
For discount on the below courses:
Appsync: appsyncmasterclass.com/?affil...
Testing serverless: testserverlessapps.com/?affil...
Production-Ready Serverless: productionreadyserverless.com...
Use the button, Add Discount and enter "geeknarrator" discount code to get 20% discount.
===============================================================================
Follow me on Linkedin and Twitter: / kaivalyaapte and / thegeeknarrator
If you like this episode, please hit the like button and share it with your network.
Also please subscribe if you haven't yet.
Database internals series: • Write-ahead-logging
Popular playlists:
Realtime streaming systems: • Realtime Streaming Sys...
Software Engineering: • Software Engineering
Distributed systems and databases: • Distributed Systems an...
Modern databases: • Modern Databases
Stay Curios! Keep Learning!
Cheers,
The GeekNarrator

Опубликовано:

 

28 июн 2024

Поделиться:

Ссылка:

Скачать:

Готовим ссылку...

Добавить в:

Мой плейлист
Посмотреть позже
Комментарии : 16   
@GabrielMagno
@GabrielMagno 6 месяцев назад
I've just found out recently about DuckDB and I'm each day more impressed with it. This talk with Mark was great to better understand the context, motivations and internals of DuckDB. Thanks!
@TheGeekNarrator
@TheGeekNarrator 6 месяцев назад
Thank you 🙏🏻 I am glad it helped.
@GabrielMagno
@GabrielMagno 6 месяцев назад
@@TheGeekNarrator It definetly did! Thank you very much. Your questions were very "on point" and you counducted the interview very well. Just found out about your channel and subscribed :-)
@nosh3019
@nosh3019 5 месяцев назад
Another good one. Thanks! 😊
@TheGeekNarrator
@TheGeekNarrator 5 месяцев назад
Thanks a lot 🙏🏻😀
@listarmingstar
@listarmingstar 6 месяцев назад
Good interview! Thanks.
@TheGeekNarrator
@TheGeekNarrator 6 месяцев назад
Thanks 🤩
@sohansingh2022
@sohansingh2022 6 месяцев назад
Beautiful content ❤
@TheGeekNarrator
@TheGeekNarrator 6 месяцев назад
Thank you so much 😀
@matveyshishov
@matveyshishov 5 месяцев назад
Thank you very much for GREAT guests and interviews! If I may ask for a favor, would appreciate if you had less of "hmm" and "yeah", no offense, it's just a bit distracting from what the guests are saying, when they are making a complex point and you are trying to focus on their words.
@TheGeekNarrator
@TheGeekNarrator 5 месяцев назад
Thank you for the feedback. Really appreciate it.
@michaelsimons2560
@michaelsimons2560 7 месяцев назад
Best party ice-break, really.
@TheGeekNarrator
@TheGeekNarrator 7 месяцев назад
Haha Absolutely. I am gonna try that soon.
@smyrnian_
@smyrnian_ 5 месяцев назад
A non-distributed database in a "distributed databases" playlist? Let's do CSV files next.. :)
@TheGeekNarrator
@TheGeekNarrator 5 месяцев назад
CSV files are great. Btw I don’t have a “Distributed Databases” playlist. I do have a “Distributed Systems and Databases”playlist, which is what you are referring to I believe. Having said that, DuckDb is very much relevant in the Distributed systems world as it can be embedded by another system.
Далее
Distributed SQLite with Litestream and LiteFS
54:37
Просмотров 7 тыс.
Big Data is Dead | MotherDuck
25:58
Просмотров 11 тыс.
Writing My Own Database From Scratch
42:00
Просмотров 129 тыс.
Production SQLite with Turso and libSQL
1:02:53
Просмотров 6 тыс.
Microservices with Databases can be challenging...
20:52
DHH discusses SQLite (and Stoicism)
54:00
Просмотров 52 тыс.