Тёмный

Build a poor man’s data lake from scratch with DuckDB 

Dagster
Подписаться 2,9 тыс.
Просмотров 25 тыс.
50% 1

DuckDB is so hot right now. Could it replace our cloud data warehouses or data lakes?
Pete Hunt builds a data lake from scratch with DuckDB and Dagster. Follow the tutorial here: dagster.io/blog/duckdb-data-lake
Combined with Dagster, S3, and Apache Parquet, DuckDB can become a powerful, multiplayer data lake that can serve the needs of many organizations with very little effort. Think of it as a free, simple SQLite.
We can see the rise of DuckDB for subsets of workloads that don’t have massive scale and can take advantage of its simplicity and fast performance.
Give it a go!
Try Dagster for free for 30 days: dagster.io/lp/dagster-cloud-t...

Развлечения

Опубликовано:

 

27 июн 2024

Поделиться:

Ссылка:

Скачать:

Готовим ссылку...

Добавить в:

Мой плейлист
Посмотреть позже
Комментарии : 10   
@vikramtatke5930
@vikramtatke5930 Месяц назад
As a person with just 2 years of experience my mind was blown watching this. I am a single person writing code in my department so I don't have any seniors to learn from but I'm leading a data engineering project that deals with terabytes of data and each request is multiple times larger than the server's RAM and multiple such requests need to be processed in parallel to complete stuff in time. Also, we have the tiniest possible budget to aggregate 25 to 30 columns and billions of rows every day. Also, we need to cut down on costs. This was super helpful.
@tobiaspucher9597
@tobiaspucher9597 19 дней назад
Awesome!!! Please more!
@michaelayoub2211
@michaelayoub2211 Год назад
Great video, thanks!
@marcosoliveira8731
@marcosoliveira8731 7 месяцев назад
Really good stuff! A lot of good ideas.
@ImperialTerrain
@ImperialTerrain Год назад
thank you pete
@gauravlotekar660
@gauravlotekar660 Год назад
aawwwseome.
@gw1284
@gw1284 Год назад
Thanks for this demo. Can you comment on what role polars may play in this?
@hwy9nightkid
@hwy9nightkid Год назад
polars is akin to pandas or spark dataframes.. a way to organize your tables of data , if im not mistaken
@marcosoliveira8731
@marcosoliveira8731 7 месяцев назад
As pandas alternative.
@kalidsherefuddin
@kalidsherefuddin 7 месяцев назад
Thanks for
Далее
Big Data is Dead | MotherDuck
25:58
Просмотров 11 тыс.
DuckDB vs Pandas vs Polars For Python devs
12:05
Просмотров 14 тыс.
Why should you care about DuckDB? ft. Mihai Bojin
14:35
Why use DuckDB in your data pipelines ft. Niels Claeys
22:26
Writing My Own Database From Scratch
42:00
Просмотров 118 тыс.
Было?😂
0:12
Просмотров 652 тыс.
Экзамен у женщин и мужчин
0:27
Просмотров 4,4 млн