Тёмный
No video :(

Achieving Lakehouse Models with Spark 3.0 

Databricks
Подписаться 115 тыс.
Просмотров 7 тыс.
50% 1

It’s very easy to be distracted by the latest and greatest approaches with technology, but sometimes there’s a reason old approaches stand the test of time. Star Schemas & Kimball is one of those things that isn’t going anywhere, but as we move towards the “Data Lakehouse” paradigm - how appropriate is this modelling technique, and how can we harness the Delta Engine & Spark 3.0 to maximise it’s performance?
This session looks through the historical problems of attempting to build star-schemas in a lake and steps through a series of technical examples using features such as Delta file formats, Dynamic Partition Pruning and Adaptive Query Execution to tackle these problems.
About:
Databricks provides a unified data analytics platform, powered by Apache Spark™, that accelerates innovation by unifying data science, engineering and business.
Read more here: databricks.com...
See all the previous Summit sessions:
Connect with us:
Website: databricks.com
Facebook: / databricksinc
Twitter: / databricks
LinkedIn: / databricks
Instagram: / databricksinc Databricks is proud to announce that Gartner has named us a Leader in both the 2021 Magic Quadrant for Cloud Database Management Systems and the 2021 Magic Quadrant for Data Science and Machine Learning Platforms. Download the reports here. databricks.com...

Опубликовано:

 

29 авг 2024

Поделиться:

Ссылка:

Скачать:

Готовим ссылку...

Добавить в:

Мой плейлист
Посмотреть позже
Комментарии : 5   
@Sangeethsasidharanak
@Sangeethsasidharanak 3 года назад
Good to see u in Databricks summit 😀
@ashseth7885
@ashseth7885 2 года назад
Thanks for SCD demo, nicely explained
@findangoanalytics8938
@findangoanalytics8938 3 года назад
Great talk - would love to hear your thoughts on surrogate keys...
@mzhukovs
@mzhukovs 3 года назад
Do you make these slides available anywhere sir?
@SultanKhan-fq5jq
@SultanKhan-fq5jq 2 года назад
Create SCHEMA is recommended
Далее
Data Warehousing on the Lakehouse
39:15
Просмотров 6 тыс.
مسبح السرير #قصير
00:19
Просмотров 2,1 млн
журавли в пятницу
00:14
Просмотров 83 тыс.
Advancing Spark - Understanding the Spark UI
30:19
Просмотров 51 тыс.
Accelerating Data Ingestion with Databricks Autoloader
59:25
Common mistakes in big data models
1:13:23
Просмотров 64 тыс.
GEOMETRIC DEEP LEARNING BLUEPRINT
3:33:23
Просмотров 178 тыс.