Тёмный
No video :(

Data Lakehouse: An Introduction 

Bryan Cafferky
Подписаться 41 тыс.
Просмотров 20 тыс.
50% 1

Опубликовано:

 

22 авг 2024

Поделиться:

Ссылка:

Скачать:

Готовим ссылку...

Добавить в:

Мой плейлист
Посмотреть позже
Комментарии : 40   
@ayandapeter1681
@ayandapeter1681 2 месяца назад
Sir, I just want to say thank you so much, I've gone through many videos but was still confused, u made this crystal clear with all your conceptual approach.
@BryanCafferky
@BryanCafferky 2 месяца назад
Thank you for kind words. I'm so glad my videos are helping you. That's why I do them. I know this technology is not easy to learn so kudos to you for sticking with it.
@DenisGorev-xj5hl
@DenisGorev-xj5hl Год назад
It is amazing how concisely you put so much information in one video! Great!
@sujithravindran7082
@sujithravindran7082 Год назад
I really enjoyed the perspective you brought into the evolution. Great work. Please keep bringing in these great videos. Thank you very much.
@BryanCafferky
@BryanCafferky Год назад
Thank You! and you're welcome.
@HasanCatalgol
@HasanCatalgol Месяц назад
Underrated channel, really quality information.
@ioannisnikolaospappas6703
@ioannisnikolaospappas6703 Месяц назад
Life saver 🫡 Thank you sir!
@janni9789
@janni9789 Год назад
Again, perfectly explained. Thank you
@wennie2939
@wennie2939 Год назад
Best video on this topic ever!
@brokejohnnylive1530
@brokejohnnylive1530 2 месяца назад
Dude you are on the money!! Agree all 100%.
@gardnmi
@gardnmi Год назад
I'd love to see a non-bias comparison between delta lake, hudi, and iceberg.
@BryanCafferky
@BryanCafferky Год назад
So would I. lol. Iceberg seems to be Snowflake's version of Lakehouse. Not sure about hudi.
@BryanCafferky
@BryanCafferky Год назад
Looks like Amazon is promoting hudi.
@kamalesht5942
@kamalesht5942 Год назад
Your videos are really helping me improve the core knowledge on Data Engineering concepts. Thankyou!
@BryanCafferky
@BryanCafferky Год назад
Great to hear! You're welcome.
@GILLOS21
@GILLOS21 Год назад
Amazing lecture! Thank you!
@BryanCafferky
@BryanCafferky Год назад
You're Welcome!
@jayashreetheagarajan2708
@jayashreetheagarajan2708 Год назад
Amazing contents.. Thank you Bryan
@BryanCafferky
@BryanCafferky Год назад
You're Welcome! Glad it is helpful!
@WeAreTeamNovus
@WeAreTeamNovus Год назад
Amazing stuff, as always!
@BryanCafferky
@BryanCafferky Год назад
Thank you!
@BhaveshKumar-dz8hq
@BhaveshKumar-dz8hq 5 месяцев назад
you are a hidden gem
@stu8924
@stu8924 Год назад
Thank you Bryan.
@BryanCafferky
@BryanCafferky Год назад
You're welcome Stu.
@ChristianWDegn
@ChristianWDegn Год назад
Good presentation Thank!
@BryanCafferky
@BryanCafferky Год назад
YW!
@maheshthati1320
@maheshthati1320 8 месяцев назад
Best explanation
@potnuruavinash
@potnuruavinash 3 месяца назад
Can we implement data lakehouse with open source tools like spark, presto & hive metastore ? is there any alternative for unity catalog in open source eco system
@BryanCafferky
@BryanCafferky 3 месяца назад
Lakehouse is just Delta Lake, i.e., delta tables which are available in open source Spark so yes. Unity Catalog is really just a catalog of catalogs so you could build your own central catalog by extracting the meta data from local Hive metastores. I believe Spark tends to work one cluster at a time unlike Databricks which spins any number of clusters up as needed so not sure if UC could be implemented on open source Spark but perhaps?
@avishaysebban1515
@avishaysebban1515 Год назад
you're the best thank you.
@BryanCafferky
@BryanCafferky Год назад
You're welcome! Thanks for watching.
@rich111296
@rich111296 Год назад
do you have an example in any of your videos connecting to an s3 bucket specifying an endpoint within databricks? basically how to connect to an s3 bucket from a service other than aws? Thanks
@BryanCafferky
@BryanCafferky Год назад
Hmmmm.... No have not tried that. Have you googled it?
@rich111296
@rich111296 Год назад
@@BryanCafferky yeah ha, i did find a solution eventually, i think somewhere from stack overflow, searched around several places so i don't have the exact source "sc
@rich111296
@rich111296 Год назад
and run the function obvi
@prarthananeesh
@prarthananeesh 4 месяца назад
Can we use the lakehouse to replace a transactional system ?
@BryanCafferky
@BryanCafferky 4 месяца назад
See my reply to your question about OLTP.
@prarthananeesh
@prarthananeesh 4 месяца назад
Is it mainly used for OLAP or can this be used for OLTP also ?
@BryanCafferky
@BryanCafferky 4 месяца назад
It's meant for data warehousing, i.e., warehouse = lake + house, so warehouse on a data lake. OLTP has stringent requirements like high data transactions concurrency, referential integrity, etc. Delta logging is done at a file level whereas SQL databases log at a row level. See my video on Delta logs to get an understanding of what I mean.
@BryanCafferky
@BryanCafferky 4 месяца назад
Delta Logs 1: ru-vid.com/video/%D0%B2%D0%B8%D0%B4%D0%B5%D0%BE-pCH_qNqnms0.html Delta Logs 2: ru-vid.com/video/%D0%B2%D0%B8%D0%B4%D0%B5%D0%BE-ZSTJLfZy_Hs.html
Далее
Understanding Data Lakehouse
11:46
Просмотров 7 тыс.
Construction site video BEST.99
01:00
Просмотров 349 тыс.
Data Mesh, Data Fabric, Data Lakehouse - SQLBits 2022
56:26
Data Lakehouses Explained
8:51
Просмотров 86 тыс.
Core Databricks: Understand the Hive Metastore
22:12
Просмотров 15 тыс.
Data Warehouse vs Data Lake vs Data Lakehouse
9:32
Просмотров 43 тыс.
Why Databricks Delta Live Tables?
16:43
Просмотров 16 тыс.
Which Database Model to Choose?
24:38
Просмотров 51 тыс.
Advancing Fabric - Lakehouse vs Warehouse
14:22
Просмотров 24 тыс.