Тёмный
No video :(

Big Data Engineer Mock Interview | AWS | Kafka Streaming | SQL | PySpark Optimization  

Sumit Mittal
Подписаться 118 тыс.
Просмотров 13 тыс.
50% 1

Опубликовано:

 

21 авг 2024

Поделиться:

Ссылка:

Скачать:

Готовим ссылку...

Добавить в:

Мой плейлист
Посмотреть позже
Комментарии : 6   
@arunsundar3739
@arunsundar3739 4 месяца назад
very insightful on sql, aws, data modeling concepts & applications of those concepts, helps to recall & understand better the concepts learnt in big data master course & sql leetcode playlist :)
@sonuparmar5836
@sonuparmar5836 4 месяца назад
@sumitmittal07 The SQL aggregate question in which we need to calculate cumulative profit won't use ROWS Between as that will be used for rolling profit between a range, instead it should be simply: CUMULATIVE_PROFIT = SUM(profit) OVER(ORDER BY transaction_id, transaction_date). Let me know if I understood the question correctly or not. Also, in the partitioning and bucketing question interviewee have explained vice-versa.
@aniruths9900
@aniruths9900 2 месяца назад
You are right - Buckets are stored as files. Partitions are stored as directories.
@ankandatta4352
@ankandatta4352 4 месяца назад
In the case of creating a primary key in case unavailable, we can select any attribute and check if that attribute has 1 to 1 relationship with other composite values (in excel using a pivot table, check distinct values) and then use sha2 or md5 in adf to form the surrogate key. Correct me if I'm wrong
@rajeshvijayakumar
@rajeshvijayakumar 4 месяца назад
Yes, I was also thinking about md5
@dattabandi9226
@dattabandi9226 4 месяца назад
👌👌
Далее
Zombie Boy Saved My Life 💚
00:29
Просмотров 8 млн
GCP Data Engineer Mock  interview
15:22
Просмотров 1,8 тыс.
Google Data Engineer Interview Experience
16:46
Просмотров 37 тыс.
Top AWS Services A Data Engineer Should Know
13:11
Просмотров 160 тыс.