Data Engineer Interview Question series - 3/10 [unpack the array]

Data Engineer Interview Question series - 5/10 [ Calculate the month-wise cumulative revenue ]

Гайд по лучшим странам для криптовалют 🟡 Hamster Academy

지민 (Jimin) 'Smeraldo Garden Marching Band (feat. Loco)' Official Track Video

Sinfdosh xotin 7😂

I Built a SECRET McDonald’s In My Room!

Data Engineer Interview Question series - 4/10 [employee count under each manager]

Подписаться 385

Просмотров 210

50% 1

Видео Поделиться Скачать Добавить в

#spark #interviewquestions #dataengineers #pyspark #sparksql
Question: write a spark or sql code to find the employee count under each manager?
input:
data = [('4529', 'Nancy', 'Young', '4125'),
('4238','John', 'Simon', '4329'),
('4329', 'Martina', 'Candreva', '4125'),
('4009', 'Klaus', 'Koch', '4329'),
('4125', 'Mafalda', 'Ranieri', 'NULL'),
('4500', 'Jakub', 'Hrabal', '4529'),
('4118', 'Moira', 'Areas', '4952'),
('4012', 'Jon', 'Nilssen', '4952'),
('4952', 'Sandra', 'Rajkovic', '4529'),
('4444', 'Seamus', 'Quinn', '4329')]
schema = ['employee_id' ,'first_name', 'last_name', 'manager_id']
output:
employee_id | first_name | count
4125, | Mafalda, | 2

SPARK SQL solution:
df = spark.createDataFrame(data=data, schema=schema)
df.createOrReplaceTempView('EMP')
df.show()
query = '''select e.manager_id as manager_id,
count(e.employee_id) as no_of_emp,(m.First_name) as mangr_name
from emp e
inner join emp m
on m.employee_id =e.manager_id
group by 1,3
'''
result=spark.sql(query).show()
checkout:
/ poojatripathi0697

Опубликовано:

4 июн 2024

Поделиться:

Ссылка:

Скачать:

Готовим ссылку...

Добавить в:

Мой плейлист

Посмотреть позже

Комментарии : 5

@aalokkumarmishra2849 5 дней назад

Keep going 👍🏻👍🏻

@dataengineerscorner 4 дня назад

Thankyou 😇😇

@apurvjadhav7995 24 дня назад

df1 = df.groupBy('manager_id').agg(count('*').alias('count')) This is simple way, guide me if im wrong ?

@dataengineerscorner 23 дня назад

thankyou for the comment :) it will not work if you need to show manager's name in the result. otherwise it's correct for getting the count.

@apurvjadhav7995 23 дня назад

@@dataengineerscorner got it, thanks 👍

Далее

Data Engineer Interview Question series - 3/10 [unpack the array]

7:41

Data Engineer Interview Question series - 3/10 [unpack the array]

Просмотров 109

Data Engineer Interview Question series - 5/10 [ Calculate the month-wise cumulative revenue ]

4:49

Data Engineer Interview Question series - 5/10 [ Calculate the month-wise cumulative revenue ]

Просмотров 253

Гайд по лучшим странам для криптовалют 🟡 Hamster Academy

07:04

Гайд по лучшим странам для криптовалют 🟡 Hamster Academy

Просмотров 3,5 млн

지민 (Jimin) 'Smeraldo Garden Marching Band (feat. Loco)' Official Track Video

03:47

지민 (Jimin) 'Smeraldo Garden Marching Band (feat. Loco)' Official Track Video

Просмотров 8 млн

Sinfdosh xotin 7😂

01:01

Sinfdosh xotin 7😂

Просмотров 2,4 млн

I Built a SECRET McDonald’s In My Room!

36:00

I Built a SECRET McDonald’s In My Room!

Просмотров 15 млн

Figma CEO on failed Adobe deal, startup landscape, big redesign with AI

4:36

Figma CEO on failed Adobe deal, startup landscape, big redesign with AI

Просмотров 38 тыс.

Data Engineer Interview Question Series - 7/10 [MOST ASKED!!!!]

6:52

Data Engineer Interview Question Series - 7/10 [MOST ASKED!!!!]

Просмотров 249

LeetCode SQL Interview Practice Question (Question 570) #sql, #datascience, #machinelearning

5:10

LeetCode SQL Interview Practice Question (Question 570) #sql, #datascience, #machinelearning

Просмотров 54

Top 15 Spark Interview Questions in less than 15 minutes Part-2 #bigdata #pyspark #interview

12:46

Top 15 Spark Interview Questions in less than 15 minutes Part-2 #bigdata #pyspark #interview

Просмотров 10 тыс.

Choosing a Database for Systems Design: All you need to know in one video

23:58

Choosing a Database for Systems Design: All you need to know in one video

Просмотров 26 тыс.

Data Engineer Interview Question - 6/10 [ Count the number of movies in each genre? ]

1:48

Data Engineer Interview Question - 6/10 [ Count the number of movies in each genre? ]

Просмотров 110

Processing 25GB of data in Spark | How many Executors and how much Memory per Executor is required.

14:20

Processing 25GB of data in Spark | How many Executors and how much Memory per Executor is required.

Просмотров 10 тыс.

Learn Apache Airflow in 10 Minutes | High-Paying Skills for Data Engineers

12:38

Learn Apache Airflow in 10 Minutes | High-Paying Skills for Data Engineers

Просмотров 131 тыс.

10 recently asked Pyspark Interview Questions | Big Data Interview

28:36

10 recently asked Pyspark Interview Questions | Big Data Interview

Просмотров 25 тыс.

Не включаются после залития / Наушники Marshall Major IV (реплика) | РЕМОНТ

6:44

Не включаются после залития / Наушники Marshall Major IV (реплика) | РЕМОНТ

Просмотров 41 тыс.

Необычные наушники • 212443387 Делюсь обзорами в профиле @lykofandrei

0:16

Необычные наушники • 212443387 Делюсь обзорами в профиле @lykofandrei

Просмотров 18 тыс.

Это спасёт камеру iPhone

0:32

Это спасёт камеру iPhone

Просмотров 181 тыс.

Как сфотографировать Закат? #Shorts

0:25

Как сфотографировать Закат? #Shorts

Просмотров 36 тыс.

🖊️ Скрытая фишка S Pen Полезно 👍

0:21

🖊️ Скрытая фишка S Pen Полезно 👍

Просмотров 18 тыс.

Урна с айфонами!

0:30

Урна с айфонами!

Просмотров 7 млн

AirPods Pro оригинал и реплика с #wildberries

0:59

AirPods Pro оригинал и реплика с #wildberries

Просмотров 862 тыс.