Тёмный

Data Engineer Interview Question - 6/10 [ Count the number of movies in each genre? ] 

Pooja Tripathi
Подписаться 367
Просмотров 100
50% 1

#spark #interviewquestions #dataengineers #pyspark #sparksql
question: Count the number of movies in each genre?
df = spark.createDataFrame([('The Shawshank Redemption',['Drama', 'Crime']),
('The Godfather', ['Drama', 'Crime']),
('Pulp Fiction', ['Drama', 'Crime','Thriller']),
('The Dark Knight', ['Drama', 'Crime','Thriller','Action']),
],["name", "genres"])
checkout:
/ poojatripathi0697

Наука

Опубликовано:

 

9 июн 2024

Поделиться:

Ссылка:

Скачать:

Готовим ссылку...

Добавить в:

Мой плейлист
Посмотреть позже
Комментарии : 5   
@vigneshkumar8808
@vigneshkumar8808 13 дней назад
Try to show the output of each step. That would be clear to all. Like show the output of explode rows. And then groupby and count.
@dataengineerscorner
@dataengineerscorner 13 дней назад
Ok sure
@AnuragYadav-vi1ol
@AnuragYadav-vi1ol 14 дней назад
Hey pooja didi , i really want to get into this Can u please mentor me What more things i need to learn and how can i get an internship
@dataengineerscorner
@dataengineerscorner 14 дней назад
Hi Anurag, Basic things you will need to become a data engineer is SQL, Python and knowledge of Spark. You can do udemy courses on spark or pyspark if you are new to Bigdata technology For internship : you can apply in analytics companies by going on there career website and also you can make account on naukari.com Companies that hire freshers in Analytics: Tiger Analytics Fractal Mu sigma Tredence You can check out these.
@AnuragYadav-vi1ol
@AnuragYadav-vi1ol 14 дней назад
@@dataengineerscorner initially i was aiming for data scientist and I am good at python and c++ and DSA in both, and matplotlib etc , and i completed SQL last year, which i only need to revise, and currently im learning ML, But I'm not confident enough ,like is all this worth , am i going in right direction Like how would i get internship etc
Далее
North Korea sends troops to Ukraine
12:52
Просмотров 351 тыс.
Top AWS Services A Data Engineer Should Know
13:11
Просмотров 153 тыс.
Data Engineering System Design Interview Framework
13:49
How do indexes make databases read faster?
23:25
Просмотров 51 тыс.
Master Reading Spark Query Plans
39:19
Просмотров 21 тыс.
Купил этот ваш VR.
37:21
Просмотров 300 тыс.