Тёмный

How to Run a Spark Cluster with Multiple Workers Locally Using Docker 

The Data Guy
Подписаться 6 тыс.
Просмотров 2,5 тыс.
50% 1

In this video, I'll show you how you can use Docker Desktop to run a Spark cluster that mimics a production environment on your local machine!
/ setting-up-a-spark-sta...

Опубликовано:

 

26 фев 2024

Поделиться:

Ссылка:

Скачать:

Готовим ссылку...

Добавить в:

Мой плейлист
Посмотреть позже
Комментарии : 23   
@rasmusandreasson1548
@rasmusandreasson1548 4 месяца назад
The king! Thank u for good content!
@thedataguygeorge
@thedataguygeorge 4 месяца назад
Thanks so much Rasmus!
@not_saboor
@not_saboor 4 месяца назад
Thanks for this !
@thedataguygeorge
@thedataguygeorge 4 месяца назад
No problem Saboor!
@Levy957
@Levy957 4 месяца назад
i love your videos
@thedataguygeorge
@thedataguygeorge 4 месяца назад
Thanks Levy! Love your support!
@early-riser18
@early-riser18 Месяц назад
Thank you for the rundown - very helpful. Could you add a link to the code written please? Some code in the Dockerfile is hidden by the right-side screen fold and has to be guessed. Thanks :)
@ccc_ccc789
@ccc_ccc789 4 месяца назад
Thanks
@thedataguygeorge
@thedataguygeorge 4 месяца назад
No problem!
@mayowaoludoyi5425
@mayowaoludoyi5425 2 месяца назад
Thank you for this walkthrough video. How can I establish connection to a relational database like Oracle from "dockerised" spark like this. I understand there is a different set up that requires JDBC. Where does it fits in this your setup?
@thedataguygeorge
@thedataguygeorge 2 месяца назад
Hey, you would add it similarly to how I connect to snowflake in other scripts, where you use the python ODBC drivers to establish connections to relational db's like Oracle
@imanitrecruiterineurope4142
@imanitrecruiterineurope4142 Месяц назад
Hi! It seems that the applications aren't taking any resources and are stuck in a loop on my end. What could be the cause?
@not_saboor
@not_saboor Месяц назад
Can you explain the part on Jinja Templating you mentioned in 3:40
@thedataguygeorge
@thedataguygeorge Месяц назад
Sure! What specifically about it are you interested in learning more about?
@AraviDen
@AraviDen Месяц назад
you don't have a directory like requirement, how will req/req.txt work?
@thedataguygeorge
@thedataguygeorge Месяц назад
With Spark, you'll typically initiate a spark session and provide a list of requirements you need for that particular session
@csmithDevCove
@csmithDevCove 4 месяца назад
What about connecting sparp-nlp to this
@thedataguygeorge
@thedataguygeorge 4 месяца назад
You would just want to add it to be installed within the docker image!
@rafaellourenco4599
@rafaellourenco4599 2 месяца назад
Bro, you skiped all the bugs stuff
@thedataguygeorge
@thedataguygeorge 2 месяца назад
Sorry was solving them off camera but will make sure to show more of the troubleshooting process next time!
@rafaellourenco4599
@rafaellourenco4599 2 месяца назад
@@thedataguygeorge can you share a repo with this project?
@AraviDen
@AraviDen Месяц назад
I am always getting entry point.sh not found, has anyone figured it out?