Тёмный
No video :(

03 Spark Streaming Local Environment Setup - Docker, Jupyter, PySpark and Kafka 

Ease With Data
Подписаться 4,6 тыс.
Просмотров 3,7 тыс.
50% 1

Опубликовано:

 

22 авг 2024

Поделиться:

Ссылка:

Скачать:

Готовим ссылку...

Добавить в:

Мой плейлист
Посмотреть позже
Комментарии : 18   
@mohdkhalidsiddiqui5317
@mohdkhalidsiddiqui5317 6 месяцев назад
You're doing an excellent job! Learned PySpark from here. I'm eager to learn Spark Streaming from this channel too.
@easewithdata
@easewithdata 6 месяцев назад
Awesome! Thank you! Please share your thoughts over linkedin and dont forget to tag us.
@Bijuthtt
@Bijuthtt 2 месяца назад
You are awesome man. I was trying to setup spark and kafka in docker for long time. done today. thank you very much
@easewithdata
@easewithdata 2 месяца назад
Glad I could help. Please make to share with your network over LinkedIn ❤️
@babuganesh2000
@babuganesh2000 6 месяцев назад
Waiting for more videos on this play list, you are the perfect teacher for fast learners.thank you so much for taking the time to post videos in this topic.
@easewithdata
@easewithdata 6 месяцев назад
Please make sure to share with your network
@Cinepixelx
@Cinepixelx 12 дней назад
I am from from spark Scala background will it be fine to use that? if yes then how can I change programming to Scala in jupyter notebook ?
@easewithdata
@easewithdata 12 дней назад
Never actually tried setting up Jupyter lab with Scala, but you can definitely try. The core concept for Spark and Spark Streaming would remain same with some light code changes. Please checkout Databricks Community Edition, that supports working with Scala by default.
@krishnakanthmacherla4431
@krishnakanthmacherla4431 5 месяцев назад
in search of gold , i found a diamons
@easewithdata
@easewithdata 5 месяцев назад
I believe you mean Diamonds 💎
@s-sd2re
@s-sd2re 5 месяцев назад
HI, While creating the spark session, I am getting this error: RuntimeError: Java gateway process exited before sending its port number I encountered few others had the same issue in your pyspark -zero to hero series but couldn't find any solution .Please help. I followed the same instructions from this video to download the docker image and the container- 03 Data Lakehouse | Data Warehousing with PySpark | Setup Docker PySpark Jupyter Lab Environment
@easewithdata
@easewithdata 5 месяцев назад
Please ignore that installation, that need to be fixed. Follow the instructions from Spark Streaming Env Setup.
@abc_cba
@abc_cba 3 месяца назад
Hi, I just made changes in the yml file of your github repo, removing the version line which was giving an error, also with many new featured added to Spark 3.5.0 with addition of Apache Arrow which will make querying many times faster , pandas package 2.0x upgrade, autocompletion, Plenty of new features to PySpark, I'll run the whole and give you an update, would that be okay?
@easewithdata
@easewithdata 3 месяца назад
Spark 3.5.0 is launched recently and is not accepted for all production use case yet. But you can go ahead and use it if you want get a feel of it.
@somyaranjankar5804
@somyaranjankar5804 4 месяца назад
Hello , where did you continue with kafka in which video ?
@easewithdata
@easewithdata 4 месяца назад
Please follow the playlist: ru-vid.com/group/PL2IsFZBGM_IEtp2fF5xxZCS9CYBSHV2WW&si=74YndU2KZO53gWQJ
@abc_cba
@abc_cba 4 месяца назад
Hi, thank you for your videos, can you be my mentor? i am willing to purchase your course on Real-time processing frameworks/engines.
@easewithdata
@easewithdata 4 месяца назад
Sorry, I am not offering any paid services yet. Will let you know in case I plan anything in future.
Далее
PEDRO PEDRO INSIDEOUT
00:10
Просмотров 2,9 млн
What is Apache Kafka®?
11:42
Просмотров 350 тыс.
07 Spark Streaming Read from Files | Flatten JSON data
14:26
Spark Streaming with Python under 12 minutes
12:08
Просмотров 12 тыс.
The cloud is over-engineered and overpriced (no music)
14:39