Тёмный

how to install spark in windows 10 | spark local setup 

MANISH KUMAR
Подписаться 19 тыс.
Просмотров 11 тыс.
50% 1

In this video I have talked about spark local setup. Please follow all the steps carefully otherwise you will get an error. Mostly error will occur due to environment variable and software version difference.
software location download & install.
JAVA_HOME=C:\java\jdk
HADOOP_HOME=C:\hadoop
SPARK_HOME=C:\spark\spark-3.3.2-bin-hadoop2
PYSPARK_HOME= C:\Users
ikita\AppData\Roaming\Microsoft\Windows\Start Menu\Programs\Python 3.6\python.exe
Environment variable:-
%JAVA_HOME%\bin
%HADOOP_HOME%\bin
%SPARK_HOME%\bin
Download software:-
Java JDK:- download.oracle.com/java/20/l... (sha256)
Python Software:- www.python.org/downloads/rele...
Spark software:- www.apache.org/dyn/closer.lua...
Winutils Software:- github.com/steveloughran/winu...
Directly connect with me on:- topmate.io/manish_kumar25
For more queries reach out to me on my below social media handle.
Follow me on LinkedIn:- / manish-kumar-373b86176
Follow Me On Instagram:- / competitive_gyan1
Follow me on Facebook:- / manish12340
My Second Channel -- / @competitivegyan1
Interview series Playlist:- • Interview Questions an...
My Gear:-
Rode Mic:-- amzn.to/3RekC7a
Boya M1 Mic-- amzn.to/3uW0nnn
Wireless Mic:-- amzn.to/3TqLRhE
Tripod1 -- amzn.to/4avjyF4
Tripod2:-- amzn.to/46Y3QPu
camera1:-- amzn.to/3GIQlsE
camera2:-- amzn.to/46X190P
Pentab (Medium size):-- amzn.to/3RgMszQ (Recommended)
Pentab (Small size):-- amzn.to/3RpmIS0
Mobile:-- amzn.to/47Y8oa4 ( Aapko ye bilkul nahi lena hai)
Laptop -- amzn.to/3Ns5Okj
Mouse+keyboard combo -- amzn.to/3Ro6GYl
21 inch Monitor-- amzn.to/3TvCE7E
27 inch Monitor-- amzn.to/47QzXlA
iPad Pencil:-- amzn.to/4aiJxiG
iPad 9th Generation:-- amzn.to/470I11X
Boom Arm/Swing Arm:-- amzn.to/48eH2we
My PC Components:-
intel i7 Processor:-- amzn.to/47Svdfe
G.Skill RAM:-- amzn.to/47VFffI
Samsung SSD:-- amzn.to/3uVSE8W
WD blue HDD:-- amzn.to/47Y91QY
RTX 3060Ti Graphic card:- amzn.to/3tdLDjn
Gigabyte Motherboard:-- amzn.to/3RFUTGl
O11 Dynamic Cabinet:-- amzn.to/4avkgSK
Liquid cooler:-- amzn.to/472S8mS
Antec Prizm FAN:-- amzn.to/48ey4Pj

Опубликовано:

 

3 авг 2023

Поделиться:

Ссылка:

Скачать:

Готовим ссылку...

Добавить в:

Мой плейлист
Посмотреть позже
Комментарии : 86   
@yogesh9992008
@yogesh9992008 9 месяцев назад
Every one dont install python 3.6.7. this will again give error in futher video. instead of this 3.7.7 you can install. this will solved future error
@manish_kumar_1
@manish_kumar_1 9 месяцев назад
Thanks Yogesh. I have pinned the comment so it will be helpful for all whoever will be downloading in future
@praveenkumarrai101
@praveenkumarrai101 8 месяцев назад
Hi yogesh, which version spark you have downloaded spark and java? I am getting error while running code in pycharm that no python.
@sinhadeepak4101
@sinhadeepak4101 6 месяцев назад
@@praveenkumarrai101 same hear i also have error like that
@rakeshverma6867
@rakeshverma6867 9 месяцев назад
Hi Manish, Thank you very much for uploading videos on Real Time Project, this video is very helpful for practicing everything locally with the help of this, able to set up Spark locally. I am using the suggested Python version 3.6.7. Every thing is working fine except the SQL connections. However, the recommended version is creating issues so I upgraded my Python version to 3.7.0 and it is working successfully. 💯
@shubhamchavan9438
@shubhamchavan9438 10 месяцев назад
Thank you manish, for pyspark practice purpose I was using google colab. But this video help to install pyspark in this machine.
@rakeshverma6867
@rakeshverma6867 9 месяцев назад
great job Manish. You are really awesome regarding all the content that is provided in your channel. It is unique and complete depth, genuine as much as possible 💯❤
@rekhasingh4945
@rekhasingh4945 11 месяцев назад
Thank you Manish for posting this video 😊
@aryanvik5614
@aryanvik5614 9 месяцев назад
This is one of the best project video I have come across. Thanks a lot Manish for all your hard work. God Bless you keep doing the great work !!!!!!!!
@nitilpoddar
@nitilpoddar 5 месяцев назад
Did you manage to get a job using this project??
@Cherry29-no9pb
@Cherry29-no9pb 11 месяцев назад
Thanks Manish for this video. 😀
@user-pz1ko6iu1s
@user-pz1ko6iu1s 3 месяца назад
Thank you for the tutorial Manish.
@GajendraAhirkar
@GajendraAhirkar 8 месяцев назад
Hi Manish and all , Firstly Manish thank you for all your efforts and for giving all details, I would like to add on few points here ; 1) python 3.6.7 version good to go but for that we need to install the Java8 (i.e JDK8) and by doing this we will not face any issue of the py4j error 2) also the spark version -- spark-3.3.2-bin-hadoop2 will work fine and wont throw any error 3) winutil 2.7 version is good to go with above config.
@anirudhakhandagale1722
@anirudhakhandagale1722 2 месяца назад
A big thanks man !!
@nikhilhimanshu9758
@nikhilhimanshu9758 7 месяцев назад
3.3.3(Aug 21 2023 ) and Pre - Built for apache hadoop 2.7 -> The requested file or directory is not on the mirrors. shall i go for 3.4.2(NOV 30 2023 ) and Pre - Built for apache hadoop 3.3 and later? or what do you suggest ?
@anketsonawane6651
@anketsonawane6651 10 месяцев назад
pre built for apache hadoop 2.7 is not available now which version should i download?
@praveenkumarrai101
@praveenkumarrai101 9 месяцев назад
Hi bro, I am completed 45% of your pyspark videos. SO, just wanted to know should i start to work on your DE project or just complete full pyspark and then DE project. Want your opinion
@RajanKumarYadavIndia
@RajanKumarYadavIndia 10 месяцев назад
Macbook mai winutils ke jagah kya download karna hai. Please suggest ??
@nayanjyotibhagawati939
@nayanjyotibhagawati939 11 месяцев назад
Thank you ❤
@user-lx1rq1yu1s
@user-lx1rq1yu1s 9 месяцев назад
Hi Manish, Prebuilt Hadoop 2.7 is not able to download ,asking to verify signatures. Please check !
@hritikapal683
@hritikapal683 9 месяцев назад
Will it work in win11 too? Getting an error system cannot find the path specified although the environment variables are set properly followed all steps
@rohitmali1587
@rohitmali1587 10 месяцев назад
Hi Manish, thankyou for uploading videos on Real time project,this video was very good and i am able to setup spark in my local. But Could you please explain how lcal variable work or its flow, how spark session created after enter pyspark command in cmd , because we didn't created variable with pyspark. thankyou in Advance
@manish_kumar_1
@manish_kumar_1 10 месяцев назад
bin directory ka paath aapne set kiya hai. Usme agar bin folder open karenege to aapko pyspark dikh jayega. Wahi se ye refence kar rha hota hai
@nikhilhimanshu9758
@nikhilhimanshu9758 7 месяцев назад
hi Manish bro can you please help me with with spark version to install. latest is 3.5.0 and which python, as you have pinned that 3.7.7 will work shall i go with 3.7.7 and 3.5.0 combination.? help pls
@RajatKumar-px3qg
@RajatKumar-px3qg 9 месяцев назад
Hi Manish, Thankyou for such an amazing project !! I have two years experience in automation testing , i want change my field to Data Engineering. Please share your thoughts on how can i showcase this project on my resume? Please help me draft this project on my resume.
@manish_kumar_1
@manish_kumar_1 9 месяцев назад
I have uploaded one project on youtube. Please watch that playlist
@praveenkumarrai101
@praveenkumarrai101 11 месяцев назад
great setup bro, ek room tour v
@manish_kumar_1
@manish_kumar_1 11 месяцев назад
Bas ek wall par kuch kuch kar diya hai. Bana dunga kv Agar aap log chah rhe hai to
@aasthagupta9381
@aasthagupta9381 5 дней назад
Python 3.10.7 and Spark 3.5.1 worked in my case
@ApParashar
@ApParashar 6 месяцев назад
Hi Manish, Thank you very much for the detailed video. But I am unable to download spark of any version. Any lead would be appreciated
@prabhatsingh7391
@prabhatsingh7391 10 месяцев назад
Hi Manish Bhaiya , getting this error while trying to execute pyspark : Unable to load native-hadoop library for your platform... using builtin-java classes where applicable 23/09/10 00:39:32 WARN SparkContext: Another SparkContext is being constructed (or threw an exception in its constructor). This may indicate an error, since only one SparkContext should be running in this JVM (see SPARK-2243). only spark is executed succesfully.
@prabhatsingh7391
@prabhatsingh7391 10 месяцев назад
solved , after installing java 11.
@tahirsurfer
@tahirsurfer 9 месяцев назад
I too faced the same problem Solved after installing jdk 17 Thanks bro
@Karansingh-xw2ss
@Karansingh-xw2ss 9 месяцев назад
@@prabhatsingh7391 Thanks it works, I was also facing this same issue but when i installed java version 11 it worked succesfully.
@abhigyapranshu4791
@abhigyapranshu4791 6 месяцев назад
Getting below error when typing pyspark in cmd: Traceback (most recent call last): File "C:\Spark\spark-3.3.2-bin-hadoop2\python\pyspark\shell.py", line 29, in from pyspark.context import SparkContext File "C:\Spark\spark-3.3.2-bin-hadoop2\python\pyspark\__init__.py", line 54, in from pyspark.rdd import RDD, RDDBarrier File "C:\Spark\spark-3.3.2-bin-hadoop2\python\pyspark dd.py", line 33, in from typing import ( ImportError: cannot import name 'NoReturn'
@vinodkumarpal2697
@vinodkumarpal2697 9 месяцев назад
followed all steps still unable to run spark what could be the reason for that? im using windows 11
@mantukumar-qn9pv
@mantukumar-qn9pv 6 месяцев назад
Hi Manish, Mene video dekha do bar or achhe se same version sabkuchh install kiya but phir bhi mere ko ye below error aar ha hai jab cmd me df.show() kar rha hu Python worker failed to connect back.
@anamikachaudhary1605
@anamikachaudhary1605 5 месяцев назад
Spark is not getting installed. System is unable to read. tgz file. Iske liye kya karen??
@riptideking
@riptideking 3 месяца назад
i think downloading and environment management is not happening.better latest version should have been used.not able to use pyspark in command prompt stopped there
@sahilsharma8600
@sahilsharma8600 11 месяцев назад
😄💗💗finally
@yishnavisangramsingh3325
@yishnavisangramsingh3325 11 месяцев назад
I am regularly following ur videos. I am having confusion on Spark submit, scheduling job Can u provide some knowledge on this
@manish_kumar_1
@manish_kumar_1 11 месяцев назад
Sure
@yishnavisangramsingh3325
@yishnavisangramsingh3325 11 месяцев назад
@@manish_kumar_1 thank you ❤️
@user-yd1fu7hl6j
@user-yd1fu7hl6j 10 месяцев назад
@@manish_kumar_1yes you forgot to make video on Spark app and sprak submit i think its a request Manish plzz make a video on that .thanks
@manish_kumar_1
@manish_kumar_1 10 месяцев назад
@@user-yd1fu7hl6j I haven't forgot it's just I need some time to shoot videos on that.
@deepakpradhan9743
@deepakpradhan9743 9 месяцев назад
Not able to download Winutils for windows from the link which you provided in desc of video.
@hritikapal683
@hritikapal683 9 месяцев назад
Hey I've faced a similar issue I tried it fixing by right clicking over winutils.exe later opted 'Save link as' afterwards it saved and downloaded as well
@ramanaiahseerla8248
@ramanaiahseerla8248 10 месяцев назад
Hi Manish Thanks for the video, But I am not able to download winutils. Even i tried the process you explained in the video. can you please provide proper link for winutils.
@AYANCHATTERJEEabir
@AYANCHATTERJEEabir 8 месяцев назад
Not able to download the winutils.exe file
@lazycool3611
@lazycool3611 3 месяца назад
Hello Manish Please update the correct working version for projects as per now April-2024. Thank you
@patil8302
@patil8302 4 месяца назад
Hey manish....i have oracle virtual machine box ....still do i need to install your steps in machine to start practicing or else oracle vm is enough
@vamsikrishna7329
@vamsikrishna7329 8 месяцев назад
Unable to download winutils from the link provided. Any other way
@manish_kumar_1
@manish_kumar_1 8 месяцев назад
I will have to check
@pogoclub8495
@pogoclub8495 8 месяцев назад
Bro jdk 20 wala link is not working. Jdk 21 aa gya h. Sare steps follow kiya but pyspark is not starting
@rey619ashkash
@rey619ashkash 5 месяцев назад
Same here..did u resolve it ?
@user-xx3rk1bf7g
@user-xx3rk1bf7g 10 месяцев назад
Hi Manish, Spark 3.3.2 version is no more available to download, which alternate version should I choose to download, please help
@manish_kumar_1
@manish_kumar_1 10 месяцев назад
You can download 3.4.1 or 3.4.2 I think
@anketsonawane6651
@anketsonawane6651 10 месяцев назад
did it worked?
@chiragmaniyath
@chiragmaniyath 10 месяцев назад
I downloaded tgz from archives
@adityatomar9820
@adityatomar9820 9 месяцев назад
@@chiragmaniyath how did u download it bro..im latest versions dont have hadoop 2.7
@chiragmaniyath
@chiragmaniyath 9 месяцев назад
@@adityatomar9820 there's an archive section
@ankitsaxena565
@ankitsaxena565 11 месяцев назад
Sir,how much required learn python for gain the knowledge of PYSPARK
@manish_kumar_1
@manish_kumar_1 11 месяцев назад
Aap 12 offer wali video dekhiye
@praveenkumarrai101
@praveenkumarrai101 11 месяцев назад
@@manish_kumar_1 bhai python libraries kon kon se padhni hai
@ankitsaxena565
@ankitsaxena565 11 месяцев назад
@@manish_kumar_1 link share kr dijiye sir
@ankitsaxena565
@ankitsaxena565 11 месяцев назад
Different between PYSPARK and spark please let me 🙏
@manish_kumar_1
@manish_kumar_1 11 месяцев назад
Spark used with python language is called pyspark. spark used with scala language is called spark scala.
@prachideokar7639
@prachideokar7639 2 дня назад
Ye pure project ke liye prerequisites kya hai plz rply
@manish_kumar_1
@manish_kumar_1 2 дня назад
Yes
@varungupta1047
@varungupta1047 11 месяцев назад
getting error - System cannot find the specified path .
@manish_kumar_1
@manish_kumar_1 11 месяцев назад
Bas itna hi error hai ya kuch aur v?
@riptideking
@riptideking 3 месяца назад
throwing pyspark is not defined
@mantukumar-qn9pv
@mantukumar-qn9pv 6 месяцев назад
Please help
@user-sw9cc6dz9d
@user-sw9cc6dz9d 10 месяцев назад
winutils download nhi ho raha hai
@manish_kumar_1
@manish_kumar_1 10 месяцев назад
download hi nahi ho rha?
@anketsonawane6651
@anketsonawane6651 10 месяцев назад
ha nahi ho raha mera bhi@@manish_kumar_1
@user-sw9cc6dz9d
@user-sw9cc6dz9d 10 месяцев назад
tried but unable to download, is this laptop issue or version issue. @@manish_kumar_1
@deepakpradhan9743
@deepakpradhan9743 9 месяцев назад
winutils download hua kya
@AGRIMTYAGI14
@AGRIMTYAGI14 8 месяцев назад
@@manish_kumar_1 Hi Manish, yes not able to download winutils. Nothing happens after we click on the download button. Can you share file using google drive ?
@pankajrao6895
@pankajrao6895 6 месяцев назад
mainsh bhai i am not able to run spark-shell in the cmd, it throws this error To adjust logging level use sc.setLogLevel(newLevel). For SparkR, use setLogLevel(newLevel). ReplGlobal.abort: bad constant pool index: 0 at pos: 48445 [init] error: bad constant pool index: 0 at pos: 48445 while compiling: during phase: globalPhase=, enteringPhase= library version: version 2.12.15 compiler version: version 2.12.15 reconstructed args: -classpath -Yrepl-class-based -Yrepl-outdir C:\Users\panka\AppData\Local\Temp\spark-98350a89-6b34-496c-9fe9-e5c79ab4e42a epl-63ab3c32-f74d-4af8-b4f3-a762634d986e
@PrateekGoel-vb5es
@PrateekGoel-vb5es Месяц назад
Hi how did u resolve the issue
Далее
spark local setup | how to install spark in windows 10
19:20
ЛУЧШАЯ ПОКУПКА ЗА 180 000 РУБЛЕЙ
28:28
Apache Spark Installation on Anaconda video(PySpark)
17:58
Apache Spark Introduction
48:54
Просмотров 55 тыс.