Тёмный

Python Data Science Tutorial: Analyzing the 2019 Stack Overflow Developer Survey 

Corey Schafer
Подписаться 1,4 млн
Просмотров 62 тыс.
50% 1

Опубликовано:

 

3 окт 2024

Поделиться:

Ссылка:

Скачать:

Готовим ссылку...

Добавить в:

Мой плейлист
Посмотреть позже
Комментарии : 145   
@coreyms
@coreyms 5 лет назад
I hope everyone finds this useful. Going out and practicing with real-world data like this is one of the best (if not the best) way to practice your skills. If any of you do your own analysis that's interesting then be sure to share it with us in the comment section below! PS I forgot to mention that as usual, the code for this video can be found in the description section.
@shpazhist
@shpazhist 5 лет назад
Corey, can you please do a video where you explain how to get to that level of intermediate-advanced developer where one can play with code really-really easily? In essence, say if one wants to learn about new library or framework or a new challenge - - can you please explain your process how do you go about actually learning that field rather than doing videos on standalone topics as IMO that would really help a lot of people to sorta move to the next level a lot. Thx
@bodekolton5961
@bodekolton5961 3 года назад
I know it's quite off topic but do anybody know a good website to stream new tv shows online?
@troylorenzo6430
@troylorenzo6430 3 года назад
@Bode Kolton Lately I have been using Flixzone. Just google for it :)
@maddoxronin9643
@maddoxronin9643 3 года назад
@Troy Lorenzo yea, have been watching on Flixzone for since april myself =)
@bodekolton5961
@bodekolton5961 3 года назад
@Troy Lorenzo thank you, I went there and it seems like a nice service :D I really appreciate it!
@mrityunjayahiremath3951
@mrityunjayahiremath3951 5 лет назад
Python + Data Science + Corey = ❤️
@Vasko10100
@Vasko10100 4 года назад
Finally, i understand analyzing Data in Python! I went from Coursera via EDX, UDEMY etc. But this.... this is a pure gold sir! Please, do some course or learning path for Data Science with Machine Learning in Python in depth( let it be if needed a paid learning path). You are one of the best! If you somehow manage to come in Macedonia, tell me, i want to bow before you!
@coreyms
@coreyms 4 года назад
Thanks! Im still working on a Pandas series. 4 videos are complete but I'm working to finish the rest soon. Glad to have you as a subscriber and member. You all make it possible for me to release these for free. Thanks again! media.giphy.com/media/WvUyMX1gB1gnC/giphy.gif
@Ghasakable
@Ghasakable 5 лет назад
Man, you are awesome, you are the reason that I open RU-vid, please if you come to Japan one day, I will do my best to show you good places here in Tokyo.
@coreyms
@coreyms 5 лет назад
That'd be awesome!
@shivamkumraa
@shivamkumraa 5 лет назад
Who else liked before watching this. Just 1000% sure this guy can't make any shit.
@Jmzz542
@Jmzz542 4 года назад
what makes you say that?
@afternoontiger9319
@afternoontiger9319 5 лет назад
I'm definitely interested in a future tutorial of using pandas with this data! Otherwise a fantastic job
@muralitechtimes6487
@muralitechtimes6487 Год назад
You are true MASTER of coding. We are now a days not finding your other videos. Do it for us which re very help us to grow.
@mylesmontclair
@mylesmontclair 5 лет назад
Great to have you back producing great content for us. Thanks, Corey!
@michaelworkspace
@michaelworkspace 4 года назад
In case anybody ran into an error like "UnicodeDecodeError: 'charmap' codec can't decode byte 0x81" while reading the csv file like I had, the fix is to add a encoding='utf-8' parameter to the with open() function. e.g. with open('survey_results_public.csv', encoding='utf-8') as f:
@Mayur7Garg
@Mayur7Garg 4 года назад
Thx man! You are a savior.
@shivstn98
@shivstn98 4 года назад
It can be also fixed by using raw string I.e. use r before first quote Eg. r"hello world"
@jemand4036
@jemand4036 4 года назад
thank u so much, i had no idea what was going on!
@irfandy4hmad
@irfandy4hmad 4 года назад
Thanks bro. U saved our time
@michaelworkspace
@michaelworkspace 4 года назад
Irfandy Ahmad No problem bro. Have a good one!
@Jo-id9zm
@Jo-id9zm 5 лет назад
You're doing God's work, Corey!
@akhilrapalli4118
@akhilrapalli4118 5 лет назад
wow what a video from an expert on python like u verymuch corey
@dadoll1660
@dadoll1660 5 лет назад
Wow, this is Amazing! Please make more videos for this DS Tutorial series. Thank you Corey...
@jackleone4150
@jackleone4150 5 лет назад
If anyone else is facing the UnicodeDecodeError try adding this second argument to the open function => with open('data/survey_results_public.csv', encoding="utf8")
@sylvanmasha
@sylvanmasha 5 лет назад
wow, thank you so so much. i was facing the same problem
@GuilhermePalazzoRodrigues
@GuilhermePalazzoRodrigues 5 лет назад
That was incredible, really easy to follow and replicate, thank you very much.
@bishantadhikari3440
@bishantadhikari3440 5 лет назад
Your videos are great and is probably the most easily understandable in all of youtube. Also, can you make videos about implementing AI in python cause there are not any good ones that meet your standard
@HunniiChii
@HunniiChii 5 лет назад
Hope to see more of these videos from you in the future!
@AvantjerTube
@AvantjerTube 5 лет назад
Thanks Corey! I’ve been following your excellent tutorials for quite awhile. Just wanted to say that I’m excited and looking forward to your Pandas course(s).
@nikhillingam4630
@nikhillingam4630 5 лет назад
absolutely brilliant work well done
@gtosXD
@gtosXD 5 лет назад
Thank you!!!! Really great video!
@arnopisspot5115
@arnopisspot5115 5 лет назад
Looking forward the panda series boss. Thank you for all your hard work
@theDrewDag
@theDrewDag 4 года назад
Please make this the first of a Data Science playlist. PLEASE.
@ajb4964
@ajb4964 5 лет назад
Wow, please do more like this.
@sandeepgaur2407
@sandeepgaur2407 5 лет назад
Awesome tutorial like always. Please make video on asyncio in python.. please please.
@DistortedV12
@DistortedV12 5 лет назад
From this, looks like you can't go wrong with javascript, python, SQL and a C type language. Wonder how long these will dominate.
@skandrskon7091
@skandrskon7091 4 года назад
for line in csv_reader : counts[line['hobbyist'] ] += 1 really great idea for increment on counts dict thanks
@yosefco3
@yosefco3 4 года назад
Hi great Corey ! Thank you very much on your channel , your flask tutorial is the best i ever seen ! Please could you do a flask-api tutorial too ?
@Medhusalem
@Medhusalem 4 года назад
Hey Corey, I have seen the previous videos on DataFrames with Pandas where you used Jupyter as well. You mentioned in this video that it is nice that you can run code brackets or just certain lines of code. My question would be then, what is the difference between Jupyter and Spyder? Since Spyder can also execute certain code lines, without loading everything in again and has a super awesome functionality where you have a variable manager in which you can view every variable you declared.
@Medhusalem
@Medhusalem 4 года назад
It didn't work somehow and I had to do a google search on my error: 'charmap' codec can't decode byte 0x81 in position 5739. So I had to change the with open part and specify the encoding part, after that everything worked fine. *with open('data/survey_results_public.csv', encoding='utf-8') as f:*
@rahulpatel4701
@rahulpatel4701 4 года назад
thanks for help
@sanukurien2752
@sanukurien2752 4 года назад
@Corey, I love watching your videos when I have my morning coffee. It's elegant and super informative. Thank you!
@borgestheborg
@borgestheborg 5 лет назад
Oh baby! Never clicked on a video so fast in my life! :D
@BrendanMetcalfe
@BrendanMetcalfe 5 лет назад
nice vid! thanks
@yomajo
@yomajo 4 года назад
It seems like Counter() could be a topic of its own. I need to dig more into that. Awesome code-along experience!
@anaesthesiadolorosa7100
@anaesthesiadolorosa7100 5 лет назад
It's awesome that you've started making data science videos. Please keep them coming! ;)
@iangitonga2811
@iangitonga2811 5 лет назад
Great content.
@_boris
@_boris 5 лет назад
Your videos are so brilliant as I want to keep them to myself only... 💎 ...however, way too good not to share them with others. Shared with my classmates - hopefully, they find it useful :)
@arkhamian7586
@arkhamian7586 5 лет назад
Don't stop doing this.
@SagarSapkota-eo4ei
@SagarSapkota-eo4ei 5 лет назад
Hi Corey great and amazing content ..thanks a lot buddy
@백영래-u3x
@백영래-u3x 5 лет назад
Thank you for Python Data Science Tutorial! Keep it up, please!
@morganfree100
@morganfree100 4 года назад
More please sir..(:-
@SagarSapkota-eo4ei
@SagarSapkota-eo4ei 5 лет назад
Hi Corey great and amazing content ..thanks a lot buddy
@jayjay6804
@jayjay6804 5 лет назад
As hobbiest my first inclination would be to extract the data using pivot tables in libreoffice. Thank you for demonstrating the use of python for this purpose.
@adildeshmukh4140
@adildeshmukh4140 5 лет назад
yes, please pandas
@rangavembar
@rangavembar 5 лет назад
Brilliant as always!! Looking forward to the Pandas series!!
@hserdarkuyuk
@hserdarkuyuk 5 лет назад
Please do similar live coding, these are great to improve our perspective
@BoukariYameogo
@BoukariYameogo 5 лет назад
This is what I was waiting for! Thank you!
@syedabdulsalam4659
@syedabdulsalam4659 5 лет назад
Finally you are back :p
@javierrivasseau8956
@javierrivasseau8956 Год назад
Hi Corey, how are you? i have been watching your series and videos, and think they are the very best out there, i really like to learn with you, I would like to ask you some things if you could solve them, when you make the matplotlib series, you separate the convertedcomp for all the devs and for the python dev and javascript dev, can you upload a video to solve it? I tried anyway and could not. ?
@strange_man
@strange_man 5 лет назад
The video that I need :D
@rahulsharma-st5yg
@rahulsharma-st5yg 4 года назад
@corey: when to expect some data science real life projects?
@alexanderten5497
@alexanderten5497 5 лет назад
Notification from Corey and Iam here to like the video
@kizzavincent
@kizzavincent 5 лет назад
Very nice to have you back. Many thanks.
@DistortedV12
@DistortedV12 5 лет назад
Wow Corey you do it all; thanks for the helpful tutorials!
@mitos95176
@mitos95176 5 лет назад
Now I want to perform this task using pandas.
@Strider1066
@Strider1066 3 года назад
I liked the incremental approach of coding by examples .
@gaunterodimm3569
@gaunterodimm3569 4 года назад
Great video! This video helped me soooooooo much!
@ramil17998
@ramil17998 5 лет назад
Thanks a lot for the video! I love it cause its as real as it can get!
@darrenlefcoe
@darrenlefcoe 5 лет назад
On inspection of the data. The csv.DictReader(f) throws 594 errors in the for loop >> line in csvReader: Not sure what causes the error, but this exists. Needs to be handled with try: except UnicodeDecodeError: who else sees this ?
@christiaan3315
@christiaan3315 4 года назад
Sorry. I am getting error messages... 'for line in csv.reader: row = next(self.reader) etc
@MAli-os2yj
@MAli-os2yj 5 лет назад
Hi Corey! Love your videos man! I have finished your Flask series and currently doing the Django series. I have a few suggestions for future videos around the Blog Website that involve some ML and Data Science: 1. Introduce 'like and comment' functionality to the application which allows a user to comment and like posts by other users. 2. Then, integrate a recommendation system engine to the Blog application, the system will recommend content to the individual user based on his own posts and the posts that he has liked. 3. Also, a user can look into the profile of other users and see a word cloud of all posts that the other has submitted, this way they can get a rough idea of what the other uses posts. Keep Doing the good work man!
@boolloop
@boolloop 5 лет назад
I love your videos, but I wondered why you didn't use pandas for this?
@coreyms
@coreyms 5 лет назад
I mentioned the reason in the video. I wanted to show how to do this without Pandas first, and once I release my Pandas series then I will redo this video using Pandas as well. That way people who are unfamiliar with Pandas can do this as well
@boolloop
@boolloop 5 лет назад
@@coreyms neat, looking forward to it!
@Lfppfs
@Lfppfs 4 года назад
Awesome tutorial, thanks Corey!
@sprk2009
@sprk2009 4 года назад
Your videos are awesome. I am new to Python and preparing for Python certification. Can you please let me know which one of the following programs are good to proceed? Thanks in advance. 1. PCAP: Programming Essentials in Python from Netacad 2. Microsoft 98-381 Introduction to Programming Using Python
@ashissahoo4158
@ashissahoo4158 3 года назад
Is the pandas version of the last question has been uploaded?
@kingslypaul2999
@kingslypaul2999 5 лет назад
Can u make a video on pyttxs3 ...or something better .....thanks for all your God skilled contents that makes ppl understand anything u teach
@rosyndapee8129
@rosyndapee8129 5 лет назад
Nice! What about a future video with some visualization of geospatial data? I.e. using some GPS tracks converted into CSV file with all points lat and long. Would you suggest to use postrgres or postgis to then make queries to the geodata?
@marinbeslo7841
@marinbeslo7841 4 года назад
Is the video with pandas-way out? :)
@yuliu1105
@yuliu1105 5 лет назад
Please update pandas version of the 2nd half of the problem, my forloops don't work, bummer. Thank you
@nitishchhabra_me
@nitishchhabra_me 5 лет назад
Hey @corey please make videos for Airflow DAG's also. They are basically python codes so it would fit nicely within your python series
@johnnya590
@johnnya590 5 лет назад
I don't know if you realize this, but at the start of EVERY video you say "Hey there how's it going everybody?" :)
@rangabharath4253
@rangabharath4253 5 лет назад
Awesome as always
@shaguftakhan302
@shaguftakhan302 3 года назад
I clicked on this video just to see what data scientists do.
@rolandgerard6064
@rolandgerard6064 5 лет назад
man, you are good 👍
@amankothiyal7074
@amankothiyal7074 4 года назад
Hey can you analyse it using numpy library
@opst1704
@opst1704 5 лет назад
If one day I decide to learn Python, then do I waste my time by learning C language now? I am learning C, primarily because I want to understand how memory works and DSA is easier to understand with C. Will this knowledge be helpful if I decide to learn Python? Seems like C and Python have a lot of differences, in Python there are so many things that I don't need to worry about. One more question, is it difficult to find out what is happening behind scenes in Python, for example: implementation of input() function? Great videos, the main reason why I am considering to learn Python.
@ЛеонидКлыков
@ЛеонидКлыков 5 лет назад
Hello, Corey! Please append your matplotlib tutorial with a lesson about Legend Picking
@someonesomebody716
@someonesomebody716 5 лет назад
Data Science + Corey Sir = YAY!!!
@nowyouknow2249
@nowyouknow2249 5 лет назад
Nailed it
@darrenlefcoe
@darrenlefcoe 5 лет назад
30 minutes: requires the following to pass bad data... try: # put the code here except AttributeError: pass
@philinnc
@philinnc 4 года назад
Looking forward to seeing a future video where pandas is used to analyze this same data. Really appreciated your stepping through the thought process, something I haven't seen in many other data science tutorials. Approaches to data cleaning is another interesting topic that would be nice to see covered in a future video. Recently there were these FEC reports... :-)
@KevinOMalleyisonlysmallreally
@KevinOMalleyisonlysmallreally 5 лет назад
You produce some of the best tutorials on python. Itd really help if you did one on advancing jupyter notebook use, and seeing up sublime to work with pandas.
@anandinamdar4054
@anandinamdar4054 5 лет назад
Legend 🙌🙌
@amnah.-.4708
@amnah.-.4708 5 лет назад
Hi Corey, thank you for the content you put out, it is beyond exceptional. Also, it would be great if you could make an in-depth course on SpaCy, and its application in Data Science.
@gnuter3000
@gnuter3000 5 лет назад
greatgreatgreat
@aryanvikash1967
@aryanvikash1967 4 года назад
Corey please create a tutorial on python coroutine.. async await its so confusing .. please
@mancalos
@mancalos 3 года назад
Finally a project not using either the Titanic or Iris datasets. Corey, you are a gem. Thanks for all the effort you put into this and every other tutorial.
@daltonmarhufu9549
@daltonmarhufu9549 4 года назад
Great vid! Really helpful, could you possibly do a Pandas version of this?
@barungh
@barungh 5 лет назад
Badass !👍👌
@Sauhros
@Sauhros 4 года назад
Just what I needed to start off my winter break. I can't wait for your pandas series! Keep up the great work Corey.
@ebrahimm.roshdy4487
@ebrahimm.roshdy4487 5 лет назад
Please do it using pandas since it's way easier and so common for data scientists
@pingojr4048
@pingojr4048 5 лет назад
Can you do something for home bill management?
@aydinjalilov2328
@aydinjalilov2328 4 года назад
it blew my mind.
@Ajithkumarsekar96
@Ajithkumarsekar96 5 лет назад
Almost everyone has used Javascript and html/css? From front-end developer to system-administrator
@736939
@736939 5 лет назад
Thank you Corey! Continue please: ScikitLearn, TensorFlow
@chinmayabarik557
@chinmayabarik557 5 лет назад
Please do a pandas tutorial with this survey results sir..
@hemanth6951
@hemanth6951 5 лет назад
i saw you in another RU-vid channels
@coreyms
@coreyms 5 лет назад
Which one was that?
@hemanth6951
@hemanth6951 5 лет назад
@@coreyms some thing pyconf channel with some one interviewing with you last year
@coreyms
@coreyms 5 лет назад
@@hemanth6951 Oh yeah! I think that was Dan Bader. He's a great guy.
@hemanth6951
@hemanth6951 5 лет назад
@@coreyms yes yes yes finally I saw you and this second time I'm chatting with you I'm so happy and luky 😊😀😀
@samishabli5191
@samishabli5191 5 лет назад
i got this error: Yes Traceback (most recent call last): and it just shows me the first yes and no and in the end it says: return codecs.charmap_decode(input,self.errors,decoding_table)[0] UnicodeDecodeError: 'charmap' codec can't decode byte 0x81 in position 5739: character maps to
@atishayshukla1117
@atishayshukla1117 5 лет назад
Its an encoding issue with the file, open a file by specifying proper encoding something like: with open('data/survey_results_public.csv', encoding='utf8')
@ThePasupathi
@ThePasupathi 5 лет назад
@@atishayshukla1117 thank you..it works
@darrenlefcoe
@darrenlefcoe 5 лет назад
for line in csvReader: failed on line 16. UnicodeDecodeError: 'charmap' codec can't decode byte 0x81 in position 5739: character maps to problem with the data-set or other reason ?
@sylvanmasha
@sylvanmasha 5 лет назад
f anyone else is facing the UnicodeDecodeError try adding this second argument to the open function => with open('data/survey_results_public.csv', encoding="utf8") this is a comment i found by Jack Leone. you're welcome
@graybits4111
@graybits4111 5 лет назад
thanks for this great video the total % for each dev type should be 100%, which is not the case
@coreyms
@coreyms 5 лет назад
Not necessarily. They can choose multiple languages. So for example, for front-end developers, most said they know JavaScript, but they also knew other languages as well. So each language is a percentage of how many developers said they know a particular language. Hope that clears it up
@graybits4111
@graybits4111 5 лет назад
@@coreyms it is clear now, thanks
@natesh1
@natesh1 5 лет назад
Please do a pandas tutorial series with this survey results too.
@bluekeybo
@bluekeybo 5 лет назад
You didn't watch the video till the end did you
@malmigona1734
@malmigona1734 5 лет назад
Dear Corey. I am finding your tutorials quite interesting but with respect to this one I feel disappointed. What you are showing is not a proper way to analyze any data because with significant effort comes quite basic results. I have expected you to show first of all the process of recoding the "verbal" data into numeric one (transforming the answer into numerical representation) than adding description layer (assigning labels to questions and answers) and only then star the proper data analysis including but not limited to: frequencies, crosstabulations, statistical tests, correlations and maybe even data clustering. I fully understand it is a long shot but I guess the impact on small series "How to proper analyze data in Python" would be much better that just one clip that shows something...rather not very usefull from the perspective of data science.
Далее
10 Python Tips and Tricks For Writing Better Code
39:21
Python NumPy Tutorial for Beginners
58:10
Просмотров 1,6 млн
5 Common Python Mistakes and How to Fix Them
29:19
Просмотров 289 тыс.
The Value of Source Code
17:46
Просмотров 49 тыс.