Тёмный

Exploratory Data Analysis with Pandas Python 

Rob Mulla
Подписаться 171 тыс.
Просмотров 430 тыс.
50% 1

In this video about exploratory data analysis with pandas and python, Kaggle grandmaster Rob Mulla will teach you the basics of how to explore data using python and pandas. Exploratory Data Analysis it a necessary tool for any data scientist. Pandas is a MUST for anyone getting into data science with python. Python is the #1 coding language for data science and has been growing over the years as an essential tool, with Pandas being the main data wrangling module. Kaggle Grandmaster Rob goes over it all in this video. In this video we discuss the basics of how to use explore data including...
Timestamps:
00:00 Introduction
01:00 Imports and reading data
03:35 Data Understanding
06:40 Data Preparation
20:57 Feature Understanding
27:35 Feature Relationships
35:30 Asking a Question about the Data
40:00 Final Thoughts
Follow me on twitch for live coding streams: / medallionstallion_
Intro to Pandas video: • A Gentle Introduction ...
Link to kaggle notebook used in the tutorial: www.kaggle.com/robikscube/int...
* RU-vid: / @robmulla
* Twitch: / medallionstallion_
* Twitter: / medalliondata
* Kaggle: www.kaggle.com/robikscube
#Python #Coding #DataScience #Kaggle

Опубликовано:

 

19 июн 2024

Поделиться:

Ссылка:

Скачать:

Готовим ссылку...

Добавить в:

Мой плейлист
Посмотреть позже
Комментарии : 442   
@robmulla
@robmulla 2 года назад
Chapters don't appear to be working for my videos for some reason. Here are the timestamps for the video: 00:00 Introduction 01:00 Imports and reading data 03:35 Data Understanding 06:40 Data Preparation 20:57 Feature Understanding 27:35 Feature Relationships 35:30 Asking a Question about the Data 40:00 Final Thoughts
@oshogweikekhai5499
@oshogweikekhai5499 Год назад
amazing video, please can you make available the dataset used? thank you
@Pluvo2for1
@Pluvo2for1 Год назад
Hi Rob. I'm new to notebooks. Could you plese explain why you don't need an explicit print statement to view cell output?
@cocgamingstar6990
@cocgamingstar6990 8 месяцев назад
This was one of the bad video and non industry knowledge
@ABYHYDROS
@ABYHYDROS 7 месяцев назад
😮
@saintsaens3517
@saintsaens3517 Год назад
As a begginer in data this really opend my eyes as to how things works. Your explanations are very clear and I can feel how passionate you are. Great video
@robmulla
@robmulla Год назад
Glad it was helpful! I am passionate about it, and excited when I hear people are learning from my videos.
@arunkumark6351
@arunkumark6351 Год назад
So true,, though there are many projects and training videos outside. The way you think, step by step approach and the reason for doing so, is so relatable and feels very natural. Awesome video, Thank you so much
@saptarshidey7672
@saptarshidey7672 10 месяцев назад
There are a ton of EDA videos on RU-vid. This is one of the best I have ever come across. You just nailed it, Rob.
@robmulla
@robmulla 10 месяцев назад
Thanks so much!
@nigelkiernan1321
@nigelkiernan1321 3 месяца назад
This is a great refresher guide! Very nice coding style and I appreciate you using a simple Kaggle dataset to follow along. Great stuff - thanks!
@romanrodin5669
@romanrodin5669 Год назад
Great channel! Very helpful for beginners and for those who' re digging deeper and moving forward into DS industry as myself! Thanks Rob!
@lindyhopwithliz
@lindyhopwithliz 7 месяцев назад
Great video: informative and fun; easy to follow along. Helped me feel motivated to tackle more Python Pandas. Thanks so much!
@adamvoltemar420
@adamvoltemar420 3 месяца назад
This is one of the best content related to Data Analysis and Python/Pandas, I am really glad I found it! Thanks!
@user-et7zv5rs3q
@user-et7zv5rs3q 11 месяцев назад
this is absolutely amazing! Follow your video step by step actually make me more confident of my coding!
@Eysh2009
@Eysh2009 9 месяцев назад
This video makes me feel glad to be alive. Great explanation, amazingly fast and on point. Thank you!
@silver_soul98
@silver_soul98 11 месяцев назад
I have tried plenty of tutorials by now. This is the most precise and to-the-point tutorial so far. Well done.
@jcgdt94
@jcgdt94 7 месяцев назад
Thank you for sharing this, Rob! This is wonderful content. Keep up the good work. Cheers!
@Aarron-io3pm
@Aarron-io3pm Год назад
Hi Rob, this was super useful to me as a tired Excel veteran and python beginner. You explain and demonstrate everything so clearly, thank you
@alikakavand3165
@alikakavand3165 7 месяцев назад
It was fantastic. Every step you took was kind of amazing, specially the last bit where you visualized average coaster speed by location. Thanks.
@itm1996
@itm1996 4 месяца назад
Hands down one of the best tutorial I ever saw. Basic enough to follow as a newbie but demanding enough to be useful. ❤
@jmmj5018
@jmmj5018 6 месяцев назад
thank you so much, you have made my EDA analysis easier and faster. :) also, it's easy to digest as I go along with the data you are working. thanks a lot. you are helping a lot of analysts or people who wants to study in data analytics. Great video, keep them coming.
@sheshankjoshi
@sheshankjoshi Год назад
This is perfect for my interview tomorrow. I just needed a refresher on how to approach the problem, ask right questions and then come up with exploratory options. Thank You so much for this video
@chrisosomo2856
@chrisosomo2856 Год назад
I can’t get enough of your videos, especially the very hands-on practical approach to learning. Your explanations are clear and easy to follow along with. Please make more of these types of videos. You are definitely makes a change and contributing to the RU-vid knowledge pool. Thank you so much.
@SearchingforScraps
@SearchingforScraps Год назад
Wow ! this is such a clean run through. You make it look so easy and easy to learn ! Thank you so much. This is giving me the confidence to finally start something on my own.
@9jorge
@9jorge 11 месяцев назад
Thanks a lot, you explain concepts like no one, subscribed!
@user-ei9jd7pw4s
@user-ei9jd7pw4s 4 месяца назад
Thank you for the video. You have combined all my knowledge into one comprehensible picture.
@TrendingUpdateCentral
@TrendingUpdateCentral 20 дней назад
It's really hard to find good videos on this topic. This was fantastic. Thank you.
@vietndk5437
@vietndk5437 Месяц назад
I did not practise pandas usually then I almost forgot the syntax or its application. Now I find your video with very clear instructions, it helps me remember better. Thanks alot
@jackgarn8392
@jackgarn8392 Год назад
This is such an amazing guide! I’m new to data analysis and had limited python exposure and have taught myself most of these things so far by googling or just reading the pandas documentation. Watching someone familiar with the process do it all together was really helpful and gave me a lot of insights as to how I can improve my skills and workflow. Thank you so so much!
@Rantalytics
@Rantalytics 6 месяцев назад
wow... thank you so much rob. I come from a frontend background but just began a data analytics bachelor at SJSU. I was trying to grasp at a high level what DA might look like as it pertains to conducting an explorative project. This tutorial completely cleared up those questions!
@Thorne2610
@Thorne2610 Месяц назад
I have watched more than 5 times its really eye opener and step by step teaching. Well done Boss
@iiN1GH7M4R3ii
@iiN1GH7M4R3ii Год назад
Great stuff man, thanks so much for this. Youre great at teaching beginners!!!!
@darshantawte7435
@darshantawte7435 Месяц назад
Lucidly explained. One thing i have learned that in order to be a great Data scientist what matters is your problem solving skills, understanding the business requirements and curiosity to dive deep into data (true to the name data scientist) . There is no need in remembering these codes as long as you know what to look for.
@bradleyfrueh2761
@bradleyfrueh2761 Год назад
Thank you so much. I appreciate the work you put into your videos. It shows.
@robmulla
@robmulla Год назад
I really appreciate the feedback! Please share with anyone you think might also learn from it.
@Anarky35
@Anarky35 Год назад
Great! Thanks a lot for this tuorial, so helpful for me as a beginner!
@wahabamin6946
@wahabamin6946 Год назад
Thanks Rob, you’re doing a great job for the data science community. Your videos here and on TikTok is helping me a lot in this journey. Thank you.
@robmulla
@robmulla Год назад
Love to hear that Wahab! Glad you learned something, and thanks for posting the feedback.
@Dongnanjie
@Dongnanjie 5 месяцев назад
Definitely amazing. Thank you so much, Rob!
@fabiolasilva6623
@fabiolasilva6623 6 месяцев назад
Amazing, Thank you so much, the best tutorial!! :)
@aydanlopresti2879
@aydanlopresti2879 8 месяцев назад
Clear and applicable to any type of analysis. Thank you
@wesleyweel8007
@wesleyweel8007 Год назад
The quality of your content is only surpassed by the ease at which it is to assimilate it, keep up the great content Rob, cheers!
@robmulla
@robmulla Год назад
Wow. Thanks for the positive feedback!
@yjgg5882
@yjgg5882 Год назад
​@@robmulla bi L😅
@yjgg5882
@yjgg5882 Год назад
🎉😊
@tomparatube6506
@tomparatube6506 7 месяцев назад
I dabbled in this 4 years ago at EDX. This is a wonderful refresher. Thanks Rob!
@MCMMADDOGXCV
@MCMMADDOGXCV 9 месяцев назад
This is GOLD! Thank you
@ZeuSonRed
@ZeuSonRed 9 месяцев назад
This was the greatest Tutorial I ever had. Thank you. Here I get to cnow about the corelation and some panda functions and ploting. But for Power counting and Corelation between Variables was very pleasend and satisfied my expectations. From Bulgaria Volga Sauvete! Thaank you ! 👑👑👑👑👑👑
@TheMonieray
@TheMonieray Месяц назад
Really loved seeing the pairplot. Will definitely try this out this week
@walterpark8824
@walterpark8824 Год назад
Terrific introductory survey that answered so many of my questions, moving from SQL. Looks extremely efficient. Now, to plug into my data! Thanks.
@robmulla
@robmulla Год назад
Glad you liked it. Sql still has a place but when working with the data for EDA pandas can’t be beat.
@aayaanhasnain5143
@aayaanhasnain5143 Год назад
Hey Rob, really admired the way you explained complicated topics with ease!! Looking forward to learning from you more :)
@robmulla
@robmulla Год назад
Thanks so much for that feedback. I really apprecaite it.
@sa-pt3kf
@sa-pt3kf Год назад
By far one of the most clear and concise ways of teaching in a computer science related field I've come across in a while. I'll be binging all your tutorials for sure!
@robmulla
@robmulla Год назад
Whoa. I love this feedback. I'll try my best to keep them coming.
@muhammadfadliaktsar7172
@muhammadfadliaktsar7172 11 месяцев назад
Thank you Rob for your explanation, before this it was hard for me to study and my mind just start pressured me of how to do EDA with Python language. And this video just open my mind to study it!
@kmvkmv3433
@kmvkmv3433 10 месяцев назад
Great stuff - thank you, Rob!
@ricardorockthem3339
@ricardorockthem3339 6 месяцев назад
Very well explained and quite nice difficulty level! Brilliant!
@deneskalnoky7939
@deneskalnoky7939 6 месяцев назад
This is a really good tutorial. I am new to Python and data analysis, and was completely lost. It was so hard to find a good, reliable source about it. This source just clarifies the basics for beginners so that I can start off with my own project.
@sergiopellitero4136
@sergiopellitero4136 10 месяцев назад
This video is.... PERFECT! Thanks ^^
@siddhant0701
@siddhant0701 Год назад
This was a really nice tutorial, Rob. Had fun coding along, thanks for doing it :)
@robmulla
@robmulla Год назад
Thanks for watching and providing feedback. Feel free to share with anyone else you think might also learn from it.
@walterpark8824
@walterpark8824 Год назад
well organized, concise, very helpful to get grounded in Pandas. my explorations will continue. Thanks!
@robmulla
@robmulla Год назад
Glad it helped! Thanks for watching. Share with anyone else you think might also learn from it.
@rdatta
@rdatta 10 месяцев назад
Excellent work and introduction. Very well done!
@linux2350
@linux2350 Год назад
Thank you! that was very informative!
@shihaosun6861
@shihaosun6861 Год назад
Thank you very much Rob for this wonderful walkthrough and explanation! Really Appreciate it!!!!
@robmulla
@robmulla Год назад
Thanks for the feedback! Glad to hear you learned something from it.
@jsplayground241
@jsplayground241 Год назад
You are the best coach. Thank you, sir!
@panneerselvamposangu9929
@panneerselvamposangu9929 9 месяцев назад
This is very helpful. Thank you, Rob!
@pdrcouto
@pdrcouto Год назад
It has been great to refresh some topics and learn new ones. Thanks a lot :)
@robmulla
@robmulla Год назад
Thanks Pedro. So glad you’ve found these as a good refresher.
@edusheffer
@edusheffer 9 месяцев назад
I'm impressed! your videos are excellent. Thanks, Rob
@olusolafatoye9691
@olusolafatoye9691 Месяц назад
Great video. Clear explanation! You just earned a new subscriber
@NeerajKumardeaf
@NeerajKumardeaf 5 месяцев назад
Thank you so much. I am happy for your teaching about EDA with data analysis for pandas. I am clearly explaining to you. I can continue my hands-on experience for EDA
@chrismagee5845
@chrismagee5845 Год назад
This is the best reference guide. I always find myself rewatching this whenever I'm cleaning a dataset.
@robmulla
@robmulla Год назад
So glad you find it helpful.
@guilhermedesanctis
@guilhermedesanctis Год назад
Thanks for this lesson. It’s much valuable.
@vishwathapa6626
@vishwathapa6626 10 месяцев назад
Pair-plot looked absolutely beautiful!
@thaanathaana4522
@thaanathaana4522 5 месяцев назад
Clear explanation for beginners.. will follow you more for tutorials
@JHornsby89
@JHornsby89 Год назад
This is the second one of these I have now watched and coded along with! Genuinely awesome content, so precise and simple to follow. You make daunting tasks (for beginners getting into data) really accessible which is a sign of a great teacher!
@robmulla
@robmulla Год назад
Comments like this make me really happy that I made this video. So happy it helped you in your coding journey. Did you use the Kaggle notebook when you followed along?
@ilmankhairusidqi9146
@ilmankhairusidqi9146 7 месяцев назад
Your explanation is easy to understand and also show how the things work, ThankYou please make more videos about EDA in python Rob!!
@grandselenium296
@grandselenium296 5 месяцев назад
- import data Data understanding - filter columns by need - convert dtype of certain columns - rename columns - check isna in columns and dropna on row or column accordingly - locate duplicated rows in single or multiple columns - drop duplicated rows from dataset and reset index Data prep -univariate analysis of features - kde, histogram, box plot - use value counts to determine duplicates and unique values in feature - he creates bar plot for top 10 years introduced to highest # of coasters - he creates histogram to bin speeds of roller coaster and view their frequency distribution Feature understanding - scatterplot, pairplot, correlation, groupby - he creates scatterplot for speed and height with year based hue of points - he create pairplot to compare correlation between features, alongside hue from material type - creates a correlation heatmap for selected features Ask question - he uses groupby and query to create bar plot with sorted descending data on mean speed of roller coasters by location.
@kapamagicman
@kapamagicman 2 месяца назад
Awesome! Love the method chaining
@leonrobinson2053
@leonrobinson2053 9 месяцев назад
Late to the party but this is really really good. Helps you dig in to the detail (rather than you thinking, how do I do what I'm thinking I need to do). This should be a template to use as it general enough for you to pick it up but specific enough with examples to be used elsewhere
@mschuer100
@mschuer100 Год назад
Awesome video! Thanks for putting the time into this. Very helpful
@robmulla
@robmulla Год назад
Glad it was helpful! Share with a friend!
@mschuer100
@mschuer100 Год назад
@@robmulla I certainly will. Thanks
@hardikacharya2664
@hardikacharya2664 2 года назад
Great video. Look forward to your twitch streams!!
@robmulla
@robmulla 2 года назад
Thanks so much. Hope to see you during one of the twitch streams soon.
@ranahuzaifa147
@ranahuzaifa147 Год назад
I was smiling at 39:23 . How easily you answered the question. Thanks for this amazing video tutorial.
@robmulla
@robmulla Год назад
My pleasure 😊 Glad you liked seeing it all come together at the end.
@Jack-bs2mx
@Jack-bs2mx Год назад
you are the best teacher ever, I'm not good at English but I try to write down this sentence to show my appreciate to you. Im still waiting for new lesson using pandas for Data Analyst
@anuragarunedlabadkar8889
@anuragarunedlabadkar8889 Год назад
Excellent explanation Rob. Learned a lot from this video. Keep it up.
@mario1ua
@mario1ua 5 месяцев назад
Really cool and looks so easy now, thank you Rob
@Dean-nz9ld
@Dean-nz9ld 2 месяца назад
This is a brilliant video, helped alot thankyou!!
@chess6802
@chess6802 Год назад
Perfect stuff what I love about this video is the simplicity and the clearness of the way you talk
@robmulla
@robmulla Год назад
I appreciate that! Thats how I learn best so it's also how I try to explain things.
@chess6802
@chess6802 Год назад
@@robmulla your response reflect your both knowldge and wisdom please keep on 💞
@Al-Ahdal
@Al-Ahdal Месяц назад
@Rob Mulla: Excellent video on EDA
@rrestituti
@rrestituti Год назад
#1 Data science youtuber!!! You made easy to understand the basic commands e sintaxes. Thank you a lot, Rob. 😉
@robmulla
@robmulla Год назад
Tell all your friends. 😆
@lashlarue7924
@lashlarue7924 Год назад
Agreed, this is the best Python content on the entire internet, hands down. I'm going to be carefully watching these videos over and over for a long time.
@nunolopes3910
@nunolopes3910 Год назад
This type of videos are amazing to follow, i am starting to use python for data analysis and i could not happier! Your channel is helping me alot, thank you!
@robmulla
@robmulla Год назад
So happy to hear this. Let me know what you would like to see in future videos.
@nunolopes3910
@nunolopes3910 Год назад
​@@robmulla One question i have is about the safety of using jupyter while working with company data. I am just starting to use jupyter and that is a big question that i'm sure other begginers would like to know to! Can you give your opinion on it? Thanks in advance
@soumyadrip
@soumyadrip 2 года назад
Wow so many things are covered, its a great tutorial for getting started with EDA.
@robmulla
@robmulla 2 года назад
Thanks @somuSan. Glad you liked the tutorial. It took me waaaaay longer to film than I expected but I'm happy with the result. I hope more people in the future find it helpful.
@zhaozheng7704
@zhaozheng7704 Год назад
Excellent tutorial and immediately useful. Thank you!
@robmulla
@robmulla Год назад
Glad it was helpful! Thanks for watching Zhao.
@hotbit7327
@hotbit7327 Год назад
Some of your other videos I found too fast paced, like about pandas mistakes, this one here has great content and also fantastic presentation. Thank you.
@robmulla
@robmulla Год назад
Thanks for the feedback. I’ve been experimenting with different editing styles so it’s nice to hear you prefer slower paced like this video. I’ll keep that in mind in the future.
@ThePablo505
@ThePablo505 10 месяцев назад
amazing video, appreciate it a lot!
@Davlet
@Davlet Год назад
your content is pure gold! thank you
@robmulla
@robmulla Год назад
Glad you enjoy it! This comment is gold. 😎
@alisonhenley2551
@alisonhenley2551 Год назад
Wow, what an informative fun tutorial. Thanks Rob!
@robmulla
@robmulla Год назад
Glad you learned from it and I appreciate the comment.
@alasdairmunro1953
@alasdairmunro1953 Год назад
Really great video. Clear and well presented. Thank you.
@robmulla
@robmulla Год назад
Glad you enjoyed it!
@rr2b
@rr2b 2 месяца назад
Very nice explanations , thank you so much!
@282OJK
@282OJK 2 года назад
This is a great video. So helpful and informative. Thank you.
@robmulla
@robmulla 2 года назад
Glad it was helpful! Thanks for watching and tell your friends!
@shoaibahmed5616
@shoaibahmed5616 4 месяца назад
I completed my certification in data science but could not figure out where to start. Thanks for such a detailed and easy to understand instructions with very useful commands. It helps a lot. Keep it up
@lucasoliveirapaes
@lucasoliveirapaes 9 месяцев назад
Very good content, Rob. It helped me a lot! Thanks for sharing. You gained a subscriber!
@robmulla
@robmulla 9 месяцев назад
Thanks for the feedback and subscribing!
@kushagra54
@kushagra54 23 часа назад
Amazing video, you made it so simple
@xiaoyangshawnhuang1251
@xiaoyangshawnhuang1251 9 месяцев назад
it's a super great video, just enjoy the way you explained, it's long video but every part is so useful and informative, thanks a lot for sharing it, well done.
@codecode6419
@codecode6419 Год назад
I am spending my weekend with your videos and I would like to say that I have learned several tricks to use python's libraries efficiently. Thanks for your explanation and your time to provide the videos
@100themagician
@100themagician Год назад
Excellent video Rob, thank you!
@robmulla
@robmulla Год назад
Thanks for watching!
@zulucharlie5244
@zulucharlie5244 Год назад
Fantastic video, extremely informative, and very useful. Thank you.
@robmulla
@robmulla Год назад
So thankful that you found this helpful. Share with anyone else you think might also learn from it!
@onyinyeobijiofor7075
@onyinyeobijiofor7075 7 дней назад
I loved it. Thank you.
@irfanshaikh262
@irfanshaikh262 8 месяцев назад
This is really an eye opener stuff for rookies like me. Thank you Rob
@robmulla
@robmulla 8 месяцев назад
Thanks for watching!
@nyadokuamponsah04
@nyadokuamponsah04 Год назад
This is the best video I have watched so far. Thanksss!
@robmulla
@robmulla Год назад
Thanks so much!
@anishkumaranjan
@anishkumaranjan 6 месяцев назад
Just completed it along with coding it all!
@bingolio
@bingolio Год назад
Excellent and succint. THANKS!
@robmulla
@robmulla Год назад
Glad it was helpful! If you can, share it somewhere you think others might also learn from it too!
@sowjanyakake5720
@sowjanyakake5720 Год назад
Very detailed explanation for each step you perform during the analysis, helpful for beginners like me.
@robmulla
@robmulla Год назад
Glad to hear you found the pace was helpful!
@adarshtiwari7395
@adarshtiwari7395 Год назад
I have been grinding through your videos lately in preperation for my data science job and you have been an absolute blessing! Thanks a bunch!!
@robmulla
@robmulla Год назад
Wonderful! Glad I could help.
@hmx21
@hmx21 Год назад
Hey Adarsh! Other than this what else are you learning that will help you in the data science job, I'm also preparing for the same but kinda new to data science so any guidance would be appreciated. Cheers!
@adarshtiwari7395
@adarshtiwari7395 Год назад
@@hmx21 Hi Hemang. I'm a fresher in data science as well. I started with Python and statistics. Then moved on to EDA followed by Machine Learning algorithms. I then made a few projects on ML. Also tools like SQL, Power BI, Excel are preferred
@hmx21
@hmx21 Год назад
@@adarshtiwari7395 Hey Adarsh! Thnaks for the reply, I'm done with EDA and made a dashboard using Power BI, and don't know how much machine learning or SQL is required for the role as I've studied SQL in college and know how to work with joins,etc. Any tips or resources you'd like to share would be a great help. Also from where did you learn stats for ds, whenver I try to learn stats online I get overwhelmed with the magnitude of tutorials.
@adarshtiwari7395
@adarshtiwari7395 Год назад
@@hmx21 depends on what you're going for. If you are interested in a data analyst position, EDA through Power BI is great but if you want to go done the data scientist or machine learning route you need to be hands on with Python. EDA using python is much more nuanced as compared to visualisation tools like Power BI. SQL is essential in all contexts so it's a must. But whether you should study machine learning depends on your career goal.
Далее
Learning Pandas for Data Analysis? Start Here.
22:50
Просмотров 79 тыс.
Python 101: Learn the 5 Must-Know Concepts
20:00
Просмотров 1 млн
25 Nooby Pandas Coding Mistakes You Should NEVER make.
11:30
3 PYTHON AUTOMATION PROJECTS FOR BEGINNERS
17:00
Просмотров 1,5 млн