Тёмный

Data science in Python: pandas, seaborn, scikit-learn 

Data School
Подписаться 245 тыс.
Просмотров 187 тыс.
50% 1

Опубликовано:

 

20 окт 2024

Поделиться:

Ссылка:

Скачать:

Готовим ссылку...

Добавить в:

Мой плейлист
Посмотреть позже
Комментарии : 572   
@dataschool
@dataschool 3 года назад
Having problems with the code? I just finished updating the notebooks to use *scikit-learn 0.23* and *Python 3.9* 🎉! You can download the updated notebooks here: github.com/justmarkham/scikit-learn-videos
@aryanterrance6092
@aryanterrance6092 3 года назад
I know im randomly asking but does any of you know of a method to log back into an instagram account? I was stupid forgot the account password. I appreciate any help you can give me!
@stetsondavian5756
@stetsondavian5756 3 года назад
@Aryan Terrance instablaster ;)
@aryanterrance6092
@aryanterrance6092 3 года назад
@Stetson Davian thanks for your reply. I found the site through google and im trying it out now. I see it takes quite some time so I will get back to you later when my account password hopefully is recovered.
@aryanterrance6092
@aryanterrance6092 3 года назад
@Stetson Davian It did the trick and I now got access to my account again. I'm so happy:D Thank you so much you really help me out :D
@stetsondavian5756
@stetsondavian5756 3 года назад
@Aryan Terrance no problem :)
@Emmaizam
@Emmaizam 5 лет назад
This is the best ML tutorials I have ever seen! Thank you very much Sir.
@dataschool
@dataschool 5 лет назад
Thank you!
@prachinainawa3055
@prachinainawa3055 3 года назад
I'm a beginner but your way of teaching makes me love machine learning, I feel it's so easy. Even you make me understand how the algo is working behind the scene. Love from India...
@dataschool
@dataschool 3 года назад
That's awesome to hear! 😊
@LekanMakanju
@LekanMakanju 2 года назад
This is unreal! I literally abandoned my datacamp machine learning course for this one and no regret at all. I especially like that you taught the underlying mathematical concept of how these codes come to be. You also speak clear and understandable English plus the sound system is top notch. I've taken your Data science course and your and prof Allen's remains my best to date with Hugo's coming in a distant 3rd. And to think you recorded this more than 7 years ago makes you conclude that this is way ahead of its time
@dataschool
@dataschool 2 года назад
Thank you so much for your kind words, Moruf! 🙏
@TheBurningofSolomon
@TheBurningofSolomon 7 лет назад
MANY THANKS!!! All other data science tutorials (for beginners) go by way to quickly. Some people may find you going slowly a nuisance, but I found it to be EXTREMELY HELPFUL. THANK YOU! Subbed ^__^
@dataschool
@dataschool 7 лет назад
Awesome! That's so great to hear... thanks very much for your comment!
@XaccountFr
@XaccountFr 5 лет назад
@@dataschool yes very good explanation for the beginner like me
@pratikdhumal3975
@pratikdhumal3975 7 лет назад
I was searching for appropriate videos on ML from long time. After following this series i can say that it is the best which i have ever seen.Each and every concept is covered with great detail. Same applies for study material and links. Thanks Data School .....!!!!
@dataschool
@dataschool 7 лет назад
That is great to hear, thanks so much for your very kind words!!
@tissues2441
@tissues2441 6 лет назад
You're a way better instructor than my college professors. The syntax is fairly simple and the explanation of the statistical intuition behind the metrics made this enjoyable.
@dataschool
@dataschool 6 лет назад
Thanks very much for your kind words! Really appreciate it!
@dataschool
@dataschool 8 лет назад
Want to learn more pandas? I have a new video series about it: ru-vid.com/group/PL5-da3qGB5ICCsgW1MxlZ0Hq8LL5U3u9y
@lakswin
@lakswin 5 лет назад
Kinda complete one, putting together all at-once! The best, I have watched until now!
@dataschool
@dataschool 4 года назад
Thank you!
@Superdooperhero
@Superdooperhero 7 лет назад
I watch way too much training videos and I would like to say that I wish you were the presenter in all of them. You rule at this training thing!
@dataschool
@dataschool 7 лет назад
Thanks so much! :)
@mukulkathpalia6924
@mukulkathpalia6924 7 лет назад
These are the best tutorial series on machine learning.
@dataschool
@dataschool 7 лет назад
Wow, thank you so much!
@dataschool
@dataschool 6 лет назад
*Note:* This video was recorded using Python 2.7 and scikit-learn 0.16. Recently, I updated the code to use Python 3.6 and scikit-learn 0.19.1. You can download the updated code here: github.com/justmarkham/scikit-learn-videos
@rael213rd
@rael213rd 5 лет назад
Can we please get a video about ensemble learning (bagging and boosting)
@BluntAmericanHistory
@BluntAmericanHistory 8 лет назад
Your videos are fantastic, for people with random gaps in their knowledge you explain things very clearly.
@BluntAmericanHistory
@BluntAmericanHistory 8 лет назад
+Siddharth Gupta For people who have random chunks of exposure to certain aspects of sklearn/pandas/etc: watch the video at 1.25 or 1.5x speed. You can get through the lesson faster, and the increased speed will actually have a counterintuitive effect of making you focus more. Also when you start losing focus or miss a concept, you will notice right away because you will suddenly be totally lost, so you will know to rewind.
@dataschool
@dataschool 8 лет назад
+Siddharth Gupta Thanks for your kind comments!
@faroukobafemi9496
@faroukobafemi9496 4 года назад
To be candid, this is the best video I've ever watched on scikit-learn. Thumbs up!!!
@dataschool
@dataschool 4 года назад
That's awesome to hear... thank you! 🙏
@JackSimpsonJBS
@JackSimpsonJBS 9 лет назад
Thank-you so much for your explanations of sk-learn, it finally makes sense to me! I'm already pretty familiar with Pandas so I'd love to learn more about sk-learn, because I feel there are so many other machine learning algorithms I'd love to get my head around.
@dataschool
@dataschool 9 лет назад
***** Nice! I love to hear that my explanations are helping things to "click" for people. Thanks for your comment!
@injypal
@injypal 5 лет назад
Please add more videos to the series. It is really helpful and amazing to watch your videos. You are a great teacher.
@dataschool
@dataschool 5 лет назад
Thanks for your suggestion, and for your kind words!
@kennyl7542
@kennyl7542 8 лет назад
wonderful videos! I would like you to focus on scikit-learn, and your style of teaching which combines hands-on with scikit-learnt, real examples, explanation of ML techniques are very helpful!
@dataschool
@dataschool 8 лет назад
+Kenny L Thanks for your kind comments and your feedback!
@nackyding
@nackyding 7 лет назад
Word! I agree with you!
@joancolon635
@joancolon635 6 лет назад
Kenny L i
@terryhenyo9216
@terryhenyo9216 5 лет назад
Your video tutorial is outstanding! You can simplify complex concepts in an elegant manner. And unlike other instructors you don't show-off on how smart you are. That's why we know that you're really a smart guy :)
@dataschool
@dataschool 4 года назад
Thank you SO MUCH for this kind comment! I truly appreciate it.
@andrewsanchez4349
@andrewsanchez4349 7 лет назад
Definitely one of the best tutorials I've ever watched. Can't wait to work through the 3 hour presentation at the end of this. Thank you!
@dataschool
@dataschool 7 лет назад
Thanks so much for your very nice comment! You're very welcome! :)
@aegystierone8505
@aegystierone8505 4 года назад
Really appreciate that you also explain the algorithms and how to find the coefficient governing the equations. Thank you so much!
@guptaachin
@guptaachin 8 лет назад
You are undeniably the best tutor i have ever had. Thank you for teaching DS precisely. :)
@dataschool
@dataschool 8 лет назад
Wow, thank you! I'm glad my teaching style works well for you :)
@AashishKumar1
@AashishKumar1 8 лет назад
This is the best video tutorial series on Machine learning I have seen. You have hooked me up! Thanks for creating the series and you are an amazing teacher. Keep it up!
@dataschool
@dataschool 8 лет назад
+Aashish Kumar You're very welcome, and thanks for your kind words!
@AntonioAugustoVianaS
@AntonioAugustoVianaS 9 лет назад
More pandas please! And more Seaborn! A large part of Machine Learning is "messing" with the data BEFORE you apply any of the algorithms on it, and pd and sns are really good at that. Also, I think it'd be interesting (maybe latter in the series) that you could go on an all out example, like working with the titanic dataset from Kaggle, and giving hints on how to visualize, understand the data and choose the best algorithm for it. As a final note, I'm already a bit familiar with the techniques you use, but your comments and clear explanations makes everything clearer and helps me fixate some of these techniques. Thank you for that! Excellent series, and keep on the good work.
@dataschool
@dataschool 9 лет назад
Antonio Augusto Santos Thanks for the feedback! I am planning to cover more examples later in the series, probably using a Kaggle competition. And, I appreciate your kind words! I was hoping to reach both users new to machine learning and those with some machine learning familiarity, so it's nice to hear that it's working :)
@lubojurciak2525
@lubojurciak2525 5 лет назад
I wish you were my data analysis lecturer... Thank you very much for this.
@dataschool
@dataschool 5 лет назад
Thanks very much for your kind words!
@umashankarverma3179
@umashankarverma3179 5 лет назад
Your teaching methodology is best,you step by step teaching method is very helpful for me to understand.You are the best.
@dataschool
@dataschool 5 лет назад
Thank you!
@priyaponnus8620
@priyaponnus8620 3 года назад
Thank you for the awesome videos. I am currently learning Machine Learning as part of a course. I don't have previous knowledge of Python (currently learning an introduction to Python as well), I am really struggling to understand; this is my midterm break; I found one of your videos while I was searching, I am one of the fortunate to found your videos. Thanks for your effort.
@dataschool
@dataschool 3 года назад
You're very welcome! Glad I could help!
@DenzilJoseph
@DenzilJoseph 6 лет назад
Excellent description of the end-to-end ML flow. Thank you.
@dataschool
@dataschool 6 лет назад
You're welcome!
@MrChristian331
@MrChristian331 5 лет назад
Say one thing....you are an excellent teacher. My teachers at engineering school and on Udemy don't explain things half as well as you do! That should tell you a lot! I wish I could hire you personally.
@dataschool
@dataschool 5 лет назад
Thanks so very much for your kind words! You might be interested in joining my membership community: www.patreon.com/dataschool
@igorfigueredo5040
@igorfigueredo5040 7 лет назад
Hi, im a begginer in data science and your videos are helping me a lot of, thanks.
@dataschool
@dataschool 7 лет назад
You're welcome!
@your_buddy_11
@your_buddy_11 5 лет назад
Thank you very much Your teaching methodology is awesome making things crystal clear.
@dataschool
@dataschool 4 года назад
Thanks!
@doupanpan7271
@doupanpan7271 6 лет назад
really thankful for your video series. it is straightforward and easy to understand, highly recommend to other guys who are interested in python, machine learning etc.
@dataschool
@dataschool 6 лет назад
Awesome! Thanks for sharing it with others :)
@danielandreasen2293
@danielandreasen2293 9 лет назад
As for an answer for your question: I would like to learn more about sklearn. Pandas is amazing, and I'm just starting to learn it, but there are already a lot of nice tutourials out there. Keep up the good job :)
@dataschool
@dataschool 9 лет назад
Daniel Andreasen Good point! There are lots of Pandas tutorials already out there.
@vamsikrishna1131
@vamsikrishna1131 5 лет назад
Lots of great information at the end and links in the description. Very valuable. Really appreciate it!
@dataschool
@dataschool 5 лет назад
Thanks!
@flamboyantperson5936
@flamboyantperson5936 6 лет назад
You are the best teacher in the world. I learned something very important to me in this video. Thank you so much. Please keep the good work going.
@dataschool
@dataschool 6 лет назад
Wow! Thank you so much for the very kind comment! Good luck to you :)
@robindong3802
@robindong3802 6 лет назад
you made it so easy to learn. you lead me to ML right here. Thank you so much.
@dataschool
@dataschool 6 лет назад
You're very welcome!
@The2002962
@The2002962 7 лет назад
Tutorial content is pretty cool. adding humor while explaining will add good experience for learners. :)
@dataschool
@dataschool 7 лет назад
Thanks!
@gauravmitra3683
@gauravmitra3683 8 лет назад
This is one of the best available online resource for introduction to data science. Thank you for these amazing videos. Its teachers like you who inspire students like me :)
@dataschool
@dataschool 8 лет назад
Wow, what a kind comment! Thank you so much!
@arjunbakshi810
@arjunbakshi810 4 года назад
Gaurav, Im having trouble reading advertisemets.csv Can you help ma?
@TheGautamj
@TheGautamj 3 года назад
The csv file does not load up. Has the url changed?
@v_b_r_1996
@v_b_r_1996 7 лет назад
Very good content. I have tried so many video series for data science and this is by far the best! Thanks!
@dataschool
@dataschool 7 лет назад
That's great to hear - thanks so much for your kind comment!
@samkumargupta2536
@samkumargupta2536 6 лет назад
Really Awesome tutorials sir... Its very easy to understand...Better that other ML tutorials I have watched...☺☺☺
@dataschool
@dataschool 6 лет назад
Thanks for your kind comment!
@sribastavrajguru304
@sribastavrajguru304 7 лет назад
Great work,please upload more tutorials lyk these,really helpful to get started. Before watching this tutorial i was not at al aware of ML,but now after watching 4/5 videos i've got a good overview ,thank you
@dataschool
@dataschool 7 лет назад
Great to hear! Thanks for your kind comment.
@raghug2073
@raghug2073 6 лет назад
Very very great way teaching. I really liked the speed and pronounce you do, the possible mistakes which you cover, also explanation. This is great series and you are a great tutor. Fan of you and subscribed. Please make a separate series on Machine Learning (Bit more detailed), Deeplearning, AI, Data Science. I am not sure which one should be learnt first and how. I decided you are the best guru for me to make me some good level in all these skills. Please help.
@dataschool
@dataschool 6 лет назад
Thanks for your suggestions! I'll consider them for the future :)
@serdarb8995
@serdarb8995 6 лет назад
Hi Kevin, First of all thank you very much for those great videos. If you have a chance to make tutorial regarding deep learning it would be great. You are the best instructor, I've ever seen in this field. You are the best
@dataschool
@dataschool 6 лет назад
Thanks so much for your kind words, and for your suggestion!
@Dexter01
@Dexter01 4 года назад
I am answering your question 5 years later but I would love to see more video tutorials from you about scikit-learn (e.g Neural network models (supervised)) or scikit-multilearn if you want!! :) Thnx a lot Kevin!
@dataschool
@dataschool 4 года назад
Thanks for your suggestions!
@julians.2597
@julians.2597 5 лет назад
Wow, one of the best YT tutorials about this topic, thank you!
@dataschool
@dataschool 5 лет назад
Thank you!
@lakshaynandwani9324
@lakshaynandwani9324 4 года назад
@Apocalypse-, Hey, I need some help!!
@RajeshSriMuthu
@RajeshSriMuthu 5 лет назад
தலைவரே - (tamil language) Thalaiva you are great.....
@dataschool
@dataschool 5 лет назад
Thank you! :)
@genaugenaugenau
@genaugenaugenau 7 лет назад
This guy is great at teaching. Much appreciated!
@dataschool
@dataschool 7 лет назад
Thanks for your kind comment!
@harveysummers3175
@harveysummers3175 9 лет назад
These videos are outstanding. Am new to data science and many of the videos are too simple or too hard. You have found the goldilocks zone of data science. I also like that they are on youtube where I can speed them up to 1.5x to match my comprehension rate.Vimeo can't do that. I would like you to focus on Scikit, but use Pandas as most of use will be using both. I think a single lesson on how to use Pandas, as well as how to customize Ipython/Jupyter, would also be useful. I'd also like to see a video focused on data sources and on how to approach complex problems (ala kaggle challenges) Improvement suggestions: 1. Focus on technnical quality. Use basic stage lighting (difussed above, side, front, w/ reflector) and a condensor mic to better pic up your voice w/o echo. 2) put a whiteboard or suchsimple background behind you - way to much background clutter. And I think you are missing an opportunity to end with marketing your courses at data school, your book, etc.Not that I love ads, but... marketing!
@dataschool
@dataschool 9 лет назад
Harvey Summers Thanks for all of the suggestions, and your kind comments! Very helpful. Building up to more complex problems is definitely on the list. And, it's nice to know that I'm hitting the "sweet spot" in terms of difficulty level.
@JCRMatos
@JCRMatos 9 лет назад
Another excellent video. Please continue to focus on ML and scikit-learn.
@dataschool
@dataschool 9 лет назад
João Matos Thanks for your feedback, much appreciated!
@brothermanbill7338
@brothermanbill7338 4 года назад
when I use seaborn to pairplot the data, it doesn't show data for first column i.e. 'TV'
@sofjakovalevskaya1446
@sofjakovalevskaya1446 6 лет назад
Really perfect explanation and walk through. Thanks a lot!
@dataschool
@dataschool 6 лет назад
You're very welcome!
@bogdanjcnd
@bogdanjcnd 7 лет назад
I totally agree, the excellent guide for data learning , visualisation and machine learning.Great work
@dataschool
@dataschool 7 лет назад
Thanks for your kind comment!
@siddhidhavale7329
@siddhidhavale7329 4 года назад
Hi, the file URL isn' valid. Can you please share it?
@JoannaChmielewska_uk
@JoannaChmielewska_uk 8 лет назад
Thank you for making the effort to produce these videos. It's a great resource and your delivery is superb.
@dataschool
@dataschool 8 лет назад
Wow, what a kind compliment, thank you so much!
@yffzju3405
@yffzju3405 7 лет назад
Cool video!I just finish your pandas video series, but I thought pandas should be learned before the sklearn, well, anyway thank you for making such great videos for us.
@dataschool
@dataschool 7 лет назад
Great! I also have a scikit-learn video series: ru-vid.com/group/PL5-da3qGB5ICeMbQuqbbCOQWcS6OYBr5A
@reassassinator
@reassassinator 6 лет назад
Your videos really helped me understand the sklearn basics easily. It would be great if you could do a similar video series on SVMs using scikit-learn and its applications. Your explanations and methods are great! Thanks a lot!
@dataschool
@dataschool 6 лет назад
Thanks for your suggestion as well as your kind words! I appreciate it :)
@eturkoz
@eturkoz 5 лет назад
Your explanations are wonderful. Thank you.
@dataschool
@dataschool 5 лет назад
Thanks!
@RicardoFerrazLeal
@RicardoFerrazLeal 9 лет назад
Pretty amazing video! +1 for sk-learn as next video in this series. I also think that plotting stuff helps a lot. Whenever possible it would be nice to show seaborn in action. Great job and looking forward to the next one.
@dataschool
@dataschool 9 лет назад
Ricardo Ferraz Leal Thanks for the feedback!
@AvivProg
@AvivProg 8 лет назад
Watched all your videos. Your teaching skills are amazing, thank you for compiling those videos. I'm looking forward to your next videos about machine learning using sklearn.
@dataschool
@dataschool 8 лет назад
+AvivProg Wow, thank you! You are very welcome -- I enjoyed creating the videos. Here is the playlist containing the entire video series: ru-vid.com/group/PL5-da3qGB5ICeMbQuqbbCOQWcS6OYBr5A
@dianawilliams9470
@dianawilliams9470 5 лет назад
Thank you! Your videos are helping to make the concepts click! This is the best resource I have found
@dataschool
@dataschool 5 лет назад
You're very welcome!
@rahulmanna5730
@rahulmanna5730 5 лет назад
Currently the url for the dataset is : faculty.marshall.usc.edu/gareth-james/ISL/Advertising.csv
@dataschool
@dataschool 4 года назад
Thanks for sharing! I also have it on GitHub: github.com/justmarkham/scikit-learn-videos/tree/master/data
@WillGoesMeta
@WillGoesMeta 7 лет назад
Thank you so much for having this series!
@dataschool
@dataschool 7 лет назад
You're welcome!
@_SoundByte_
@_SoundByte_ 7 лет назад
Thanks for your lessons :-) Clear, detailed and to the point.
@dataschool
@dataschool 7 лет назад
Thanks for your kind comments!
@unstatic_electronics
@unstatic_electronics 9 лет назад
Excellent and straight to the point content again. Thanks a lot for the videos and also the additional references you provide. It's always good to know where to go next :) And please continue on with scikit-learn rather than pandas/seaborn.
@dataschool
@dataschool 9 лет назад
Romain Lepert Thanks for the feedback! :)
@Tony770jr
@Tony770jr 9 лет назад
Cool stuff, would like to see more pandas integrated with scikit learn.
@dataschool
@dataschool 9 лет назад
Tony770jr Thanks for the suggestion!
@danielkazmi
@danielkazmi 6 лет назад
Absolutely amazing material, thank you Kevin! I just wanted to know how would you deal with non-numerical features (i.e Gender, Occupation, Education, etc.) when constructing your ML model? Would you assign them numerical values? If possible, I'd like some guidance or a push in the right direction. Again you explain this material much better than most channels do, please keep up the phenomenal work!
@dataschool
@dataschool 6 лет назад
Thanks very much for your kind words! This might be helpful to you: ru-vid.com/video/%D0%B2%D0%B8%D0%B4%D0%B5%D0%BE-0s_1IsROgDc.html
@transportation-talk
@transportation-talk 9 лет назад
Great video once again. I think the focus of this series should be on ML and Scikit learn. You can explain the relevant pandas code wherever required as you did in this video. One question: Is there any algorithm in ML which can select the most relevant / explanatory predictor variables (features) from the data set (instead of user using trial and error approach)? I think this is critical for the data sets with high number of features
@dataschool
@dataschool 9 лет назад
umair durrani Great question! There is no "silver bullet" for feature selection, meaning no single strategy that will always tell you which variables to keep in your model. Domain understanding, data exploration, and human intuition are key. That being said, the Random Forests model will give you a measure of "variable importance" (on a scale of 0 to 1), and you could use that to guide the selection. As well, regularized linear models will shrink coefficients down to zero as the "penalty term" increases, effectively performing feature selection. Just keep in mind that both need to be tuned to perform properly, and features need to be scaled when performing regularization. scikit-learn has some more guidance on feature selection here: scikit-learn.org/stable/modules/feature_selection.html Thanks again for your kind and helpful comments!
@darronfuller5297
@darronfuller5297 9 лет назад
umair durrani Umair, there are several useful techniques for feature selection that I recommend you look into. Statistical methods such as forward- and backward-elimination are perfectly suited for determining the most predictive variables in a regression model and easy to understand and implement. Decision Trees inherently perform feature selection in that the variable splits are deemed significant and automatically chosen by the algorithm. A bit more on the complex side are Principle Component Analysis (PCA) and Association Rules which I believe PCA is in sci-kit-learn. Good luck! Darron. www.linkedin.com/in/votefordata
@sabr9906
@sabr9906 8 лет назад
+Data School Could you please advise in another course more about Feature Selection? Which models are more suitable for several cases etc. Like for example, sorting features' scores from RandomizedLasso, or by ranking from RecursiveFeatureElimination, or by selecting K best?
@dataschool
@dataschool 8 лет назад
+Sabr Tasbolatov Thanks for the suggestion! I'll consider it for the future.
@dataschool
@dataschool 6 лет назад
I just released a video about feature selection which might be helpful to you! ru-vid.com/video/%D0%B2%D0%B8%D0%B4%D0%B5%D0%BE-YaKMeAlHgqQ.html
@edrissemussa371
@edrissemussa371 5 лет назад
Thanks a lot for this great material you've put together. Very very helpful!
@dataschool
@dataschool 5 лет назад
Great to hear!
@mirzaburgic
@mirzaburgic 2 года назад
Great content, you have an inspiring way of presenting, keep it up! I have one question though, why is the TV coefficient smaller than the Radio coefficient, even though from the plots and best fit line it looks like the sales go up faster with more TV ad spending?
@sebastianpinedaarango8239
@sebastianpinedaarango8239 9 лет назад
Great video!! Thanks for that. I'd like to keep learning about Scikit-learn. Although, Pandas is also definitely a powerful Python data analysis toolkit.
@dataschool
@dataschool 9 лет назад
Sebastian Pineda Arango Glad you liked it! Thanks for the feedback.
@fritz0199
@fritz0199 8 лет назад
This series is amazing, thank you!
@dataschool
@dataschool 8 лет назад
You're welcome! Thanks for your kind words!
@harshrajj9995
@harshrajj9995 5 лет назад
Such great content you provide sir! Thank you so much.
@dataschool
@dataschool 5 лет назад
You're very welcome!
@saranemohan
@saranemohan 7 лет назад
It's wonderful tutorial ever I seen regarding machine learning. I expect more videos related to machine learning. if you made some video regarding some optimization technique of linear regression, then it should be more beneficial. ( like bfgs etc )
@dataschool
@dataschool 7 лет назад
Thanks so much for your kind words! I'll take your suggestion under consideration.
@loaiabdallatif4947
@loaiabdallatif4947 7 лет назад
very useful video on liner regression thanks very much Mr. Kevin Markham
@dataschool
@dataschool 7 лет назад
You're very welcome! :)
@shivbalaji8286
@shivbalaji8286 7 лет назад
You are doing a great job !!!!!! Thank you very much for all your valuable videos !!! They are really helping me !!!! Thanks again :-)
@dataschool
@dataschool 7 лет назад
That's great to hear! I'm glad the videos are helpful to you!
@MrMmahesh007
@MrMmahesh007 7 лет назад
amazing videos. Very streamlined and easy to understand.
@dataschool
@dataschool 7 лет назад
Thanks!
@zymx2007
@zymx2007 4 года назад
Hi Kevin, I'm new to both Python and machine learning. Your tutorials are great learning materials. I understanding this is a 5-year old presentation and I'm wondering if you would still answer a question I have related to this tutorial. Specifically, when I was trying to get the pairplots you demonstrated, I got the following error: KeyError: "['Sales'] not in index" and I got three blank boxes. What was wrong? Many Thanks for your help. FYI, I also tried to find answers by Googling online and haven't been able to find any answers that work.
@siming07
@siming07 8 лет назад
Thank you so much for the video, really great introduction to Pandas and SKlearn, I hope you can focus more on the sklearn with pandas dataframe, again, thanks for the great video!
@dataschool
@dataschool 8 лет назад
+Siming Zhao You're very welcome, and thanks for your comment!
@gosha5198
@gosha5198 7 лет назад
The Advertising.csv file does not exist on that url any more. Is there any way to download it and practice?
@crigar001
@crigar001 2 года назад
If I am part of patreon, can I ask questions about your videos?
@dataschool
@dataschool 2 года назад
Sure! I see comments more quickly if you post them within a course: courses.dataschool.io
@elilavi7514
@elilavi7514 9 лет назад
Thanks for good video ! Will be great if you can in a future video take any data set from some kaggle competition any try to work with , feature engineering is an interesting issue too. Two technical notes : - for people who works with proxy , to install seaborn with anaconda have to define http/https proxy first , so on anaconda prompt execute following command : "set http_proxy=X.X.X.X:port_number" - for Python 3 users zip command looks like : "list(zip(feature_cols,linreg.coef_))"
@dataschool
@dataschool 9 лет назад
Eli Lavi Sounds good... thanks for the notes!
@iberar
@iberar 3 года назад
Hello! Would you happen to have a video on how to create a logistic regression model using scikit learn LogisticRegression() to solve 2 or more independent variables to predict a dependent variable?
@musabosman2843
@musabosman2843 2 года назад
Nicely presented and delivered. Thank you!. I have subscribed to your channel!
@dataschool
@dataschool 2 года назад
Thank you!
@troywalters6106
@troywalters6106 9 лет назад
Great tutorial!! After watching this and looking at the sklearn docs, it seems as if the LinearRegression() object has only coef_ and intercept_ attributes. Does sklearn not provide metrics such as standard errors, t-statistics, p-values, and R-squared? If not, what is the reasoning behind it ? Thanks.
@dataschool
@dataschool 9 лет назад
Troy Walters Thanks for your comment! You can indeed compute R-squared using the r2_score function in the sklearn.metrics module. Regarding the others, I think the scikit-learn contributors would argue that those metrics belong in a statistics library, not a machine learning library. Here is a relevant discussion from the scikit-learn mailing list: www.mail-archive.com/scikit-learn-general%40lists.sourceforge.net/msg13102.html
@aracelyssunico8116
@aracelyssunico8116 7 лет назад
Super Helpful! Your explanation are clear and clean :) thanks
@dataschool
@dataschool 6 лет назад
You're very welcome!
@HossainRabin
@HossainRabin 6 лет назад
Fantastic tutorial series for PYTHON beginners ...Can you please start teaching us deep learning and neural network? I learn PANDAS, Numpy from your tutorial.. Thanks a lot man
@dataschool
@dataschool 6 лет назад
Thanks for your suggestion!
@itpro4470
@itpro4470 5 лет назад
sklearn.cross_validation is now sklearn.model_selection if you're getting an error on that line try changing the name of the module to model_selection
@dataschool
@dataschool 5 лет назад
Thanks for sharing! I've got a full article on updating your scikit-learn code here: www.dataschool.io/how-to-update-your-scikit-learn-code-for-2018/
@21121990jay
@21121990jay 7 лет назад
Very helpful video !!! thanks for sharing your knowledge. looking forward for more !!
@dataschool
@dataschool 7 лет назад
You're very welcome! Glad to hear it was helpful to you!
@libardomm.trasimaco
@libardomm.trasimaco 7 лет назад
I absolutely love what you do!. Thank you very very much!
@dataschool
@dataschool 6 лет назад
You are very very welcome!
@alialsaady5
@alialsaady5 6 лет назад
Thank you for your explanation, it's very clear. But what I don't understand is that you say the algorithm you are working with is called linear regression. But if you predict the dependent variable(Y) from multiple independent variables (x1, x2 etc.), then we are dealing with multiple linear regression right? Can you please explain why that is not the case?
@dataschool
@dataschool 6 лет назад
The general term "linear regression" applies whether there is one or more predictor.
@nackyding
@nackyding 7 лет назад
Thanks. Awesome tutorials. I'm learning a lot. Thank you again.
@dataschool
@dataschool 7 лет назад
You're very welcome!
@ankitbiradar8599
@ankitbiradar8599 9 лет назад
Could you teach how to program Neural Networks and SVM using sckit-learn ?
@dataschool
@dataschool 8 лет назад
+ankit biradar Thanks for the suggestion! I'll consider it for a future video.
@subratkumarsahoo4849
@subratkumarsahoo4849 6 лет назад
Guys if any one is getting error on this line : sns.pairplot(data,x_vars=['TV','radio','newspaper'],y_vars='sales' ) you need to mention the exact same column names in x_var and y_var attributes.
@dataschool
@dataschool 6 лет назад
Thanks for sharing!
@ulascanzorer
@ulascanzorer 6 лет назад
First of all I would like to thank you for these amazing videos :D My question is, do you know why this is an issue now and why you don't have a problem with it in your video?
@dataschool
@dataschool 6 лет назад
Unfortunately, they updated the file that is posted online and changed the column names!
@ulascanzorer
@ulascanzorer 6 лет назад
Thanks a lot for your answer, that was what I guessed. Keep up the great videos!
@dataschool
@dataschool 6 лет назад
Thanks!
@basilbeltran7712
@basilbeltran7712 8 лет назад
Your material is second only to "Introduction to Statistical Learning" so far for me. You know your subject "to the core" and recommend resources that I have already collected (so I will value your judgement :). Do you have a recommendation for a cheat sheet for hacking around in python notebook? I'm keeping all my notes there, but only just learned how to add an image when you showed the iris. I really don't have time to read up on python properly. Thanks!
@dataschool
@dataschool 8 лет назад
+V Kandinski What a nice compliment, thank you! Are you asking for a cheat sheet about the notebook itself, or the Python language as a whole? For the notebook, I have a brief list of keyboard shortcuts here, plus links to some good resources: github.com/justmarkham/scikit-learn-videos/blob/master/02_machine_learning_setup.ipynb For the Python language, this is kind of like a cheat sheet: www.dataschool.io/python-quick-reference/ Hope that helps!
@suemareverton7756
@suemareverton7756 8 лет назад
These videos helped me a lot! Thank you so much!!
@dataschool
@dataschool 8 лет назад
Great, I'm glad the series is helpful to you!
@hsin-yuku4086
@hsin-yuku4086 3 года назад
Thank you for the awesome videos, clear and to the point. However, I have a question regarding the retraining for the feature selection part (starting 30:31) : Won't it introduce data snooping bias when retraining to pick for different features?
@sungdeukpark9299
@sungdeukpark9299 9 лет назад
I think scikit-learn should be the focus of your lecture, too. And, I want to ask you to recommend a book for self-studying while your absence.
@dataschool
@dataschool 9 лет назад
SungDeuk Park For self-study, this book is excellent if you want to go deeper into machine learning: www-bcf.usc.edu/~gareth/ISL/ For getting better at Python (especially Pandas), this book is very good: shop.oreilly.com/product/0636920023784.do
@xjosef82
@xjosef82 4 года назад
If I start from hundred of features, is there a way to automatically test combinations?
@elivazquez7582
@elivazquez7582 6 лет назад
Great videos - all of them! Thanks for doing this.
@dataschool
@dataschool 6 лет назад
Thanks for your kind comment!
@ebenezerpopoola7860
@ebenezerpopoola7860 8 лет назад
Wow! this is very clear. You are the best.
@dataschool
@dataschool 8 лет назад
Thanks very much for your kind comment!
Далее
Exploratory Data Analysis with Pandas Python
40:22
Просмотров 485 тыс.
Solving real world data science tasks with Python Pandas!
1:26:07
Seaborn Is The Easier Matplotlib
22:39
Просмотров 174 тыс.
Build your first machine learning model in Python
30:57
What Can You Do With Python? | GeeksforGeeks
9:26
Просмотров 79 тыс.
How do I use the MultiIndex in pandas?
25:01
Просмотров 175 тыс.