Тёмный

Beginner Data Science Portfolio Project Walkthrough (Kaggle Titanic) 

Ryan Nolan Data
Подписаться 11 тыс.
Просмотров 14 тыс.
50% 1

Welcome to my data science journey through the Kaggle Titanic - Machine Learning from Disaster Project!
In this video, we'll dive deep into the world of data analysis, feature engineering, and machine learning to predict passenger survival rates on the Titanic.
As Kaggle states: "The competition is simple: use machine learning to create a model that predicts which passengers survived the Titanic shipwreck."
Kaggle Code: www.kaggle.com/code/ryannolan...
Titanic Kaggle Part 2: • Beginner Data Science ...
Interested in discussing a Data or AI project? Feel free to reach out via email or simply complete the contact form on my website.
📧 Email: ryannolandata@gmail.com
🌐 Website & Blog: ryannolandata.com/
🍿 WATCH NEXT
Scikit-Learn and Machine Learning Playlist: • Scikit-Learn Tutorials...
Kaggle House Pricing Project: • Data Science Beginner ...
Voting Classifier: • Mastering Voting Class...
Column Transformer: • Simplify Data Preproce...
MY OTHER SOCIALS:
👨‍💻 LinkedIn: / ryan-p-nolan
🐦 Twitter: / ryannolan_
⚙️ GitHub: github.com/RyanNolanData
🖥️ Discord: / discord
📚 *Practice SQL & Python Interview Questions: stratascratch.com/?via=ryan
WHO AM I?
As a full-time data analyst/scientist at a fintech company specializing in combating fraud within underwriting and risk, I've transitioned from my background in Electrical Engineering to pursue my true passion: data. In this dynamic field, I've discovered a profound interest in leveraging data analytics to address complex challenges in the financial sector.
This RU-vid channel serves as both a platform for sharing knowledge and a personal journey of continuous learning. With a commitment to growth, I aim to expand my skill set by publishing 2 to 3 new videos each week, delving into various aspects of data analytics/science and Artificial Intelligence. Join me on this exciting journey as we explore the endless possibilities of data together.
*This is an affiliate program. I may receive a small portion of the final sale at no extra cost to you.

Наука

Опубликовано:

 

1 авг 2024

Поделиться:

Ссылка:

Скачать:

Готовим ссылку...

Добавить в:

Мой плейлист
Посмотреть позже
Комментарии : 70   
@user-nu9ek4pb8k
@user-nu9ek4pb8k 19 дней назад
this needs more views. was so in depth and perfect for a beginner!
@RyanNolanData
@RyanNolanData 10 месяцев назад
Hope you enjoyed this video, it took so long to produce. If you enjoyed it, please subscribe to the channel. I just uploaded the 2nd part of this video where I improve the model (linked down below) Below are a few links that you should check out: Part 2: ru-vid.com/video/%D0%B2%D0%B8%D0%B4%D0%B5%D0%BE-KzK1pifa2Vk.html&ab_channel=RyanNolanData Kaggle Code: www.kaggle.com/code/ryannolan1/titanic-wip-9-12 Twitter: twitter.com/RyanNolanData LinkedIn: www.linkedin.com/in/ryan-p-nolan/ SciKit-Learn Tutorials: ru-vid.com/video/%D0%B2%D0%B8%D0%B4%D0%B5%D0%BE-SjOfbbfI2qY.html&ab_channel=RyanNolanData Practice SQL & Python Interview Questions: stratascratch.com/?via=ryan
@yakosti
@yakosti 9 месяцев назад
Thank you for this, your videos are so helpful. Keep it up!
@RyanNolanData
@RyanNolanData 9 месяцев назад
Np new one tomorrow. New project this week
@ritamchatterjee8785
@ritamchatterjee8785 9 месяцев назад
yes man, much appriciated for your efforts
@RyanNolanData
@RyanNolanData 9 месяцев назад
Thanks for working on this project! My housing one will be out in November
@robertbenson8554
@robertbenson8554 Месяц назад
Excellent video. So much in it, thought process, code tips etc.
@RyanNolanData
@RyanNolanData Месяц назад
Thank you
@AmaRan31
@AmaRan31 5 месяцев назад
It was a super useful video and I am happy to have done my first Data Science project. Thank you very much.
@RyanNolanData
@RyanNolanData 5 месяцев назад
Congrats on completing your first project
@collingreens7
@collingreens7 26 дней назад
I loved the walkthrough, honestly the last about 35 mins I had no idea what was going on but it's really cool that people like you are giving free tutorials on such complex work. Thanks!
@RyanNolanData
@RyanNolanData 26 дней назад
No problem. Everything I go over in this vid is covered in my ML and Python playlists. Check them out!
@elfincredible9002
@elfincredible9002 3 месяца назад
Thanks... I really enjoyed and you explain so well.... Bless you.
@RyanNolanData
@RyanNolanData 3 месяца назад
Thank you
@alihajizadeh7749
@alihajizadeh7749 7 месяцев назад
It really helped a lot, thank you, keep going
@RyanNolanData
@RyanNolanData 7 месяцев назад
Thank you
@samallen598
@samallen598 4 месяца назад
Very cool video! Would love to see some of this type of content.
@RyanNolanData
@RyanNolanData 4 месяца назад
Have a lot of other vids to check out!
@onurbltc
@onurbltc 10 месяцев назад
Amazing content!
@RyanNolanData
@RyanNolanData 10 месяцев назад
Thanks man, you helped a ton with this vid
@jacksun8129
@jacksun8129 Месяц назад
Hi Ryan, I am new to data science. I am a bit lost on what the point of analyzing the ticket number and passenger name. What is the goal of doing that? Same with qcuts, are we doing them to help with a decision tree model? Do we need to do any of this if we just build a regression model?
@japyh4
@japyh4 9 месяцев назад
Thank you so much. Keep it up:)
@RyanNolanData
@RyanNolanData 9 месяцев назад
No problem
@umarmusisi8853
@umarmusisi8853 2 месяца назад
Awesomely awesome...i had to sub
@RyanNolanData
@RyanNolanData 2 месяца назад
Np
@AbelGriffen
@AbelGriffen 4 месяца назад
Hi @Ryan Thanks for making this amazing video. I just want to understand why did use "Plus one for yourself" @25:05? Thank you!
@codingcambodia
@codingcambodia 10 месяцев назад
keep up your good work
@RyanNolanData
@RyanNolanData 10 месяцев назад
Thank you! Working on more videos this weekend!
@Al-Ahdal
@Al-Ahdal 2 месяца назад
@Ryan Nolan Data: Excellent vdo.
@RyanNolanData
@RyanNolanData 2 месяца назад
Much appreciated
@sildistruttore
@sildistruttore 3 месяца назад
What's the point in splitting the dataset into train and validation if then at the end you are using only the training to do the grid search with cross validation? doesn't the grid search directly create the validation set on the training set you give it?
@ayushijainrkt
@ayushijainrkt 2 месяца назад
1:17:00 i don't understand the usage of .transform('count'). Can someone explain with an example?
@John-xi2im
@John-xi2im 3 месяца назад
Extremely interesting tutorial, learnt a lot of new functions in pandas and different ways to analyze the date. Thanks Ryan !!😀
@RyanNolanData
@RyanNolanData 3 месяца назад
No problem! Make sure to check out part 2
@John-xi2im
@John-xi2im 3 месяца назад
@@RyanNolanData Sure! I hope following these tutorials will land me a good job as data scientist, as the content is very informative and you inspire me tremendously. 🌄
@tosinwilliams9343
@tosinwilliams9343 5 месяцев назад
This was really helpful 🥳🥳🥳🥳
@RyanNolanData
@RyanNolanData 5 месяцев назад
Thanks
@BreadForBrain100
@BreadForBrain100 8 месяцев назад
at least you explain in detail what you are typing for after copy your line of code. Nice video btw
@RyanNolanData
@RyanNolanData 8 месяцев назад
Thank you!
@tosinwilliams9343
@tosinwilliams9343 5 месяцев назад
Just a suggestion your next video should be on using chatgbt for this project
@aviluminos8759
@aviluminos8759 3 месяца назад
I got 78% result using forest. Thanks for the brilliant explanation!
@RyanNolanData
@RyanNolanData 3 месяца назад
No problem, awesome job
@idontevenwanttomakea
@idontevenwanttomakea 8 месяцев назад
Hi, great video. One idea - instead of writing out so many loc statements, it might be easier to just use labels=False when using qcut.
@RyanNolanData
@RyanNolanData 7 месяцев назад
Thank you! And I’ll look into it for the next time I use it
@katorechaitanya
@katorechaitanya 10 месяцев назад
you explained it in fantastic way just one request will you please provide the valid link for notebook actually its not working
@RyanNolanData
@RyanNolanData 10 месяцев назад
Hey, just checked the link it's working? www.kaggle.com/code/ryannolan1/titanic-wip-9-12
@ixcel87
@ixcel87 8 месяцев назад
great tutorial, can't wait to check out part 2! question on correlation map; how did you use it to narrow down your parameters/features?
@RyanNolanData
@RyanNolanData 8 месяцев назад
Part 2 is out! And I did this project a long time ago will try to take a look at the code and see the reasoning
@ixcel87
@ixcel87 8 месяцев назад
@@RyanNolanData Thanks again! I will view part 2 today. Also, definitely let me know about the correlation map and how it was used!
@mn4769
@mn4769 2 месяца назад
Why on 40:57 my output of age is 0 every row
@BreadForBrain100
@BreadForBrain100 8 месяцев назад
nice video tutorial chief! What kind of extension do you use ?
@RyanNolanData
@RyanNolanData 8 месяцев назад
Wdym extension?
@BreadForBrain100
@BreadForBrain100 8 месяцев назад
for example as you are typing, pop up auto correct or something like that@@RyanNolanData
@user-dz7ut7zq4u
@user-dz7ut7zq4u 6 месяцев назад
hello! thank you for your video, i am trying to follow you and repeat all the steps. i have found a better way to assign labels for age groups : df['Age_Lebel'] = pd.qcut( df['Age'], 8, labels = np.arange(8) + 1 ) hope it can be helpful!
@RyanNolanData
@RyanNolanData 6 месяцев назад
Awesome! I may try to revisit this in the future.
@alanjohnstone8766
@alanjohnstone8766 3 месяца назад
The length of the name is dominated by married ladies who have their married name AND their maiden names in brackets. Here is the top few: 'Penasco y Castellana, Mrs. Victor de Satode (Maria Josefa Perez de Soto y Vallejo)', 'Phillips, Miss. Kate Florence ("Mrs Kate Louise Phillips Marshall")', 'Duff Gordon, Lady. (Lucille Christiana Sutherland) ("Mrs Morgan")', 'Brown, Mrs. Thomas William Solomon (Elizabeth Catherine Ford)', 'Andersson, Mrs. Anders Johan (Alfrida Konstantia Brogren)' So I think when noting that survival is related to name length you are actually picking up that name length is a predictor of being female who of course have a higher chance of surviving. Analysing this dataset is addictive - I must give it up!
@RyanNolanData
@RyanNolanData 3 месяца назад
theres so much to take a look at
@SYCS13YashGadhave
@SYCS13YashGadhave 25 дней назад
can we add this project to our resume ?
@RyanNolanData
@RyanNolanData 25 дней назад
Sure
@saadAhmed-co1si
@saadAhmed-co1si 3 месяца назад
can you make more project every month? i want learn about project more.
@RyanNolanData
@RyanNolanData 3 месяца назад
Eventually. I’m so busy though atm going through a backlog of videos to create
@alanjohnstone8766
@alanjohnstone8766 4 месяца назад
I do not think you meant to make the young French girls ‘noble’ which you did. I am just starting to learn pandas with your help but some of the complicated string editing you did would have been so much simpler and more understandable if done in an old fashioned ‘for loop’. I know it is frowned upon by ‘experts’ but the whole point of Python is that the code is readable.
@RyanNolanData
@RyanNolanData 4 месяца назад
Probably a small mistake. Curious if it did better with not marking them as noble
@J6rms
@J6rms 6 месяцев назад
Am I the only one that can't see anything he types or clicks in the beginning of "Starting the Project" (from 9:00 for approx a minute)? :/
@RyanNolanData
@RyanNolanData 6 месяцев назад
Hey there was a small editing error but all the code is in the description through the Kaggle link
@tamtam8420
@tamtam8420 23 дня назад
on 48:37 there is a shorter way of extracting Title: train_df['Title'] = train_df['Name'].str.extract(' ([A-Za-z]+)\.', expand=False)
@pasindugimhan5779
@pasindugimhan5779 8 месяцев назад
Great work brother! I have subscribed you and waiting for next Kaggle projects also
@RyanNolanData
@RyanNolanData 8 месяцев назад
Thanks and I just uploaded one this week!
@pasindugimhan5779
@pasindugimhan5779 8 месяцев назад
@@RyanNolanData Also at the final steps, I've faced to some errors. So, is there any way to contact you please..
Далее
How to do the Titanic Kaggle Competition
18:28
Просмотров 73 тыс.
ML Was Hard Until I Learned These 5 Secrets!
13:11
Просмотров 254 тыс.
iPhone socket cleaning #Fixit
0:30
Просмотров 17 млн
📱магазин техники в 2014 vs 2024
0:41