Тёмный

Titanic Dataset Analysis (Classification) | Kaggle | Machine Learning | Python 

Hackers Realm
Подписаться 24 тыс.
Просмотров 24 тыс.
50% 1

⭐️ Content Description ⭐️
In this video, I have explained about titanic survival prediction dataset analysis using python. This is one of the basic kaggle project every beginner should try. I have done various techniques to explore the data and submitted the results to kaggle. This will be a end to end project for any competition.
Text-based Tutorial: www.hackersrealm.net/post/tit...
GitHub Code Repo: bit.ly/mlcoderepo
Dataset link: www.kaggle.com/c/titanic/data
🌐 Website: www.hackersrealm.net
🔔 Subscribe: bit.ly/hackersrealm
🗓️ 1:1 Consultation with Me: calendly.com/hackersrealm/con...
📷 Instagram: / aswintechguy
🔣 Linkedin: / aswintechguy
🎯 GitHub: github.com/aswintechguy
🎬 Share: • Titanic Dataset Analys...
⚡️ Data Structures & Algorithms tutorial playlist: bit.ly/dsatutorial
😎 Hackerrank problem solving solutions playlist: bit.ly/hackerrankplaylist
🤖 ML projects tutorial playlist: bit.ly/mlprojectsplaylist
🐍 Python tutorial playlist: bit.ly/python3playlist
💻 Machine learning concepts playlist: bit.ly/mlconcepts
✍🏼 NLP concepts playlist: bit.ly/nlpconcepts
🕸️ Web scraping tutorial playlist: bit.ly/webscrapingplaylist
Make a small donation to support the channel 🙏🙏🙏:-
🆙 UPI ID: hackersrealm@apl
💲 PayPal: paypal.me/hackersrealm
🕒 Timeline
00:00 Introduction to Titanic Dataset Analysis
05:47 Import Modules
07:00 Loading the Titanic Dataset
11:05 Exploratory Data Analysis
25:43 Data Preprocessing
37:27 Correlation Matrix of Titanic Data
43:15 Drop unnecessary columns
45:00 Label Encoding
47:06 Train Test Split
50:30 Model Training
01:01:46 Get Predictions from Test Data
01:04:51 Submit Predictions to Kaggle
01:11:51 End
#titanicdatasetanalysis #machinelearning #hackersrealm #classification #titanic #deeplearning #datascience #model #project #artificialintelligence #beginner #analysis #python #tutorial #aswin #ai #dataanalytics #data #bigdata #programming #datascientist #technology #coding #datavisualization #computerscience #pythonprogramming #analytics #tech #dataanalysis #iot #programmer #statistics #developer #ml #business #innovation #coder #dataanalyst

Опубликовано:

 

30 июл 2024

Поделиться:

Ссылка:

Скачать:

Готовим ссылку...

Добавить в:

Мой плейлист
Посмотреть позже
Комментарии : 44   
@syedhaideralizaidi1828
@syedhaideralizaidi1828 2 года назад
waiting !!!
@anamaposadar
@anamaposadar Год назад
I really liked your explanation, everything was so clear!
@HackersRealm
@HackersRealm Год назад
Glad you liked it!!!😄
@Johan_S4
@Johan_S4 Год назад
55:56 i'm wondering why you delete the cv=5 on cross_val_score ? is it no longer neccesary? as few second before that it still exist. btw great video, i'm learning much from it, and oddly enough, when i tried your logistic regression it work just fine.
@nidhik5270
@nidhik5270 Год назад
Thankyou so much..... It was so helpful ❤️
@HackersRealm
@HackersRealm Год назад
Happy to h3lp!!!
@SaniaShaikh9202
@SaniaShaikh9202 11 месяцев назад
sns.countplot(train['Sex']) i get an error saying that it cannot convert string to float: 'male' and also during the exploratory data analysis i am only getting output graph for one value as in Axes: ylabel='count' it does not show the xlabel pls help
@taibaarain2000
@taibaarain2000 10 месяцев назад
Same happened with me in survived graph did you find any solution?
@ST-nu6ib
@ST-nu6ib 2 года назад
How to fill NaN in age with respect to Survived column. Suppose if I want to fill age of a survived human with mean age of all survived humans, and fill age of a not survived human with age of not survived human, then how to do it?
@HackersRealm
@HackersRealm 2 года назад
you can use groupby function or pivot table function to get the mean of the category. you can check bigmart sales or black friday sales project videos for the methods
@jitendratrivedi7889
@jitendratrivedi7889 2 года назад
Can you please explain why you use lower case (model x , y ) for CV whereas for LR ( model , X, y).
@HackersRealm
@HackersRealm 2 года назад
Both are in same case only, Please check!!!
@alexnieto5036
@alexnieto5036 Год назад
very good work !
@HackersRealm
@HackersRealm Год назад
Glad you liked it!!!
@anilsailakhinana94
@anilsailakhinana94 2 года назад
waiting
@tooba5611
@tooba5611 2 года назад
My 'Age' data is float too but when I am trying to solve it it cant execute. The error shows- TypeError: unsupported operand type(s) for +: 'float' and 'method' Can anyone help?
@HackersRealm
@HackersRealm 2 года назад
what method you are trying to use on age?
@jerryvictor3583
@jerryvictor3583 11 месяцев назад
Thank you for this. Pls I have a little challenge, when I get to LogisticRegression, I get error, it says ' X is not defined'.... Pls how do I solve this?
@HackersRealm
@HackersRealm 11 месяцев назад
try to run the cells in order to avoid any errors!!!
@abhishekprakash9803
@abhishekprakash9803 2 года назад
ks stats kab perfrom krte h
@digitalshaji2636
@digitalshaji2636 9 месяцев назад
when I am using a sns for survival visual graph it returns with and a graph filled with only blue color
@quotez4943
@quotez4943 4 месяца назад
Same Issus Did you understand what is the probles
@patrick.bateman.psy1
@patrick.bateman.psy1 2 месяца назад
Yes same issue
@Akashvardhan._
@Akashvardhan._ Месяц назад
Actually the problem is that the method he did is kind of old where it automatically detects 'X' axis to be 'Survived' column but when we try to do it now it won't work instead in the function call itself we should mention as sns.countplot(x=train['Survived']) ...('We are now manually mentioning that the X axis should be of the specified column')
@tejaswinimunumudi8440
@tejaswinimunumudi8440 2 года назад
Sir I am getting values of passengers in 800 's sir what can I do now
@HackersRealm
@HackersRealm 2 года назад
I didn't get you, value of passengers means?
@daniecookie505
@daniecookie505 Год назад
im getting an error i dont understand why TypeError: Singleton array array(891) cannot be considered a valid collection.
@HackersRealm
@HackersRealm Год назад
Are you using the same code?
@adithyabm9182
@adithyabm9182 11 месяцев назад
why are you dropping the survived column in the last
@HackersRealm
@HackersRealm 11 месяцев назад
before the model training part?
@shubhankarsharma4094
@shubhankarsharma4094 6 месяцев назад
Fare null should be replaced with median
@Sumer-zt5cu
@Sumer-zt5cu Год назад
Input Nan Contains NaN ... How to deal with it brother plzz explain
@HackersRealm
@HackersRealm Год назад
drop the null values or replace the null values with some stats or negatives
@HackersRealm
@HackersRealm Год назад
you can also check this video ru-vid.com/video/%D0%B2%D0%B8%D0%B4%D0%B5%D0%BE-FEQpdgoH_pM.html
@patrick.bateman.psy1
@patrick.bateman.psy1 2 месяца назад
When I run the sns.countplot(train['Survived']) then only the blue graph is showing with value of 800.... What's the solution of this or this is the bug from the dataset itself..
@HackersRealm
@HackersRealm 2 месяца назад
Could you double check the column values whether it has 2 classes
@patrick.bateman.psy1
@patrick.bateman.psy1 2 месяца назад
No it has only 1 class
@HackersRealm
@HackersRealm 2 месяца назад
@@patrick.bateman.psy1 it should be 2 classes, survived and not survived, I think some issue in the dataset
@patrick.bateman.psy1
@patrick.bateman.psy1 2 месяца назад
@@HackersRealm Yes it's the issue with the dataset, Is anyway to fix it)
@HackersRealm
@HackersRealm 2 месяца назад
@@patrick.bateman.psy1 try the dataset in my github or use the kaggle link to download it. You could also try from other websites as well
Далее
Exploratory Data Analysis with Pandas Python
40:22
Просмотров 448 тыс.
Stock Price Prediction Using Python & Machine Learning
49:48