Тёмный

Predict House Prices With Machine Learning And Python [Full Tutorial] 

Dataquest
Подписаться 59 тыс.
Просмотров 32 тыс.
50% 1

We'll predict future house prices by training a machine learning model to predict if prices will rise or fall. We'll write all of our code in Python using JupyterLab.
We'll use data from the US Federal Reserve, along with house price data from Zillow. We'll merge and combine this data, then use it to train a random forest model. We'll measure error using backtesting, then improve our model with new predictors.
This project can be customized to predict house prices in your metro area if you live in the US.
You can find the full code and README here - github.com/dataquestio/projec... .
You can download the data from the previous link, or from this link if it doesn't work - drive.google.com/uc?export=do... .
Chapters
00:00 Introduction
02:23 Loading federal reserve data
07:39 Loading Zillow house price data
14:10 Preparing data for machine learning
18:25 Setting up our machine learning target
25:08 Creating a machine learning model
28:11 Creating a backtesting engine
31:58 Measuring error
32:47 Improving our accuracy
35:44 Running diagnostics on our model
40:22 Next steps with the model
---------------------------------
Join 1M+ Dataquest learners today!
Master data skills and change your life.
Sign up for free: bit.ly/3O8MDef

Опубликовано:

 

10 июл 2024

Поделиться:

Ссылка:

Скачать:

Готовим ссылку...

Добавить в:

Мой плейлист
Посмотреть позже
Комментарии : 36   
@AlphaChinou
@AlphaChinou Год назад
Vik, these tutorials are amazing. I'm a Dataquest member and absolutely love the platform and learning with your team. Incredible stuff! Thanks.
@tianbowen721
@tianbowen721 11 месяцев назад
You speak so concise and clear !! So well organized ! Even better than our professor at the university!
@AntonioGondim-uf5eh
@AntonioGondim-uf5eh Год назад
Vik this is amazing, man. I really appreciate you having this free material. high quality stuff
@user-qj5qp3qz4w
@user-qj5qp3qz4w Год назад
This was awesome. Thank you for being so clear and thorough.
@Quantumvp
@Quantumvp Год назад
Thanks Vikas you always give us a real user friendly experience
@businessandmanagmentlesson8592
Thanks, we need more similar videos
@dpratte
@dpratte Год назад
Very fine job, Sir! Thank you.
@FauziNomad
@FauziNomad Год назад
Very valuable channel. Just love it! Subscribed..
@laczkos
@laczkos Год назад
Thanks a lot for this very helpful video!!
@maulanaajiw6184
@maulanaajiw6184 Год назад
Bravo Dataquest 👏 I hope in another video, you will teach how to calculate each algorithm manually.
@Onlinefreelancingacademy
@Onlinefreelancingacademy Год назад
Thanks very much especially for the data
@TheNenelly
@TheNenelly Год назад
Thank you! This was great! Would love to see on the same topic using LSTM :D
@idkidkidk3488
@idkidkidk3488 Год назад
Thanks! This video was very usefull!
@lantispillay5363
@lantispillay5363 Год назад
Hi Vik, Incredible stuff! Thanks. would you consider doing a video on predicting sales forecasts of different products
@tt4m
@tt4m Год назад
شكرا ❤️
@TheTurmoiljack
@TheTurmoiljack Год назад
Very useful video, well explained and really easy to follow along the entire thing for someone like myself that is still a beginner to python for data science, and was fun to follow along the machine learning even though the majority of it went over my head for the time being! One question i have is what is the reason that the 3 federal reserve data sets could be combined using .concat and then .ffill, however the 2 zillow files require the loop to_datetime, creating a new month column and then merging based on this column? is this simply because of the fact the data from the original csv was not in the correct format initially?
@agrawal1207
@agrawal1207 Год назад
Where is the predicted data?
@IbrahimOld
@IbrahimOld Год назад
Thank you very much But I have a question What is the method you did use of this project? ANN or RNN?
@realestatemarketreports-me8668
Very informative. Thanks for sharing. (I am sure this can be done using JS, too.)
@Dataquestio
@Dataquestio Год назад
Yes, you can do this in JS, but it would be harder. JS doesn't have the same data libraries (pandas, scikit-learn, etc) that Python does.
@prasanthi7173
@prasanthi7173 Год назад
Just asking how to deploy this model?? I mean to make a website for prediction
@sumitbarua9121
@sumitbarua9121 Год назад
what is prerequiste before doing project?
@mquannz5573
@mquannz5573 Год назад
How to predict future values for rows that have NaN values at 22:20 after building the model sir :( I don't know how to do the predictions phase after I build my model
@mquannz5573
@mquannz5573 Год назад
please anyone can help me with this one :'(
@jakhongirs
@jakhongirs Год назад
What other machine learning algorithms can we use with this data?
@Dataquestio
@Dataquestio Год назад
Pretty much any regression algorithm - SVM, random forests, xgboost, etc.
@rajeshmanjrekar3614
@rajeshmanjrekar3614 Год назад
I am getting an error, at the program step : price_data.index = dfs[0].index ........and the error in shows "ValueError: Length mismatch: Expected axis has 748 elements, new values have 754 elements" kindly help
@Dataquestio
@Dataquestio Год назад
It looks like price_data has a different number of rows from dfs[0]. This would happen if the data wasn't loaded/cleaned properly.
@rajeshmanjrekar3614
@rajeshmanjrekar3614 Год назад
@@Dataquestio your video also the exact number of records that i have ....kindly request you to please check, thanks a lot for replying
@rajeshmanjrekar3614
@rajeshmanjrekar3614 Год назад
@@Dataquestio my dfs[0] has 754 rows, and my dfs[1] has 319 rows exactly the way shown in your video, thanks again for your reply. regards, rajesh manjrekar
@jamesleleji6984
@jamesleleji6984 Год назад
Can this be done using R? Thanks
@Dataquestio
@Dataquestio Год назад
Hi James - you can definitely do this using R. R has packages that work similarly to pandas and scikit-learn.
@varuncharan9109
@varuncharan9109 4 месяца назад
This one is very complicated project
@aakashgohil859
@aakashgohil859 10 месяцев назад
Hi Vik, thank you for sharing the video it helped a lot. also would you mid sharing your email I have some questions to ask ?
@rajeshmanjrekar3614
@rajeshmanjrekar3614 Год назад
i am also getting a warning at the following step : for df in dfs: df.index = pd.to_datetime(df.index) df["month"] = df.index.to_period("M") the warning is as follows: C:\Users\HP\AppData\Local\Temp\ipykernel_12456\3620532488.py:2: UserWarning: Parsing '16-02-2008' in DD/MM/YYYY format. Provide format or specify infer_datetime_format=True for consistent parsing.
@Dataquestio
@Dataquestio Год назад
This warning is fine, this is related to how dates are written in the US vs some other countries.
Далее
УНИТАЗ В ЛЕСУ?? #shorts
00:24
Просмотров 803 тыс.
когда мучает жажда // EVA mash
00:58
Просмотров 1,4 млн
Alisha Lehmann joins Juventus Women 🤍🖤
00:16
Просмотров 3,3 млн
ML Was Hard Until I Learned These 5 Secrets!
13:11
Просмотров 233 тыс.
Predict NBA Games With Python And Machine Learning
58:33
УНИТАЗ В ЛЕСУ?? #shorts
00:24
Просмотров 803 тыс.