Тёмный

Full Python Tutorial: Customer Lifetime Value & RFM Analysis using Machine Learning 

Business Science
Подписаться 25 тыс.
Просмотров 38 тыс.
50% 1

Опубликовано:

 

30 сен 2024

Поделиться:

Ссылка:

Скачать:

Готовим ссылку...

Добавить в:

Мой плейлист
Посмотреть позже
Комментарии : 59   
@puppyfindchloe
@puppyfindchloe 27 дней назад
43:53 , for the monetary value, can someone explain why we use price alone, instead of revenue (price * quantity) for each customer? Am i missing something
@LostMakaveli
@LostMakaveli 6 дней назад
the video is good, but it really lack proper cross validation and over-fitting handling
@cevikyi
@cevikyi 3 года назад
Hi, I think I’m a bit confused about data splitting and CV process. When you build the model, you’re using “unseen” data as input, and try to predict again on seen and unseen data. From the business perspective and algorithmic way, isn’t it supposed be tested on the only targets df?
@kcenannn
@kcenannn 2 года назад
You are absolutely true, what you should do is while you are estimating all features, discard last 3m. Set last 3month as test timeline. After having train test split check your results. (test: last 3 month, train: except 3 month data) Lecture is perfect, thank you!
@nicolewallace6242
@nicolewallace6242 16 дней назад
Smith Elizabeth Miller Ronald Johnson Laura
@nicolascortinas6936
@nicolascortinas6936 Год назад
Hi! I think there is an issue with the logic. Correct me if Im wrong. You are splitting the data in most recent and oldest, and then you are training a model using the same old and new data. Your y in the model is what you are trying to predict later (90 most recent days). I dont see the point of creating the model here.
@Scootenfruity
@Scootenfruity Год назад
I love R. But at my new Job i "have to" work with python and I'm quite happy that your courses are using python as well. Really nicely done.
@BusinessScience
@BusinessScience Год назад
You’re welcome!
@BettyMartin-m5f
@BettyMartin-m5f 24 дня назад
Harris Sarah Martinez Thomas Young Melissa
@KathleenJacksonyu
@KathleenJacksonyu 25 дней назад
White Dorothy Martin Sandra Thomas Daniel
@SysknShall
@SysknShall 29 дней назад
Young Laura Allen Nancy Brown Amy
@brothermalcolm
@brothermalcolm 2 года назад
Ad a data scientist whose been working on customer analytics at startups - I feel like I've just discovered a gold mine
@BusinessScience
@BusinessScience 2 года назад
You have!
@chenjxing
@chenjxing 3 года назад
May I know if there is tutorial videos for CLV modeling using R?
@BusinessScience
@BusinessScience 3 года назад
Hi Chen, we have one in Learning Labs PRO membership. It's Learning Lab 58. Upon enrollment you'll gain access to all of our labs: university.business-science.io/p/learning-labs-pro
@travelsizearchitect
@travelsizearchitect 9 месяцев назад
Hi Matt, I'm planning to add this to my portfolio. Is it possible to follow along with jupyter notebook + pip? or having the same setup as you is crucially important?
@BusinessScience
@BusinessScience 9 месяцев назад
You can but support is only provided to Learning Labs Pro members. You can join LL PRO here: university.business-science.io/p/learning-labs-pro?el=youtube
@gauravmodi12
@gauravmodi12 2 года назад
can you add shapley model for interpretability ?
@djaadiabdellah9081
@djaadiabdellah9081 7 месяцев назад
Do you have any good sources to understand shapley values? Thanks
@gauravmodi12
@gauravmodi12 7 месяцев назад
@@djaadiabdellah9081 now SHAP package is available and it has various plots and graphs to understand each variable importance
@senolkurt7864
@senolkurt7864 2 года назад
Thanks for the tutorial but I think there is a problem with the logic. You trained the model using seen target values. Then you tried to predict that target values. You should have predicted the following 90 day period.
@RidingWithGerdas
@RidingWithGerdas 2 года назад
54:00 so what you did was you predicted on the data we already knew. What about the true future, fitting model on all known data and predicting next unkown 90 days ? BG/NBD and gamma-gamma models can do that.
@keiran110
@keiran110 Год назад
You can still do that. Any new customers, just apply the model to their data and predict their next 90 days.
@andresgogol5725
@andresgogol5725 14 дней назад
Exactly, he made a very basic mistake in training and testing.
@krgoutham8852
@krgoutham8852 2 года назад
Can we download the codes and input data if we take PRO ?
@BusinessScience
@BusinessScience 2 года назад
Yes, you get code and videos for all learning labs when you join.
@sohambasu660
@sohambasu660 10 месяцев назад
Is the github repo available for this project ?
@PratapO7O1
@PratapO7O1 2 года назад
53:16 actually deviation by $10 is pretty bad. for 25%tile it's off by 50% for 50%tile it is off by 25% for 75%tile it is off by 10%
@dsdjiitian
@dsdjiitian 2 года назад
thank you for such a nice usecase of CLV....
@BusinessScience
@BusinessScience 2 года назад
You are welcome
@НикитаБуров-ъ6р
@НикитаБуров-ъ6р 2 года назад
Nice tutorial, thanks
@BusinessScience
@BusinessScience 2 года назад
You got it!
@desarrolloroghur7372
@desarrolloroghur7372 2 года назад
What is quantity? is the numbers of invoice at the same date? or is it the quantity of products of the same purchase?
@BusinessScience
@BusinessScience 2 года назад
Quantity of products
@spicytuna08
@spicytuna08 Год назад
in your library, how many tutorials are in python? thanks
@BusinessScience
@BusinessScience Год назад
15 or so.
@oleksiiastakhov4192
@oleksiiastakhov4192 2 года назад
So... do you miss dplyr Matt? =) Other than that amazing work :D
@BusinessScience
@BusinessScience 2 года назад
I do. Some things in pandas take way too long. But everything is still possible. The big problem is I feel bad for you because my at labs are 2-3X more efficient so you get to results faster. In python the code always takes more. Like 300 lines vs 200 lines in R.
@mamadoucamara8989
@mamadoucamara8989 2 года назад
Thank you ! How can I predict LTV for new customers using this technique?
@keiran110
@keiran110 Год назад
Two ways you can use this model. 1. Retrospectively run this model on your database of existing customers daily. You will have an updated list of what your customers were expected to spend vs what they actually spent. 2. For new customers, allow them some time to generate data. Then use the model to predict their predicted 90 day spend in the future. In this case, you won't know their actual 90 day spend as it hasn't happened yet. But you can append the predictions back to your customer ID's and action the customers with the lowest predicted spend. I prefer method 1 as I want to action my customers now based on the reality of the situation. I know exactly what they spent vs what the model thought they would. This allows me to categorise them into 'high risk churners' or 'exceeded expectations' etc.
@mishralucky
@mishralucky 3 года назад
Thank you very much. this is very useful. I was wondering if we can use XAI to explain the xgboost model?
@BusinessScience
@BusinessScience 3 года назад
Yes, tools like LIME and SHAP can be used to explain the xgboost model.
@mishralucky
@mishralucky 3 года назад
@@BusinessScience Thanks, why you have not split the data into train and test?
@BusinessScience
@BusinessScience 3 года назад
@@mishralucky we implement 5-fold cross validation which does 5 repetitions of train test splitting.
@mishralucky
@mishralucky 3 года назад
@@BusinessScience ohh got it thank you
@mhh5002
@mhh5002 2 года назад
for this problem, don't we need to see whether the xgboost model is performing well by having training and testing (unseen data which does not influence the training of the model) sets?
@BusinessScience
@BusinessScience 2 года назад
I believe the model is cross validated. If not, then I didn’t have time to show more code.
@sirgorr2194
@sirgorr2194 3 года назад
Do you get access to the code in the labs when you become a Labs pro member?
@BusinessScience
@BusinessScience 3 года назад
Yes - you get access to the full code and videos.
@BrokenRecord-i7q
@BrokenRecord-i7q Год назад
can you share the github link please?
@BusinessScience
@BusinessScience Год назад
This is a learning lab and you’ll need to join the membership program.
@BrokenRecord-i7q
@BrokenRecord-i7q Год назад
@@BusinessScience oh thanks, it's a great approach 👍
@abdulahmed5610
@abdulahmed5610 2 года назад
you have predicted on X_train itself... Why not on X_test ????
@BusinessScience
@BusinessScience 2 года назад
Ahhhhhhhhh! It’s because we use a special technique to model the likelihood of purchase in next 90 days. We create multiple train/test sets in the lesson. The second train set is holdout.
@BusinessScience
@BusinessScience 2 года назад
Sorry I reviewed the lesson again. We perform 5 fold cross validation. The cv = 5. This gives us metrics to evaluate. We did not do parameter tuning. Not enough time and that’s what I teach in the courses. But predicting on test data is perfectly fine to assess their probability of purchase and their estimated number of days to purchase.
@Musaibaziz
@Musaibaziz 2 года назад
Amazing explanation. Definitely gonna subscribe pro service. What is your opinion about lifetimes library? How reliable is the CLV calculation made by lifetimes lib using gamma-gamma and BG/NBD model?
@BusinessScience
@BusinessScience 2 года назад
Lifetimes library uses traditional models. I don’t get good results with them for my projects at business science. Machine learning has been extremely beneficial for email subscriber modeling and customer targeting/segmentation. I use H2O which is available in both R & Python. It’s amazing.
@darnelb1912
@darnelb1912 3 года назад
Hi! Thanks for these amazing videos! I just started in Marketing Analysis and these labs help me a lot to understand the calculations and possibilities! One question: why in the minute 31:23 you sum the 'price' column? Should not be price * quantity? Btw, the labs pro include all the labs?
@HazemAzim
@HazemAzim 2 года назад
same Question why is Quantity dropped ?!!
Далее
КВН 2024 Встреча выпускников
2:00:41
ML Was Hard Until I Learned These 5 Secrets!
13:11
Просмотров 309 тыс.
Tableau in R for $0 (Introducing GWalkR)
11:20
Просмотров 5 тыс.