Class 17 Machine Learning SVM Regression in Python

Pedram Jahangiry

Подписаться 5 тыс.

Просмотров 6 тыс.

50% 1

Видео Поделиться Скачать Добавить в

Опубликовано:

4 окт 2024

Ссылка:

Скачать:

Готовим ссылку...

Добавить в:

Мой плейлист

Посмотреть позже

Комментарии : 21

@sukruatsizelti6075 Год назад

super clear, many thanks!

@pedramjahangiry Год назад

Glad it helped!

@kyildirim34 Год назад

very very useful and clear, many thanks. I have a question. I usually come across assesments such as "the correct order of steps for machine learning tasks is to first split the data into training and testing sets, and then perform any preprocessing steps like creating dummy variables on each set separately to avoid information leakage.". But in SVM classification and regression videos you performed first dummies and then split the data. Are there any specific reasons for this choice? Or is it about using OneHotEncoding or OrdinalEncode vs. get dummies?

@pedramjahangiry Год назад

Great point Kadir. Yes, the best practice is to fit_transfrom(X_train) and then transfrom(X_test) to avoid data leakage (for any kind of data preprocessing). In this example, because there is no outliers or leverage points, I decided to do it all at once. I encourage you to try it your way as well and compare the results.

@haydenclegg8261 3 года назад

I'm a little confused as to why epsilon can't be tuned through CV? It has the appearance of a hyper parameter. Which lecture was this explained in?

@pedramjahangiry 3 года назад

you can simply think of epsilon as tolerance level. We fix it (according to industry standard) and try to optimize other hyper parameters.

@Sarahchamberlain97 3 года назад

Knowing "one hot encoding" is so helpful in looking up help on Google! Is there other lingo that we should be familiar with?

@pedramjahangiry 3 года назад

not off the top of my head right now! but will try my best to cover all relevant ones while I am teaching.

@Keatsrocks 3 года назад

If we had more processing power, how many gamma and C's would we run to find the optimum hyper-parameters?

@pedramjahangiry 3 года назад

good question, usually we can make educated guess by trying the first couple of values for gamma and C. There is no clear cut answer for that.

@SD-dq6di 2 года назад

Why the fit line is linear although SVM regression is not a linear regression?

@pedramjahangiry 2 года назад

Great question! Remember the Linear Kernel SVM is a linear model and can fit a simple line as decision boundary.

@davidjung4882 3 года назад

Do we care about the residuals from the comparison of actual vs. predicted? Or is that measured with the SVM_regression.score() ?

@pedramjahangiry 3 года назад

they are definitely related.

@feri12328 Год назад

Hi, Is not it like that when you convert your dates to cat variables you break the dependency in time? So, this reduces your prediction power? We need to treat the data in a way to model seasonality and trend...

@pedramjahangiry Год назад

Good observation Amirali. note that if there is a timeseries variable in the data (like date) and you want to add it to the model, then you do NOT transform it into categories (as you pointed out, it will break the time dependency and etc). However, when there is a variable like week day, month or some other time variables with limited number of outcomes (12 month, 7 days and etc) we do make them categories and this will NOT break the seasonality effect. Hope that helps! in general, for time series prediction, there are better models to use and we rarely use SVM for timeseries data.

@clatacis88 3 года назад

So the random searches of C and gamma are both faster and more efficient? This is only with three number searches? If we increased that to 4 or 5 does that assist in finding the most efficient numbers or potentially waste more time?

@pedramjahangiry 3 года назад

good question Clayton. Usually we are able to make an educated guess by trying out the first couple of alternatives for C and gamma. if not, we keep guessing! there is no clear cut answer for that.