How to Effectively Use the Data Science Lifecycle

Dave Ebbelaar

Подписаться 102 тыс.

Просмотров 2,7 тыс.

50% 1

Видео Поделиться Скачать Добавить в

Опубликовано:

16 окт 2024

Ссылка:

Скачать:

Готовим ссылку...

Добавить в:

Мой плейлист

Посмотреть позже

Комментарии : 10

@vlahonator Год назад

The way you explain things is amazing, well done Ebbelaar!

@etornamtsyawo6407 Год назад

Dave, this is really appreciated. Thanks!

@smart0758 Год назад

This video is insanely helpful. I often feel lost when working in large project, but now I can apply those steps. Thank you Dave!!

@daveebbelaar Год назад

Thanks! 🙌🏻 I made this video because of your answer to my question earlier this week. Glad I can help you out with your data science journey 🚀

@PeterPan-hs5tu Год назад

damn ... this is more important than any bootcamp i took ... 😍 i m gonna imprint this rule in my brain

@Xyrium 8 месяцев назад

Very nicely done sir. As you've mentioned, this process becomes a loop, with drift analysis following the initial implementation. Do you cover drift in one of your presos? Thanks!

@christopherzanoli2029 Год назад

Hi Dave, great video. Regarding data Preparation, I believe that train/test split should come before missing values imputation, otherwise there would be a data leakage from the test set. Do you agree?

@daveebbelaar Год назад

Hi Christopher, thanks for your comment. It depends on how you impute the data, but generally creating a train/test split is done later. It would also be good practice to select the test set in such a way that there are no imputed values (or dropping any row with missing values). Again, depending on how much of the data is missing. If you have enough data and a small percentage of missing values, dropping rows with missing values typically makes the most sense.