Тёмный
No video :(

#5: Scikit-learn 3: Preprocessing 3: Scaling a sparse matrix, CSR, CSC format 

learndataa
Подписаться 3,8 тыс.
Просмотров 5 тыс.
50% 1

Опубликовано:

 

22 авг 2024

Поделиться:

Ссылка:

Скачать:

Готовим ссылку...

Добавить в:

Мой плейлист
Посмотреть позже
Комментарии : 8   
@johnschultz4079
@johnschultz4079 11 месяцев назад
The previously defined ‘scaler’ (for MaxAbsScaler()) was used for standard scaler transformation. The scaler for StandardScaler() was defined as ‘scalar’ Note the typo
@learndataa
@learndataa 11 месяцев назад
Thank you. Updated video description for 11:53 timepoint.
@sinatorkzadeh7872
@sinatorkzadeh7872 Год назад
it seems like you use indices regarding column and index for each element but this not the way CSC concept is.
@learndataa
@learndataa Год назад
Thank you for the comment. Trying to understand the suggestion, below is description on Wikipedia: "CSC is similar to CSR except that values are read first by column, a row index is stored for each value, and column pointers are stored." Link: en.wikipedia.org/wiki/Sparse_matrix
@sridharans9400
@sridharans9400 10 месяцев назад
Hi Sir, can we say that StandardScalar() method is more suitable for dense matrix. As the average will be representative of the datapoints. Whereas in sparce data due to null values .. using mean to scale will be misleading... Is my thought process right ?
@learndataa
@learndataa 9 месяцев назад
While sklearn.preprocessing.StandardScaler() does offer options to use it on sparse matrices, choosing the right scaler would depend on your data and the requirements of your specific machine learning problem "scikit-learn: z = (x - u) / s This scaler can also be applied to sparse CSR or CSC matrices by passing with_mean=False to avoid breaking the sparsity structure of the data." Having said that, your thought process is on the right track! StandardScaler (not StandardScalar) can be more suitable for dense matrices as it assumes that the data is normally distributed and uses the mean and standard deviation for scaling. This works well when you have a significant amount of data and the null or missing values are minimal. However, in sparse data with many null values, using the mean for scaling could be misleading because the null values are treated as zeros, affecting the mean and potentially skewing the scaling process. Hope it helps! Thanks for watching.
@barunbodhak720
@barunbodhak720 2 года назад
How to get these results in excel with original dataframe columns.
@nabil101971
@nabil101971 Год назад
How can MaxAbsScaler() has the same result as StandardScaler()
Далее
How does Netflix recommend movies? Matrix Factorization
32:46
How This New Battery is Changing the Game
12:07
Просмотров 206 тыс.
Why The Sun is Bigger Than You Think
10:30
Просмотров 191 тыс.
Standardization Vs Normalization- Feature Scaling
12:52
Risking my life to save $5000
19:29
Просмотров 918 тыс.
Is Gravity RANDOM Not Quantum?
20:19
Просмотров 218 тыс.
Sparse Matrices - Intro to Parallel Programming
1:31