Тёмный

Building ML Models in Snowflake Using Python UDFs and Snowpark | DEMO 

Snowflake Developers
Подписаться 19 тыс.
Просмотров 3,7 тыс.
50% 1

Learn how to build machine-learning models in Snowflake in this demo by Sonny Rivera of Thoughtspot and Chris Hastie of InterWorks. During the demo, they show how to use Snowpark to clean your data and perform feature engineering, build and train sales forecast models using Python in Snowflake, use Python UDFs to expose your predictive models, and present and analyze your models in ThoughtSpot.
To access the code used in this demo, go to:
github.com/Chr...
To access the Quickstart guide for this topic, go to:
github.com/tho...
Learn more about Thoughtspot:
Website: www.thoughtspot.com
Twitter: @thoughtsport
LinkedIn: /www.linkedin.com/company/thoughtspot
Learn more about Interworks:
Website:interworks.com
Twitter: @interworks
LinkedIn: www.linkedin.com/company/interworks
To connect with the presenters:
Sonny Rivera, Senior Analytics Evangelist, ThoughtSpot
LinkedIn: / sonnyrivera
Chris Hastie, Data Engineering and Analytics Consultant, InterWorks
LinkedIn: / chris-hastie
Learn how to build your application on Snowflake:
developers.sno...
Continue the conversation by joining the Snowflake Community:
community.snow...
❄Join our RU-vid community❄ bit.ly/3lzfeeB
"

Опубликовано:

 

7 сен 2024

Поделиться:

Ссылка:

Скачать:

Готовим ссылку...

Добавить в:

Мой плейлист
Посмотреть позже
Комментарии : 11   
@octo3010
@octo3010 Год назад
How would you solve this with a vectorized UDF? Is there a demo on same.
@sonny.rivera
@sonny.rivera Год назад
Chris and I did not vectorize the UDF. That's a great idea. I'll sync with Chris and see want we do. Thanks for the great suggestion.
@octo3010
@octo3010 Год назад
@@sonny.rivera that would be very helpful. On a side note, is there any reference material on optimising costs for Snowflake compute resources.
@snowflakedevelopers
@snowflakedevelopers 11 месяцев назад
Here a few resources to get you started: medium.com/snowflake/best-practices-to-optimize-snowflake-spend-73b8f66d16c1 medium.com/snowflake/using-snowflakes-scale-to-zero-capabilities-for-fun-profit-f326a1d222d0 medium.com/snowflake/deep-dive-into-managing-latency-throughput-and-cost-in-snowflake-2fa658164fa8 medium.com/snowflake/improve-snowflake-price-performance-by-optimizing-storage-be9b5962decb medium.com/snowflake/compute-primitives-in-snowflake-and-best-practices-to-right-size-them-b3add53933a3
@octo3010
@octo3010 11 месяцев назад
Thank you !
@nagasai5029
@nagasai5029 Год назад
Where can i find the data set that is used in this video
@tahabekmez5072
@tahabekmez5072 Год назад
When you run the ml, does it run on local machine or within snowflake?
@sonny.rivera
@sonny.rivera Год назад
I often dev and test using VS Code/python on my local instance and then deploy the code to snowflake & snowpark that runs in the cloud.
@saeedrahman8362
@saeedrahman8362 Год назад
if we do the per category training and predictions in udf function generate_auto_arima_predictions via pandas dataframe we wouldn't get any parallelization benefit, right ? We would process all the categories sequenetially. Shouldn't we use UDTF for these kind of operations ?
@snowflakedevelopers
@snowflakedevelopers Год назад
Thanks for your comment! A UDTF would be a stronger option, as it could leverage parallel partitioning to perform these concurrently instead (as you mention). Check out the following two articles on training ARIMA models: interworks.com/blog/2022/11/22/a-definitive-guide-to-creating-python-udtfs-directly-within-the-snowflake-user-interface/ interworks.com/blog/2022/11/29/a-definitive-guide-to-creating-python-udtfs-in-snowflake-using-snowpark/ For some more information on UDTFs and how they work, see: interworks.com/blog/2022/11/15/an-introduction-to-python-udtfs-in-snowflake/ Thanks!
@sonny.rivera
@sonny.rivera Год назад
The models will run concurrently on the virtual warehouse. The UDTF is really just calling the 'predict' function. The model training is happening in the stored proc.
Далее
PySpark To Snowpark Migration
40:42
Просмотров 2,7 тыс.
Deep Dive: Operationalizing Snowpark For Python
45:34
Просмотров 1,5 тыс.
Financial Forecasting using Python & Snowflake
46:49
Hyperparameter Tuning in Snowpark
15:25
Просмотров 717
Snowpark for Python | Snowflake Tutorial
28:18
Просмотров 30 тыс.
Building and Deploying Python Code in Snowflake
1:02:21