Тёмный

Best MLOps Practices for Building End-to-End Machine Learning Computer Vision Projects with Alex Kim 

DVCorg
Подписаться 8 тыс.
Просмотров 8 тыс.
50% 1

In this workshop with DataTalks Club, we’ll build an end-to-end Computer Vision system using MLOps tools DVC, CML, the DVC extension for VS Code, and Iterative Studio along with Fast AI, nvtop and Docker.
We’ll explore an industrial use case of training an image segmentation model for the purposes of defect detection on a manufacturing conveyor belt.
The dataset and the use case are described in this [paper](www.researchgate.net/profile/....
Repo: github.com/iterative/magnetic...
*******
You’ll learn:
- How to quickly configure a remote development environment with [TPI](github.com/iterative/terrafor... write code locally while executing on a remote machine with a GPU
- How to version large datasets and models with [DVC](github.com/iterative/dvc)
- When it’s the right time to move from Jupyter notebooks to ML pipelines and how to do that with [DVC](github.com/iterative/dvc)
- Why it’s beneficial to integrate CI/CD workflows into your model development process and how to do that with [CML](github.com/iterative/cml)
- How to manage experiments and collaborate on ML projects using [Iterative Studio](studio.iterative.ai/)
********
Target Audience:
Technical folks (e.g. Software Engineers, ML Engineers, Data Scientists) who are familiar with general Machine Learning concepts, Python programming.
Knowledge of CI/CD processes and Cloud infrastructure will be helpful.
********
Prerequisites:
- AWS Account: [mlbookcamp.com/article/aws](mlbookcamp.com/article/aws)
- Familiarity with AWS S3 and AWS EC2
- Familiarity with GitHub Actions will be helpful: [github.com/features/actions](github.com/features/actions)
********
About the speaker:
Most of Alex’s work experience involved solving data science problems in various domains: physics, aerospace, telemetry/log analytics, image, and video processing.
In the last couple of years, he became increasingly interested in the engineering side of ML projects: processes and tools needed to go from an idea to a production solution. Currently, he works as an MLOps Solutions Engineer at [Iterative.ai](iterative.ai/), helping customers extract the most value from the Iterative ecosystem of tools.
Short video trailers of what will be covered in the talk
- [1 - Launch VSCode with TPI.mp4](drive.google.com/file/d/1yl8F...)
Shows how to achieve local development experience while running code on a Cloud machine with a powerful GPU.
- [2 - Create DVC pipeline.mp4](drive.google.com/file/d/1pF3I...)
Introduces DVC pipelines
- [3 - DVC and VSCode Extension.mp4](drive.google.com/file/d/1jdXo...)
Shows how to manage experiments with VSCode Extension for DVC
- [4 - CICD and CML.mp4](drive.google.com/file/d/1DPpG...)
Shows how to configure CI/CD jobs powered by CML: deploying cloud runner and automatic reporting to GitHub.
- [5 - Exp Management in Studio.mp4](drive.google.com/file/d/1nKJo...)
Experiment management in Studio, plots, running remote experiments via Studio UI
Try out the DVC Extension for VS Code here: marketplace.visualstudio.com/...
To learn more about Iterative's open-source and SaaS tools please visit:
🧑🏽‍💻 Our online course: learn.iterative.ai
✍🏼 Our docs: dvc.org/doc (Data Version Control, Pipelines, Experiments)
cml.dev/doc (CI/CD for Machine Learning)
mlem.ai/doc (Package and Serve your models)
studio.iterative.ai (Team Collaboration, Experiments, Model Registry)
Join our Discord server: / discord
#dvc #machinelearning #datascience

Наука

Опубликовано:

 

16 май 2023

Поделиться:

Ссылка:

Скачать:

Готовим ссылку...

Добавить в:

Мой плейлист
Посмотреть позже
Комментарии : 9   
@jainamdoshi7109
@jainamdoshi7109 9 месяцев назад
Can I build this with a local pc rather than AWS as I am a student and don't have an AWS account
@dvcorg8370
@dvcorg8370 9 месяцев назад
@jainamdoshi7109 Thanks for the question! Yes you can! Check out this doc to set up a local remote: dvc.org/doc/user-guide/data-management/remote-storage#file-systems-local-remotes
@curdyco
@curdyco 4 месяца назад
isn't there a way to perfom retraining in pipeline using google colab or kaggle?
@dvcorg8370
@dvcorg8370 3 месяца назад
You can use DVC in your notebooks to rerun pipelines but ultimately for production you will want to convert your code. Check out this video next:ru-vid.com/video/%D0%B2%D0%B8%D0%B4%D0%B5%D0%BE-6x6GwtNeYdI.html
@shortspeeches1455
@shortspeeches1455 10 месяцев назад
why is the code repo deleted from Git Hub?
@maximmanchenko6660
@maximmanchenko6660 10 месяцев назад
This is the correct link github.com/iterative/terraform-provider-iterative
@dvcorg8370
@dvcorg8370 9 месяцев назад
@shortspeeches1455 you can find the repo here: github.com/iterative/magnetic-tiles-defect
@mdabuzar9300
@mdabuzar9300 9 месяцев назад
why iterative studio is betterr than mlflow?
@dvcorg8370
@dvcorg8370 8 месяцев назад
@mdavuzar9300 Thank you for the question! Both tools indeed accomplish many of the same things, but the key differentiator is that DVC Studio (name has been changed) is Git-based. You are building your end-to-end MLOps process on infrastructure you already use (Git) instead of saving your ML workflows and processes in another server. This enables you and your team to be set up for success and reproducibility through every step of the process to production.
Далее
But What Is Cloud Native Really All About?
7:32
Просмотров 141 тыс.
MLOps on Databricks: A How-To Guide
1:27:43
Просмотров 54 тыс.
Ansible vs. Terraform: What's the difference?
9:32
Просмотров 186 тыс.
What is Apache Iceberg?
12:54
Просмотров 18 тыс.
How I’d learn ML in 2024 (if I could start over)
7:05
AI/ML Engineer path - The Harsh Truth
8:39
Просмотров 337 тыс.
How GitHub Actions 10x my productivity
8:18
Просмотров 396 тыс.
Colorful Vulcan w rtx 4070ti Super
13:30
Просмотров 54 тыс.