Тёмный

How to Deploy ML Solutions with FastAPI, Docker, & AWS 

Shaw Talebi
Подписаться 38 тыс.
Просмотров 14 тыс.
50% 1

Опубликовано:

 

21 окт 2024

Поделиться:

Ссылка:

Скачать:

Готовим ссылку...

Добавить в:

Мой плейлист
Посмотреть позже
Комментарии : 33   
@ShawhinTalebi
@ShawhinTalebi 5 месяцев назад
More on Full Stack Data Science👇 👉Series Playlist: ru-vid.com/group/PLz-ep5RbHosWmAt-AMK0MBgh3GeSvbCmL 💻Example Code: github.com/ShawhinT/RU-vid-Blog/tree/main/full-stack-data-science/ml-engineering
@FREAK-st6kk
@FREAK-st6kk 2 месяца назад
One of the best aspects of AWS Elastic Cloud is how seamlessly everything comes together, whether you're using FastAPI or Docker. It's all integrated beautifully.
@divyanshtripathi4867
@divyanshtripathi4867 4 месяца назад
This is such a great video, no nonsense straight to the point!
@Coret-with-c
@Coret-with-c 28 дней назад
Super detailed explanation, thank you!
@miraclechijioke1213
@miraclechijioke1213 Месяц назад
This was so simplified. thank you Shawhin
@pawe5560
@pawe5560 5 месяцев назад
Hey Shawn, videos on FastAPI and Docker from you would be great.
@ShawhinTalebi
@ShawhinTalebi 5 месяцев назад
Thanks for the suggestion! I'll add it to my queue :)
@nocomments_s
@nocomments_s Месяц назад
I like your video, it deserves much more views
@brianmorin5547
@brianmorin5547 5 месяцев назад
This is fantastic stuff as I’m pulling out my hair on this same step. You have the right idea for the next video, but I think the next one after that is making the chat interface publicly accessible
@ShawhinTalebi
@ShawhinTalebi 5 месяцев назад
Great suggestion Brian! There are several ways one can do this. The simplest and cheapest would be hosting it via HuggingFace Spaces: huggingface.co/spaces/launch However, for this specific use case the most practical option would be to embed it into my Squarespace website. I'll need to do some more digging to see the best way to do that.
@brianmorin5547
@brianmorin5547 4 месяца назад
@@ShawhinTalebi The streamlit cloud has been my go-to so far. I am toying with creating a react front end template but would like to see what others are doing
@dhirajkumarsahu999
@dhirajkumarsahu999 4 месяца назад
Thank you so much, such videos are really very helpful
@ShawhinTalebi
@ShawhinTalebi 4 месяца назад
Glad to hear :)
@RatherBeCancelledThanHandled
@RatherBeCancelledThanHandled 28 дней назад
well done , Thanks !
@fatimayousaf1644
@fatimayousaf1644 22 дня назад
Hey Shaw! what a beautiful content. I followed all the steps from ML APP from scratch till deployment but at last moment, I don;t have AWS free tier account as they still ask to enter the debit card details for having free access, so can you please tell me another way round to cope up with this ?
@ShawhinTalebi
@ShawhinTalebi 15 дней назад
Glad you liked it! You could try deploying to railway (railway.app/). I just used them for a project and don't think I needed to input credit card info.
@paulohss2
@paulohss2 21 день назад
Great tutorial... Im still not able to connect to the API unfortunately (This site can’t be reached,refused to connect.) even though I followed the network config steps you explained...
@ShawhinTalebi
@ShawhinTalebi 15 дней назад
To confirm, you added inbound rules in the VPC dashboard to allow all incoming traffic from your IP? Does the IP listed in the inbound rules match yours?
@Smrigankiitk
@Smrigankiitk 24 дня назад
how can we integrate streamlit to make the UI to get input and send to model and display the output here ?
@ShawhinTalebi
@ShawhinTalebi 22 дня назад
Good question. This should be similar to the Gradio example shown 25:54. This blog post might be helpful: blog.streamlit.io/create-a-search-engine-with-streamlit-and-google-sheets/
@angieyoon9900
@angieyoon9900 4 месяца назад
Thank you!
@rameshbabu2085
@rameshbabu2085 4 месяца назад
I have a DL model which takes about 5 mins and 3gb GPU to process the query and to return result. I need to handle 5 queries per minute and I have a GPU with 8gb in GCP. How can I deploy such a model without memory leakage and I should be able to use the GPU at its full potential?
@ShawhinTalebi
@ShawhinTalebi 4 месяца назад
How big is that model? Do you have GPU parallelization enabled? If it takes 5 min and 3GB to do one query with parallelization, the model may be too big to meet those technical constraints.
@ax5344
@ax5344 5 дней назад
where does K8S fit in in this pipeline?
@ShawhinTalebi
@ShawhinTalebi 2 дня назад
Good question. In my experience, Kubernetes is rarely used in DS/ML, so I wouldn't worry learning it if your just getting started.
@thonnatigopi
@thonnatigopi 4 месяца назад
Bro create a video for handling post and get request and multiple endpoints using fast api dockerize and ECR and aws lambda functions
@ShawhinTalebi
@ShawhinTalebi 3 месяца назад
Great suggestion. I'll add that to my list!
@R-ms9uo
@R-ms9uo Месяц назад
Can make tutorials on CI/CD pipeline
@ShawhinTalebi
@ShawhinTalebi 29 дней назад
I get into that in another video of this series: ru-vid.com/video/%D0%B2%D0%B8%D0%B4%D0%B5%D0%BE-wJ794jLP2Tw.html
@abbasrabbani7665
@abbasrabbani7665 5 месяцев назад
that was an awesome video, I have a task for one click ml model deployment on aws, azure and GCP, like one click on aws and other click on azure. CAn u please guide me shortly the roadmap...!
@ShawhinTalebi
@ShawhinTalebi 4 месяца назад
Thanks for your comment. Sorry I'm not sure what you mean by "one click ml model deployment". Could you share more details?
@vivekasthana12345
@vivekasthana12345 Месяц назад
Thank you Shaw for making so many amazing videos. quick question from this video.. where exactly are you making a connection between your dockerhub and AWS ECS? Is it where you mention the url of the image? what if someone has a similar image name (shawhint/yt-search-demo) or that's not possible? Sorry if its a dumb question 😐
@ShawhinTalebi
@ShawhinTalebi 29 дней назад
Good question! Yes, exactly. No one will have a similar image name because the first part will be your unique DockerHub username.
Далее
Борщ в стиле высокой кухни!
00:57
18 Weird and Wonderful ways I use Docker
26:18
Просмотров 320 тыс.
Deploy ML model in 10 minutes. Explained
12:41
Просмотров 27 тыс.
How to deploy a dockerized FastAPI to Cloud Run
22:22
Просмотров 3,3 тыс.
How To Deploy Your RAG/AI App On AWS (Step by Step)
55:07
The intro to Docker I wish I had when I started
18:27
Просмотров 173 тыс.
4 Tips for Building a Production-Ready FastAPI Backend
27:02