To serve multiple concurrent users accessing FastAPI endpoint running Hugging Face API, you must start the FastAPI app with several workers. It will ensure current user requests will not be blocked if another request is already running. I show and describe it in this video.
Sparrow - data extraction from documents with ML:
github.com/katanaml/sparrow
0:00 Introduction
0:30 Concurrency
2:50 Problem Example
4:10 Code and Solution
6:10 Summary
CONNECT:
- Subscribe to this RU-vid channel
- Twitter: / andrejusb
- LinkedIn: / andrej-baranovskij
- Medium: / andrejusb
#python #fastapi #machinelearning
27 июл 2024