Thanks for the talk. Just a few notes. TF.js can be potentially used in cases with no network by building browser-based desktop apps using frameworks like Electron. Also while cloud options can be more pricy in terms of infrastucture cost but they are usually more managed and requiring less maintenance cost/overhead.
Why Flask and not FastAPI? FastAPI is more suited to POST calls for example (say you want to give an observation as input and get back a prediction for it) because it also provides an input schema validation mechanism with pydantic. Maybe I am missing something but I am curious regarding this choice.