Florence-2, a novel vision foundation model with a unified, prompt-based representation for a variety of computer vision and vision-language tasks.
GitHub: github.com/Aar...
Try out the Florence-2 model here: huggingface.co...
Paper: arxiv.org/pdf/...
Florence-2 is pre-trained on our FLD-5B dataset encompassing a total of 5.4B comprehensive annotations across 126M images.
#computervision #largelanguagemodels #languagemodels #microsoft #ai #artificialintelligence
3 окт 2024