Top Object Detection Models in 2023 | Model Selection Guide sponsored by Intel

Подписаться 36 тыс.

Просмотров 21 тыс.

50% 1

Description:
Discover the top object detection models in 2023 in this comprehensive video. We compare models like YOLOv8, YOLOv7, RTMDet, DETA, DINO, and GroundingDINO based on metrics like Mean Average Precision, community support, packaging, and licensing for you to decide which is best for your production AI applications. The video also details the challenges in comparing model speed and highlights important nuances within the realm of object detection, like choosing the right model for the right hardware and use case. It's an essential watch for anyone interested in computer vision and model selection. This research was sponsored by Intel.
#ObjectDetection #ComputerVision
Chapters:
- 00:00 Introduction
- 00:35 Object Detection
- 01:42 Mean Average Precision
- 02:28 Speed
- 03:40 Paper, Packaging, and License
- 04:35 YOLOv8
- 05:21 YOLOv7
- 06:06 YOLOv6-v3
- 07:01 RTMDet
- 07:46 RT-DETR
- 08:50 DETA
- 10:02 GroundingDINO
- 10:37 Model Community Comparison
- 11:46 Conclusion
Resources:
- 🌏 Roboflow: roboflow.com
- 📚 Roboflow Notebooks Repository: github.com/roboflow/notebooks
- 🌌 Roboflow Universe: universe.roboflow.com
- 📈GPU vs Intel HPU (new hardware options for AI): blog.roboflow.com/gpu-vs-hpu/
Stay updated with the projects I'm working on at github.com/roboflow and github.com/SkalskiP! ⭐
Remember to like, comment, and subscribe for more content on AI, computer vision, and the latest technological breakthroughs! 🚀

Наука

Опубликовано:

11 июл 2024

Ссылка:

Скачать:

Готовим ссылку...

Добавить в:

Мой плейлист

Посмотреть позже

Комментарии : 70

@hawkingradiation3774 9 месяцев назад

would also like to see a video for comparison for segmentation tasks as well.

@Roboflow 9 месяцев назад

Awesome idea! I'm curious if there are more people who would like to see that. It's a lot of work to create video like that.

@sozno4222 9 месяцев назад

@@RoboflowI would also be interested in a video like that

@visuality2541 9 месяцев назад

Indeed

@ferneutron 5 месяцев назад

Super great job, thanks!

@mpty2022 9 месяцев назад

thanks for sharing your research

@Roboflow 9 месяцев назад

My pleasure!

@rperezalejo 9 месяцев назад

Great video, right now I am working on a real time sport object detection and this video comes like a charm for me. I can test other posibilities I did not had in mind.

@Roboflow 9 месяцев назад

Awesome we came at the right one! I’d love to here about your results once you finish tests.

@rperezalejo 9 месяцев назад

@@Roboflow I have to retrain on 4k images first and then see the performance of the models in real time, I am not sure If i am going to be able to test the all, but the idea of having the golden cub gives a sign on where to start. It is a great guide anyways

@romroc627 9 месяцев назад

Excellent video, really useful.

@Roboflow 9 месяцев назад

I’m so happy to see such a positive feedback!

@AlainPilon 9 месяцев назад

Thanks for the work! I am wondering if the COCO metric is actually useful. It is an interesting comparison point but I am not sure how it translates in practice given that most people only train detector for a very few set of classes. Community support, documentation and how the model integrates in the current ecosystem is much more impactful and I am glad you added these to your chart.

@Roboflow 9 месяцев назад

I'm glad you agree with my methodology. I think the license is also very important. After all you must be able to use the model in your project. As for mAP, 100% agree. I'd love to have other metrics that I could use. We developed RF100 metric - paperswithcode.com/dataset/rf100, but I didn't have enough datapoints to compare all the models.

@juanolano2818 9 месяцев назад

Will go with Yolo8 for my current microbes identification project :) Thank you Piotr!

@Roboflow 9 месяцев назад

If it is open source or academic good choice! 👍🏻

@saurabhgupta7148 5 месяцев назад

I am also working with images of micro organisms. Did you get good results with YoloV8?

@mr_tpk 6 месяцев назад

Thank you ❤

@tryingtobeproductive 9 месяцев назад

This video is extremely useful 10/10

@Roboflow 9 месяцев назад

Thank you! Awesome to hear such a positive feedback 🔥

@satellite-image-deep-learning 9 месяцев назад

Fantastic summary, thank you for the effort that went into this! MMdetection has come up before and I would love an intro video on it 😊

@Roboflow 9 месяцев назад

We already have MMDetection. :) Take a look on our channel

@segheysens 9 месяцев назад

They already created a video here! 🙌 ru-vid.com/video/%D0%B2%D0%B8%D0%B4%D0%B5%D0%BE-5kgWyo6Sg4E.html

@johannesmokami5760 9 месяцев назад

Thanks for the info I will definitely try them as well.

@Roboflow 9 месяцев назад

Awesome! Which detector are you going to try?

@johannesmokami5760 9 месяцев назад

I'm currently trying out YOLOv8 but I'd like to try YOLOv7 and GroundingDINO @@Roboflow

@rafael.gildin 9 месяцев назад

Great Video 🎉

@Roboflow 9 месяцев назад

My pleasure! 🔥

@Seethis-HD 9 месяцев назад

Excellent video. Thanks for the efforts. I was wondering why you didn't consider YOLO-NAS in the list?

@Roboflow 9 месяцев назад

I considered it but ultimately decided not to include it. I’m pretty confident those models are better choices.

@PhilippBlum 4 месяца назад

@ow What was the reason not to include it? Accuracy?

@EliSpizzichino 9 месяцев назад

We need a platform to fully compare them on real datasets on real training on the same device. Is also important to keep track if a version change produce worsening quality. I've noticed for example that between one minor version and another of the ultralitics codebase the quality of the final trained model worsened by a lot.

@Roboflow 9 месяцев назад

That's super interesting! Could you share more insights on what the versions were? I'd love to do more investigation. As for the "platform to fully compare them on real datasets", have you seen RF100? paperswithcode.com/dataset/rf100

@EliSpizzichino 9 месяцев назад

@@Roboflow The last tested good version was 8.0.103. Unfortunately I had no time since then to do further investigation my self but I remember trying a couple of training out with some versions after that and got worse results I haven't tested the RF100 yet, it's a good effort and I like what you do as company (I never left such a good comment to anyone in my life :)

@user-fj1zz9mi5r 8 месяцев назад

I was looking for performance over the time inference for edge devices. I was trying to use Yolov8 for edge deployment into STM32 but at the end, i realized this model was too big for this card. What do you think is a good model for a good ratio between inference time / model size? Thanks for your response

@seanolivieri4829 7 месяцев назад

Do you have a video on licenses? I dont understand any of those and which one should I use if I want to be able to sell my program or call it my own

@user-rx4wd3xn2c 6 месяцев назад

can we use yolov8 pretrained weights for commercial use?

@zskater1234 9 месяцев назад

I’m currently porting GroundingDINO to the transformers library so buckle up

@Roboflow 9 месяцев назад

Uuu! Awesome! I can’t wait to see that happening. Being able to setup GroundingDINO with single pip install.

@zskater1234 9 месяцев назад

@@Roboflow the model is already ported, now I’m tackling the tests and documentation if everything goes well by next weekend I’ll finished it and will await the HF review

@pleison111 4 месяца назад

I have a question, I am working on a OCR project, I am using a fastrcnn with resnet50 as object detector, and then I need something like a conv + GRU or ViT to decode the text, do you have some suggestions regarding OCR?

@Roboflow 4 месяца назад

First of all why Fast RCNN? As for OCR did you try Tesseract?

@tyronetyrone2652 9 месяцев назад

Which framework is better to use in embedded chips?

@Roboflow 9 месяцев назад

which board are you using?

@sarathkumar-gq8be 4 месяца назад

Which model will perform better in raspberry pi

@shubh722 9 месяцев назад

I am doing an object detection task and get 97.4% accuracy on the dataset using yolov5 and will be running it on an edge device. Is yolov5 too old and Should I train a yolov8 model for faster inference? As I think accuracy will be almost similar as it’s already 97.4%. Or is it task specific. If yolov5 is performing good then is there any need to change. If anyone can suggest please

@Roboflow 9 месяцев назад

I don’t think you can expect better accuracy than that. The main issue here is that YOLOv5 did not have proper Python packaging so integrating it into larger projects was problematic.

@omigator 9 месяцев назад

We switched from Yv8 to Yv5 because it gave better performance without any loss in accuracy on our edge devices.

@shubh722 9 месяцев назад

@omigator What do you think about the inference times between v8 and v5? Is it real time? Also idk I found yolov5 easier to use as well. I was training it in Azure ml so was much easier to tweak the files for v5 to train there rather than v8. And the accuracy is pretty good as well.

@appliedml8665 9 месяцев назад

Yolo gold is now available.

@Roboflow 9 месяцев назад

Yeah I know... This video was recorded before YOLO GOLD was released. I didn't have time to play with it yet. Have you?

@satyajitpanigrahy7742 9 месяцев назад

Kindly, Update the ultralytics package for YOLOv4 model

@Roboflow 9 месяцев назад

Hi @satyajitpanigrahy7742 👋 Ultralytics is a separate team. Kindly, try to submit bug report in their repository: github.com/ultralytics/ultralytics

@titusfx 9 месяцев назад

In 2:28 why not just do asymptotic analysis (computational complexity analysis).

@Roboflow 9 месяцев назад

Hi 👋🏻! You mean use FLOPS to asses complexity?

@sanchaythalnerkar9736 9 месяцев назад

Can you show actual code and real time comparison of these?

@Roboflow 9 месяцев назад

You mean independent time benchmark comparing the speed of all of those models?

@sanchaythalnerkar9736 9 месяцев назад

Yes Exactly , a side by side comparison@@Roboflow

@Roboflow 9 месяцев назад

@@sanchaythalnerkar9736 not sure if it really can be side by side. To truly measure the model speed we need to make sure there is no other heavy process running on the machine. But sure we can try to make that happen. I'll add it to our TODO list.

@8eck 9 месяцев назад

Where is DETA video? Couldn't find DETA with 100k stars... Could you please add github link here.

@Roboflow 9 месяцев назад

For now we only have DETR. You can find it here: ru-vid.com/video/%D0%B2%D0%B8%D0%B4%D0%B5%D0%BE-AM8D4j9KoaU.html As for star count, DETA is distributed via transformers library and that's what I used to measure community size.

@8eck 9 месяцев назад

RT-DETR have 355 stars, 20k+

@Roboflow 9 месяцев назад

To be honest no one use implementation from original repository. RT-DETR is distributed via PaddlePaddle package. That's why we use 20k+ star count. I know it is not perfect... but like I said I decided to use the top repo that make the model accessible.

@8eck 9 месяцев назад

@@Roboflowcan you please drop some links? Thank you.

@Roboflow 9 месяцев назад

@@8eck take a look here: github.com/PaddlePaddle/PaddleDetection/tree/develop/configs/rtdetr and here: huggingface.co/docs/transformers/main/en/model_doc/deta

@8eck 9 месяцев назад

Found only DETA with 198 stars, not 100k like in your table...

@Roboflow 9 месяцев назад

I responded to that question under your other comment :)

@adurks4846 9 месяцев назад

Personally, I've found yolov8 to be disappointing in the real world. I work in aerial/satellite imaging and yolov8 performs ~10% worse than scaled-yolov4. Most of the others on that list perform similarly. Overall, it seems like once you leave the types of images/targets in the COCO dataset, the metrics mean less and less for what will do well on your project.

@Roboflow 9 месяцев назад

Absolutely agree! I even said that in the video. I'd love to have other metric to compare models, not just mAP on COCO. The moment to start to fine-tune the model on your dataset that number means nothing. Do you care about the speed when you process aerial/satellite imaging?

@adurks4846 9 месяцев назад

@@Roboflow We don't care that much about speed. However, we don't typically have much data which means that the larger models seem to do worse. Do you guys have in-house metrics for some of these models using the roboflow-100?