You Only Look Once - YOLO: Object Detection using Convolutional Neural Networks

Подписаться 1,1 тыс.

Просмотров 36 тыс.

50% 1

This video present one of the fastest object detection algorithms for videos that can be used for real time applications. The algorithm is made easy for beginners. This is part 1, and part 2 will also be uploaded soon.

Опубликовано:

26 сен 2024

Ссылка:

Скачать:

Готовим ссылку...

Добавить в:

Мой плейлист

Посмотреть позже

Комментарии : 22

@prashanthvalluru8600 4 года назад

Thank you for the video.

@aparajitaojha807 4 года назад

You're welcome

@sachinbahade9982 4 года назад

If I get an early reply, it would be very helpful, output feature size is 19x19. How they will create a label this downsampled size. How it mapped on to the original size.

@aparajitaojha807 4 года назад

19X19 gives information about each grid cell. So each cell corresponds to a pixel block, where one can find if there is an object present or not, using the confidence score. And if the confidence score is high, it predicts the class of the object, by multiplying confidence score with class score. for that pixel block [ cell]. Then for object of the predicted class, it takes into account the box estimation [anchor box], centre height and width. Finally for each grid cell, it gives you object class, and its bounding boxes, if the object is predicted in the box. Hope this is clear.

@shabanaoc9750 2 года назад

Useful for my project 👏

@umamaheswararaomeleti8878 4 года назад

Good explanation, clear enough with suitable example. it will be great if you can share the slides.

@aparajitaojha807 3 года назад

Thanks, appreciate it.

@ishanarora2325 4 года назад

Hello mam can u explain that if a image contains 4 objects what is the label vector for that?

@aparajitaojha807 3 года назад

For 4 objects, the label will be like this. [ 1, x1,y1, h 1,w1, this is for object 1, 1, x2, y2, h2, w2, ...., 1, x4,y4, h4, w4]

@ankitasontakke138 4 года назад

Simple and nice explanation

@aparajitaojha807 3 года назад

Thanks for liking

@mekaladharani7233 4 года назад

hlo mam......can u explain how to develop and execute a code for object detection using cnn

@aparajitaojha807 4 года назад

Yes, sure

@ssp3839 4 года назад

how to give threshold for for 7 x 7 grid ?

@aparajitaojha807 4 года назад

What threshold are you talking about ? If it is related to IOU, we generally give a threshold of 0.5 or 0.6. If the data is having very complex samples, then even a small threshold will do.

@ssp3839 4 года назад

@@aparajitaojha807 IOU mam thanks now it's clear

@jawaharali5568 4 года назад

loved it

@aparajitaojha807 4 года назад

Thanks, I appreciate.

@prashanthvalluru8600 4 года назад

it's one of the best explanations I have seen. Loved your explanation :) . can you tell me how we get 19X19 as output with a 304X304 input image and 16X16 grid size?

@aparajitaojha807 4 года назад

Sorry for the late reply, I was busy with a training programme with 1000+ participants on Machine Learning for computer vision. It is simple, 304/16 =19 on x and y direction.