Тёмный

You Only Look Once: Unified, Real-Time Object Detection 

ComputerVisionFoundation Videos
Подписаться 38 тыс.
Просмотров 128 тыс.
50% 1

This video is about You Only Look Once: Unified, Real-Time Object Detection

Кино

Опубликовано:

 

31 авг 2016

Поделиться:

Ссылка:

Скачать:

Готовим ссылку...

Добавить в:

Мой плейлист
Посмотреть позже
Комментарии : 64   
@lorinma
@lorinma 7 лет назад
amazing work!
@beteaberra631
@beteaberra631 2 года назад
Great presentation: clear, thoughtful and fun!
@engineer.alqupatimohammed6942
@engineer.alqupatimohammed6942 2 года назад
Great show and unbelievable explanation. Thank you for your tremendous effort.
@user-nl5hx3pu7y
@user-nl5hx3pu7y 5 лет назад
love the "toilet" regonition at 11:17😂
@zhiyongwang2849
@zhiyongwang2849 2 года назад
lol
@christianreiser779
@christianreiser779 7 лет назад
Great talk!
@christians6295
@christians6295 6 лет назад
Informative & entertaining!
@hristovrigazov3150
@hristovrigazov3150 5 лет назад
Industry defining talk!
@Bamboo_gong
@Bamboo_gong 3 года назад
wonderful presentation.
@user-ahmed-
@user-ahmed- Год назад
Incredible presentation
@friskybiscuits10
@friskybiscuits10 7 лет назад
AWSOME VIDEO!!!!
@quang-namvu407
@quang-namvu407 6 лет назад
you've reduced me a lot of time. Thank you!
@chesterkylescolita3393
@chesterkylescolita3393 5 лет назад
yeah reading the conference paper is quite cumbersome and i am having a hard time understanding it. this video made it simpler to understand
@srd6263
@srd6263 2 года назад
重温经典,膜拜大神
@user-ld3ke2yt8l
@user-ld3ke2yt8l 2 года назад
Awesome
@lyltencent
@lyltencent 6 лет назад
Greate work!
@zuam7645
@zuam7645 3 года назад
*THAT IT IS AN EXCELENT SOFTWARE FOR USING IN "DASH CAMERAS" FOR CAPTURING AND VIDEO RECORDING PEOPLE AROUND YOUR BELONGINGS OR CAR KEYING YOUR CAR*
@franklyvulgar1
@franklyvulgar1 10 месяцев назад
yes Joe, YOLO FTW!!!
@crabsynth3480
@crabsynth3480 6 лет назад
2:30 .... Wow... ! [Edited] 11:50 .... Awe-Inspiring !
@dompower500
@dompower500 4 года назад
excellent conference on YOLO.
@user-em1nh6xq2i
@user-em1nh6xq2i 4 года назад
cool!
@nguyenanhnguyen7658
@nguyenanhnguyen7658 3 года назад
Love Yolo ❤️❤️❤️👌👌👌
@sahil-7473
@sahil-7473 3 года назад
How it's works at Inference time. I am not able to get it. Each output with give range between -1 to 1. Now how can I bring it BB into original image.? Kindly tell me the mathematics, how to compute it's? This is where I stuck. Help me🙏
@apurbaroy8411
@apurbaroy8411 3 года назад
Is it possible to integrate the YOLO algorithm with arduino or raspberry pi using a webcam?
@alexdalton4535
@alexdalton4535 Год назад
how does a grid cell predict a box that is bigger than itself?
@sumod12
@sumod12 4 года назад
super
@hyunseokjeong7994
@hyunseokjeong7994 7 лет назад
Thank you for the video. I did not get "NMS and threshold detections" could you explain a bit more?
@elbouziadyabderrahim8086
@elbouziadyabderrahim8086 5 лет назад
NMS (Non Maximum Supression) : take the bounding box with the max condidence value
@sridharkashyap9603
@sridharkashyap9603 3 года назад
Nms will only keep the bounding box which has max intersection over union of overlapping bounding boxes
@huawei2091
@huawei2091 6 лет назад
When you say "dont adjust the class probabilities or coordinates" if there are no object centered in that grid cell, you mean simply pass on that cell and move to next, right? So you only backpropagate the NN when there is an object centered in that cell. Am I getting it right?
@migrantama
@migrantama 6 лет назад
Hello, I'm also looking for the answer to the same question, do you got the idea??
@anikethshetty992
@anikethshetty992 6 лет назад
Hey I'm new to the field of Convolutional Neural Network. I have a presentation in school on YOLO and I need some help. Can someone please explain how the output of the convolution layer works. The input to the first convolution network is a 448*448*3 tensor. And it's output is a 224*224*64 tensor on a filter of 7*7. I understand that the depth is 64 because of 64 different filters (features) Thank you!
@dhruba1992
@dhruba1992 2 года назад
Output shape= (𝑊 −𝐾+2𝑃)/𝑆 + 1 ; W = input volume, K = kernel size, P = padding, S = stride
@sherlockskey4131
@sherlockskey4131 2 года назад
sir , where can i get complete code....pls help i am working on this project
@manojguha2046
@manojguha2046 7 лет назад
This new method is going to be the future of object detection... So fast and accurate. Is he running on a windows or linux pc ??
@Nicolas-xr3id
@Nicolas-xr3id 4 года назад
linux
@anoushakhan7896
@anoushakhan7896 5 лет назад
which laptop do you have ?
@maheswaranparameswaran8532
@maheswaranparameswaran8532 4 года назад
Something with a titan x for sure....its on his github page
@Dennis-nn5tc
@Dennis-nn5tc 6 лет назад
why they use 2 bounding boxes for 1 cell? For localization 1 bounding box for each cell should be enough or? In OpenCv for example the Object Detection draws only 1 bounding box around an object.
@chesterkylescolita3393
@chesterkylescolita3393 5 лет назад
i think it is called anchor boxes.
@maheswaranparameswaran8532
@maheswaranparameswaran8532 4 года назад
i guess more than 2 anchor boxes being in a same grid cell if u use a large grid is relatively low...check out andrew ngs video on yolo on deeplearning.ai s channel
@shaz7163
@shaz7163 7 лет назад
How to calculate the p(class/object)
@Splish_Splash
@Splish_Splash Год назад
model itself produces confidence level by softmax(logits)
@vikrantchoudhary4411
@vikrantchoudhary4411 4 года назад
I just Had one question When we know where the ground-truth centre of the object is why can't we scan just that area or nearby area why do we scan the whole image??
@ogsconnect1312
@ogsconnect1312 4 года назад
Yes you're correct that when we know where the ground-truth center is, we can just scan that area. The problem is generalization i.e. our model will only be good at that specific instance, and when the object happens to be located in another region of the image as is often the case in the test set, the model fails completely and that defeats our training objective and learning wouldn't have taken place in that respective. Hope it makes some sense? Thanks for reading.
@vikrantchoudhary4411
@vikrantchoudhary4411 4 года назад
@@ogsconnect1312 oh my god ! that's the single point I was confused the whole time thanks a lot buddy.
@XiuyuYang
@XiuyuYang Год назад
A milstone in cv.
@dominiksulzer1338
@dominiksulzer1338 6 лет назад
2:43 more than 105 % sure that there is a person when there is not.
@phoneplaysguitar
@phoneplaysguitar 4 года назад
Video is still on and he is still in front of the camera. 105 is still weird, but okay.
@alexdalton4535
@alexdalton4535 Год назад
@@phoneplaysguitar it should be between 0 and 100 though lmao, how do you get 105%
@pooorman-diy1104
@pooorman-diy1104 4 года назад
I stil dont get it ..
@alchemication
@alchemication 4 года назад
This will make it easier (make sure you watch the previous videos as well to understand the building blocks): ru-vid.com/video/%D0%B2%D0%B8%D0%B4%D0%B5%D0%BE-9s_FpMpdYW8.html . Hope it helps!
@nano7586
@nano7586 4 года назад
YOLO is so fucking hilarious.. it's a big "fuck you" to all these kind of scientists who see things a bit too seriously. I love these kind of things and it gets me motivated in the science field, given that science for most part is very dry and it easily makes you depressed. Just thinking about the fact that "YOLO" will probably be mentioned in my masters thesis is so good :D 0:01 That picture is top notch.
@sandaminirathnayaka876
@sandaminirathnayaka876 6 лет назад
can u share the source code with me?
@Deadbeatdad666
@Deadbeatdad666 6 лет назад
github.com/pjreddie/darknet
@TheVasanthbuddy
@TheVasanthbuddy 7 лет назад
103% probability that its a person. Something fishy in your calculation
@xinyizhang3766
@xinyizhang3766 2 года назад
toilet lol
@user-uo4qi3ip8n
@user-uo4qi3ip8n 6 лет назад
poor presentation
@dudeking1000
@dudeking1000 6 лет назад
Hey I'm new to the field of Convolutional Neural Network. I have a presentation in school on YOLO and I need some help. Can someone please explain how the output of the convolution layer works. The input to the first convolution network is a 448*448*3 tensor. And it's output is a 224*224*64 tensor on a filter of 7*7. I understand that the depth is 64 because of 64 different filters (features) Thank you!
@weijiangxu3950
@weijiangxu3950 6 лет назад
Because the stride is 2?
Далее
How YOLO Object Detection Works
17:04
Просмотров 22 тыс.
🎙ПОЮ твои ЛЮБИМЫЕ ПЕСНИ💥
3:22:10
Legendary KNOCKOUT
00:44
Просмотров 1,9 млн
Eddie Hall VS Neffati Brothers
00:11
Просмотров 1,7 млн
Lecture 11 | Detection and Segmentation
1:14:26
Просмотров 616 тыс.
Build an Object Detector for Any Game Using YOLO
22:40
Introduction into YOLO v3
26:56
Просмотров 97 тыс.
Object Detection in 10 minutes with YOLOv5 & Python!
10:45
😾ПОКОРМИ уже кота, бабуля!
1:00
Kim bu Gollandskiy | Dizayn jamoasi
0:54
Просмотров 1,4 млн