Тёмный

CNN: Convolutional Neural Networks Explained - Computerphile 

Computerphile
Подписаться 2,4 млн
Просмотров 858 тыс.
50% 1

Years of work down the drain, the convolutional neural network is a step change in image classification accuracy. Image Analyst Dr Mike Pound explains what it does.
Kernel Convolutions: • How Blurs & Filters Wo...
Deep Learning: • Deep Learning - Comput...
Botnets: • Botnets - Computerphile
AI's Game Playing Challenge: • AI's Game Playing Chal...
Space Carving: • Space Carving - Comput...
/ computerphile
/ computer_phile
This video was filmed and edited by Sean Riley.
Computer Science at the University of Nottingham: bit.ly/nottscom...
Computerphile is a sister project to Brady Haran's Numberphile. More at www.bradyharan.com

Опубликовано:

 

28 сен 2024

Поделиться:

Ссылка:

Скачать:

Готовим ссылку...

Добавить в:

Мой плейлист
Посмотреть позже
Комментарии : 508   
@Intelligenz_Bestie
@Intelligenz_Bestie 8 лет назад
convolutional neural networks is one of those things that really needs some visuals, i find that it is really hard to 'grok' when you get it explained in a book or via speech but once you get a visual example it's kind of hilariously simple and scaringly plausible
@AlpineLifeProductions
@AlpineLifeProductions 7 лет назад
damn, I thought I was the only one who read that book
@hanneshertach8013
@hanneshertach8013 7 лет назад
Dog Barksley which book?
@jim_rev9335
@jim_rev9335 5 месяцев назад
forshadowing
@dickhamilton3517
@dickhamilton3517 8 лет назад
I like the 'extensive' library of books he has on that shelf above and behind him
@techsuvara
@techsuvara 3 месяца назад
Word Press Forms is all you need!
@bobiboulon
@bobiboulon 7 лет назад
I just discovered this channel, saw a bunch viideos and didn't come across a single boring one.
@alpardal
@alpardal 8 лет назад
Kudos to Mike, videos with him are always fun and well explained!
@unvergebeneid
@unvergebeneid 8 лет назад
Wow, interesting concept, nicely explained ... well done!
@michaelsidorov157
@michaelsidorov157 2 года назад
In minute 9:07 - the size of the image changes not because it computes only the middle pixel, but because it fits in the size of the image less times than the images' width and height.
@anwul4
@anwul4 8 лет назад
Now I know what to work on between handing in my Master thesis and defending it. Cause model my Artificial Neural network towards a Convolutional Neural Network. Might indeed bring up my accuracies in regards to recognizing game events based on Electroencephalogram and Eye-tracking data.
@nand3kudasai
@nand3kudasai 8 лет назад
I love the topics from this guy and he explained it very well. Though his accent is a little difficult for me. Awesome video and very cool how you get to reference and link all those other previous videos
@Rajeshgandh
@Rajeshgandh 2 года назад
Me too, I enabled the subtitles, Any way its a great video
@szeredaiakos
@szeredaiakos 5 лет назад
I like how he throws around "corners and edges" and the begining of DL corners and edges was actually a prediction but in reality, the slices of the most capable nets looks absolutely nothing like corners and edges and a whole lot more like noise.
@UberAlphaSirus
@UberAlphaSirus 8 лет назад
Would you mind putting links in the description for annotated link videos for us mobile users, thanks.
@Computerphile
@Computerphile 8 лет назад
+Sirus done >Sean
@UberAlphaSirus
@UberAlphaSirus 8 лет назад
Thanks
@baconology
@baconology 7 лет назад
Thanks. Annotated links are pretty much always off in the future. At least for nerds?
@tubeyoukonto
@tubeyoukonto 6 лет назад
Have to do a work on a paper about imagenet and deep convolutional neural networks. This video explained sooo much! Thank you!!!
@aligator381
@aligator381 8 лет назад
Another great video! I really like the fact that you create annotations for relevant or prerequisite videos and stuff, but maybe they would be more useful if they opened in a new tab. I don't want to lose where I was on this video, when I open and go through the annotated video.
@JacksMacintosh
@JacksMacintosh 8 лет назад
This guy is awesome
@gabetower
@gabetower 8 лет назад
I love Mike's videos on image processing.. Keep em up!
@alexrossouw7702
@alexrossouw7702 8 лет назад
You should use growing media-free hydroponic systems to for viewing healthy roots. I'm sure you are aware of the "speaking plant approach" or perhaps the benefit for using imaging and CNN's for monitoring crops, as plants don't talk binary...
@GoodWoIf
@GoodWoIf 8 лет назад
Presumably images is just one type of information you could feed into one of these. You could feed in text, or patient vitals/symptoms, economic data, etc..
@IceMetalPunk
@IceMetalPunk 8 лет назад
+GoodWoIf Yep. Mike tends to focus on image analysis in his videos because he is, in fact, an image analyst, but CNNs can be used on pretty much any data set you can think of--as long as you have enough data to train them and a decent number of possible convolutions to apply.
@tryfonmichalopoulos5656
@tryfonmichalopoulos5656 3 года назад
4:00 - That is a somewhat misleading statement right there my friend! The reason why convolution is used is not due to the fact that it is needed to downsample the input space so that the computer will not melt; the reason is that if you dont use the kernel based method typically known as convolutions you lose ANY topological information that relates the pixels with each other. There is literally no chance that a multilayer perceptron type of neural network could achieve accuracy anywhere near a convolution neural network and the reason is as stated previously, that the topological relationship of the pixels is lost once its reshaped in a one dimenionsal vector with space equal the total amount of the pixels. In fact, latest architectures do not even attempt to downsample the information and pooling techiniques are considered to be obsolete, further proving how wrong the statement you made at 4:00 is.
@klam77
@klam77 8 лет назад
A beautiful description! Well done.
@tho207
@tho207 8 лет назад
wow that was delightfully well explained, I enjoyed the video so much. please ask him to talk about RNNs too!
@FisforFenton
@FisforFenton 3 года назад
So it turns out these are in fact a big deal.
@carlhopkinson
@carlhopkinson 4 года назад
Just a tricky form of curve-fitting that is primed to blow up at the most inopportune moments.
@kp-ce1uk
@kp-ce1uk 8 лет назад
Surprisingly simple.
@michaelhunt6313
@michaelhunt6313 8 лет назад
all of this knowledge with only three books in background
@sikor02
@sikor02 8 лет назад
Great video! I would love to see more A.I. related stuff.
@judgeomega
@judgeomega 8 лет назад
My intuitiion says that using an HSI image format would have much better results than rgb as shadows would be simplified.
@evelynfinegan4687
@evelynfinegan4687 8 лет назад
RGB was just the example used here, I don't think he literally meant they use RGB to produce different versions of the image.
@Wizarth
@Wizarth 8 лет назад
Heck, three of the convolutions might well convert from RGB to H, to S, to I, and/or some combination of them.
@dezmoanded
@dezmoanded 8 лет назад
right, the information is exactly the same so it might not matter to the Net
@RedNNet
@RedNNet 8 лет назад
The type of input matters because it helps when the input correlates strongly with reality. You could map every possible input to a pre-determined random representation, and it would probably have a lot more trouble, especially with generalization. It helps if similar inputs correspond to similar outputs.
@edskev7696
@edskev7696 8 лет назад
That is AMAZING!
@miningmanna5967
@miningmanna5967 8 лет назад
Dat pen spin at 0:30
@linkVIII
@linkVIII 8 лет назад
Slightly related, mostly not: I had to implement a simple k means clustering (assign random clusters to each data point, compute centroids, assign new clusters based on centroids) in r and run it on jpegs. did not run well (fast) lol
@karlkastor
@karlkastor 8 лет назад
+linkviii lol, I actually wrote a k means implementation just two hours ago (in Octave, because vectorization rocks) and compressed a png with it
@IceMetalPunk
@IceMetalPunk 8 лет назад
+linkviii As simplistic as K means clustering is, I freaking love it. Probably *because* it's so simple, yet can be quite powerful depending on the task.
@thenayancat8802
@thenayancat8802 8 лет назад
+linkviii That's a weird assignment! You should have used the C code from the stats package implementation ;)
@photon_phi902
@photon_phi902 3 года назад
Is it possible to gather all 64 of Kernel in one kernel which cell identity arrow in category theory ?
@tady159
@tady159 8 лет назад
I think Artificial Neural Networks really interesting. As mentioned in the video, I understand that to process image data, the original is usually downsampled. But I still don't get how ANNs can process audio data
@IceMetalPunk
@IceMetalPunk 8 лет назад
+tady159 The same way, only instead of using (R,G,B) as the dimensions to analyze, it uses (T, F, V)--that is, it uses the volume of each frequency interval at each sample time. With that data, you can do similar things as mentioned in this video.
@tady159
@tady159 8 лет назад
+IceMetalPunk I don't understand what (T F, V ) stands for, could you please enlighten me? Also I still don't get how to process arbitrarily long sound samples. How their length can the be normalized before being processed by a network?
@IceMetalPunk
@IceMetalPunk 8 лет назад
tady159 As I said, I was using (T,F,V) to refer to the dimensions Time, Frequency, and Volume. So think of a 3D plot, where the dimensions are (x,y,z), except here the dimensions aren't spacial, they're samples. Keep in mind that sounds stored on computers, when uncompressed, are stored as discrete samples, not continuous waveforms (sometimes you can build a waveform up by connecting the samples, but that's interpolated). So to normalize it, you just break the sound into frequency intervals using a Fourier transform (smaller intervals = higher resolution), then for each interval, you divide every sample's volume by the maximum volume over all samples for that interval. Voila!
@calebmcnevin
@calebmcnevin 8 лет назад
Lol, the only time CNN changed the game
@msolomonbush
@msolomonbush 6 лет назад
fake news
@mickmickymick6927
@mickmickymick6927 7 лет назад
0:41 'i would furrily recommend people watch that video'
@photon_phi902
@photon_phi902 3 года назад
What does he mean nonlinear? does he means quantum computer or some thing different?
@greenrocket23
@greenrocket23 8 лет назад
Huh, I wonder if this could be used with drones to survey things like tropical rain forests or ocean ecosystems completely in say, less then a year?
@dickhamilton3517
@dickhamilton3517 8 лет назад
you better be quick - another few years and there won't be those things to survey...
@greenrocket23
@greenrocket23 8 лет назад
Dick Hamilton Too true man! Let's hope we're wrong on this one prediction, right?
@dickhamilton3517
@dickhamilton3517 8 лет назад
+greenrocket23 with you, there.
@daisy3067
@daisy3067 8 лет назад
I have a problem ,I have about 30 digital 8 mini 90 min tapes back in the 90s I filmed houses being built I would film a few minutes then go to the next house and so on . My question is could one of these networks edit all the film together spread out over dozens of tape
@akshayborse5756
@akshayborse5756 6 лет назад
So, does that mean that if I use CNN for image classification, there is no need to use methods like feature extraction or use of filters like Gaussian filter or 2D Gabor Fillter or LBP/uniform LBP
@sohweekian
@sohweekian 7 лет назад
Show some DEMO please! Did your Machine Learning Code Work?
@aspie96
@aspie96 7 лет назад
Do kernels themselves have depth? Are they as deep as the image? I am confused, becuase the examples use a 2D kernel, but shouldn't the kernel be 3D as well?
@Alaplaya9
@Alaplaya9 8 лет назад
Ridiculously photogenic computer scientist
@MrBOB-hj8jq
@MrBOB-hj8jq 4 года назад
Sorry to the camera man. This was mean.
@WistrelChianti
@WistrelChianti 2 года назад
closer... but still feel like someone threw all the parts of an engine on the floor and told me I'd know how to put them together,... but at least this time some of connectors are colour coded.
@ZerofeverOfficial
@ZerofeverOfficial 5 лет назад
If you feed a CNN all of the maths that have ever been thought up and then feed it a problem in physics that we havent solved yet, will it give us new maths that solve it?
@habibabirached1
@habibabirached1 5 лет назад
no
@alcesmir
@alcesmir 8 лет назад
Neat, I just did my bachelor thesis on convolutional neural networks. We built and trained a sign language interpreter that worked pretty well. I can affirm that neural networks are equal parts wisdom and witchcraft.
@riccardoorlando2262
@riccardoorlando2262 8 лет назад
+Alcesmire Aha! Now that I know neural networks exist, I can start my Mathematics thesis with: "Let N be a neural network..."
@IceMetalPunk
@IceMetalPunk 8 лет назад
+Alcesmire That's actually pretty awesome! It could seriously improve the lives of deaf people, especially seeing as how we're moving more and more into the whole voice-controlled, NLP virtual assistant world.
@brcha
@brcha 8 лет назад
Frankly, I don't see the wisdom part. Sure, when you design a NN, you do have to scale it correctly to the problem and so on, but once you've got everything setup, the rest is magic.
@verynicehuman
@verynicehuman 8 лет назад
Is'nt Nueral networks just math? Ive studied the backpropagation algo,stacked neural networks,etc..etc and the thing that struck me is that its all just math that you learn in an engineering course especially stats and linear algebra to solve equations. Why did you say its "witchcraft"?
@brcha
@brcha 8 лет назад
Sreekar Nimbalkar Because NNs are large sets of linear or non-linear equations with dynamically generated coefficients that mean absolutely nothing to the designers of NNs, but somehow work. Hence witchcraft. I mean, I could basically simulate a NN on paper and it would still work, while I would still have no idea why it works.
@NardiPaffon
@NardiPaffon 6 месяцев назад
I guess the debate he mentions over whether neural networks will change everything, is settled now in 2024?
@ninjamaster224
@ninjamaster224 8 лет назад
"...check whether the photo is of a bird." "give me a research team and five years"
@baconology
@baconology 7 лет назад
YES!!!!!!!!! GRANT MONEY!!! PAY ME! This is an ACADEMIC!!!!!!!!!!! $$$$$$$$$$$$$$
@baconology
@baconology 7 лет назад
this is the best comment i've ever read. I feel you bro. Lived it!!!!!!!
@KnakuanaRka
@KnakuanaRka 5 лет назад
SaltyBrains I don’t get how that changes the meaning at all.
@KnakuanaRka
@KnakuanaRka 5 лет назад
Another xkcd fan?
@NareshTur
@NareshTur 4 года назад
@@KnakuanaRka That changes meaning because a computer did it. Now you can automate that process and probably hundreds such processes. Automated object detection can be used in a multitude of processes and industries. Google it and surprise yourself.
@ZimoNitrome
@ZimoNitrome 8 лет назад
3 deep 5 me learning
@winecheese2185
@winecheese2185 8 лет назад
i dont understand what you mean
@JakeDownsWuzHere
@JakeDownsWuzHere 8 лет назад
2 deep 4 them + inception
@ZimoNitrome
@ZimoNitrome 5 лет назад
not 3 deep 5 me anymore tho
@IceMetalPunk
@IceMetalPunk 8 лет назад
Not long ago, I read about a machine learning system that was able to classify planes, trees, and people in nearly live video, all without ever having any hard-coded feature sets. The math was way over my head (despite being a computer scientist, specialized areas can still stump me at times). Now I look back at it, and it was in fact a CNN being used! This was a few years ago now, but if they just started becoming popular in 2012, that makes sense. Thank you for the higher-level explanation that allows me to understand it after all this time XD
@rockrollinnolan8521
@rockrollinnolan8521 7 лет назад
Dang kernel convolutions. My least favorite thing to happen when I'm making popcorn.
@Wardropulous
@Wardropulous 8 лет назад
This guy made all that really easy to follow. I admire his ability to explain such complicated things. He's really good at identifying and skipping over the irrelevant stuff, and focusing on the core problem/solution.
@_tyrannus
@_tyrannus 8 лет назад
Extremely good explanation of things that, until this series on deep learning, were just black magic to me !
@compulsive_curiosity
@compulsive_curiosity 8 лет назад
+turarwanaa Lucky you, watched it twice and I still think he is a dark wizard.
@HexerPsy
@HexerPsy 8 лет назад
+turarwanaa +J Simmons But doesnt it mean that its all just an optimization program that does the magic? The training images get run through the process, it produces a value. The settings are tweaked slightly, the result are compared and one is better than the other. Rince and repeat. Working with some optimization programs myself, the trick is in how the algorithm is programmed to make large or small tweaks to settings... Its like finding the tallest mountain in the area while blind... Does crossing the valley lead to a taller mountain or should you just go up hill? It all seems CPU horse power dependent to me o.o
@thomasgandalf4111
@thomasgandalf4111 8 лет назад
+HexerPsy yes it's pretty much brute force, no magic. as with most things machine...
@Vulcapyro
@Vulcapyro 8 лет назад
Thomas Gandalf It isn't anywhere near brute force either. The "magic" is in why neural networks work so well at all compared to other methods. At a low level it looks fairly similar to other optimization methods, but the structure of the network and how it abstracts is very important. It makes much less sense than a naive perspective suggests.
@thomasgandalf4111
@thomasgandalf4111 8 лет назад
+Vulcapyro something that runs to iteratively tweak parameters of mathematical formulae until it finds the best possible solution, i.e. explores or exhausts the state space, is pretty much the definition of brute force.... granted CNN don't usually exhaust the state space but make informed decisions on which parameter set to try next
@MistThief
@MistThief 7 лет назад
I am a neural network watching videos about neural networks.
@musicjetstream2476
@musicjetstream2476 7 лет назад
we need to go deeper
@timconnors3386
@timconnors3386 6 лет назад
this is amazing
@dzlcrd9519
@dzlcrd9519 6 лет назад
Moeシt wow
@kaidatong1704
@kaidatong1704 6 лет назад
aren't we all?
@ianallen738
@ianallen738 4 года назад
I am a neural network inputting and outputting comments about neural networks watching videos about neural networks. The singularity is nigh.
@dries2965
@dries2965 8 лет назад
I could`t have explained it better, given the limitations of a youtube video. Well done computerphile!
@flits1
@flits1 4 месяца назад
update: they are indeed a big deal
@spaminbox
@spaminbox 8 лет назад
this is all rather convoluted.
@Wowthatsfail
@Wowthatsfail 8 лет назад
fidelio your profile pic wins the internet for the day!
@AkshayAradhya
@AkshayAradhya 7 лет назад
I guess if you applied the sharpen kernel it would make things more clearer
@baconology
@baconology 7 лет назад
This is hilarious.
@paulkossey7543
@paulkossey7543 4 года назад
Google where is A visible date of Video oooplooopad inthisRU-vid
@shariarpapaon5305
@shariarpapaon5305 Год назад
i love watching mike out of all the other ppl on this channel. this man just sounds right
@TrabberShir
@TrabberShir 8 лет назад
"I'd have to start by programming up linux" he says while sitting in front of a WPF book
@DarkmoonUK
@DarkmoonUK 8 лет назад
...because Computer Scientists are only allowed to reference one Operating System? I don't get it.
@amirabudubai2279
@amirabudubai2279 8 лет назад
Linux is better for long(multiday) computations because it is more stable and uses less resources in the background; it also happens to make the project more reproducible because versions of linux don't become dysfunctional with time like windows. Even if a CS has windows on their personal, there is no reason to think they wouldn't use linux on there workstation.
@baconology
@baconology 7 лет назад
agreed he is not capable of this but he doesn't care because he has WORK TO DO.
@WillNewton10
@WillNewton10 Год назад
So happy for Frodo Baggins and his new career as AI teacher
@igors1131
@igors1131 Год назад
7 years later Hey, ChatGPTwhat do you think of Computerphile channel ChatGPT As of my last knowledge update in September 2021, Computerphile is a popular RU-vid channel that focuses on computer science and technology-related topics. The channel is known for its educational content presented in an engaging and accessible manner. It covers a wide range of subjects within computer science, including programming languages, algorithms, networking, security, and more.
@lewisb8634
@lewisb8634 8 лет назад
This guy seems cool - I like the videos he presents! :)
@harleyspeedthrust4013
@harleyspeedthrust4013 4 года назад
I wrote a neural network framework in Java that allows you to build neural networks with arbitrary shapes and structures. You can chain layers together and as long as you implement forward and backward functions, your layer will work. I implemented a lot of layers (fully connected, convolutional, pooling, etc.) And yes it's Java but the process was a valuable learning experience, much better for me than learning keras or something without knowing how it works
@IgorRoztr
@IgorRoztr Год назад
The best way of learning is building stuff that you want to understand😉
@harleyspeedthrust4013
@harleyspeedthrust4013 Год назад
@@IgorRoztr absolutely agree. i wouldn't say i understand something unless i've built something like it
@ItsMine-fd3lq
@ItsMine-fd3lq 9 месяцев назад
Hey... I tried to implement a cnn from scratch using python... But it is not working properly... I found few issues but not the solution for it.. I searched abt it in every website but couldn't find proper solution.. can u clarify my doubts, pls?
@ItsMine-fd3lq
@ItsMine-fd3lq 9 месяцев назад
I jst want to clarify whether whatever ik is right or wrong and want to know the solutions for the issues
@ASLUHLUHC3
@ASLUHLUHC3 6 лет назад
Maybe the only human job left in the future will be labelling images
@badshabz1
@badshabz1 Год назад
This is by far the best video I've watched on CNNs and I've watched 4 others. It really describes the back propagation and image compression to a single dimension.
@jaffarbh
@jaffarbh 2 года назад
Interestingly, there is a platform out there called Accuval and they claim their house valuation is by far the best because they use ANN (fully connected rather than CNN).
@Gutagi
@Gutagi 3 года назад
For years down the line and look where we are now! What a time to be alive!
@shashankkothari8066
@shashankkothari8066 3 года назад
2 minutes paper with Dr. Gibberish
@chebkhaled1985
@chebkhaled1985 8 лет назад
Couldn't but to observe WPF C# book , nice to see another one specialised in these two things
@omegasrevenge
@omegasrevenge 8 лет назад
I would have loved to hear examples of where these are getting used and what kind of impact they have on our way of life!
@Kram1032
@Kram1032 8 лет назад
+abschussrampe Google is currently pushing them onto basically everything. I'm not sure that _all_ their services use them yet but increasingly they do. For the following they either have talked before about _planning_ to use this technology or they already use it directly in the services you may or may not love to use by them: Google Maps Google Car driving Google Now suggestions Google Search RU-vid Thumbnails RU-vid video suggestions Google Translate Google Photos DeepMind (the guys behind AlphaGo) Allo, their new messaging app probably lots more Other big guns who either talk about or already do use these: Apple Microsoft Facebook Amazon probably lots more Nowadays, if you are on the internet, chances are you are using a service that in one form or another relies on deep learning and convolutional networks. You can do an insane number of semi-cognitive tasks with them. They _do_ have their limits in their current form but development goes rapidly.
@IceMetalPunk
@IceMetalPunk 8 лет назад
+abschussrampe They're used in quite a few places. One example I saw used a CNN to classify the activities occurring in a video--for example, to learn how to tell whether a video is of someone hiking, mountain biking, swimming, or canoeing. It could be expanded with more data sets to classify other types of activity, which in turn would allow our future AIs to understand what's happening around them instead of having to be told what's going on before knowing how to react.
@Vulcapyro
@Vulcapyro 8 лет назад
+abschussrampe Self-driving cars (Autos, you might say) are essentially all implemented as systems of CNNs at this point, if you want a particular example that will likely significantly change our way of life.
@mrkwse4415
@mrkwse4415 8 лет назад
If you're outside the EU/Canada, Facebook uses CNN for facial recognition to tag photos
@thallium200
@thallium200 8 лет назад
Rest assured the government is using them to identify not only who you are but what you're doing. We're going from facial recognition to activity recognition.
@syawkcab
@syawkcab 8 лет назад
What's the first sentence he says? "This is kind of a full opt vice's videos on deep learning?" I replayed it like 30 times and I can't figure out what he's saying
@Computerphile
@Computerphile 8 лет назад
+syawkcab "This is kind-of a follow up to Brais' video on deep learning"
@syawkcab
@syawkcab 8 лет назад
OHHHH!
@jerrodmilton5776
@jerrodmilton5776 8 лет назад
Is this the process that Google used for its image recognition software that can be run backwards to "dream up" images of the things it can recognize. So if the program can recognize an image of a cat you can reverse it and have it generate a picture of what it thinks a cat looks like.
@karlkastor
@karlkastor 8 лет назад
+Jerrod Milton You are correct! How they basically do this is instead of adjusting the weights of the network to get the correct output value, they change the pixels of the input image, so that the CNN predicts e.g. a cat with higher certainty.
@black_platypus
@black_platypus 8 лет назад
+Karl Kastor Wait, this exists? Are there front-ends to those applications available?
@karlkastor
@karlkastor 8 лет назад
Benjamin Philipp Google Deep Dream. People have done several implementations for this since the original paper.
@karlkastor
@karlkastor 8 лет назад
Benjamin Philipp google Deep Dream. People have done several implementations for this since the original paper
@black_platypus
@black_platypus 8 лет назад
Karl Kastor Tanks - I've since found Deep Dream, but thanks for coming back for me :)
@VicFreg19
@VicFreg19 8 лет назад
How would you handle different sized images in the training data set? According to what I understood, the number of neurons (and weigths) depends on the number of pixels.
@johnthegod
@johnthegod 8 лет назад
This sounds very interesting, it gave me a nice flashback to my AI studies a few years ago. How would this method hold up against noisy images or partially occluded images once the network is trained? For example if you trained a CNN to recognise your face from n images, could you wear a phantom of the opera mask and still expect it to recognise you?
@cmdody
@cmdody 7 лет назад
We want video about PNN(Probabilistic Neural Network)
@Carofdoom1126
@Carofdoom1126 3 года назад
'"someone" came around and applied this to imagenet and got great results' Well those someone's won the 2018 Turing award for that work LOL (LeCun for that work in particular. Bengio and Hinton for similar work)
@seasong7655
@seasong7655 2 года назад
Watching this again really helped me improve my network. Thanks
@Lougehrig10
@Lougehrig10 7 лет назад
So really, Machine learning is creating an automated task to find enough differences that are unique to a specific thing so that you can then assume an outcome with enough confidence
@Wowthatsfail
@Wowthatsfail 8 лет назад
This is the only time CNN is useful.
@volka2199
@volka2199 7 лет назад
Wowthatsfail Truth
@pebre79
@pebre79 3 года назад
I wouldnt have understood the gentleman's explanation without first watching 3BLUE1BROWN'S nn series first. Go watch that first if youre interested
@BeCurieUs
@BeCurieUs 8 лет назад
Wow, "convolution process" just sounds a lot like abstraction that brains do....I think these guys are really onto something here...I dig it
@chris_1337
@chris_1337 8 лет назад
+Christopher Willis really interesting way of looking at it.. what an exciting time to be alive!
@chris_1337
@chris_1337 8 лет назад
+bibbly bobbly Thanks! Very interesting. I once read that the patterned hallucinations of LSD are probably caused by the acid disrupting the signal in our retina. Pretty interesting stuff.. Is that plausible in your opinion?
@thomasgandalf4111
@thomasgandalf4111 8 лет назад
+bibbly bobbly thanks well said
@bug2k4
@bug2k4 8 лет назад
As far as I know, the kernels that CNNs learn on the first hidden layer (without prior knowledge) are also very similar to patterns our visual cortex reacts to. So it's maybe even closer than you thought it is. I definitely find this quite fascinating :)
@RedNNet
@RedNNet 8 лет назад
The part of the brain responsible for most mammalian intelligence (the neocortex) is considered hierarchical by many, sort of like neural networks. But it's not like there's a single layer of neurons in each level. Each level has millions of microcolumns of neurons (around 100 neurons per column), 3 to 9 or more layers (depending on how you count and the location in the hierarchy) with distinct properties, and multiple types of connections (inhibitory, excitatory, modulatory, different durations of effect, etc.) There are hundreds or thousands of underlying common characteristics in the neocortex alone, whereas neural networks have maybe one or two dozen underlying characteristics. I suspect some of that is just plumbing to deal with things like metabolism, but neural networks (except hierarchical temporal memory, which doesn't do anything the brain definitely doesn't do) are pretty unlikely to lead to brain-like intelligence. They're really useful, but the way they are designed is like trying to reinvent the computer by mimicking transistors and no other characteristics of computers.
@luffyorama
@luffyorama 8 лет назад
My friend had some project with ANN. And I have some project with image analysis. I never knew both can be linked with this technique! This might help me in my research! Thanks Sean, thanks Dr Mike!
@andreylebedenko1260
@andreylebedenko1260 4 года назад
Looks like the next step will be combination of CNN with temporal one. Like this: keep on moving CN kernel over the image using this path (as learned before) and keep on feeding TN with data, until we have 99% detection certainty.
@JamieTwells
@JamieTwells 8 лет назад
Can we have the links in the description please? Regards everyone on a mobile.
@JamieTwells
@JamieTwells 8 лет назад
+Jamie Twells Ah, they're there now - Thank you!
@JamieTwells
@JamieTwells 8 лет назад
*****​ to be honest I wish RU-vid would get rid of annotations, the i and everything else that pops up over the videos. I just want to watch the video, annotations are too often abused and spammed to be useful. 
@davidm.johnston8994
@davidm.johnston8994 6 лет назад
Thank you so much! Aside for entertaining me for years now, this video has actually helped me in my personal little research in programing an AI in a simple game using Tensorflow. (Is it overkill ? Sure. Is it fun to do and learn? Heck yeah!)
@arminneashrafi2846
@arminneashrafi2846 3 года назад
Idk how your game went, but you're the man.
@АндрейДжевага-й3с
@АндрейДжевага-й3с 4 года назад
In the case of convolutional neural networks the weight changes the opacity between the different "channels" or "features", calculated by different kernels, am I understand right?
@aurinator
@aurinator 3 года назад
Absolutely fascinating! Great stuff, thanks for making it & sharing. I'm suspecting convolutional neural networks (CNNs) are possibly the solution for any potentially subjective classification, like the images in your video, but now wondering if a CNN is eventually equal to a Support Vector Machine for OBJECTIVE classifications, e.g. solutions to a mathematics problem.
@Rottensteam
@Rottensteam 8 лет назад
R.I.P. COD Battlefield Won
@qwertyuiopzxcfgh
@qwertyuiopzxcfgh 8 лет назад
How is that relevant?
@Rottensteam
@Rottensteam 8 лет назад
qwertyuiopzxcfgh This is the internet, so to bash COD is relevant.
@Rottensteam
@Rottensteam 8 лет назад
Jimmy De'Souza Try and watch the trailer for the new Call of Duty Infinity Warfare and see the likes and dislikes. Also watch the new Battlefield 1 trailer.
@sufficientlyoldskool
@sufficientlyoldskool 7 лет назад
Plant science, huh? It would be cool if you could just take a picture of a plant with your phone and it could tell you the genus and species, whether it's poisonous, etc...
@tabaks
@tabaks 6 лет назад
Ok, convoluted, blurred, simplified, self-evident, muddled. This clip was made by a cnn process.
@siotsoni9854
@siotsoni9854 8 лет назад
"... and there'll be a different representation of my face transformed in some way to be useful." BRILLIANT
@clays6359
@clays6359 5 лет назад
Please do a video on how CNN's are applied to Natural Language Processing (NLP). Usually RNN's are, but CNN's can also be used.
@gpt-jcommentbot4759
@gpt-jcommentbot4759 2 года назад
Transformers too
@photon_phi902
@photon_phi902 3 года назад
Is possible the Terminal object in category theory have unique arrow as network ?
@Brainbuster
@Brainbuster 7 лет назад
I wish he would take the marshmallows out of his mouth before he starts talking.
@jonysonic3595
@jonysonic3595 3 года назад
when you learning basic for a project and you find it's your supervisor in this video XD
@bzqp2
@bzqp2 2 года назад
Can we get visual transformers please? There is only one video on NLP transformers.
@geminix365
@geminix365 7 лет назад
So let"s see if I understood. Did the secret services knew they were falling in a trap?
@valentinidk6101
@valentinidk6101 7 лет назад
I f#cken love these vids
@Ousainoujaiteh
@Ousainoujaiteh 5 лет назад
Hi the software you are saying that you build did you open source it. If yes can you share it.Thank you
@overdrivegain
@overdrivegain 3 года назад
I don't know why, but Dr Mike explains things so nice and clear. Thanks!
@vegahimsa3057
@vegahimsa3057 3 года назад
Recorded in 2016, the year Alpha Go defeated 9 dan professionals.
@Hgkbukk
@Hgkbukk 6 лет назад
Isn't CNN basically just a network with fixed layer structure and limited connections? Ex: 2nd layer, 1st node is a combination of all top 10x10 pixels in top corner?
@kingdrogo6124
@kingdrogo6124 3 года назад
4 years later nueral networks are only getting bigger abd better now u can literally talk to s nueral networl like a real person eg gpt 2 and gpt3
@vertox4837
@vertox4837 4 года назад
Kinda weird to think that we are neural networks ourselves watching videos about neural networks
Далее
Inside a Neural Network - Computerphile
15:42
Просмотров 427 тыс.
But what is a convolution?
23:01
Просмотров 2,6 млн
Офицер, я всё объясню
01:00
Просмотров 3,9 млн
Has Generative AI Already Peaked? - Computerphile
12:48
The moment we stopped understanding AI [AlexNet]
17:38
Watching Neural Networks Learn
25:28
Просмотров 1,3 млн
ChatGPT Jailbreak - Computerphile
11:41
Просмотров 353 тыс.
Recurrent Neural Networks (RNNs), Clearly Explained!!!
16:37
How AI 'Understands' Images (CLIP) - Computerphile
18:05