Тёмный

Coding OCR with machine learning from scratch in Python - no libraries or imports! (From Scratch #2) 

clumsy computer
Подписаться 1,8 тыс.
Просмотров 84 тыс.
50% 1

In this video, we implement OCR/image recognition using simple machine learning in Python with no imports! This was streamed live on / clumsycomputer on 2020/07/06.
Our program uses the k-nearest neighbors algorithm to classify images from the mnist and fashion-mnist datasets, meaning it takes digits (1, 2, 3, …) and garments (shoe, t-shirt, …) and guesses what they are (classifies them). It can get up to 93% accuracy on mnist. I wanted this to be a simple introduction to machine learning for those not very familiar with it.
The full code including more details is available at git.sr.ht/~vladh/clumsycomput...
Because this is a recording of a live stream, you'll hear me sometimes talking to the chat.
If you'd like to see more, please follow the clumsy computer Twitch channel at / clumsycomputer .
If you have any questions, write a comment or contact me on / clumsycomputer .
What would you like to see me code from scratch next? Let me know in the comments! :)
Wishing you a swell day,
Vlad

Наука

Опубликовано:

 

6 июл 2020

Поделиться:

Ссылка:

Скачать:

Готовим ссылку...

Добавить в:

Мой плейлист
Посмотреть позже
Комментарии : 59   
@flashbao1922
@flashbao1922 3 года назад
He explains everything from scratch. This is the scratch I actually needed!
@Muphet
@Muphet 3 года назад
most underrated software programmer and evangelist on youtube. i find this explanation/tutorial better than any other youtube video.
@ArabianKnight63
@ArabianKnight63 11 дней назад
followed, that the from scratch thing everyone was looking for.
@haniajarlaczynska4518
@haniajarlaczynska4518 4 года назад
I've just started learning machine learning and I really like this video - you explain everything very clearly. It's nice to see how to do this from scratch instead of using some huge libraries.
@clumsycomputer
@clumsycomputer 4 года назад
Thank you for the feedback! It makes me really happy to hear that.
@ANSELAbitsxb
@ANSELAbitsxb Год назад
@@clumsycomputer Dude you make really good videos. If you get time again in the future please make more.
@zodiaminc1474
@zodiaminc1474 3 года назад
Please don't get dis-heartened or demotivated, this video is a total gem (sorry too late for encouragement after 10 months) but this really helped, thanks bruv, keep up the good work !!!!!
@youssef-attai
@youssef-attai 3 года назад
I swear to god you are a legend, no one could have explained it better. Thank you so much.
@clumsycomputer
@clumsycomputer 3 года назад
That means a lot, thank you! ❤
@frindle18
@frindle18 Месяц назад
Brilliant tutorial, everything is so clearly explained, and it actually is from scratch.
@adhamsalama4336
@adhamsalama4336 2 года назад
This is amazing! Thanks for this awesome video! Subscribed!
@myquoraismine2953
@myquoraismine2953 3 года назад
I was looking around for this stuff, it's amazing, You nailed it clumsy computer, can you please implement hyper parameters in it ?
@ruoyuguo3945
@ruoyuguo3945 Год назад
Thanks for sharing, PaddleOCR seems friendly for training the OCR models. Could you please shoot a video about how to use PaddleOCR?
@benoitd94
@benoitd94 10 месяцев назад
Hi Clumsy VERY GOOD JOB ! Just one question, is it possible to decode Water Counter with that method ? do i need a special MNIST DATASET to download ? thx
@munteanionut3993
@munteanionut3993 2 года назад
Does this tutorial only build a digit ocr? Or does it also work for characters as well?
@nelhdez3126
@nelhdez3126 3 года назад
This was so refreshing, you made it way better than some full-of-pride professors at some universities. Thanks so much mate. Could you do at some point the most basic “non-lazy-training” algorithm from scratch? Whichever you deem is fine
@prakharpathak2358
@prakharpathak2358 2 года назад
How to preprocess a image of PNG file format downloaded from the web to recognise the digit and what changes to do in read images function in the above code to test that image
@SourabhChavan-1997
@SourabhChavan-1997 Год назад
Hey, Thanks for explaining everything, i like this video. which IDE are you using ?
@clumsycomputer
@clumsycomputer Год назад
Hi! I don't use an IDE, I only ever use vim.
@mechpunk
@mechpunk Год назад
@@clumsycomputer your content is really amazing , If you have any spear time consider making more videos.
@jaycobclavel7098
@jaycobclavel7098 7 месяцев назад
How do you make your own custom datasets.? Provide your own dataset froms cratch. Just want to detect numbers. sometimes keras ocr detects 3 as 5.
@SarthakRauts
@SarthakRauts 3 года назад
Amazing video
@snenscoleman7095
@snenscoleman7095 3 года назад
What features does "extract_features" extract?
@abdullahshoukat7848
@abdullahshoukat7848 2 года назад
603. you deserve more, I hope you'll get soon
@ElectroProgrammingFreakJuanGCJ
Very useful.
@PrakashKing8342
@PrakashKing8342 2 года назад
Do you have solutions for alpha numeric recognition I have tried all the method finally I’m getting fail tesseract easy ocr etc for example ABC00OO1 and SI1234H
@munteanionut3993
@munteanionut3993 2 года назад
Did you manage to find a solution for alpha numeric characters too?
@arvindshelke8889
@arvindshelke8889 2 года назад
It really helped👌👌
@johnmele720
@johnmele720 Год назад
Hey, the website you've got those files from now requires a password and username to get in. Is there another place we can get the files from?
@clumsycomputer
@clumsycomputer Год назад
Hi! I can download the files fine, could you tell me which website you're referring to?
@naufalafels
@naufalafels 10 месяцев назад
@@clumsycomputer this one, I also tried to get into the website but it needs password and username.
@konalaravichandanreddy9886
@konalaravichandanreddy9886 2 года назад
How to run this on jupyter notebook? Can someone please tell?
@iamHamza
@iamHamza 3 года назад
Can u please tell us which editor did u used?
@simondeis7432
@simondeis7432 3 года назад
It's vim
@simondeis7432
@simondeis7432 3 года назад
Or more precise nvim
@AJ93-lx6on
@AJ93-lx6on 3 месяца назад
Can this work on seven segment font though
@Tech_Inside.
@Tech_Inside. 3 месяца назад
I want to ask how to test it on your img file?
@trungthanhbp
@trungthanhbp 2 года назад
where are you now, we need your video :D
@hemantchauhan6437
@hemantchauhan6437 3 месяца назад
NEED HELP! I am making a website where user can upload a pdf but I want that pdf to upload only if that pdf has images of only HANDWRITTEN text. Thank you for reading.
@adwaithks
@adwaithks 3 года назад
@JorgeGonzalez-jo3zv
@JorgeGonzalez-jo3zv 4 года назад
Instructional video about machine learning, it would be good if you could make a video on how to configure vi with python with the basic instructions. Best regards.
@clumsycomputer
@clumsycomputer 4 года назад
It's planned! :)
@ilyaosver4987
@ilyaosver4987 3 года назад
Nice
@jonas1351
@jonas1351 Год назад
why did you stop dude, your explanation is so well
@clumsycomputer
@clumsycomputer Год назад
Thank you! New videos are coming, I just have a lot of other stuff to do. ❤
@beserious2773
@beserious2773 Год назад
I wanted to create an ocr app for Android for myself only. But I need to implement maths also equations. Like Mathpix software did. I don't have money. To pay. I want to learn from very much basic. Like I am noob don't know coding also. But I wanted to learn. And wanted to create my own style data to get extracted from image.
@kumaraguru3193
@kumaraguru3193 2 года назад
can i use my webcam to recognize any face
@xhyhbdka
@xhyhbdka 3 года назад
Good video but christ, presentation wise and viewer experience, an absolute car crash!! Max res 720P, you should have zoomed a bit more!!! That background chance from black to white just fucked my eyes up at 14:25
@Champe19
@Champe19 7 месяцев назад
Wtf are the || swiggle characters
@truongzgiang
@truongzgiang 8 месяцев назад
Too many videos trianing copy from each other but no video talk about how to use it after trianed ,
@MadhuKumar-hi2qs
@MadhuKumar-hi2qs 3 года назад
507th subscriber
@fahadsajeem2897
@fahadsajeem2897 3 года назад
next time please code in light mode. In dark mode its hard to read. Your videos are really useful and helpful. loved it 🙂
@shivu.sonwane4429
@shivu.sonwane4429 2 года назад
777
@siyabonganxumalo4574
@siyabonganxumalo4574 3 года назад
I like the video but I was looking for something that will teach me from scratch because I don't have a experience in programming so I feel like you skipped a lot of steps because some people are already familiar with programming and I downloaded Python onto my laptop, so your coding was confusing me because I have gotten used to using Python that I downloaded.
@clumsycomputer
@clumsycomputer 3 года назад
Thank you for watching! If you just started learning Python, I don't think this is a good introduction, and I definitely don't think you should start with machine learning. In the future I plan to make more basic tutorials too, but until then, I recommend you start with some more beginner-level Python resources! :)
@siyabonganxumalo4574
@siyabonganxumalo4574 3 года назад
Thanks man, noted.
@sammy9453
@sammy9453 3 года назад
I think you should remove ocr tag. I am dissappointed.
Далее
100+ Linux Things you Need to Know
12:23
Просмотров 56 тыс.
Optical Character Recognition (OCR) - Computerphile
14:16
How Does Optical Character Recognition (OCR) Work?
5:48
Let's build GPT: from scratch, in code, spelled out.
1:56:20
15 Python Libraries You Should Know About
14:54
Просмотров 370 тыс.
Optical Character Recognition (OCR)
6:16
Просмотров 65 тыс.
5 Useful Dunder Methods In Python
16:10
Просмотров 54 тыс.
How I’d learn ML in 2024 (if I could start over)
7:05
Linear Regression From Scratch in Python (Mathematical)
24:38
All Rust string types explained
22:13
Просмотров 152 тыс.
Игровой Комп с Авито за 4500р
1:00
Все Смартфоны vivo Серии V30!
24:54
Просмотров 20 тыс.