Тёмный

How Does Optical Character Recognition (OCR) Work? 

Techquickie
Подписаться 4,3 млн
Просмотров 430 тыс.
50% 1

How do computers read text on a page, and how has the technology improved?
Freshbooks message: Head over to freshbooks.com/techquickie and don’t forget to enter Tech Quickie in the “How Did You Hear About Us” section when signing up for your free trial.
Techquickie Merch Store: www.designbyhumans.com/shop/L...
Techquickie Movie Poster: shop.crowdmade.com/collection...
Follow: / linustech
Join the community: linustechtips.com

Наука

Опубликовано:

 

6 апр 2017

Поделиться:

Ссылка:

Скачать:

Готовим ссылку...

Добавить в:

Мой плейлист
Посмотреть позже
Комментарии : 419   
@TheOriginalFayari
@TheOriginalFayari 7 лет назад
That was the smoothest transition to a sponsor spot I've ever seen.
@thepalettewhispererasmr1227
@thepalettewhispererasmr1227 3 года назад
I didnt even realize it was happening
@freedomofmotion
@freedomofmotion 7 лет назад
Irish travelers will be deeply hurt that OCR and even you don't accept that dag is a word. Has no one ever tried to sell you a dag? Or admired your dag?
@chantafreak
@chantafreak 7 лет назад
Ya like dags?
@ataksnajpera
@ataksnajpera 7 лет назад
Knackers do not even speak english ;)
@GewelReal
@GewelReal 7 лет назад
hey kid, you wanna buy some dags?
@EvadingFate
@EvadingFate 7 лет назад
Oh, dogs. Sure, I like dags. I like caravans more.
@chantafreak
@chantafreak 7 лет назад
This is the post I was waiting for.
@jamesklein4399
@jamesklein4399 7 лет назад
FILE FORMATS AS FAST AS POSSIBLE! png vs jpg mp4 vs mkv mp3 vs ...?
@laser5317
@laser5317 7 лет назад
James Klein MP3 vs WAV
@RobertHildebrandt
@RobertHildebrandt 7 лет назад
mp3 vs flac
@coffeen8128
@coffeen8128 7 лет назад
James Klein png keep the quility
@smarthd7749
@smarthd7749 7 лет назад
MP4 and .mkv Is not a file format, IT is a container. And ITS not many difference between mkv and MP4 the only difference is that mkv can hold some more codecs.
@cldream
@cldream 7 лет назад
SmartFyrHD Also Matroska can also embed multiple subtitle formats (SRT, SSA/Advanced SSA)
@DisbelieverH2o
@DisbelieverH2o 7 лет назад
I gotta say, I really liked this one! Very informative but what really made it for me was the seamless sponsor spot. I'd love to see more in such a way!
@DustinRodriguez1_0
@DustinRodriguez1_0 7 лет назад
OCR was one of the first practical uses of neural networks back in the 70s or 80s. Maybe even earlier? When I took an AI class in college, we wrote a simple OCR neural net and it was pretty easy.
@ziyitan8996
@ziyitan8996 7 лет назад
I love how Luke explains stuff :D
@ShreyPandya150
@ShreyPandya150 7 лет назад
When Luke said it wouldn't look as crisp and the video resolution went down I instantly checked if I was at 1080p
@soroushjm1011
@soroushjm1011 4 года назад
Yeah me too
@jandresshade
@jandresshade 7 лет назад
the OCR can use different techniques to recognize character, one is creating a model based on data of different characters and training the sofware to recognize them( Artificial neural networks is an example of this)
@sabaamin3179
@sabaamin3179 2 года назад
Just what I was looking for. Good Job!
@OMNIA_RH
@OMNIA_RH 5 лет назад
Thank so much for you explaining Sir.
@TheDyingFox
@TheDyingFox 7 лет назад
I was going to ask "How about Voice Recognition next?" but searched your channel, and I'll be damned, 1 year ago, you guys work fast! (Not sure how I've been missing it though, alot of content much?). It's a shame neither is "How to create your own Voice Recognition and Optical Character Recognition as fast as possible"
@cestsibon2468
@cestsibon2468 3 года назад
This is the first time i've watched a tech video and actually not had a headache after. Waiting for the interpretive google dance hehe
@arnatsemtappra3822
@arnatsemtappra3822 6 лет назад
Very useful knowledge and easy to understand provided to the new faces of this technology.
@Mr.FastZombie
@Mr.FastZombie 7 лет назад
There are also programs for character recognition on your screen. Project Naptha is a Chrome extension that can let you copy and paste words in an image. And ShareX has OCR that you can use for any program.
@HirooKoslov
@HirooKoslov 7 лет назад
My ScanSnap IX500 usese software to make scans readable. It works pretty well and the IX500 is blisteringly fast.
@quenjankosky7348
@quenjankosky7348 7 лет назад
Well, with OCR, there is an exception for the lack of accuracy. When basic modern OCR was being developed, they made a series of fonts deigned to be as accurate as possible. These fonts were OCR-A and OCR-B. These fonts are super accurate with OCR, and there is usually never any error with them.
@rry1994
@rry1994 7 лет назад
I love u guys man
@bradad1111
@bradad1111 7 лет назад
Saw OCR and immediately thought it had something to do with the Exam Board.
@craigmalcom6294
@craigmalcom6294 7 лет назад
bradad111 Lool same
@StickyBagel
@StickyBagel 5 лет назад
So did youtube, i was watching a revision playlist and here i am??
@fleksimir
@fleksimir 4 года назад
Linus ad (pulseway) on linus video. I love this ahahaha
@jankomirovic2866
@jankomirovic2866 3 года назад
same gahahahahah
@moenbase1
@moenbase1 2 года назад
In my industry, which is electronics. We use OCR in our automated optical machine to detect component marking on components as small as micro BGA's that are like 400microns wide. It's amazing to see how you can push it's limits. Just, sometimes like when there's a sufficient amount of flux on the components it makes it impossible to read.
@Ahmed71616
@Ahmed71616 2 года назад
What is the best scanner that does the same job as your devices
@SnypeSin
@SnypeSin 7 лет назад
that's good and all but I would have thought you'd give us and idea of what kind of devices use OCR for consumer/business.
@Lorten369
@Lorten369 7 лет назад
YEES More history please. love knowledge.
@macpclinux1
@macpclinux1 7 лет назад
luke are you finally using linux? i saw that little ubuntu font box :D good job mate!
@hillppari
@hillppari 7 лет назад
Google translate app with OCR is pretty nifty when you can translate foreign signs etc.
@dav2mai
@dav2mai 7 лет назад
Will it also recognize language? because "dag" translates to "day" in Danish
@Meg_A_Byte
@Meg_A_Byte 7 лет назад
Is there anything on this world that recognizes danish?
@22RH544
@22RH544 7 лет назад
Nope, as a Dutch guy i can read it just fine, but when it is spoken.................I quit.
@TheDyingFox
@TheDyingFox 7 лет назад
Same result when translated to Swedish xD
@Mr.FastZombie
@Mr.FastZombie 7 лет назад
I would assume it sticks to one language, but some can probably change their language. Also perhaps some could be able to determine the language based on what it has already recognized.
@crewskater06
@crewskater06 7 лет назад
It's from the movie Snatch
@HolarMusic
@HolarMusic 7 лет назад
Is that an 8k green-screen video? Looks super clean
@jehdo144
@jehdo144 7 лет назад
great video!
@leivadaros
@leivadaros 7 лет назад
Haven't read a single comment regarding the video's topic.... only "First", "Notification Squad where you at" and comments trying to be witty..... Great video by the way, i love getting general introductory information on the subject of my studies (computer engineer). Keep at it TechQuickie :D
@vapexxx
@vapexxx 7 лет назад
Luke - I actually watched the ad because of your fresh moves!
@littletomatomonkeysmeeeeel8324
Highly recommend PaddleOCR! 80 languages supported! Good performance! Easy to use! It would be great if bloggers could do a comparative evaluation of the popular OCR tools.
@JRDev4All
@JRDev4All 7 лет назад
You should do an as fast as possible on assistive technologies such as screen readers
@JOELwindows7
@JOELwindows7 7 лет назад
Wow, I saw this video right near before my National exam days.
@sebon11
@sebon11 4 года назад
Cool! Thx a lot.
@narutosasuke30
@narutosasuke30 5 лет назад
Which OCR recognizes Handwritten text that you have shown at the end? I couldn't find anything which actually does that within a permissible error rate :/
@pearls9133
@pearls9133 7 лет назад
could you do videos explaining how mastering audio and video works? (if it doesnt already exist)
@rediculousman
@rediculousman 7 лет назад
convolutional and LSTM neural networks are the cutting edge for these applications
@howardt12345
@howardt12345 7 лет назад
Dennis: "You are dancing?"
@ulashofficial
@ulashofficial 4 года назад
Sir can you tell me how can i find duplicate numbers with any OCR app or how should i pursue to make an app for that ?
@teksight9714
@teksight9714 7 лет назад
Good video. Thumbs up!
@jean-lucasymptotic5083
@jean-lucasymptotic5083 7 лет назад
Speaking of machine learning..... that would make a good techquickie :D
@jamilangon5798
@jamilangon5798 7 лет назад
well google releases a OCRT (optical character recognition translator). which translate even other character aside from ASCII (chinese, japanese, thai and other non alpha character)... it become useful for those who travel and find themselves trap into a place where no one can speak or understand english.
@TheZorch
@TheZorch 7 лет назад
I've got a Chrome extension that does OCR within images. Sometimes comes in really handy.
@KX36
@KX36 7 лет назад
I did some OCR recently. Tesseract on Linux was the best at recognising the text accurately, but it outputs plain text only. There are 3rd party GUIs, but still none really preserve formatting. ABBYY FineReader on Windows (the gold standard for home use) was quite good at preserving formatting but worse at recognising text accurately. My scan was 200 pages of black 12pt Times New Roman on white paper scanned at 300dpi which should be one of the easiest things to process, and it regularly made mistakes on 1 vs l vs I , y vs v, H vs II etc. And these were often in places the dictionary should have easily known what it should have been. How often do you get a lower case L in the middle of a long number or a double upper case I at the start of a word or a v at the end of a word. It took 3 hours to go through the document correcting the mistakes it highlighted. Don't know how many mistakes are in there that it didn't highlight.
@rushabmehta
@rushabmehta 7 лет назад
Can you do video on Virtualization such as hardware, network and storage Virtualization.
@MotivationAdonis
@MotivationAdonis 7 лет назад
Linus tech tips as fast as possible
@Jinni_SD
@Jinni_SD 7 лет назад
I really like Tesseract withHomebrew on Mac for OCR.
@unguidedone
@unguidedone 5 лет назад
we need a firefox plugin that will log what youtube upload has paid promotions, skip past it and end the video when teh promotion happens. this video is an example of native advertisting
@Golde2Good
@Golde2Good 7 лет назад
You should explain core parking in the near future.
@94213915
@94213915 5 лет назад
Can you please tell me about any OCR software for devanagari language . Which can cost me less
@Juiceman777
@Juiceman777 2 года назад
I couldn't help but to think of the line from the movie Snatch when Brad Pitt said "ya like dags?" lol
@antonjohansson1384
@antonjohansson1384 7 лет назад
Dag is in swedish day
@MiMiOrt
@MiMiOrt 3 года назад
I downloaded but , I thought that it will recognize the different fonts that are someonetimes in just ONE page. Does anyone know an APP/Program that can recognize the font on a scanned document?
@thornejman6467
@thornejman6467 7 лет назад
Thumbs up if anyone else checked the videoquality at 0:36 xD
@Quack201
@Quack201 7 лет назад
So I guess the real question here is why is Luke only wearing socks while recording this? Doesn't Linus give sandals to all the employees?
@rinoy_43
@rinoy_43 7 лет назад
I've tried Tesseract. Its free and pretty accurate.
@DanRobards
@DanRobards 7 лет назад
Man, the ACR was great. Hardly any recoil
@NineToFiveGamer
@NineToFiveGamer 7 лет назад
I used to use an augmented translator app for my French tests. Shit just about worked half the time
@bassmickey
@bassmickey 7 лет назад
Funny used OCR last night. What a coincidence
@stayprofessional2453
@stayprofessional2453 7 лет назад
Make an episode on network topologies
@terrybell898
@terrybell898 7 лет назад
Micky: Ya like dags? Tommy: Dags? Micky: Yea, dags Tommy: OH, dogs, sure I like dags
@donaldfilbert4832
@donaldfilbert4832 7 лет назад
OneNote has a pretty good built in OCR for small text articles - and it's free !! ABBYY FineReader does an excellent job converting image PDFs into searchable text based PDFs !!
@Mihnea729
@Mihnea729 7 лет назад
Interesting !
@araddadi2
@araddadi2 5 лет назад
Watching this 10 minutes before class because I have a home and I’m a highly functional college student
@pikotechsolutions
@pikotechsolutions 2 года назад
awesome
@johneygd
@johneygd 7 лет назад
But can OCR ever distinguich hand written numbers and letters from eachother? Such as 0's & o's, G's & 6's, 1's & i's ,H's & 4's , j's & i's, 7's & 1's ,0's & 8's etc,,,, because numbers and letters looks similar to eachother.
@todddembsky8321
@todddembsky8321 7 лет назад
Luke, you have to tell me when you go on tour -- I need to leave the country at that point....
@joerider5063
@joerider5063 7 лет назад
Do speech recognition as fast as possible please.
@zcuipylo
@zcuipylo 7 лет назад
TPS reports!!!!!! What a perfect example. Almost an easter egg.
@Exploreyourlife88
@Exploreyourlife88 3 года назад
Thanks
@bas116677
@bas116677 7 лет назад
Dag actually means Hey or day in Dutch!
@kdm_6799
@kdm_6799 7 лет назад
Bas Roelofs dag means bye too
@182ndNegociator
@182ndNegociator 7 лет назад
What if it's supposed to say dag, that's also a completely legitimate word used in Australian English, plus it could also be used to describe a Directed Acyclic Graph, also known as a tree.
@isabellaereshki
@isabellaereshki 7 лет назад
I liked your dancing, ignore dennis. great video.
@_Disi
@_Disi 7 лет назад
What about if you're trying to copy the line "D'ya like dags?" from Snatch?
@angelstrife
@angelstrife 7 лет назад
Hi! Could you do a FPS 1%low explaination? I have seen so many tech reviewers use this term but i have no idea what it means.
@sniperunrepeat752
@sniperunrepeat752 7 лет назад
Long Nguyen Games tend to have "stutters" (i.e. briefly running out of VRAM on say, a 1060 3gb) which can temporarily bring the minimum fps incredibly low. So 1% lows are used. All they mean is the minimum fps that doesn't factor in the bottom 1% of frames, to give a more realistic minimum
@Bayonet1809
@Bayonet1809 7 лет назад
Could also be called the 99th percentile.
@SuperManitu1
@SuperManitu1 7 лет назад
Tesseract is the best OCR program out there. It is Open Source and runs on all major OS
@94213915
@94213915 5 лет назад
How can I run it on Windows
@Seag-Gaming
@Seag-Gaming 7 лет назад
Who else had nostalgia @ 0:36?
@blingerang
@blingerang 6 лет назад
3:33 dag is actualy morning in dutch
@DeppImAll
@DeppImAll 7 лет назад
I mean tbh ... when I write in OneNote some text and microsoft can figure out what I just wrote and convert it into real characters I'm always astonished since my handwriting is horrible.
@nitini.764
@nitini.764 6 лет назад
I liked this "don't worry, be happy" in your video. Are you a Meher Baba lover too!!!!
@aislius9200
@aislius9200 7 лет назад
Printing costs like 150 dollars for new ink if you go to retail, if you manage to go online it costs like 10-20 bucks. What the actual fuck?!!?
@mickeyhage
@mickeyhage 7 лет назад
OCRs font work ive tried them but they dont properly. They dont read encrypted documents they spit out random incorrect letters.
@saisagarmrcool4610
@saisagarmrcool4610 2 года назад
it was the most simpler way to understand
@metashrew
@metashrew 7 лет назад
If the software were dutch, the word would be "dag" (which means day in english), and not "dog".
@BenPotts
@BenPotts 7 лет назад
Nice dancing, Luke
@sahotaquack1
@sahotaquack1 7 лет назад
Oxford Cambridge RSA
@UNPhantom93
@UNPhantom93 7 лет назад
Would be much better if was a fold able or detachable at least to use it as a tablet
@ThePiGuy24
@ThePiGuy24 7 лет назад
I WANT INTERPRETIVE DANCE TRANSLATOR NOW!!!
@MrTuffarts
@MrTuffarts 7 лет назад
Dag is a word OCR software would not pick this up spellcheck does not pickup this also
@Shirojm
@Shirojm 7 лет назад
So use a normal "photographic" scanner , then use OCR services such as google drive .
@1OldWriter
@1OldWriter 7 лет назад
Techquickie you do know most scanning software do this as part of their operation. If your's doesn't perhaps you should get a new one.
@Ghjklt544
@Ghjklt544 7 лет назад
I want to see the Google interpretive dance translater
@marcusleung8985
@marcusleung8985 7 лет назад
what about Fourier transform?
@GroovingPict
@GroovingPict 7 лет назад
do you like dags?
@supervegito2277
@supervegito2277 7 лет назад
3:38 soft g, its day in danish actually.
@22RH544
@22RH544 7 лет назад
Also in Dutch, Swedish & Norwegian
@TheMasonX23
@TheMasonX23 7 лет назад
OCR is not for "pikies" apparently...
@johnmarston1155
@johnmarston1155 6 лет назад
Brad Pitt will be furious
@MrEsChannelYT
@MrEsChannelYT 7 лет назад
d'ya like dags?
@levingthedream
@levingthedream 7 лет назад
Is there any awesome free software that do this? Linux or PC. Besides Google drive that is
@svsrkpraveen
@svsrkpraveen 6 лет назад
When did Dan Reynolds start doing tech stuff?
@cal920c
@cal920c 7 лет назад
I thought Luke missing for a while... now we know where he's been...
@thepalettewhispererasmr1227
@thepalettewhispererasmr1227 3 года назад
Arizona's audit brought me here 🇺🇸
Далее
Optical Character Recognition (OCR) - Computerphile
14:16
Image File Formats - JPEG, GIF, PNG
6:45
Просмотров 1,9 млн
Хотите поиграть в такую?😄
00:16
Просмотров 375 тыс.
ПАПА ГАМБУРГЕР
00:13
Просмотров 107 тыс.
What are Mainframes?
6:37
Просмотров 1,4 млн
When Does Cable Length Matter?
6:48
Просмотров 1,7 млн
Audio File Formats - MP3, AAC, WAV, FLAC
6:17
Просмотров 1,4 млн
Misused Technology Terms
7:08
Просмотров 1,5 млн
What is Contrast Ratio?
5:26
Просмотров 527 тыс.
How To Read Images in Java Using OCR- Tesseract
21:35
The Man Who Solved the World’s Hardest Math Problem
11:14
How Do RAM Drives Work?
6:28
Просмотров 596 тыс.
How TOR Works- Computerphile
14:19
Просмотров 1,7 млн
Как разблокировать айфон?
0:27
Просмотров 149 тыс.