Тёмный

Text Detection with OpenCV in Python | OCR using Tesseract (2020) 

Murtaza's Workshop - Robotics and AI
Подписаться 405 тыс.
Просмотров 419 тыс.
50% 1

In this video, we are going to learn how to detect text in images. We will learn how to detect individual characters and words and how to place bounding boxes around them. We will also look at how to detect only digits.
🚀🚀 My Urdu/Hindi AI RU-vid Channel 🚀🚀
/ @murtazahassan01
Premium Courses:
✔️ Computer Vision Game Development Course:
bit.ly/3ttLZ2s
✔️ Computer Vision with Arduino Course:
bit.ly/3wzLB4m
✔️ Advanced Drone Programming Course:
bit.ly/3qs3v5g
✔️ Learn to Build Computer Vision Mobile Apps:
bit.ly/3uioY1J
✔️ Jetson Nano Premium Course:
bit.ly/3L8uIlF
⚙️⚙️⚙️--My Gear - ⚙️⚙️⚙️
👉 Complete Gear 💈: www.computervision.zone/tech-...
👉 My PC Specs 🖥️: www.computervision.zone/tech-...
👉 My Video Shooting Gear📽️ : www.computervision.zone/tech-...
👉 My Laptops 💻: www.computervision.zone/tech-...
👉 Educational Products🧑‍🎓: www.computervision.zone/tech-...
👉 RU-vid Starter Kit 🔴: www.computervision.zone/tech-...
Follow Me:
TikTok: bit.ly/3Vo76OQ
Facebook Group: bit.ly/3irDcb7
Discord: bit.ly/3JvyxAM
Facebook Page: bit.ly/3IvpU7W
Instagram : bit.ly/3NdGME3
Website: bit.ly/3ICFTS0
Github: bit.ly/3woU6PS
#ComputerVision
#OpenCV
#CVZone

Опубликовано:

 

30 июл 2024

Поделиться:

Ссылка:

Скачать:

Готовим ссылку...

Добавить в:

Мой плейлист
Посмотреть позже
Комментарии : 359   
@nithinprincejohn7762
@nithinprincejohn7762 4 года назад
Your tutorials are awesome. I already shared your channel with my students. I visited so many channels for learning opencv and out of that your videos stands out.
@murtazasworkshop
@murtazasworkshop 4 года назад
I am glad i could be of help. and thanx for sharing
@WLeigh-pt6qs
@WLeigh-pt6qs 2 года назад
Hi Murtaza, I indeed learnt something new! It's amazing! I really appreciate your time and effort and sharing :) !
@edSabio572
@edSabio572 3 года назад
Thanks Doctor Murtaza! This lesson is a class act!
@Marek-Marek
@Marek-Marek 4 года назад
Great job! I was learn something new, thanks a lot.
@johntamplin
@johntamplin 4 года назад
Good video. From 20:20 onwards you are using the variable 'x' for both the loop count and the x-dimension of the bounding box. You got lucky that it didn't impact the results of the code.
@surajprakash3558
@surajprakash3558 4 года назад
This is an amazing video and effort. Thanks a lot !
@akhilpandey
@akhilpandey 3 года назад
Thank you, just what I needed.
@ishtiaqahmad4118
@ishtiaqahmad4118 3 года назад
By chance I entered last 5 years before in programming languages. You will note my background is Accounting. For now it is sufficient that I am 75 years old man but I am learning Django, Ai, Machine Learning and etc., Last five years I asked my instructor of Java (Note= At that time, I did not know A,B, C of any programming but I know a sufficient knowledge about Programming - I am a lot thankful of Respectable Great Zia Khan) that I can check cross medical bill of the any person, but he could not replay. I am very happy of this watching Video, I can solve this last 5 years dream can be solved by python with AI Insha-Allah. Now I am trying to launch my E - Commerce Website very soon with Python, I shall contact very soon with you for a big project Insha-Allah. Ishtiaq Ahmed Siddiqui KARACHI
@davidgarza7355
@davidgarza7355 2 года назад
This was sick!! Thanks, It was so easy to follow :)
@YazeedAlkosai
@YazeedAlkosai 4 года назад
thanks so much, Dear Murtaza...could you do a tutorial for detecting the shape feature and make the dimensions of the shape of the workpiece appear once the camera start to detect.
@aham3687
@aham3687 3 года назад
Awesome video. I was looking for technology that could read image captcha and extract text in the backend. Then submit text to the server without users requiring to type the captcha and your video solved my issue.
@choogiesaur
@choogiesaur 3 года назад
amazing, would you share your code as an example?!
@quasa0
@quasa0 2 года назад
@@choogiesaur did you figure it out?
@shreyanshdwivedi7871
@shreyanshdwivedi7871 3 года назад
Hello Murtaza, It was good explanatory video which was easy to digest and understand, loved your work, I want to know that if you can make a video on creating the language model of sanskrit for tesseract from the certain bunch of images consisting of sanskrit text , that would be appreciated. Hope to hear from you soon.
@ADNANAHMED-eo5xx
@ADNANAHMED-eo5xx 3 года назад
Amazing TUtorial man, thanks a lot
@fakhruddintahery1561
@fakhruddintahery1561 3 года назад
Great explanation. Thank you so much. That was useful
@sohailali5741
@sohailali5741 4 года назад
Hi Murtaza, Your videos are awesome and very detailed and easy to understand each and every part. Can you please make a tutorial on Semantic Segmentation with Keras and tensorflow?
@murtazasworkshop
@murtazasworkshop 4 года назад
i will look into it
@shailavijay1
@shailavijay1 3 года назад
Thanks for your video. Very well explained :)
@birdeye700
@birdeye700 3 года назад
Excellent work! You might want to create functions to show digits, show alphabets and etc. This will improve readability and reusability of code.
@animeshmaji9515
@animeshmaji9515 3 года назад
Keep making video like this and we keep learning
@1phong.djk.392
@1phong.djk.392 4 года назад
wonderful. I comment before watching. Keep creating more wonderful tutor videos
@murtazasworkshop
@murtazasworkshop 4 года назад
Many thanks
@sidhantipatil1930
@sidhantipatil1930 4 года назад
Great content! Keep going!😇
@sudhakarp1
@sudhakarp1 3 года назад
Excellent explanation, Thanks Murtaza
@gonzalohernan4169
@gonzalohernan4169 3 года назад
Thanks so much for this!
@srvanikar
@srvanikar Год назад
wow....mighty helpful.. Thank you Murtaza
@imadsaddik
@imadsaddik Год назад
Thanks, this helped me.
@youcefahmanache3298
@youcefahmanache3298 3 года назад
Hi Murtaza, you see your loop to DISPLAY SOME SAMPLES IMAGES OF ALL THE CLASSES, if the folder names were categorical/strings instead of numerical, i.e. (sign1, sign2, sign3...etc instead of 0,1,2,3..etc) how would you iterate through these folders? would you still use iterrows? Thank you
@mandrivaal
@mandrivaal 3 года назад
You're a legend! thanks
@iaconst4.0
@iaconst4.0 9 месяцев назад
MUCHAS GRACIAS MURTAZA!! ME AYUDASTE MUCHO!, TE DEJE LIKE!
@makelabsindia3092
@makelabsindia3092 4 года назад
your video and explaining method amazing ,keep it up
@murtazasworkshop
@murtazasworkshop 4 года назад
Thanks a lot
@ashok_ign5623
@ashok_ign5623 4 года назад
Thank you so much your all tutorials are helpful keep It up 😍😍
@murtazasworkshop
@murtazasworkshop 4 года назад
Glad you like them!
@mabo2031
@mabo2031 3 года назад
hi Murtaza I have question: I want to convert to pdf(with ocr text) only few page from multipage tif image - is it possible without any other conversion ? multipage tif - 70 pages - but I want to convert to readable pdf pages from 10 to 20? best regards :)
@vcjayan8206
@vcjayan8206 2 года назад
Great help..Thank you
@harshitsrivastava9903
@harshitsrivastava9903 4 года назад
Great work..very helpful!!
@murtazasworkshop
@murtazasworkshop 4 года назад
Glad to hear that!
@jairajsahgal5062
@jairajsahgal5062 3 года назад
23:13 " I have no idea why they did this" made me chuckle.
@arthurswanson3285
@arthurswanson3285 4 года назад
Very useful. Thanks.
@Abcsciencemusic
@Abcsciencemusic 3 года назад
For windows , Instead of down manually downloading and installing just do : pip install tesseract
@shuterino9454
@shuterino9454 3 года назад
thx bro
@jurajzovinec
@jurajzovinec 3 года назад
tx man
@victorgiaff8625
@victorgiaff8625 3 года назад
Tnx
@jonascampos5564
@jonascampos5564 3 года назад
How can we know if that's official?
@d.e.a.dgaming654
@d.e.a.dgaming654 Год назад
Nah bro thats not official
@riztya7846
@riztya7846 2 года назад
that's so preety cool i maen tips with ctrl + left click, is so helpfull for me just learning python, thank you ! btw i'm already subscribe you 👍
@akashrastogi6431
@akashrastogi6431 3 года назад
great job and thank you so much
@fixitunes30
@fixitunes30 4 года назад
Awesome videos....Any plan for video on Deep Learning Chat bot in near future?
@ambijat
@ambijat 4 года назад
I want to use clipboard image for text read in Ubuntu Linux. Can you tell me how to do. Windows has other option of using pillow library, but that does not work in Linux.
@riyachougule6356
@riyachougule6356 3 года назад
That's really so useful ! Thanks a lot ! Can you please explain how we can extract the column of y point from "boxes"?
@nikethadinesh7123
@nikethadinesh7123 3 года назад
Thank you sir for this video. Cud u also explain with handwritten text recognition?
@Jake79
@Jake79 4 года назад
Very clear! TY
@murtazasworkshop
@murtazasworkshop 4 года назад
Glad it was helpful!
@ekizahmet
@ekizahmet 3 года назад
Great video and great tutorial, Thank you! I wish I could see before. Btw Detection only Digits is not working. Does anyone have any idea why?
@sanchimittal9614
@sanchimittal9614 4 года назад
Amazing tutorial! Great content!! Can you do a video or suggest something for noise removal like unrequired non-text portion and other kind of preprocessing before text detection in images? Thank You
@fortfanop
@fortfanop 3 года назад
I think denoising with autoencoders will do the job...
@Shraddhanath
@Shraddhanath 4 года назад
Thanks for this workshop. Could you also post a video on how to detect tables(ruled as well as not ruled) in document images or segregate tables, text, charts and figures. Tx.
@murtazasworkshop
@murtazasworkshop 4 года назад
Thanks. i will look into it
@innotech4079
@innotech4079 4 года назад
Hi, it's amazing but is there any train part? 1. detecting the text edges, 2. img to text training
@aaryangupta4815
@aaryangupta4815 4 года назад
Love ❤ this tutorial, can we do this same on live video also???
@SliderBlues
@SliderBlues 3 года назад
Thanks for the tutorial! It was very helpful for me getting a handle on OCR and Opencv for a redaction project I was working on. Now looking at fictionalizing text in an image. Ideally would like to replace words, like for an address, with different ones using the same font and size for the text. Not finding any solid examples of this that work. Would love to hear your take on how that might be accomplished.
@murtazasworkshop
@murtazasworkshop 3 года назад
You could do it with opencv python but it you want an end product you might want to look at unity.
@SliderBlues
@SliderBlues 3 года назад
@@murtazasworkshop - Hi. Worked on it a bit and watched more of your videos but still stuck. Posted on stackoverflow and so far no responses that might work. If you have the time would like to know your high level ideas on resolution. Thanks! stackoverflow.com/questions/64541909/how-to-fictionalize-text-data-like-an-address-within-a-document-image
@MrNadeem1984
@MrNadeem1984 3 года назад
It doesn't work on text in larger and smaller font size. Can you help me how I can use extract those text from image.
@liberocabec47
@liberocabec47 2 года назад
hello thanks you for your tutorial, i have a question, i try to make bounding box around sentence who have same b['top'] coordinate, i managed to make the separation to the word and the bounding box that correspond to it but i can't entend it to the sentence or to the same line of text, someone can help me ?
@dimitheodoro
@dimitheodoro 3 года назад
The Binaries for Windows ( 1:06) has not this selection to download, so i cannot find how to download the file for Windows. Can you help me pls?
@DrWho2008t101
@DrWho2008t101 3 года назад
Great Video!
@kevinalexander4959
@kevinalexander4959 3 года назад
used this with a webcam to take an image and parse text from (like a piece of paper or menu on a phone), then you can take the .png snapped by the webcam and analyze with image_to_string(), takes about half a second to read text presented and the webcam's settings has to be on point with brightness and all that stuff
@mehmettatli07
@mehmettatli07 3 года назад
so we cannot use this one as a real time word detection metod? How many frames did you get when you use with your webcam? i just want to use it for real time detection ,if you answer to me. Thx a lot
@hirdeshchhipa8238
@hirdeshchhipa8238 2 года назад
Pp
@WorkHardThinkBetter
@WorkHardThinkBetter 2 года назад
Hi Murtaza, Your videos are excellent. I need to ask where is 1.png since on your website I can't find it in the projects section. Thank you so much for everything!
@tecint
@tecint 3 года назад
Thank you!! Good Job !!!!
@murtazasworkshop
@murtazasworkshop 3 года назад
Glad it helped!
@shan2739
@shan2739 4 года назад
Thank you very much for this video. Can you also make some video on Deep learning self driving car using raspberry pi and jeston nano
@murtazasworkshop
@murtazasworkshop 4 года назад
I have already started the raspberry pi series . once the basics are covered we will move to more advance topics like selfdriving, ML, DL
@shan2739
@shan2739 4 года назад
@@murtazasworkshop yea..I already got them waiting for you.. Thank you
@Lindvni
@Lindvni 3 года назад
How do you guys fix the path problem? Everything looks fine ,until I run it. It just gives me an error
@smrutidabhole961
@smrutidabhole961 2 года назад
thank you so much sir...
@namtongdosa9626
@namtongdosa9626 4 года назад
Thank you so much, its very helpful for me. Could you please make a tutorial about train model to detect text in identity image ?
@murtazasworkshop
@murtazasworkshop 4 года назад
I have one tutorial on classifying digits. you might want to have a look at that.
@tehreemsyed8621
@tehreemsyed8621 4 года назад
Hi, how can we extract each character and digit from the license plate?
@MasterZeroFX
@MasterZeroFX 2 года назад
Nice vid pahjeet
@muthuji8053
@muthuji8053 4 года назад
videos really helpful for me ,but if we apply this dull image or text are present in bottom that won't detected
@colbrac720
@colbrac720 3 года назад
Thanks a lot !
@radoslavtotev4401
@radoslavtotev4401 2 года назад
Hey, thank you for the lesson but do you have it for linux because I want to use Raspberry pi 3 for similar project?
@nicoarasamuel1173
@nicoarasamuel1173 2 года назад
Love the video... I've try to make a project on wich I try to send a text to a monitor, more like a power point presentation or a teleprompter app... Do you have any suggestions...or can you make a video about something like that?
@andresnet1827
@andresnet1827 4 года назад
Very good tutorial :)
@murtazasworkshop
@murtazasworkshop 4 года назад
Glad you think so!
@vittal255
@vittal255 3 года назад
hello Sir, our videos are really informative. They motivate me to build things that I have in my brain. Thanks for those videos. I have small doubt can we implement the same project using raspberry pi instead of a laptop/pc. If yes please suggest the necessary modification I need to make. I am really to new to IoT and computer vision started learning about them
@_kayos_
@_kayos_ Год назад
did you find an answer?
@rajeshmeesala9470
@rajeshmeesala9470 3 года назад
Nice explanation
@anajajkkkk8490
@anajajkkkk8490 3 года назад
Thanks sir!!
@minecrafttipstricks9537
@minecrafttipstricks9537 3 года назад
Can i use this if i need to read a code from cells tru a camera? and then sort them by codes. is that possible?
@anmolpunetha3994
@anmolpunetha3994 4 года назад
There is no change when i add the config, it is still detecting text too. Can anyone pls tell why is it so?
@shuaishuai2009
@shuaishuai2009 3 года назад
good video~ Thanks for sharing
@silviacolabianchi8261
@silviacolabianchi8261 2 года назад
Hi! thanks for the video. What do you think of tesseract version 5.0.0. Which one do you suggest to download? Version 4 or 5? thank you
@safabelhadjromdhane2922
@safabelhadjromdhane2922 2 года назад
Hello, I would like to say thanks very much for this channel your videos have been very helpful while learning OpenCV and python. I'm having a problem while executing this code and I would like you to help me fix this "cv2.rectangle( img(x, y), (x+w, y+h), (0, 0, 255), 1) TypeError: 'numpy.ndarray' object is not callable." What do you suggest I do to handle it? Thanks in advance.
@firegames2741
@firegames2741 3 года назад
Thank you, Mortaza for such a useful video. I need help from you, can you convert captcha file to text. I'm trying, but not converting properly.
@saurabhkalra3042
@saurabhkalra3042 3 года назад
can you tell me how to do like this with python: detect an image on an application and if the image is not there search for the image again and if it's there then do something.
@silenceronald9204
@silenceronald9204 3 года назад
how can write a code that allows the user to select an area on the screen with the mouse, and the software recognizes the text in this area
@shashwatshankar6647
@shashwatshankar6647 3 года назад
Hello brother, first of all a great job with it. I need some help with text detection from a image. Actually i have to extract the headlines from a article present in a image form. But whenever a special character appears the data after it dosen't appear in a array. Can you help me?It would be great
@sudhanshukurle4024
@sudhanshukurle4024 4 года назад
sir,can you show the recognisation of devangri(hindi characters and numbers) using pytesseract
@9319964634
@9319964634 4 года назад
Good video, can we also find micr font and store using this method?
@goat6836
@goat6836 2 года назад
Thanks Man
@MegaR97
@MegaR97 2 года назад
Hello, sorry I can't donwload the tesseract version you are using the page says "currently there is no official windows installer for newer versions
@naresh2934
@naresh2934 3 года назад
thanks a lot.
@raghavgarg1258
@raghavgarg1258 4 года назад
Awesome, Please upload more videos...
@murtazasworkshop
@murtazasworkshop 4 года назад
Thank you, I will
@praffulbisht8520
@praffulbisht8520 Месяц назад
For macUser : first write at terminal - which tesseract then copy that path and paste it in tesseract cmd path pytesseract.pytesseract.tesseract_cmd = ‘path '
@MegaLobo000
@MegaLobo000 4 года назад
Gracias, excelente video, disculpa como puedo hacer para detectar palabras en videos. Saludos
@santibogo
@santibogo 3 года назад
Hi murtaza! A question: If i only want to retrieve specific text from the image, how can i do it? i'm already have the coordinates of the text that i want to get
@richmail
@richmail 3 года назад
take screen shot useing those cords, and send it via internet, tho, you need to know whee you store the images, hope this helped (srry for the bad spelling, I type fast)
@MustajabAhmed-or8ud
@MustajabAhmed-or8ud Год назад
hello all, if anyone can answer. do we have to install opencv and pytesseract in pycharm when we have already installed using pip?
@amortalbeing
@amortalbeing 2 года назад
Thanks a lot. The `image_to_boxes()` returns `x1, height1, x2, height2`. if you renamed them like this it would be much more clear.
@nisarggandhewar8242
@nisarggandhewar8242 4 года назад
Its very helpful video, Thanks for preparing it, can you please prepare video on to detect text by designing & training our own model on some dataset just like digit detection, that will be more helpful.
@murtazasworkshop
@murtazasworkshop 4 года назад
Yes, soon
@MadanLal-hl5mw
@MadanLal-hl5mw 2 года назад
@@murtazasworkshop l
@chickenz4604
@chickenz4604 3 года назад
Nice video, btw could you give me suggestion when it comes to recognize handwritten digit and then manipulate the data?? should I use OpenCV & tesseract?? or tensorflow & keras? I am still not sure about this, since im new for this application. Thank You !
@Mikasks
@Mikasks 3 года назад
So what did you use? I'm having this problem aswell.
@ezequielbecerra6992
@ezequielbecerra6992 4 года назад
Hi, how can i install pytesseract in a RASPI 4?
@dennisasamoah2213
@dennisasamoah2213 3 года назад
great again
@EduardoAdameSalles
@EduardoAdameSalles 4 года назад
I have a table of prices (with money values), and I have an algorithm in Python to calculate the final value (with the taxes I want for me, and rounded). But I write each value into the software, and then it returns me the value I want, and then I put in a excel table. Is there a way to take the values from the first table, run the algorithm and then return me a csv or xlsx table with the new values? You don't need to do a full search, I just want a way to go to learn (opencv, etc). In Python I just know tkinter, but the bases.
@interpreter001
@interpreter001 2 года назад
may be possible using pythonautogui(controlling keyboard & mouse)
@javierpachon4424
@javierpachon4424 2 года назад
Thanks for your tutorial, I want to capture an email from an image but Tesseract does not recognize the @ symbol, how can I solve it?
@gawaderajesh
@gawaderajesh 3 года назад
Great...Murtaza's Workshop - Robotics and AI... Posted on Facebook....... :-)
@sagihaviv5675
@sagihaviv5675 3 года назад
sir i want to extract only the specific word or phrase, how do i do that? do i need some nlp?
@manugomez7319
@manugomez7319 3 года назад
At 9:27 the w and h are the diagonal points means? Which diagonal point? Bottom or up?
@vimsen2221
@vimsen2221 4 года назад
I get following error. --- pytesseract.pytesseract.TesseractError: (1, 'Error opening data file C:\\Program Files\\Tessaract-OCR/eng.traineddata Please make sure the TESSDATA_PREFIX environment variable is set to your "tessdata" directory ---- Where do I set this?
@gowthamns8228
@gowthamns8228 4 года назад
Wow very good, but the problem is "If the text is very clear and crisp its is showing output correctly", But I want to know if the image has multiple data not only text for example "bills, taken calendar photo or any kind of images", How to print the string from that, I tried my self it's not printing anything, Any idea for this?
Далее
Facial Landmarks and Face Filter using OpenCV Python
42:57
5 Useful F-String Tricks In Python
10:02
Просмотров 285 тыс.
Text recognition (OCR) with Tesseract and Python
31:32