Тёмный
No video :(

Getting Started With Azure Document AI Document Intelligence API In Python (Source Code In Desc) 

Jie Jenn
Подписаться 63 тыс.
Просмотров 5 тыс.
50% 1

Опубликовано:

 

24 авг 2024

Поделиться:

Ссылка:

Скачать:

Готовим ссылку...

Добавить в:

Мой плейлист
Посмотреть позже
Комментарии : 39   
@jiejenn
@jiejenn 5 месяцев назад
What else do you want to see? Let me know in the comments below!
@Mohammed-go6he
@Mohammed-go6he 4 месяца назад
Excellent explanation. Thank you Jie!
@jiejenn
@jiejenn 4 месяца назад
👍👍👍
@gfranco137
@gfranco137 4 дня назад
Hi Jie, It would be possible to show how to ingest a PDF document, not a formatted one, as a raw document to analyze the data. (e.g. clinical trials) and how to store in azure SQL or Synapse using a data factory to feed an ML model to be used in Copilot or any other bot. Thank you.
@jiejenn
@jiejenn 4 дня назад
Thank you for the suggestion. This is a great idea, added to my video list.
@arturgomes1654
@arturgomes1654 4 месяца назад
You just saved me bro, thank you so much for this content
@jiejenn
@jiejenn 4 месяца назад
Glad the to hear.
@aswathssr5955
@aswathssr5955 4 месяца назад
How to use python code in Azure after the getting results from document intelligence
@yashub9580
@yashub9580 2 месяца назад
can you please show how to use custom models
@jiejenn
@jiejenn 2 месяца назад
I will add it to my list.
@tuikkumies1
@tuikkumies1 16 дней назад
Where did you get the beginning template for the project?
@tuikkumies1
@tuikkumies1 16 дней назад
Found it with chatgpt's help! To get what he had in the beginning, you can run this command from the folder of choice in Git Bash for example: python -m venv pyview
@sumanpaudel1997
@sumanpaudel1997 2 месяца назад
Hi, I am stuck in that bounding box part, how to create the boudind box from the given polygons? can you tell me how can I convert it into x, y, w, h format. Thanks
@jiejenn
@jiejenn 2 месяца назад
You can use pillow.
@RaiOkami
@RaiOkami 9 дней назад
The polygon coordinates are in Top-Left (1), Top-Right (2), Bottom-Right (3), Bottom-Left (4). The scaling is in inches (if pdf) with the origin (0,0) at the top left of the page. polygon: [ x1, y1, x2,y2, x3,y3, x4,y4 ]
@NSLABTUTORIAIS
@NSLABTUTORIAIS 3 месяца назад
Very useful and very good (Muito util e muito bom). Tks (Obrigado)
@jiejenn
@jiejenn 3 месяца назад
👍👍👍
@heshamelkouha7281
@heshamelkouha7281 3 месяца назад
Great tutorial, kindly, what is the theme (color) you are using in VS code
@jiejenn
@jiejenn 3 месяца назад
I'm using One Dark Pro color theme with some color customization.
@ethanphan6136
@ethanphan6136 5 месяцев назад
Great video! Thanks for sharing. Can you please share with us your github repo as well? I see that you are importing utility in the invoice extraction code, but I couldn't find it anywhere. Would really appreciate it.
@jiejenn
@jiejenn 5 месяцев назад
Good catch. I just add utility.py source code to the page.
@joaoarthurbandeira
@joaoarthurbandeira 2 месяца назад
Hey, great video! Which approach do you think is better for extracting specific documents/images patterns (say an ID from a country) in another language (maybe even handwritten) when a user uploads the file and for returing the data to him: Tesseract, Google Cloud Vision OCR, Azure or AWS Textract? How can i make it read the file the user uploaded and return to it the data so he can copy, for example? Many thanks!
@jiejenn
@jiejenn 2 месяца назад
Tesseract is mostly used to help simplify formatting output from AWS Textract. If your documents content have the same format, my take would be Azure Document AI > AWS Textract > Google Document AI. Otherwise, any of Vision API would work if you just want to extract text + images.
@joaoarthurbandeira
@joaoarthurbandeira Месяц назад
@@jiejenn Thanks for the response! Could you maybe show us how could we upload something like pdf or jpg of a foreign id (that doesn't have pre-built models) and get back its content?
@IBAAN89
@IBAAN89 4 месяца назад
hi, just a question. I have this project in my bachelor thesis. The pdf files are send to backend(c# .net framework) from frontend(angular) now I that I have list of pdf files in my backend how could I send it to Document Intelligence? I already trained my models and I have blob storage but i just cant figure out and i dont know the next step on how to send it to my custom model?
@jiejenn
@jiejenn 4 месяца назад
Your model should have an id, when you send a request you need to specify the model with the one you trained.
@surrendereverything244
@surrendereverything244 4 месяца назад
can you extract from .doc files? Document Analysis seems to only work for docx
@jiejenn
@jiejenn 4 месяца назад
Unfortunately doc is not supported. Your best option is to convert to docx or PDF.
@surrendereverything244
@surrendereverything244 4 месяца назад
@@jiejenn thank you for your response, would you happen to have something via code that will do the conversion? Am building a file upload program and want to make it use for .doc files as well. Any guidance is appreciated
@F2F_Bechlor_food
@F2F_Bechlor_food 3 месяца назад
Can you share the code for our practice
@jiejenn
@jiejenn 3 месяца назад
It's in the description.
@UiPath_ESP
@UiPath_ESP 5 месяцев назад
What about azure computer vision? I Don´t knnow much about azure, but I thought azure cv was the tool used to extract information from pdf or images. Is this Document AI is some sort of the evolution? Again im new, excuse my ignorance
@jiejenn
@jiejenn 5 месяцев назад
Some of the features are overlapped, but in summary, Azure vision is used dealing with image process vs Document AI is dealing with documents like forms, receipts, invoices, etc.
@ohcrapitsmrG
@ohcrapitsmrG 5 месяцев назад
There is an open source python ocr. How is this different?
@jiejenn
@jiejenn 5 месяцев назад
You are basically paying for pre trained models and servers to process the requests. Most of the open source libraries don't work well when it comes to extracting fields and tables from forms.
@arturgomes1654
@arturgomes1654 4 месяца назад
I tried to use Pytesseract but unfortunately I just don't get good results. Azure OCR and others are pretty much better
@hello_world704
@hello_world704 26 дней назад
Has anyone been getting the following error? azure.core.exceptions.ResourceNotFoundError: (404) Resource not found Code: 404 Message: Resource not found
@tuikkumies1
@tuikkumies1 16 дней назад
I was getting the same error and found the solution. The resource only works when set to these locations: East US, West US2, West Europe, North Central US. So just create a new one and change the endpoint and api_key
Далее
The most important Python script I ever wrote
19:58
Просмотров 185 тыс.
15 Python Libraries You Should Know About
14:54
Просмотров 383 тыс.
How to use Document AI
7:17
Просмотров 105 тыс.