Тёмный

Custom Semantic Segmentation using DeepLabv3 for a Document Scanning application 

LearnOpenCV
Подписаться 56 тыс.
Просмотров 5 тыс.
50% 1

Document Scanning is a background segmentation problem that can be solved using various methods. It is an extensively used application of computer vision. Here we consider Document Scanning as a semantic segmentation problem.
We use DeepLabv3 semantic segmentation architecture to train a Document Segmentation model on a custom dataset.
We also talk about the following topics:
✅Creating synthetic data to augment the dataset.
✅Creating custom dataset classes in PyTorch.
✅Fine-tuning DepLabv3 with custom loss functions.
✅Deploy the application using Streamlit.
📚 Blog post link: learnopencv.com/deep-learning...
🖥️ On our blog - learnopencv.com we also share tutorials and code on topics like Image Processing, Image Classification, Object Detection, Face Detection, Face Recognition, YOLO, Segmentation, Pose Estimation, and many more using OpenCV(Python/C++), PyTorch, and TensorFlow.
🤖 Learn from the experts on AI: Computer Vision and AI Courses
YOU have an opportunity to join the over 5300+ (and counting) researchers, engineers, and students that have benefited from these courses and take your knowledge of computer vision, AI, and deep learning to the next level.
opencv.org/courses
#️⃣ Social Media #️⃣
📝 Linkedin: / satyamall. .
📱 Twitter: / learnopencv
🔊 Facebook: profile.php?...
📸 Instagram: / learnopencv
🔗 Reddit: / spmallick
🔖Hashtags🔖
#AI #segmentation #scannedocument #deeplabv3 #machinelearning #objectdetection #deeplearning #computervision #artificiailntelligence

Опубликовано:

 

29 авг 2022

Поделиться:

Ссылка:

Скачать:

Готовим ссылку...

Добавить в:

Мой плейлист
Посмотреть позже
Комментарии : 11   
@LearnOpenCV
@LearnOpenCV Год назад
Checkout our blogpost on Training a custom Semantic Segmentation DeepLabv3 model for a Document Scanning application: learnopencv.com/custom-document-segmentation-using-deep-learning
@tofiqraza5025
@tofiqraza5025 11 месяцев назад
G hu gifbd Dr no hi🎉🎉
@zappist751
@zappist751 Год назад
Hey, awesome blog btw! I loved it! I'm wondering do you have any recommendations for creating my own synthetic dataset on prices of food in supermarket?
@sandeepchetanpendam4253
@sandeepchetanpendam4253 Год назад
Hi! I am trying to do semantic segmentation on Resumes/CV. I would pretty much appreciate any suggestions. Thank you bro.
@LearnOpenCV
@LearnOpenCV Год назад
Hi! This blog post should help you: learnopencv.com/deep-learning-based-document-segmentation-using-semantic-segmentation-deeplabv3-on-custom-dataset/ Additionally, extracting textual data from the segmented image is the next step. You should also look into OCR (Optical Character Recognition). Checkout this post to learn more: learnopencv.com/optical-character-recognition-using-paddleocr/
@janakibrit8109
@janakibrit8109 Год назад
Hello! Is there anywhere to find the weights?
@LearnOpenCV
@LearnOpenCV Год назад
Hi, you can find the weights file here: github.com/spmallick/learnopencv/tree/master/Document-Scanner-Custom-Semantic-Segmentation-using-PyTorch-DeepLabV3#dataset-and-trained-model-download-links
@eklavya_kumar
@eklavya_kumar Год назад
I'm doing the same training but jupyter kernel is crashing on epoch 2. I don't know what's the issue
@LearnOpenCV
@LearnOpenCV Год назад
Hi, it might be GPU or RAM issue. Keep an eye on the resources used tab. Try using a smaller batch size and smaller models like deeplabv3_mobilenet_v3_large or lraspp_mobilenet_v3_large.
@eklavya_kumar
@eklavya_kumar Год назад
@@LearnOpenCV Thanks, yeah the issue was the batch size, I reduced it to 16. I trained it for 60 epochs and performance was good enough
@LearnOpenCV
@LearnOpenCV Год назад
Thanks for sharing!
Далее
Document Scanner | OpenCV
15:37
Просмотров 10 тыс.
Enhance a Document Scan using Python and OpenCV
9:11
What is U-NET?
1:49
Просмотров 57 тыс.