Custom Semantic Segmentation using DeepLabv3 for a Document Scanning application

Подписаться 56 тыс.

Просмотров 5 тыс.

50% 1

Document Scanning is a background segmentation problem that can be solved using various methods. It is an extensively used application of computer vision. Here we consider Document Scanning as a semantic segmentation problem.
We use DeepLabv3 semantic segmentation architecture to train a Document Segmentation model on a custom dataset.
We also talk about the following topics:
✅Creating synthetic data to augment the dataset.
✅Creating custom dataset classes in PyTorch.
✅Fine-tuning DepLabv3 with custom loss functions.
✅Deploy the application using Streamlit.
📚 Blog post link: learnopencv.com/deep-learning...
🖥️ On our blog - learnopencv.com we also share tutorials and code on topics like Image Processing, Image Classification, Object Detection, Face Detection, Face Recognition, YOLO, Segmentation, Pose Estimation, and many more using OpenCV(Python/C++), PyTorch, and TensorFlow.
🤖 Learn from the experts on AI: Computer Vision and AI Courses
YOU have an opportunity to join the over 5300+ (and counting) researchers, engineers, and students that have benefited from these courses and take your knowledge of computer vision, AI, and deep learning to the next level.
opencv.org/courses
#️⃣ Social Media #️⃣
📝 Linkedin: / satyamall. .
📱 Twitter: / learnopencv
🔊 Facebook: profile.php?...
📸 Instagram: / learnopencv
🔗 Reddit: / spmallick
🔖Hashtags🔖
#AI #segmentation #scannedocument #deeplabv3 #machinelearning #objectdetection #deeplearning #computervision #artificiailntelligence

Опубликовано:

29 авг 2022

Ссылка:

Скачать:

Готовим ссылку...

Добавить в:

Мой плейлист

Посмотреть позже

Комментарии : 11

@LearnOpenCV Год назад

Checkout our blogpost on Training a custom Semantic Segmentation DeepLabv3 model for a Document Scanning application: learnopencv.com/custom-document-segmentation-using-deep-learning

@tofiqraza5025 11 месяцев назад

G hu gifbd Dr no hi🎉🎉

@zappist751 Год назад

Hey, awesome blog btw! I loved it! I'm wondering do you have any recommendations for creating my own synthetic dataset on prices of food in supermarket?

@sandeepchetanpendam4253 Год назад

Hi! I am trying to do semantic segmentation on Resumes/CV. I would pretty much appreciate any suggestions. Thank you bro.

@LearnOpenCV Год назад

Hi! This blog post should help you: learnopencv.com/deep-learning-based-document-segmentation-using-semantic-segmentation-deeplabv3-on-custom-dataset/ Additionally, extracting textual data from the segmented image is the next step. You should also look into OCR (Optical Character Recognition). Checkout this post to learn more: learnopencv.com/optical-character-recognition-using-paddleocr/

@janakibrit8109 Год назад

Hello! Is there anywhere to find the weights?

@LearnOpenCV Год назад

Hi, you can find the weights file here: github.com/spmallick/learnopencv/tree/master/Document-Scanner-Custom-Semantic-Segmentation-using-PyTorch-DeepLabV3#dataset-and-trained-model-download-links

@eklavya_kumar Год назад

I'm doing the same training but jupyter kernel is crashing on epoch 2. I don't know what's the issue

@LearnOpenCV Год назад

Hi, it might be GPU or RAM issue. Keep an eye on the resources used tab. Try using a smaller batch size and smaller models like deeplabv3_mobilenet_v3_large or lraspp_mobilenet_v3_large.

@eklavya_kumar Год назад

@@LearnOpenCV Thanks, yeah the issue was the batch size, I reduced it to 16. I trained it for 60 epochs and performance was good enough

@LearnOpenCV Год назад

Thanks for sharing!