paint-brush
Complex Document Recognition: OCR Doesn’t Work and Here’s How You Fix Itby@olegkokorin
805 reads
805 reads

Complex Document Recognition: OCR Doesn’t Work and Here’s How You Fix It

by Oleg Kokorin6mOctober 12th, 2023
Read on Terminal Reader
Read this story w/o Javascript
tldt arrow

Too Long; Didn't Read

OCR software alone can't handle complex documents — special symbols, rotated text, low-quality scans. Using deep learning, one can augment ready-made OCR solutions and allow for processing of complex documents. From removing false positives to using binary matrices to detect complex spreadsheets, deep learning can handle any document. In this article I describe my experience with developing a system for detecting technical drawings of floor plans, the perfect example of applying modern CV and AI to complex document digitization.
featured image - Complex Document Recognition: OCR Doesn’t Work and Here’s How You Fix It
Oleg Kokorin HackerNoon profile picture
Oleg Kokorin

Oleg Kokorin

@olegkokorin

CEO of Businessware Technologies, machine learning engineer

Learn More
LEARN MORE ABOUT @OLEGKOKORIN'S
EXPERTISE AND PLACE ON THE INTERNET.
0-item

STORY’S CREDIBILITY

Guide

Guide

Walkthroughs, tutorials, guides, and tips. This story will teach you how to do something new or how to do something better.

L O A D I N G
. . . comments & more!

About Author

Oleg Kokorin HackerNoon profile picture
Oleg Kokorin@olegkokorin
CEO of Businessware Technologies, machine learning engineer

TOPICS

THIS ARTICLE WAS FEATURED IN...

Permanent on Arweave
Read on Terminal Reader
Read this story in a terminal
 Terminal
Read this story w/o Javascript
Read this story w/o Javascript
 Lite