Google-Vision-inline-text-rectifier

Problem:

Font mismatch causes word level bounding boxes to vary in height within a sentence. Google vision may interpret a word with higher height as belonging to an upper line than original and vice versa. This repo aims towards solving this problem

Main Idea:

Connect the two bounding boxes whose centroids is differs less than the average word length. This can be achived using both two methods.
The first method is a dsu based approach where you can treat each word's centroid as a node and find connected component.
The second is using an unsupervised learning algorithm to cluster words based on their centroids y-coordinates.

Name		Name	Last commit message	Last commit date
Latest commit History 16 Commits
README.md		README.md
analysing ocr.ipynb		analysing ocr.ipynb
bounding_box_rectifier.py		bounding_box_rectifier.py
difference		difference
sample_image.png		sample_image.png
yolo2vision.py		yolo2vision.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Google-Vision-inline-text-rectifier

Problem:

Main Idea:

About

Releases

Packages

Languages

Jay-523/Google-Vision-inline-text-rectifier

Folders and files

Latest commit

History

Repository files navigation

Google-Vision-inline-text-rectifier

Problem:

Main Idea:

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages