Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[models] Add detection & recognition models with much lighter backbones #255

Closed
3 tasks done
fg-mindee opened this issue May 12, 2021 · 1 comment
Closed
3 tasks done
Assignees
Labels
help wanted Extra attention is needed module: models Related to doctr.models topic: character classification Related to the task of character classification topic: text detection Related to the task of text detection topic: text recognition Related to the task of text recognition type: enhancement Improvement
Milestone

Comments

@fg-mindee
Copy link
Contributor

fg-mindee commented May 12, 2021

Comparing similar vision tasks, the segmentation part of text detection is easier than real-life scene segmentation (on Cityscapes for instance) which uses ResNet-like backbones, and even easier than medical segmentation which uses often lighter feature extractor (cf. UNet).

Additionally, the feature extraction of text recognition is similar to image classification for a single character image (MNIST but with more classes) which uses very light backbones.

So here is a suggestion:

This means that DocTR will extend the list of supported tasks by adding:

@fg-mindee fg-mindee added type: enhancement Improvement help wanted Extra attention is needed module: models Related to doctr.models labels May 12, 2021
@fg-mindee fg-mindee added this to the 0.3.0 milestone May 12, 2021
@fg-mindee fg-mindee modified the milestones: 0.3.0, 0.4.0 Jul 1, 2021
@fg-mindee fg-mindee self-assigned this Jul 1, 2021
@charlesmindee charlesmindee pinned this issue Aug 11, 2021
@charlesmindee charlesmindee unpinned this issue Aug 11, 2021
@fg-mindee fg-mindee added topic: text detection Related to the task of text detection topic: text recognition Related to the task of text recognition topic: character classification Related to the task of character classification labels Aug 25, 2021
@fg-mindee
Copy link
Contributor Author

Closed by #516 & #517

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
help wanted Extra attention is needed module: models Related to doctr.models topic: character classification Related to the task of character classification topic: text detection Related to the task of text detection topic: text recognition Related to the task of text recognition type: enhancement Improvement
Projects
None yet
Development

No branches or pull requests

1 participant