Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[models] Complete the list of pretrained available models in TensorFlow and PyTorch #969

Closed
18 of 20 tasks
Tracked by #1304
felixdittrich92 opened this issue Jun 30, 2022 · 6 comments
Closed
18 of 20 tasks
Tracked by #1304
Assignees
Labels
framework: pytorch Related to PyTorch backend framework: tensorflow Related to TensorFlow backend module: models Related to doctr.models topic: character classification Related to the task of character classification topic: text detection Related to the task of text detection topic: text recognition Related to the task of text recognition type: enhancement Improvement

Comments

@felixdittrich92
Copy link
Contributor

felixdittrich92 commented Jun 30, 2022

🚀 The feature

Ensure that each model has a pretrained available checkpoint

TensorFlow:

classification:

  • ViT
  • MAGC
  • MobileNetV3

detection:

  • DB_MobileNetV3
  • LinkNet (all)

recognition:

  • SAR
  • MASTER
  • VitSTR (small)
  • VitSTR (base)
  • PARSeq
  • CRNN (mobilenet_v3_large)

PyTorch:

classification:

detection:

  • DB_ResNet34
  • LinkNet (all)

recognition:

  • SAR
  • MASTER
  • VitSTR (small)
  • VitSTR (base)
  • PARSeq

Motivation, pitch

  • Users wants to test each available model and needs the pretrained versions to finetune on there own data
@felixdittrich92 felixdittrich92 added type: enhancement Improvement module: models Related to doctr.models framework: pytorch Related to PyTorch backend framework: tensorflow Related to TensorFlow backend topic: text detection Related to the task of text detection topic: text recognition Related to the task of text recognition topic: character classification Related to the task of character classification labels Jun 30, 2022
@felixdittrich92 felixdittrich92 added this to the 0.6.0 milestone Jun 30, 2022
@felixdittrich92
Copy link
Contributor Author

felixdittrich92 commented Jun 30, 2022

@charlesmindee i think you are the only one who can do it (data) 😃

@nikokks
Copy link
Contributor

nikokks commented Jul 30, 2022

@charlesmindee, what kind of data do you use to train your text recogntions models ?
Do you use custom data ? and do you use special data for french documents ?

@frgfm
Copy link
Collaborator

frgfm commented Aug 1, 2022

@nikokks for text recognition, we use cropped words from various types of documents, which sum up to about 11M images.
The bias is indeed around the choice of characters (European mostly) and the language (some sequences of characters are more frequent than others).

However, a while back, we started a fully synthetic approach (cf. CharacterGenerator and WordGenerator). So far, the results are not on par, but still very decent for a fully synthetic approach :)

@odulcy-mindee
Copy link
Collaborator

odulcy-mindee commented May 26, 2023

Training started on Linknet_Resnet50 (Tensorflow), we'll post the results soon 😉

@odulcy-mindee
Copy link
Collaborator

Detection models in Tensorflow are done! Now, we start recognition models in Tensorflow. First one is master!

@felixdittrich92
Copy link
Contributor Author

Closing because we track it in #1364

@felixdittrich92 felixdittrich92 unpinned this issue Nov 8, 2023
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
framework: pytorch Related to PyTorch backend framework: tensorflow Related to TensorFlow backend module: models Related to doctr.models topic: character classification Related to the task of character classification topic: text detection Related to the task of text detection topic: text recognition Related to the task of text recognition type: enhancement Improvement
Projects
None yet
Development

No branches or pull requests

5 participants