[models] ViT add checkpoints and some rework to use pretrained ViT backbone in ViTSTR #1072

felixdittrich92 · 2022-09-21T14:13:40Z

This PR:

make pretrained checkpoints available for ViT (TF and PT)
rework patchify make it dynamic to the input shape
rework pretrained backbone usage for ViTSTR with checkpoints
fix issue with bicubic scale and onnxruntime (use bilinear : minor cost on quality and speed up)

All added checkpoints >98% acc

Any feedback is welcome

doctr/models/recognition/vitstr/tensorflow.py

felixdittrich92 · 2022-09-23T11:38:08Z

@frgfm but the good on is i have learned a lesson in this PR. If we build TF models each layer should have a name otherwise we are lost in space if we iter trough the weights to find the problem 😅

codecov · 2022-09-23T13:28:15Z

Codecov Report

Merging #1072 (7365d49) into main (e538cc2) will decrease coverage by 0.05%.
The diff coverage is 100.00%.

@@            Coverage Diff             @@
##             main    #1072      +/-   ##
==========================================
- Coverage   95.26%   95.21%   -0.06%     
==========================================
  Files         145      142       -3     
  Lines        6046     6018      -28     
==========================================
- Hits         5760     5730      -30     
- Misses        286      288       +2

Flag	Coverage Δ
unittests	`95.21% <100.00%> (-0.06%)`	⬇️

Flags with carried forward coverage won't be shown. Click here to find out more.

Impacted Files	Coverage Δ
doctr/models/recognition/vitstr/pytorch.py	`97.46% <ø> (ø)`
doctr/models/classification/vit/pytorch.py	`100.00% <100.00%> (ø)`
doctr/models/classification/vit/tensorflow.py	`100.00% <100.00%> (ø)`
doctr/models/modules/vision_transformer/pytorch.py	`97.72% <100.00%> (-2.28%)`	⬇️
...tr/models/modules/vision_transformer/tensorflow.py	`97.72% <100.00%> (-2.28%)`	⬇️
doctr/models/recognition/vitstr/tensorflow.py	`97.56% <100.00%> (ø)`
doctr/transforms/functional/base.py	`95.65% <0.00%> (ø)`
doctr/utils/__init__.py
... and 2 more

Help us with your feedback. Take ten seconds to tell us how you rate us. Have a feature suggestion? Share it here.

frgfm

Thanks Felix!
Perhaps we should specify the change in perf in the documentation or somewhere? It's gonna become hard to track those changes otherwise

felixdittrich92 · 2022-09-25T19:42:47Z

Yep will add
Patches: (H, W) -> (H / 8, W / 8)
to the docstrings

doctr/models/classification/vit/pytorch.py

doctr/models/classification/vit/tensorflow.py

doctr/models/recognition/vitstr/tensorflow.py

odulcy-mindee

LGTM ! 🚀

add checkpoints

6ce38de

felixdittrich92 added this to the 0.6.0 milestone Sep 21, 2022

felixdittrich92 self-assigned this Sep 21, 2022

felixdittrich92 added 4 commits September 21, 2022 16:21

fix size mismatch

c4f3458

update

9e5b412

update

0613e24

mypy

4e02616

felixdittrich92 commented Sep 22, 2022

View reviewed changes

doctr/models/recognition/vitstr/tensorflow.py Outdated Show resolved Hide resolved

name layers

18505c5

felixdittrich92 added 7 commits September 23, 2022 13:49

patchify dynamic add checkpoints and make it work

4d2bd37

update scale

4d6d12d

upgrade opset

25433a6

fix onnxruntime issue for non quadliteral use bilinear

9287309

update

ec0dc13

typo

2d369f8

update

54476c1

felixdittrich92 changed the title ~~[DRAF] [models] ViT add checkpoints and some rework to use pretrained ViT backbone in ViTSTR~~ [models] ViT add checkpoints and some rework to use pretrained ViT backbone in ViTSTR Sep 23, 2022

felixdittrich92 marked this pull request as ready for review September 23, 2022 13:15

felixdittrich92 requested review from frgfm and odulcy-mindee September 23, 2022 13:33

frgfm reviewed Sep 25, 2022

View reviewed changes

add patchify info to docstring

d7c29a5

felixdittrich92 requested a review from frgfm September 26, 2022 05:52

felixdittrich92 mentioned this pull request Sep 26, 2022

update version for minor release #1073

Merged

odulcy-mindee requested changes Sep 26, 2022

View reviewed changes

update urls

7365d49

felixdittrich92 requested a review from odulcy-mindee September 26, 2022 07:36

odulcy-mindee approved these changes Sep 26, 2022

View reviewed changes

felixdittrich92 merged commit 6b9f375 into mindee:main Sep 26, 2022

felixdittrich92 deleted the vit-checkpoints branch September 26, 2022 08:06

felixdittrich92 mentioned this pull request Sep 26, 2022

Release tracker - v0.6.0 #791

Closed

85 tasks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[models] ViT add checkpoints and some rework to use pretrained ViT backbone in ViTSTR #1072

[models] ViT add checkpoints and some rework to use pretrained ViT backbone in ViTSTR #1072

felixdittrich92 commented Sep 21, 2022 •

edited

Loading

felixdittrich92 commented Sep 23, 2022 •

edited

Loading

codecov bot commented Sep 23, 2022 •

edited

Loading

frgfm left a comment

felixdittrich92 commented Sep 25, 2022

odulcy-mindee left a comment

[models] ViT add checkpoints and some rework to use pretrained ViT backbone in ViTSTR #1072

[models] ViT add checkpoints and some rework to use pretrained ViT backbone in ViTSTR #1072

Conversation

felixdittrich92 commented Sep 21, 2022 • edited Loading

felixdittrich92 commented Sep 23, 2022 • edited Loading

codecov bot commented Sep 23, 2022 • edited Loading

Codecov Report

frgfm left a comment

Choose a reason for hiding this comment

felixdittrich92 commented Sep 25, 2022

odulcy-mindee left a comment

Choose a reason for hiding this comment

felixdittrich92 commented Sep 21, 2022 •

edited

Loading

felixdittrich92 commented Sep 23, 2022 •

edited

Loading

codecov bot commented Sep 23, 2022 •

edited

Loading