You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
I am trying to fine tune the TATR model with a proprietary dataset. I am currently trying to convert the dataset to the same format as FinTabNet and then using the script in this repository (scripts/process_fintabnet.py) to transform that into the Pascal VOC format required by TATR.
I am then training it using the main.py file in this repo, with just one change - loading the tatr table detection (microsoft/table-transformer-detection, revision="no_timm") and tatr table structure recognition model (microsoft/table-transformer-structure-recognition-v1.1-all) from hugging face, instead of the DETR model that is built in this repository.
When I train it on the DETR model that is in this repository, I am able to train it without any issue.
The command I run to train is - python3 main.py --data_root_dir <data directory> --config_file structure_config.json
I am getting this error -
Details on runtime env -
Cuda version - 12.2
Nvidia driver version - 535.154.05
torch version - 1.13.1
torch audio version - 0.13.1
torch vision version - 0.14.1
transformers version - 4.38.0.dev0
Would be great if anybody could help me with this!
Thanks,
Srivatsan.
The text was updated successfully, but these errors were encountered:
Hi @srivatsan-sridhar99. How did you create your custom dataset ? I have my own table images. How do I preprocess them and convert it to required format ? Any code help ?
thanks In advance.
Hi @srivatsan-sridhar99 and @ali4friends71, wanna follow up if you have successfully fine tuned the TATR model with your own datasets? If yes, may I ask for any guidelines or references for this process, thanks so much.
Hi @srivatsan-sridhar99. How did you create your custom dataset ? I have my own table images. How do I preprocess them and convert it to required format ? Any code help ? thanks In advance.
Hi @ali4friends71 , You will require table images along with the cell, column and row level bounding boxes to train the model, so you will have to annotate your data to get these bounding boxes.
I would suggest you to annotate your data (or convert your annotated data) in the format of the pubtabnet/fintabnet dataset. There are scripts present in the scripts folder in this repo to convert that, to the format required by the model.
Hi!
I am trying to fine tune the TATR model with a proprietary dataset. I am currently trying to convert the dataset to the same format as FinTabNet and then using the script in this repository (scripts/process_fintabnet.py) to transform that into the Pascal VOC format required by TATR.
I am then training it using the main.py file in this repo, with just one change - loading the tatr table detection (microsoft/table-transformer-detection, revision="no_timm") and tatr table structure recognition model (microsoft/table-transformer-structure-recognition-v1.1-all) from hugging face, instead of the DETR model that is built in this repository.
When I train it on the DETR model that is in this repository, I am able to train it without any issue.
The command I run to train is -
python3 main.py --data_root_dir <data directory> --config_file structure_config.json
I am getting this error -
Details on runtime env -
Would be great if anybody could help me with this!
Thanks,
Srivatsan.
The text was updated successfully, but these errors were encountered: