Question on fine-tuning TATR with a proprietary dataset #169

srivatsan-sridhar99 · 2024-02-17T12:22:35Z

Hi!

I am trying to fine tune the TATR model with a proprietary dataset. I am currently trying to convert the dataset to the same format as FinTabNet and then using the script in this repository (scripts/process_fintabnet.py) to transform that into the Pascal VOC format required by TATR.

I am then training it using the main.py file in this repo, with just one change - loading the tatr table detection (microsoft/table-transformer-detection, revision="no_timm") and tatr table structure recognition model (microsoft/table-transformer-structure-recognition-v1.1-all) from hugging face, instead of the DETR model that is built in this repository.

When I train it on the DETR model that is in this repository, I am able to train it without any issue.

The command I run to train is -
python3 main.py --data_root_dir <data directory> --config_file structure_config.json

I am getting this error -

Details on runtime env -

Cuda version - 12.2
Nvidia driver version - 535.154.05
torch version - 1.13.1
torch audio version - 0.13.1
torch vision version - 0.14.1
transformers version - 4.38.0.dev0

Would be great if anybody could help me with this!
Thanks,
Srivatsan.

The text was updated successfully, but these errors were encountered:

ali4friends71 · 2024-04-17T06:14:15Z

Hi @srivatsan-sridhar99. How did you create your custom dataset ? I have my own table images. How do I preprocess them and convert it to required format ? Any code help ?
thanks In advance.

tan-yong-sheng · 2024-07-08T17:53:50Z

Hi @srivatsan-sridhar99 and @ali4friends71, wanna follow up if you have successfully fine tuned the TATR model with your own datasets? If yes, may I ask for any guidelines or references for this process, thanks so much.

srivatsan-sridhar99 · 2024-07-10T11:11:24Z

Hi @srivatsan-sridhar99. How did you create your custom dataset ? I have my own table images. How do I preprocess them and convert it to required format ? Any code help ? thanks In advance.

Hi @ali4friends71 , You will require table images along with the cell, column and row level bounding boxes to train the model, so you will have to annotate your data to get these bounding boxes.

I would suggest you to annotate your data (or convert your annotated data) in the format of the pubtabnet/fintabnet dataset. There are scripts present in the scripts folder in this repo to convert that, to the format required by the model.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Question on fine-tuning TATR with a proprietary dataset #169

Question on fine-tuning TATR with a proprietary dataset #169

srivatsan-sridhar99 commented Feb 17, 2024

ali4friends71 commented Apr 17, 2024

tan-yong-sheng commented Jul 8, 2024

srivatsan-sridhar99 commented Jul 10, 2024

Question on fine-tuning TATR with a proprietary dataset #169

Question on fine-tuning TATR with a proprietary dataset #169

Comments

srivatsan-sridhar99 commented Feb 17, 2024

ali4friends71 commented Apr 17, 2024

tan-yong-sheng commented Jul 8, 2024

srivatsan-sridhar99 commented Jul 10, 2024