-
Notifications
You must be signed in to change notification settings - Fork 11
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
FUNSD predictor #11
Comments
@yudezhi123456 |
Thank you very much for your answer, There's a sentence In your paper "In the entity task,our model shown in Tab.2, achieved a F1 score of 0.80575 for entity-labeling and 0.77031 for entity-linking, using 32.98 million parameters." |
@yudezhi123456 You can find the implementation details here: evaluating entity-linking (Node_F1): Layout2Graph/metrics/graph_layout_metrics.py Lines 52 to 65 in 60586aa
evaluating entity-labeling(Pair_F1): Layout2Graph/metrics/graph_layout_metrics.py Lines 75 to 87 in 60586aa
|
Thank you very much for your answer. I can understand what you mean and find the corresponding primary metric. |
@weishu27 Could you please check out it and answer this questions? |
@yudezhi123456 It's quite weird that you can achieved a Moreover, I notice from your log: |
I redownloaded the FUNSD dataset and examined the data and the test_convert_Funsd2Graph function, but still couldn't find the problem. The training data set consists of 149 samples and the test data set consists of 50 samples. Below is the structure of my dataset. Because when training the “word” model and verifying it, there are no such problems. Consistent with the paper, while training “entity”, the above problems occurred. I didn't make any unnecessary changes to your code, just the yaml file configuration. The problem may not be in the data handling, it may be in the validation code itself. When I was training on the newly downloaded FUNSD, the following PairF1MACRO1.00000 appeared when I saved the model in the first round. I am very confused, thank you for your patient answer. |
@yudezhi123456 Alright, I think I find out what causes this issue. We provide two kinds of dataset and corresponding CollectFn as shown below: The Layout2Graph/networks/graph_net/graph_layout_net.py Lines 186 to 195 in ca627bf
However, if relational linking information is already available, as is the case with the FUNSD_entity_graph dataset, we can directly use it. The dataset and collectFn in config file should be set to In a word:
datasets:
train:
dataset:
type: GraphLayoutDataset
collate_fn:
type: GraphCollateFn
datasets:
train:
dataset:
type: GraphLayoutEntityDataset
collate_fn:
type: GraphEntityCollateFn Sorry for the confusion. I will add the explanations in our documentation. |
Thank you for your answer, which is of great help to the advancement of my research. I will continue to conduct research in this direction. |
how to train on funsd dataset |
According to the README given by the author, it is very clear that it can be trained step by step according to the requirements |
I trained FUNSD-entity to get the best model, verified eval, and obtained entity-labeling consistent with your paper. However, the results of entity-Linking for 'Pair_F1_MACRO' and 'Pair_F1_MICRO' both reach 1.0. I wonder why? That's the metric, right? Below are screenshots from my yaml file, along with the experimental results.
The text was updated successfully, but these errors were encountered: