Overview of the FLERT-Matcher model at LivingNER shared task (https://temu.bsc.es/livingner/)

Subtask 1: We trained a NER model based on the FLERT approach. This model consists of fine-tuning a pre-trained language model but considering the document-level context.
Subtask 2: Based on the predictions of subtask 1, we matched the entity mentions with the definitions of codes found in the training corpus and the NCBI taxonomy. This was done using the Levenshtein Distance.
Subtask 3: We trained a NER model based on the FLERT approach to address each binary classification problem. We merged the output of each model with the predictions of subtask 2. Finally, we grouped the mentions by document and transformed the predictions to document-level.

Install

Create an enviroment: python -m venv venv and activate it.
Run pip install -r requirements.txt to install all dependencies
Download the statistical model to perform the tokenization: python -m spacy download es_core_news_lg
In case you use a GPU NVIDIA RTX 3090, then install this PyTorch version: pip install torch==1.10.1+cu111 torchvision==0.11.2+cu111 torchaudio==0.10.1 -f https://download.pytorch.org/whl/torch_stable.html

Creating data for NER Training

Place the folder training_valid_test_background_multilingual (https://zenodo.org/record/6768606) inside the folder src/data/ner_utils.

Subtask 1 (Species)

cd src/data/ner_utils python main.py --subtask 1

Subtask 3 (Food, Animal Injury, Pet, Nosocomial)

cd src/data/ner_utils python main.py --subtask 3 --subtask3_entity_type FOOD python main.py --subtask 3 --subtask3_entity_type ANIMALINJURY python main.py --subtask 3 --subtask3_entity_type PET python main.py --subtask 3 --subtask3_entity_type NOSOCOMIAL

NER files will be placed in ner_data folder.

Training NER Models

cd src/models/ner-flair/src

Training parameters can be changed in config.yaml file

Run the script python main.py. The results will be stored in the results folder as a csv. Inside the main.py are the methods called for predictions in each subtask, it is easy to change for what to split to predict (training, validation, test_background) by passing an argument of the split.

Predictions

Go back to the home directory of this repository. Place the trained NER models in the models folder with the following names: species.pt, animal.pt, pet.pt, nosocomial.pt, and food.pt.

Run the script in the main folder. python main.py

Contact

Authors: - Matías Rojas - Jose Barros - Mauricio Araneda - Jocelyn Dunstan

Mail: [email protected]

Name		Name	Last commit message	Last commit date
Latest commit History 6 Commits
models		models
results		results
src		src
.DS_Store		.DS_Store
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
main.py		main.py
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Overview of the FLERT-Matcher model at LivingNER shared task (https://temu.bsc.es/livingner/)

Install

Creating data for NER Training

Training NER Models

Predictions

Contact

About

Releases

Packages

Contributors 2

Languages

License

plncmm/flert-matcher

Folders and files

Latest commit

History

Repository files navigation

Overview of the FLERT-Matcher model at LivingNER shared task (https://temu.bsc.es/livingner/)

Install

Creating data for NER Training

Training NER Models

Predictions

Contact

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Contributors 2

Languages

Packages