We read every piece of feedback, and take your input very seriously.
To see all available qualifiers, see our documentation.
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
We have a draft improvement to the PDF parsing logic. This will enable us to eliminate Spacy as a dependency.
The training code is here: https://github.com/harmonydata/pdf-text-models-amol
The API modification is here https://github.com/harmonydata/harmonyapi branch nospacy
nospacy
The modification to the main python library is in
git clone -b updated_files_for_forntend https://github.com/Notysoty/harmony.git
Please quality control this branch and then merge it into main in all repositories and remove spacy from all requirements.txt and toml files.
requirements.txt
toml
Pdf extraction needs improvement
The text was updated successfully, but these errors were encountered:
Random Forest to address PDF parsing issues. #23 #28 #11 #4 #39
a1c4561
Remove Spacy completely from negation #39 #11
1fd9b12
Switched to Sklearn CRF Suite
Sorry, something went wrong.
No branches or pull requests
Description
We have a draft improvement to the PDF parsing logic. This will enable us to eliminate Spacy as a dependency.
The training code is here:
https://github.com/harmonydata/pdf-text-models-amol
The API modification is here
https://github.com/harmonydata/harmonyapi branch
nospacy
The modification to the main python library is in
Please quality control this branch and then merge it into main in all repositories and remove spacy from all
requirements.txt
andtoml
files.Rationale
Pdf extraction needs improvement
The text was updated successfully, but these errors were encountered: