iceParsingPipeline

Pipelines to parse plain text files using either the Berkeley neural parser or the Berkeley parser. Both models are trained on IcePaHC.

The two pipelines and the Berkeley neural parser are licensed under the MIT license while the Berkeley parser is licensed under GPLv2.

Setting up the pipeline

The pipeline requires both Python 3.6>= and Java. Once both programs have been installed, the rest of the dependencies can be installed. Run ./setup.sh to install all necessary dependencies. All dependencies are listed below.

Using the neural pipeline

Download the parsing model from here (2.2 GB) and save under the /tools/neuralParser/ directory. Make sure not to change the name of the model. Run the command:

$ ./runallNeural.sh inputfile.txt textOutputfile.txt outputfile.psd

file1: plain text input

file2: plain text output, split into matrix clauses

file3: parsed .psd file formatted like IcePaHC

Using the previous pipeline

Run the command:

$ ./runall.sh inputfile.txt textOutputfile.txt outputfile.psd

file1: plain text input

file2: plain text output, split into matrix clauses

file3: parsed .psd file formatted like IcePaHC

Dependencies

python3

-- package detectormorse (pip3 install detectormorse)

java

Additional dependencies needed for the neural parsing pipeline:

-- package tokenizer (pip3 install tokenizer)

Cython (pip3 install cython)

numpy (pip3 install numpy)

PyTorch version 0.4.1 or 1.0/1.1 (pip3 install torch==1.1.0 torchvision==0.3.0)

pytorch-pretrained-bert (pip3 install pytorch-pretrained-bert)

Name		Name	Last commit message	Last commit date
Latest commit History 45 Commits
micepahc		micepahc
temp		temp
tools		tools
GPLv2 LICENSE		GPLv2 LICENSE
LICENSE		LICENSE
README.md		README.md
demoOutput.psd		demoOutput.psd
demoTextOutput.txt		demoTextOutput.txt
demoinput.txt		demoinput.txt
runall.sh		runall.sh
runallNeural.sh		runallNeural.sh
setup.sh		setup.sh

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

iceParsingPipeline

Setting up the pipeline

Using the neural pipeline

Using the previous pipeline

Dependencies

About

Releases

Packages

Contributors 2

Languages

License

antonkarl/iceParsingPipeline

Folders and files

Latest commit

History

Repository files navigation

iceParsingPipeline

Setting up the pipeline

Using the neural pipeline

Using the previous pipeline

Dependencies

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Contributors 2

Languages

Packages