Privacy Policies Language Understanding Evaluation

This repository contains code for downloading data and implementations of baseline systems for PLUE.

Setup

Downloading PLUE Datasets

We have already provided all PLUE datasets and the correponding preprocessing scripts in the data folder. Except PolicyIE, we have also uploaded the all pre-processed datasets.

To pre-process policyIE, please run

cd data/policyie
bash run.sh

If you want to checkout how we preprocess PrivacyQA, PIExtract, APP-350, and OPP-115, please run

cd data
bash setup.sh

Downloading Pre-training Corpus

To download our pre-training corpus, please run

cd pretraining/data
bash download.sh

Usage

Pre-training

In each pre-trained model folder, please download all the required dependencies

pip install -r requirements.txt

Note that the dependencies are associated with each pre-trained models. After all dependencies are properly installed, please run

bash train.sh

Fine-tuning

All fine-tuning tasks share the same environment

cd finetuning
pip install -r requirements.txt

for each task, please run the run.sh in the corresponding folder. For example, if we want to run APP-350 with pp-roberta, we run

cd finetuning/classification/app350/
bash run.sh 0 policy_roberta # 0 indicate the gpu_id

License

Contents of this repository is under the MIT license. The license applies to the released model checkpoints as well.

Citation

@inproceedings{chi2023plue,
  author = {Chi, Jianfeng and Ahmad, Wasi Uddin and Tian, Yuan and Chang, Kai-Wei},
  title = {PLUE: Language Understanding Evaluation Benchmark for Privacy Policies in English},
  booktitle = {ACL (short)},
  year = {2023}
}

Name		Name	Last commit message	Last commit date
Latest commit History 6 Commits
data		data
finetuning		finetuning
pretraining		pretraining
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
setup.py		setup.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Privacy Policies Language Understanding Evaluation

Setup

Downloading PLUE Datasets

Downloading Pre-training Corpus

Usage

Pre-training

Fine-tuning

License

Citation

About

Releases

Packages

Languages

License

JFChi/PLUE

Folders and files

Latest commit

History

Repository files navigation

Privacy Policies Language Understanding Evaluation

Setup

Downloading PLUE Datasets

Downloading Pre-training Corpus

Usage

Pre-training

Fine-tuning

License

Citation

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages