SEEKR: Selective Attention-Guided Knowledge Retention for Continual Learning of Large Language Models
Official implementation of our paper "SEEKR: Selective Attention-Guided Knowledge Retention for Continual Learning of Large Language Models" in EMNLP 2024.
conda create -n seekr python=3.10
conda activate seekr
pip install torch==2.0.1 torchvision==0.15.2 torchaudio==2.0.2 --index-url https://download.pytorch.org/whl/cu118
pip install -r requirements.txt
pip install flash-attn==2.5.5
-
Download Trace Benchmark
-
Download SuperNI Benchmark
-
Modify
path/to/datasets
inscripts/exp_seq_seekr.sh
-
Modify
path/to/base_models
inscripts/exp_seq_seekr.sh
bash scripts/exp_seq_seekr.sh llama2 tracer1
This project is built on top of TRACE