SEEKR: Selective Attention-Guided Knowledge Retention for Continual Learning of Large Language Models

Official implementation of our paper "SEEKR: Selective Attention-Guided Knowledge Retention for Continual Learning of Large Language Models" in EMNLP 2024.

Install

conda create -n seekr python=3.10
conda activate seekr
pip install torch==2.0.1 torchvision==0.15.2 torchaudio==2.0.2 --index-url https://download.pytorch.org/whl/cu118
pip install -r requirements.txt
pip install flash-attn==2.5.5

Prepare datasets and models

Download Trace Benchmark
Download SuperNI Benchmark
Modify path/to/datasets in scripts/exp_seq_seekr.sh
Modify path/to/base_models in scripts/exp_seq_seekr.sh

Continual learning with SEEKR

bash scripts/exp_seq_seekr.sh llama2 tracer1

Acknowledgement

This project is built on top of TRACE

Name		Name	Last commit message	Last commit date
Latest commit History 6 Commits
data		data
evaluations		evaluations
inference		inference
scripts		scripts
training		training
utils		utils
LICENSE		LICENSE
README.md		README.md
re_eval.py		re_eval.py
requirements.txt		requirements.txt
show_results.py		show_results.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

SEEKR: Selective Attention-Guided Knowledge Retention for Continual Learning of Large Language Models

Install

Prepare datasets and models

Continual learning with SEEKR

Acknowledgement

About

Releases

Packages

Languages

License

jinghan1he/SEEKR

Folders and files

Latest commit

History

Repository files navigation

SEEKR: Selective Attention-Guided Knowledge Retention for Continual Learning of Large Language Models

Install

Prepare datasets and models

Continual learning with SEEKR

Acknowledgement

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages