Skip to content

Latest commit

 

History

History
21 lines (14 loc) · 342 Bytes

README.md

File metadata and controls

21 lines (14 loc) · 342 Bytes

LLM Evaluation

This project aims to evaluate Large Language Models performance on different NLP tasks in combination with various prompts

Environment

Create a virtualenv and install requirements

make virtualenv

Then pull the data

dvc pull

Note that for DVC to work you need access to Mantis AWS

Data

To be filled