Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Import EAI tasks #9

Closed
ibeltagy opened this issue Mar 2, 2022 · 3 comments
Closed

Import EAI tasks #9

ibeltagy opened this issue Mar 2, 2022 · 3 comments

Comments

@ibeltagy
Copy link

ibeltagy commented Mar 2, 2022

EAI tasks that are not on CrossFit.
Total task files in EAI: 47
Missing from CrossFit: 28

Not on HF dataset

  • arithmetic
  • asdiv - A Diverse Corpus for Evaluating and Developing English Math Word Problem Solvers
  • coqa
  • gsm8k - Training Verifiers to Solve Math Word Problems
  • hendrycks_ethics, math, test
  • lambada-cloze
  • lambada-multilingual
  • logiqa
  • mutual - MuTual: A Dataset for Multi-Turn Dialogue Reasoning
  • naturalqs
  • pile
  • qa4mre
  • quac
  • sat
  • storycloze
  • translation
  • triviaqa
  • truthfulqa
  • unscramble
  • wikitext

Available on HF dataset

  • head_qa
  • lambada
  • prost - PROST: Physical Reasoning about Objects Through Space and Time
  • pubmedqa
  • qasper
  • wsc273 - another version of winogrande
  • cbt - Children’s Book
  • drop
@ibeltagy
Copy link
Author

ibeltagy commented Mar 2, 2022

Not all the tasks listed above are important. Listing the important ones that are not on CrossFit and that we still need to import from EAI.

  • arithmetic
  • coqa
  • drop
  • logiqa
  • naturalqs
  • sat
  • storycloze
  • triviaqa
  • wikitext
  • head_qa
  • lambada
  • prost
  • pubmedqa
  • qasper

@ibeltagy
Copy link
Author

ibeltagy commented Mar 2, 2022

Also, looking at the prompts of EAI, many of them are either no prompt at all (arithmetic, wikitext, lambada) or has a mostly unified QA format

@dirkgr
Copy link
Member

dirkgr commented Apr 20, 2022

Superseded by #8.

@dirkgr dirkgr closed this as completed Apr 20, 2022
OyvindTafjord added a commit that referenced this issue May 19, 2023
Improve perplexity custom task
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants