Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

test leaderboard submission #3

Closed

Conversation

zhudotexe
Copy link
Owner

@zhudotexe zhudotexe commented Mar 13, 2024

Copy link

Eval run succeeded! Link to run: link

Here are the results of the submission(s):

GPT-3.5-turbo

OpenAI, 2023

Closed Book

  • Loose: 0.0
  • Strict: 0.0
  • ROUGE-1: 0.0
  • ROUGE-2: 0.0
  • ROUGE-L: 0.0
  • BLEURT: 0.0
  • GPT Judge: 0.0

Open Book

  • Loose: 0.0
  • Strict: 0.0
  • ROUGE-1: 0.0
  • ROUGE-2: 0.0
  • ROUGE-L: 0.0
  • BLEURT: 0.0
  • GPT Judge: 0.0

Evidence Provided

  • Loose: 0.0
  • Strict: 0.0
  • ROUGE-1: 0.0
  • ROUGE-2: 0.0
  • ROUGE-L: 0.0
  • BLEURT: 0.0
  • GPT Judge: 0.0

GPT-4

OpenAI, 2023

Closed Book

  • Loose: 0.0
  • Strict: 0.0
  • ROUGE-1: 0.0
  • ROUGE-2: 0.0
  • ROUGE-L: 0.0
  • BLEURT: 0.0
  • GPT Judge: 0.0

Open Book

  • Loose: 0.0
  • Strict: 0.0
  • ROUGE-1: 0.0
  • ROUGE-2: 0.0
  • ROUGE-L: 0.0
  • BLEURT: 0.0
  • GPT Judge: 0.0

Evidence Provided

  • Loose: 0.0
  • Strict: 0.0
  • ROUGE-1: 0.0
  • ROUGE-2: 0.0
  • ROUGE-L: 0.0
  • BLEURT: 0.0
  • GPT Judge: 0.0

If all looks well, a maintainer will come by soon to merge this PR and your entry/entries will appear on the leaderboard. If you need to make any changes, feel free to push new commits to this PR. Thanks for submitting to FanOutQA!

Copy link

Eval run succeeded! Link to run: link

Here are the results of the submission(s):

GPT-3.5-turbo

OpenAI, 2023

Closed Book

  • Loose: 0.0
  • Strict: 0.0
  • ROUGE-1: 0.0
  • ROUGE-2: 0.0
  • ROUGE-L: 0.0
  • BLEURT: 0.0
  • GPT Judge: 0.0

Open Book

  • Loose: 0.0
  • Strict: 0.0
  • ROUGE-1: 0.0
  • ROUGE-2: 0.0
  • ROUGE-L: 0.0
  • BLEURT: 0.0
  • GPT Judge: 0.0

Evidence Provided

  • Loose: 0.0
  • Strict: 0.0
  • ROUGE-1: 0.0
  • ROUGE-2: 0.0
  • ROUGE-L: 0.0
  • BLEURT: 0.0
  • GPT Judge: 0.0

GPT-4

OpenAI, 2023

Closed Book

  • Loose: 0.0
  • Strict: 0.0
  • ROUGE-1: 0.0
  • ROUGE-2: 0.0
  • ROUGE-L: 0.0
  • BLEURT: 0.0
  • GPT Judge: 0.0

Open Book

  • Loose: 0.0
  • Strict: 0.0
  • ROUGE-1: 0.0
  • ROUGE-2: 0.0
  • ROUGE-L: 0.0
  • BLEURT: 0.0
  • GPT Judge: 0.0

Evidence Provided

  • Loose: 0.0
  • Strict: 0.0
  • ROUGE-1: 0.0
  • ROUGE-2: 0.0
  • ROUGE-L: 0.0
  • BLEURT: 0.0
  • GPT Judge: 0.0

If all looks well, a maintainer will come by soon to merge this PR and your entry/entries will appear on the leaderboard. If you need to make any changes, feel free to push new commits to this PR. Thanks for submitting to FanOutQA!

Copy link

Eval run succeeded! Link to run: link

Here are the results of the submission(s):

GPT-3.5-turbo

OpenAI, 2023

Closed Book

  • Loose: 0.0
  • Strict: 0.0
  • ROUGE-1: 0.0
  • ROUGE-2: 0.0
  • ROUGE-L: 0.0
  • BLEURT: 0.0
  • GPT Judge: 0.0

Open Book

  • Loose: 0.0
  • Strict: 0.0
  • ROUGE-1: 0.0
  • ROUGE-2: 0.0
  • ROUGE-L: 0.0
  • BLEURT: 0.0
  • GPT Judge: 0.0

Evidence Provided

  • Loose: 0.0
  • Strict: 0.0
  • ROUGE-1: 0.0
  • ROUGE-2: 0.0
  • ROUGE-L: 0.0
  • BLEURT: 0.0
  • GPT Judge: 0.0

GPT-4

OpenAI, 2023

Closed Book

  • Loose: 0.0
  • Strict: 0.0
  • ROUGE-1: 0.0
  • ROUGE-2: 0.0
  • ROUGE-L: 0.0
  • BLEURT: 0.0
  • GPT Judge: 0.0

Open Book

  • Loose: 0.0
  • Strict: 0.0
  • ROUGE-1: 0.0
  • ROUGE-2: 0.0
  • ROUGE-L: 0.0
  • BLEURT: 0.0
  • GPT Judge: 0.0

Evidence Provided

  • Loose: 0.0
  • Strict: 0.0
  • ROUGE-1: 0.0
  • ROUGE-2: 0.0
  • ROUGE-L: 0.0
  • BLEURT: 0.0
  • GPT Judge: 0.0

If all looks well, a maintainer will come by soon to merge this PR and your entry/entries will appear on the leaderboard. If you need to make any changes, feel free to push new commits to this PR. Thanks for submitting to FanOutQA!

@zhudotexe zhudotexe closed this Mar 14, 2024
@zhudotexe zhudotexe deleted the leaderboard-submissions-test branch April 16, 2024 18:41
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

1 participant