Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[BFCL] Set Model Temperature to 0.001 for All Models #574

Merged
merged 4 commits into from
Aug 10, 2024

Conversation

HuanzhiMao
Copy link
Collaborator

The current model response generation script uses a default temperature of 0.7 for inference. This introduces some degree of randomness into the model output generation, leading to potential variability in the evaluation scores from run to run.
For benchmarking purposes, we set it to 0.001 for consistency and reliability of the evaluation results.

resolves #500 , resolves #562

This will affect the leaderboard score. We will update it shortly.

@HuanzhiMao HuanzhiMao marked this pull request as ready for review August 7, 2024 23:34
Copy link
Collaborator

@CharlieJCJ CharlieJCJ left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Can put default values in constants file, address in refactor pr later, this PR concerns on enabling functionality, LGTM

@HuanzhiMao
Copy link
Collaborator Author

Can put default values in constants file, address in refactor pr later, this PR concerns on enabling functionality, LGTM

Why should we put it in the constants file? I think that will only make the code more complicated? Cause you will then need to import that global constant in every single model handler.
If you are concerned with the default values for argument temperature=0.001, top_p=1, max_tokens=1000 when init the model handler, we could technically remove them as they will never be used optionally.

@ShishirPatil ShishirPatil merged commit e8c9c5a into ShishirPatil:main Aug 10, 2024
@HuanzhiMao HuanzhiMao deleted the temperature branch August 13, 2024 06:04
aw632 pushed a commit to vinaybagade/gorilla that referenced this pull request Aug 22, 2024
The current model response generation script uses a default temperature
of 0.7 for inference. This introduces some degree of randomness into the
model output generation, leading to potential variability in the
evaluation scores from run to run.
For benchmarking purposes, we set it to 0.001 for consistency and
reliability of the evaluation results.

resolves ShishirPatil#500 , resolves ShishirPatil#562 

This will affect the leaderboard score. We will update it shortly.

---------

Co-authored-by: Shishir Patil <[email protected]>
@HuanzhiMao HuanzhiMao added the BFCL-General General BFCL Issue label Aug 25, 2024
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
BFCL-General General BFCL Issue
Projects
None yet
Development

Successfully merging this pull request may close these issues.

Set Model Temperature to 0 for Consistent Leaderboard Results
3 participants