Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Add RewardModelScore step #840

Merged
merged 8 commits into from
Jul 30, 2024
Merged

Add RewardModelScore step #840

merged 8 commits into from
Jul 30, 2024

Conversation

gabrielmbmb
Copy link
Member

Description

This PR adds the new RewardModelScore step which uses transformers to load a reward model to assign and score to an instruction-response or a conversation.

@gabrielmbmb gabrielmbmb added the enhancement New feature or request label Jul 29, 2024
@gabrielmbmb gabrielmbmb added this to the 1.3.0 milestone Jul 29, 2024
@gabrielmbmb gabrielmbmb self-assigned this Jul 29, 2024
Copy link

Documentation for this PR has been built. You can view it at: https://distilabel.argilla.io/pr-840/

Copy link

codspeed-hq bot commented Jul 29, 2024

CodSpeed Performance Report

Merging #840 will not alter performance

Comparing reward-model-step (62ec8a5) with develop (974b45e)

Summary

✅ 1 untouched benchmarks

@gabrielmbmb gabrielmbmb merged commit 20bd1e3 into develop Jul 30, 2024
5 of 7 checks passed
@gabrielmbmb gabrielmbmb deleted the reward-model-step branch July 30, 2024 07:59
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
enhancement New feature or request
Projects
Status: Done
Development

Successfully merging this pull request may close these issues.

1 participant