Skip to content

Commit

Permalink
adding citation of part 2
Browse files Browse the repository at this point in the history
  • Loading branch information
fakerbaby authored Feb 4, 2024
1 parent 657e796 commit 49cf766
Showing 1 changed file with 10 additions and 0 deletions.
10 changes: 10 additions & 0 deletions README.md
Original file line number Diff line number Diff line change
Expand Up @@ -186,3 +186,13 @@ bash train_rm.sh
primaryClass={cs.CL}
}
```
```bibtex
@misc{wang2024secrets,
title={Secrets of RLHF in Large Language Models Part II: Reward Modeling},
author={Binghai Wang and Rui Zheng and Lu Chen and Yan Liu and Shihan Dou and Caishuang Huang and Wei Shen and Senjie Jin and Enyu Zhou and Chenyu Shi and Songyang Gao and Nuo Xu and Yuhao Zhou and Xiaoran Fan and Zhiheng Xi and Jun Zhao and Xiao Wang and Tao Ji and Hang Yan and Lixing Shen and Zhan Chen and Tao Gui and Qi Zhang and Xipeng Qiu and Xuanjing Huang and Zuxuan Wu and Yu-Gang Jiang},
year={2024},
eprint={2401.06080},
archivePrefix={arXiv},
primaryClass={cs.AI}
}
```

0 comments on commit 49cf766

Please sign in to comment.