- My research work primarily focuses on RLHF (Reinforcement Learning from Human Feedback).
- Here is my academic page
🎯
Focusing
Focus on LLM Alignment (RLHF)
-
Fudan University
- shanghai
-
08:30
(UTC -12:00)
Pinned Loading
-
-
fakerbaby.github.io
fakerbaby.github.io PublicForked from academicpages/academicpages.github.io
Github Pages template for academic personal websites, forked from mmistakes/minimal-mistakes
JavaScript
Something went wrong, please refresh the page to try again.
If the problem persists, check the GitHub status page or contact support.
If the problem persists, check the GitHub status page or contact support.