Skip to content

[cp] apply fsdp to model when CP is enabled without DP for correct loss and lower mem usage#685

Merged
XilunWu merged 22 commits intomainfrom gh/XilunWu/12/headDec 11, 2024

Commits

Commits on Nov 20, 2024

Commits on Dec 4, 2024