You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
I've read your paper and am intrigued by the MetaRM-optimization algorithm. Could you share details on the gradient computation and any specific conditions for the update step (θ_t+1 ← θ_t - α ∇θ_t Lθ'(X_t))? Any reference in open-source code would also be appreciated. Thanks a lot!
The text was updated successfully, but these errors were encountered:
Hi, thank you very much for your attention!
The release of the MetaRM code has been delayed as we have been occupied with paper submissions. We will be publishing the open-source code soon.
Can @liuyan please provide an explanation of the technical details of MetaRM here beforehand?
During each iteration, we back up the model parameters and then sequentially perform Meta-process and MetaRM-optimization.
Regarding the MetaRM-optimization you are concerned about, we first conduct the backpropagation to obtain gradients. Subsequently, we copy the backed-up model parameters here and proceed with gradient descent.
Since gradient descent involves updating parameters within the optimizer for backing up and resetting parameters we particularly use the deepspeed implementation's interfaces, safe_get_full_fp32_param and safe_set_full_fp32_param.
For specific details of the interfaces, please refer to the official document below. https://deepspeed.readthedocs.io/en/stable/zero3.html#deepspeed.utils.safe_get_full_fp32_param
I hope this answer proves helpful to you! Feel free to raise any questions if you have any.
Hi there,
I've read your paper and am intrigued by the MetaRM-optimization algorithm. Could you share details on the gradient computation and any specific conditions for the update step (θ_t+1 ← θ_t - α ∇θ_t Lθ'(X_t))? Any reference in open-source code would also be appreciated. Thanks a lot!
The text was updated successfully, but these errors were encountered: