-
Notifications
You must be signed in to change notification settings - Fork 177
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Add normalization to HessianDiagonalAggregator #184
Comments
The PR looks good to me in general, but I haven't got a chance to look at it carefully. Can you have other folks to take a look at the PR as well, for example @ankans or Ajith, so that you guys get the help needed and won't get blocked? |
Yes, we are having it reviewed by Ankan too. What it does is only setup normalization contexts in GAME and then call the same machinery as photon. If it was correct in photon after the normalization contexts were set, it should be correct in GAME, that hasn't changed at all. What happens to the Hessian diagonal in photon, in case of normalization? |
@XianXing - what is the real impact of not having this? Is it correctness? Is it performance? Is it required to compute variances correctly? |
Correctness. The impact would be downstream applications. For example, I don't know if we are using the estimated variance for multi-armed bandit and explore-exploit tradeoffs at LNKD or not, but based on our experience the quality of estimated variance would have direct impact on those applications (can't be more specific than this without getting into trouble :) ). Having said that, if there are no such use cases at LNKD, please feel free to de-prioritize it. |
So, it is correctness of variances, only. The correctness of coefficient means themselves does not depend on that, right? |
From a comment by @XianXing on PR #131:
"The computation and approximation for Hessian diagonal was designed and implemented independent of Photon's normalization, because previously we didn't enable normalization for GLMix.
But it will be helpful if we take the normalization into account as well here for Hessian diagonal, in order to make it consistent with the rest of code."
The text was updated successfully, but these errors were encountered: