Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

HSTU streaming training #178

Open
WangPengGG opened this issue Jan 23, 2025 · 0 comments
Open

HSTU streaming training #178

WangPengGG opened this issue Jan 23, 2025 · 0 comments

Comments

@WangPengGG
Copy link

I understand that this code does not currently include a streaming training implementation, but I wanted to raise a concern about this topic for discussion.

In industrial streaming training settings, user behavior data typically arrives incrementally over several days, e.g., [d1, d2, ..., dn], with new data continuously appended, e.g., [d1, d2, ..., dn, dn+1]. A key issue in such scenarios is that historical data is gradually forgotten over time.

This can lead to a situation where historical data is trained on multiple times before it is forgotten, while newly arrived data—often the most critical for model updates—receives similar importance in terms of loss weighting as the historical data. This imbalance in the treatment of new and historical data can result in suboptimal model performance, as the model may fail to adequately prioritize the most recent and relevant user behavior patterns.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

1 participant