Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

September Work Plan #110

Closed
feifeibear opened this issue Sep 8, 2021 · 0 comments
Closed

September Work Plan #110

feifeibear opened this issue Sep 8, 2021 · 0 comments
Assignees

Comments

@feifeibear
Copy link
Collaborator

feifeibear commented Sep 8, 2021

派大星现在整块的功能已经满足开源的条件了。9月份需要重点关注如何释放开源的影响力。主要是应用效果和性能效果两个方面

应用效果

  1. 我们自己训练的一个大语言模型,和PatrickStar一起发布,这个不需要依赖业务方,完全自己可控。
    CLUE数据集准备 #97
  2. 和PRC的算法同学,对longformer/xlnet/bert等训练场景进行提升,这些合作有助于增加派大星倍数,但是应用方可能存在模型不够大的情况。
  3. 对外拉一些用户试用,形成反馈的闭环。
    支持TencentPretrain #57

性能效果

  1. 我们继续提升benchmark指标,和DeepSpeed,Megatron对比
    子任务包括memory profiler,可以有助于我们profile现在的性能瓶颈
@zhuzilin zhuzilin pinned this issue Sep 8, 2021
@zhuzilin zhuzilin unpinned this issue Oct 26, 2021
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants