Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Add static GPT inference configs for 89B and 175B #2504

Merged
merged 2 commits into from
Jun 14, 2022

Conversation

LiYuRio
Copy link
Contributor

@LiYuRio LiYuRio commented Jun 14, 2022

PR types

New features

PR changes

Others

Description

新增GPT-175B和GPT-89B的参数配置,和FasterTransformer对齐,主要用于推理部署。

@CLAassistant
Copy link

CLAassistant commented Jun 14, 2022

CLA assistant check
All committers have signed the CLA.

@LiYuRio LiYuRio force-pushed the dev_add_configs branch 2 times, most recently from 26f090e to 5eb4c11 Compare June 14, 2022 03:17
qingqing01
qingqing01 previously approved these changes Jun 14, 2022
Copy link
Collaborator

@qingqing01 qingqing01 left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

LGTM

@@ -483,6 +483,36 @@ class GPTPretrainedModel(PretrainedModel):
"bos_token_id": 0,
"eol_token_id": 3,
},
"gpt3-89B-en": { # 89B
Copy link
Collaborator

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

这个是大模型推理部署是吧。

这里是 普通 的gpt 模型,这里可以不加。

动态图gpt-3可以添加一下
examples/language_model/gpt-3/dygraph/modeling.py

Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

已修改

@ZHUI ZHUI self-assigned this Jun 14, 2022
Copy link
Collaborator

@ZHUI ZHUI left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

LGTM

@ZHUI ZHUI merged commit 1f446ff into PaddlePaddle:develop Jun 14, 2022
@LiYuRio LiYuRio deleted the dev_add_configs branch June 14, 2022 11:07
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

4 participants