Add static GPT inference configs for 89B and 175B #2504

LiYuRio · 2022-06-14T03:11:52Z

PR types

New features

PR changes

Others

Description

新增GPT-175B和GPT-89B的参数配置，和FasterTransformer对齐，主要用于推理部署。

CLAassistant · 2022-06-14T03:11:59Z

All committers have signed the CLA.

qingqing01

LGTM

ZHUI · 2022-06-14T09:02:14Z

paddlenlp/transformers/gpt/modeling.py

@@ -483,6 +483,36 @@ class GPTPretrainedModel(PretrainedModel):
            "bos_token_id": 0,
            "eol_token_id": 3,
        },
+        "gpt3-89B-en": { # 89B


这个是大模型推理部署是吧。

这里是普通的gpt 模型，这里可以不加。

动态图gpt-3可以添加一下
examples/language_model/gpt-3/dygraph/modeling.py

ZHUI

LGTM

LiYuRio force-pushed the dev_add_configs branch 2 times, most recently from 26f090e to 5eb4c11 Compare June 14, 2022 03:17

qingqing01 previously approved these changes Jun 14, 2022

View reviewed changes

ZHUI reviewed Jun 14, 2022

View reviewed changes

ZHUI self-assigned this Jun 14, 2022

LiYuRio added 2 commits June 14, 2022 10:01

Add configs for 89B and 175B

5f25dc1

remove config for normal gpt, add config for dygraph

3ccb224

LiYuRio dismissed qingqing01’s stale review via 3ccb224 June 14, 2022 10:02

LiYuRio force-pushed the dev_add_configs branch from 959df0d to 3ccb224 Compare June 14, 2022 10:02

LiYuRio requested a review from ZHUI June 14, 2022 10:59

ZHUI approved these changes Jun 14, 2022

View reviewed changes

ZHUI merged commit 1f446ff into PaddlePaddle:develop Jun 14, 2022

LiYuRio deleted the dev_add_configs branch June 14, 2022 11:07

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add static GPT inference configs for 89B and 175B #2504

Add static GPT inference configs for 89B and 175B #2504

LiYuRio commented Jun 14, 2022

CLAassistant commented Jun 14, 2022 •

edited

Loading

qingqing01 left a comment

ZHUI Jun 14, 2022

LiYuRio Jun 14, 2022

ZHUI left a comment

Add static GPT inference configs for 89B and 175B #2504

Add static GPT inference configs for 89B and 175B #2504

Conversation

LiYuRio commented Jun 14, 2022

PR types

PR changes

Description

CLAassistant commented Jun 14, 2022 • edited Loading

qingqing01 left a comment

Choose a reason for hiding this comment

ZHUI Jun 14, 2022

Choose a reason for hiding this comment

LiYuRio Jun 14, 2022

Choose a reason for hiding this comment

ZHUI left a comment

Choose a reason for hiding this comment

CLAassistant commented Jun 14, 2022 •

edited

Loading