[Benchmark] Optimize bert using fused_ffn and fused_attention #2523

FeixLiu · 2022-06-15T07:44:06Z

PR types

Others

PR changes

Others

Description

Optimize the benchmark performance.

model_zoo/bert/run_pretrain.py

ZHUI · 2022-06-16T03:41:25Z

model_zoo/bert/run_pretrain.py

+                                          custom_white_list=[
+                                              "layer_norm", "softmax", "gelu",
+                                              "fused_attention",
+                                              "fused_feedforward"


amp情况下，fused_attention, fused_feedforward 的输入是fp32，那op内部计算走的是fp16还是fp32呢？

那非 amp 情况下，算子内部是 fp32吗?

paddlenlp/transformers/bert/modeling.py

ZHUI · 2022-06-16T03:46:25Z

paddlenlp/transformers/bert/modeling.py

+        self.fuse = fuse
+        if self.fuse:
+            self.encoder = nn.LayerList([
+                FusedTransformerEncoderLayer(


这里layer变了的话。state_dict中参数的命名是不是也是变了？

你是指用非fuse的checkpoint启动fuse的训练？这样应该不支持吧

ZHUI

LGTM

FeixLiu changed the title ~~[WIP] Bert benchmark optimize~~ Bert benchmark optimize Jun 16, 2022

FeixLiu added 5 commits June 16, 2022 11:01

framework for fused transformer

a25a8a4

fused ffn and fused attention

85ef143

update script

f289898

for dygraph

34e1988

update

3ec2b7a

ZHUI self-requested a review June 16, 2022 03:11

update

5756269

ZHUI reviewed Jun 16, 2022

View reviewed changes

address some comments

a8db129

ZHUI approved these changes Jun 16, 2022

View reviewed changes

Merge branch 'develop' into bert_benchmark_optimize

f939455

ZHUI merged commit bc84454 into PaddlePaddle:develop Jun 16, 2022

FeixLiu deleted the bert_benchmark_optimize branch June 16, 2022 06:21

ZHUI changed the title ~~Bert benchmark optimize~~ [Benchmark] Optimize bert using fused_ffn and fused_attention Jun 16, 2022

ZHUI mentioned this pull request Jun 28, 2022

PaddleNLP 2.3.4 Release Note Candidate #2632

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[Benchmark] Optimize bert using fused_ffn and fused_attention #2523

[Benchmark] Optimize bert using fused_ffn and fused_attention #2523

FeixLiu commented Jun 15, 2022

ZHUI Jun 16, 2022

FeixLiu Jun 16, 2022

ZHUI Jun 16, 2022

FeixLiu Jun 16, 2022

ZHUI Jun 16, 2022

FeixLiu Jun 16, 2022

ZHUI left a comment

[Benchmark] Optimize bert using fused_ffn and fused_attention #2523

[Benchmark] Optimize bert using fused_ffn and fused_attention #2523

Conversation

FeixLiu commented Jun 15, 2022

PR types

PR changes

Description

ZHUI Jun 16, 2022

Choose a reason for hiding this comment

FeixLiu Jun 16, 2022

Choose a reason for hiding this comment

ZHUI Jun 16, 2022

Choose a reason for hiding this comment

FeixLiu Jun 16, 2022

Choose a reason for hiding this comment

ZHUI Jun 16, 2022

Choose a reason for hiding this comment

FeixLiu Jun 16, 2022

Choose a reason for hiding this comment

ZHUI left a comment

Choose a reason for hiding this comment