Skip to content

Add attention_bias argument in transformer block and transformer layer modules, addressing change in MCore #6398

Add attention_bias argument in transformer block and transformer layer modules, addressing change in MCore

Add attention_bias argument in transformer block and transformer layer modules, addressing change in MCore #6398

Job Run time
1s
2m 19s
1m 49s
2m 52s
2m 54s
12m 57s
2m 59s
1m 51s
7m 34s
3m 40s
1m 39s
2m 56s
1m 43s
21m 16s
58s
39s
1m 17s
1m 20s
39s
1m 49s
2m 9s
2m 47s
1m 1s
49s
27s
2m 58s
1m 53s
1m 45s
1m 51s
1m 50s
56s
1m 46s
58s
56s
1m 50s
4m 52s
2m 57s
2m 26s
2m 31s
54s
1m 32s
46s
45s
43s
48s
43s
2m 1s
50s
44s
45s
58s
1m 42s
1m 39s
1m 46s
1m 36s
44s
2m 43s
2m 6s
1m 55s
3m 38s
1m 34s
3m 29s
2m 5s
2m 39s
57s
51s
3m 31s
3m 40s
3m 43s
4m 5s
3m 33s
2m 55s
3m 35s
1m 43s
2m 0s
1m 11s
57s
1m 12s
55s
1m 46s
1m 57s
2m 23s
2m 19s
2m 30s
1m 22s
2m 47s
1m 51s
3m 21s
3m 8s
4m 18s
3m 3s
2m 2s
3m 20s
3m 0s
2m 0s
2m 1s
2m 35s
4m 39s
2m 44s
1m 55s
2m 49s
3m 50s
1m 9s
1m 56s
3m 44s
2m 10s
2m 52s
2m 55s
2m 51s
3m 4s
1m 57s
2m 59s
4m 0s
4m 15s
1m 55s
1m 44s
1m 45s
1m 1s
3m 58s
4m 47s
4m 19s
4m 44s
4m 36s
4m 23s
5m 33s
3m 39s
3m 45s
4m 21s
4m 23s
4m 22s
2m 2s
4m 9s
4m 9s
3m 36s
3m 39s
3m 33s
3m 36s
4m 13s
2m 24s
2m 22s
2m 16s
2m 26s
2m 30s
3m 1s
3m 1s
3s
6h 25m 34s