llama 3.1 has correct max_seq_len
for all versions
#6526
Job | Run time |
---|---|
4m 37s | |
4m 26s | |
4m 28s | |
13m 31s |
max_seq_len
for all versions
#6526
Job | Run time |
---|---|
4m 37s | |
4m 26s | |
4m 28s | |
13m 31s |