Add HPU specific arguments to benchmark_throughput #406

kdamaszk · 2024-10-18T09:41:04Z

Modify benchmark_throughput.py to allow running with FP8 on HPU (KV cache dtype fp8_inc) and to use padding-aware scheduling.

michalkuligowski · 2024-10-21T11:36:59Z

benchmarks/benchmark_throughput.py

+    parser.add_argument("--weights-load-device",
+                        type=str,
+                        default=None,
+                        choices=["cuda", "neuron", "hpu", "cpu"],


Why the need to add other than cpu and hpu?

I didn't want to block other paths that are theoretically possible, based on the EngineArgs defaults. I see that currently this value is based on the DEVICE_OPTIONS, so I can change to this.

Modify `benchmark_throughput.py` to allow running with FP8 on HPU (KV cache dtype `fp8_inc`) and to use padding-aware scheduling.

kdamaszk added 3 commits October 18, 2024 12:39

Add HPU specific arguments to benchmark_throughput

76af871

Add --max-num-seqs argument

98e30a4

format.sh

3775d2a

kdamaszk requested review from michalkuligowski and kzawora-intel October 18, 2024 12:50

michalkuligowski reviewed Oct 21, 2024

View reviewed changes

Change choices for --weights-load-device

28c334b

kdamaszk requested a review from michalkuligowski October 21, 2024 12:17

kdamaszk added the habana Issues or PRs submitted by Habana Labs label Oct 21, 2024

michalkuligowski approved these changes Oct 22, 2024

View reviewed changes

michalkuligowski merged commit acde882 into habana_main Oct 22, 2024
19 checks passed

michalkuligowski deleted the dev/kdamaszke/update-benchmark-throughput branch October 22, 2024 08:24

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add HPU specific arguments to benchmark_throughput #406

Add HPU specific arguments to benchmark_throughput #406

kdamaszk commented Oct 18, 2024

michalkuligowski Oct 21, 2024

kdamaszk Oct 21, 2024

Add HPU specific arguments to benchmark_throughput #406

Add HPU specific arguments to benchmark_throughput #406

Conversation

kdamaszk commented Oct 18, 2024

michalkuligowski Oct 21, 2024

Choose a reason for hiding this comment

kdamaszk Oct 21, 2024

Choose a reason for hiding this comment