Skip to content

Commit

Permalink
Using a more precise profiling on ROCm to properly account for weight…
Browse files Browse the repository at this point in the history
…s padding (#394)

Signed-off-by: Gregory Shtrasberg <[email protected]>
  • Loading branch information
gshtras committed Jan 31, 2025
1 parent 7a8987d commit 8388cf1
Showing 1 changed file with 1 addition and 1 deletion.
2 changes: 1 addition & 1 deletion vllm/platforms/rocm.py
Original file line number Diff line number Diff line change
Expand Up @@ -164,4 +164,4 @@ def get_current_memory_usage(cls,
device: Optional[torch.types.Device] = None
) -> float:
torch.cuda.reset_peak_memory_stats(device)
return torch.cuda.max_memory_allocated(device)
return torch.cuda.mem_get_info()[1] - torch.cuda.mem_get_info()[0]

0 comments on commit 8388cf1

Please sign in to comment.