Create `PlacementGroup` for steps using `vLLM` #842

gabrielmbmb · 2024-07-30T11:17:53Z

Description

This PR updates RayPipeline to create a PlacementGroup when an step is using vLLM. The created placement group, contains as many GPU bundles as tensor_parallel_size specified in the vLLM initialisation and it uses the STRICT_PACK to have the GPU bundles in the same node. It also creates the placement group specifying the _soft_target_node_id, assuring that the _RayStepWrapper actor for the step using vLLM will be created in a specific node. This avoid having vLLM raising the exception Ray does not allocate any GPUs on the driver node. Consider adjusting the Ray placement group or running the driver on a GPU node, as it assures that the driver _StepWrapperRay created resides in the same node as the ray actors created by vLLM for the distributed inference.

github-actions · 2024-07-30T11:19:22Z

Documentation for this PR has been built. You can view it at: https://distilabel.argilla.io/pr-842/

codspeed-hq · 2024-07-30T11:25:27Z

CodSpeed Performance Report

Merging #842 will not alter performance

_{Comparing ray-placement-group (f56a9bb) with develop (2aa977f)}

Summary

✅ 1 untouched benchmarks

gabrielmbmb added 2 commits July 30, 2024 11:56

Create placement group for vLLM

9b60f8b

Merge branch 'develop' into ray-placement-group

6d3b74e

gabrielmbmb added the enhancement New feature or request label Jul 30, 2024

gabrielmbmb added this to the 1.3.0 milestone Jul 30, 2024

gabrielmbmb self-assigned this Jul 30, 2024

Use SPREAD if pipeline_parallel_size>1

b689881

gabrielmbmb added 4 commits July 30, 2024 14:00

Fix bundle initialization

7b85497

Fix wrong dictionary

2449ed7

Remove using SPMD from ray docs

b679206

Refactor creating PlacementGroup for vLLM

f56a9bb

gabrielmbmb marked this pull request as ready for review July 30, 2024 12:42

gabrielmbmb merged commit be61d20 into develop Jul 30, 2024
7 checks passed

gabrielmbmb deleted the ray-placement-group branch July 30, 2024 12:56

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Create `PlacementGroup` for steps using `vLLM` #842

Create `PlacementGroup` for steps using `vLLM` #842

gabrielmbmb commented Jul 30, 2024 •

edited

Loading

github-actions bot commented Jul 30, 2024

codspeed-hq bot commented Jul 30, 2024 •

edited

Loading

Create PlacementGroup for steps using vLLM #842

Create PlacementGroup for steps using vLLM #842

Conversation

gabrielmbmb commented Jul 30, 2024 • edited Loading

Description

github-actions bot commented Jul 30, 2024

codspeed-hq bot commented Jul 30, 2024 • edited Loading

CodSpeed Performance Report

Merging #842 will not alter performance

Summary

Create `PlacementGroup` for steps using `vLLM` #842

Create `PlacementGroup` for steps using `vLLM` #842

gabrielmbmb commented Jul 30, 2024 •

edited

Loading

codspeed-hq bot commented Jul 30, 2024 •

edited

Loading