Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[router] add base_gpu_id server args & merged radix tree python reference #2115

Merged
merged 3 commits into from
Nov 22, 2024

Conversation

ByronHsu
Copy link
Collaborator

@ByronHsu ByronHsu commented Nov 21, 2024

Motivation

This PR prepares for #2114 by:

  1. Adding base_gpu_id server argument to set the device ID offset when starting TP server. This enables starting multiple TP servers with different base GPU IDs when launching several servers and the router together using a single command. This also eliminates the need of setting CUDA_VISIBLE_DEVICE if we want to launch on certain GPUs

  2. Adding merged radix tree Python implementation. This serves as an experimental Python version which I referenced for my Rust implementation.

Modifications

Checklist

  • Format your code according to the Contributor Guide.
  • Add unit tests as outlined in the Contributor Guide.
  • Update documentation as needed, including docstrings or example tutorials.

Verified

This commit was created on GitHub.com and signed with GitHub’s verified signature.
@ByronHsu ByronHsu changed the title [router] add base_gpu_id server args & router tree python reference [router] add base_gpu_id server args & merged radix tree python reference Nov 21, 2024
@ByronHsu
Copy link
Collaborator Author

@merrymercy can you take another look?

@merrymercy merrymercy merged commit 30af7df into sgl-project:main Nov 22, 2024
13 checks passed
timethink pushed a commit to timethink/sglang that referenced this pull request Mar 9, 2025
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

None yet

3 participants