[BFCL] Support for pre-existing completion endpoint #864

ThomasRochefortB · 2025-01-02T16:30:21Z

URL Endpoint Support for BFCL

This PR is a product of the discussion in #850.

Description

This PR adds support for using pre-existing OpenAI-compatible endpoints in BFCL, allowing users to bypass the built-in vLLM/sglang server setup. This is particularly useful for distributed environments like SLURM clusters where model serving and benchmarking need to be handled as separate jobs.

Changes

Added --skip-server-setup flag to CLI
Added environment variable support for endpoint configuration:
- VLLM_ENDPOINT (defaults to 'localhost')
- VLLM_PORT (defaults to existing VLLM_PORT constant)
Modified OSSHandler to support external endpoints
Updated documentation for new configuration options

Usage

Users can now specify custom endpoints in two ways:

Using environment variables:

export VLLM_ENDPOINT="custom.host.com"
export VLLM_PORT="8000"

Using a .env file:

VLLM_ENDPOINT=custom.host.com
VLLM_PORT=8000

Then run BFCL with the --skip-server-setup flag:

python -m bfcl generate --model MODEL_NAME --backend vllm --skip-server-setup

Related Issue

Closes #850

…/864

HuanzhiMao

Thanks for the PR @ThomasRochefortB !

sghyan16 · 2025-01-06T02:47:03Z

hi, I followed the instruction Using a .env file and use --skip-server-setup flag, but I encountered a problem that when I specify the MODEL_NAME, An error occured:

zz-8444 is not a local folder and is not a valid model identifier listed on 'https://huggingface.co/models'
If this is a private repository, make sure to pass a token having permission to this repo either by logging in with `huggingface-cli login` or by passing `token=<your_token>`

HuanzhiMao · 2025-01-06T14:27:19Z

hi, I followed the instruction Using a .env file and use --skip-server-setup flag, but I encountered a problem that when I specify the MODEL_NAME, An error occured:
zz-8444 is not a local folder and is not a valid model identifier listed on 'https://huggingface.co/models'
If this is a private repository, make sure to pass a token having permission to this repo either by logging in with `huggingface-cli login` or by passing `token=<your_token>`

Hey @sghyan16 ,
Could you open a issue for this and attach the console output as well? Thank you!

ThomasRochefortB and others added 5 commits January 2, 2025 10:25

Initial commit

98c3105

final commit

ec1cc91

Updated the docs

a28fe87

Merge remote-tracking branch 'upstream/main' into pr/ThomasRochefortB…

62c7f61

…/864

update change log; clean up

287d79b

HuanzhiMao approved these changes Jan 3, 2025

View reviewed changes

HuanzhiMao merged commit 1729c9b into ShishirPatil:main Jan 3, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[BFCL] Support for pre-existing completion endpoint #864

[BFCL] Support for pre-existing completion endpoint #864

ThomasRochefortB commented Jan 2, 2025 •

edited by HuanzhiMao

Loading

HuanzhiMao left a comment

sghyan16 commented Jan 6, 2025

HuanzhiMao commented Jan 6, 2025

[BFCL] Support for pre-existing completion endpoint #864

[BFCL] Support for pre-existing completion endpoint #864

Conversation

ThomasRochefortB commented Jan 2, 2025 • edited by HuanzhiMao Loading

URL Endpoint Support for BFCL

Description

Changes

Usage

Related Issue

HuanzhiMao left a comment

Choose a reason for hiding this comment

sghyan16 commented Jan 6, 2025

HuanzhiMao commented Jan 6, 2025

ThomasRochefortB commented Jan 2, 2025 •

edited by HuanzhiMao

Loading