[Feature]: Avoid actual chat request in model provider availability check. #598

oasisfeng · 2024-12-31T13:47:45Z

Is your feature request related to a problem?

Please don't use an actual chat completion request as the implementation method of model provider availability check. It's costly for advanced models (e.g. OpenAI o1).

The current implementation also sends chat completion request for all the models in the specified provider. This is very slow and costly if many models are enabled for that provider.

Describe the solution you'd like

Consider using just "/v1/models" to the check the availability (of base URL and API key), as it is free of charge.

Describe alternatives you've considered

No response

Additional Context

No response

kangfenmao · 2025-01-02T05:00:23Z

Not all service providers offer the /v1/models interface

kangfenmao · 2025-01-02T05:00:50Z

What are some other good suggestions?

oasisfeng · 2025-01-03T14:09:47Z

Probably try "/models" first, if absent, fallback to "/completion"?

github-actions bot assigned kangfenmao Dec 31, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[Feature]: Avoid actual chat request in model provider availability check. #598

[Feature]: Avoid actual chat request in model provider availability check. #598

oasisfeng commented Dec 31, 2024

kangfenmao commented Jan 2, 2025

kangfenmao commented Jan 2, 2025

oasisfeng commented Jan 3, 2025

[Feature]: Avoid actual chat request in model provider availability check. #598

[Feature]: Avoid actual chat request in model provider availability check. #598

Comments

oasisfeng commented Dec 31, 2024

Is your feature request related to a problem?

Describe the solution you'd like

Describe alternatives you've considered

Additional Context

kangfenmao commented Jan 2, 2025

kangfenmao commented Jan 2, 2025

oasisfeng commented Jan 3, 2025