Fix passing `base_url` in `model_id` in `InferenceEndpointsLLM` #924

gabrielmbmb · 2024-08-23T09:49:21Z

Description

We were passing the base_url using the model_id argument to the huggingface_hub.AsyncInferenceClient. This worked if using Inference Endpoint solutions, but if using a local TGI deployment, it didn't causing the chat_completion endpoint to return a 422.

github-actions · 2024-08-23T09:50:51Z

Documentation for this PR has been built. You can view it at: https://distilabel.argilla.io/pr-924/

codspeed-hq · 2024-08-23T09:55:36Z

CodSpeed Performance Report

Merging #924 will not alter performance

_{Comparing fix-inference-endpoints (85b66cd) with main (c76d4a7)}

Summary

✅ 1 untouched benchmarks

Fix passing base_url in model_id

8d7d296

gabrielmbmb added the fix label Aug 23, 2024

gabrielmbmb requested a review from plaguss August 23, 2024 09:49

gabrielmbmb self-assigned this Aug 23, 2024

plaguss approved these changes Aug 23, 2024

View reviewed changes

gabrielmbmb added 7 commits August 23, 2024 12:52

Merge branch 'main' into fix-inference-endpoints

4389fbe

Print ruff version

05908e1

Install dev dependencies after

4aa5325

Update ruff

a957bab

noqa

2858db3

Skip ray tests on 3.12

e1abd2c

Do not run ray tests in 3.12

85b66cd

gabrielmbmb merged commit 379c756 into main Aug 23, 2024
5 of 7 checks passed

gabrielmbmb deleted the fix-inference-endpoints branch August 23, 2024 12:53

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fix passing `base_url` in `model_id` in `InferenceEndpointsLLM` #924

Fix passing `base_url` in `model_id` in `InferenceEndpointsLLM` #924

gabrielmbmb commented Aug 23, 2024

github-actions bot commented Aug 23, 2024

codspeed-hq bot commented Aug 23, 2024 •

edited

Loading

Fix passing base_url in model_id in InferenceEndpointsLLM #924

Fix passing base_url in model_id in InferenceEndpointsLLM #924

Conversation

gabrielmbmb commented Aug 23, 2024

Description

github-actions bot commented Aug 23, 2024

codspeed-hq bot commented Aug 23, 2024 • edited Loading

CodSpeed Performance Report

Merging #924 will not alter performance

Summary

Fix passing `base_url` in `model_id` in `InferenceEndpointsLLM` #924

Fix passing `base_url` in `model_id` in `InferenceEndpointsLLM` #924

codspeed-hq bot commented Aug 23, 2024 •

edited

Loading