add support for anthropic, azure, aleph alpha, ai21, togetherai, cohere, replicate, huggingface inference endpoints, etc. #660

krrishdholakia · 2023-09-02T23:34:37Z

Following up on this PR - #574.

I fixed the streaming bugs and confirmed it works with azure.

Here is the gpt-engineer working with my deployed azure instance:

Here is the gpt-engineer working with ai21's "j2-mid" model:

For those trying to use WizardCoder / Phind-CodeLlama, we (+ this PR) also provides support for Huggingface Inference Endpoints and Baseten.
https://docs.litellm.ai/docs/completion/supported

If anyone's using gpt-engineer in production, and would like to split traffic between a finetuned model and gpt-4 - they can do that too: https://docs.litellm.ai/docs/tutorials/ab_test_llms

Please let me know if this PR looks good or you'd still prefer to stash it.

Happy to make any changes / update docs, if the initial PR looks good.

AntonOsika · 2023-09-03T13:19:48Z

@ErikBjare what do you think of this?

It feels like it's too much have both:

API_BASE_URL
Azure flags
LiteLLM

We should simplify and pick a subset (maybe just one?) of all of these!

krrishdholakia · 2023-09-04T04:43:05Z

@AntonOsika happy to make any changes as necessary 🙂

AntonOsika · 2023-09-07T19:48:20Z

Thought about it and I think we should go for it! @krrishdholakia

Do we still need the special azure parameters after this? Would be great to have that taken care of by env variables + litellm instead of code bloat! 🚀

…ai, cohere, replicate, huggingface inference endpoints, etc. (#660)" This reverts commit 9079f84.

…ai, cohere, replicate, huggingface inference endpoints, etc. (#660)" (#685) This reverts commit 9079f84.

AntonOsika · 2023-09-07T20:06:05Z

Hey @krrishdholakia I got this and reverted:

Do you have any ideas why?

krrishdholakia · 2023-09-07T20:25:25Z

What version of langchain were you using?

krrishdholakia · 2023-09-08T04:24:50Z

Hey @AntonOsika here's chatlitellm working for replicate and cohere

Here it is for OpenAI + Anthropic

Can you please let me know the version you're using so i can debug this?

…re, replicate, huggingface inference endpoints, etc. (AntonOsika#660) * fix streaming bug * fixing streaming bug and adding litellm to pyproject.toml * adding provider details to env template

…ai, cohere, replicate, huggingface inference endpoints, etc. (AntonOsika#660)" (AntonOsika#685) This reverts commit 9079f84.

anushrxy · 2023-12-02T21:57:43Z

Hi @AntonOsika,

Following up on this PR - #574.

I fixed the streaming bugs and confirmed it works with azure.

Here is the gpt-engineer working with my deployed azure instance:

Here is the gpt-engineer working with ai21's "j2-mid" model:

For those trying to use WizardCoder / Phind-CodeLlama, we (+ this PR) also provides support for Huggingface Inference Endpoints and Baseten. https://docs.litellm.ai/docs/completion/supported

If anyone's using gpt-engineer in production, and would like to split traffic between a finetuned model and gpt-4 - they can do that too: https://docs.litellm.ai/docs/tutorials/ab_test_llms

Please let me know if this PR looks good or you'd still prefer to stash it.

Happy to make any changes / update docs, if the initial PR looks good.

How do someone use GPT 3.5 turbo Fine tuned models in it?

krrishdholakia added 3 commits September 2, 2023 16:21

fix streaming bug

f0e822c

fixing streaming bug and adding litellm to pyproject.toml

1463558

adding provider details to env template

43be235

AntonOsika merged commit 9079f84 into AntonOsika:main Sep 7, 2023

AntonOsika added a commit that referenced this pull request Sep 7, 2023

Revert "add support for anthropic, azure, aleph alpha, ai21, together…

7725f2f

…ai, cohere, replicate, huggingface inference endpoints, etc. (#660)" This reverts commit 9079f84.

AntonOsika mentioned this pull request Sep 7, 2023

Revert "add support for anthropic, azure, aleph alpha, ai21, togetherai, cohere, replicate, huggingface inference endpoints, etc. " #685

Merged

AntonOsika added a commit that referenced this pull request Sep 7, 2023

Revert "add support for anthropic, azure, aleph alpha, ai21, together…

212a733

…ai, cohere, replicate, huggingface inference endpoints, etc. (#660)" (#685) This reverts commit 9079f84.

krrishdholakia mentioned this pull request Oct 12, 2023

(Docs) Add support for Llama2, Claude, Palm, Cohere, Replicate (100+ LLMs) #787

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

add support for anthropic, azure, aleph alpha, ai21, togetherai, cohere, replicate, huggingface inference endpoints, etc. #660

add support for anthropic, azure, aleph alpha, ai21, togetherai, cohere, replicate, huggingface inference endpoints, etc. #660

krrishdholakia commented Sep 2, 2023

AntonOsika commented Sep 3, 2023

krrishdholakia commented Sep 4, 2023

AntonOsika commented Sep 7, 2023 •

edited

Loading

AntonOsika commented Sep 7, 2023

krrishdholakia commented Sep 7, 2023

krrishdholakia commented Sep 8, 2023

anushrxy commented Dec 2, 2023

add support for anthropic, azure, aleph alpha, ai21, togetherai, cohere, replicate, huggingface inference endpoints, etc. #660

add support for anthropic, azure, aleph alpha, ai21, togetherai, cohere, replicate, huggingface inference endpoints, etc. #660

Conversation

krrishdholakia commented Sep 2, 2023

AntonOsika commented Sep 3, 2023

krrishdholakia commented Sep 4, 2023

AntonOsika commented Sep 7, 2023 • edited Loading

AntonOsika commented Sep 7, 2023

krrishdholakia commented Sep 7, 2023

krrishdholakia commented Sep 8, 2023

anushrxy commented Dec 2, 2023

AntonOsika commented Sep 7, 2023 •

edited

Loading