Allow using Azure OpenAI #431

UmerHA · 2023-06-27T15:28:54Z

Allows using Azure OpenAI instead of OpenAI (solves #325)

gpt-engineer [project_path] --azure --engine [your_azureopenai_model_name]

To be consistent with Azure OpenAI terminology, the model is called an engine instead.

Requires to set AZURE_OPENAI_ENDPOINT and AZURE_OPENAI_KEY as environment variables.

UmerHA · 2023-06-27T15:30:27Z

@herbutant because you opened #325: Could you test the pr for me? I don't have azure openai access. thanks!

herbutant · 2023-06-28T08:51:31Z

@UmerHA , thanks for this.

I made one small change locally to also pull the api_version from ENV because this can also vary (e.g. in my case it is "2023-03-15-preview"). With that it runs like expected.

if not os.getenv("AZURE_OPENAI_VERSION"):
raise ValueError("To use Azure OpenAI version please set a "
"AZURE_OPENAI_VERSION enviroment variable.")

openai.api_version = os.getenv("AZURE_OPENAI_VERSION")

Should an amendment to the README.md with Azure specific instructions also be part of the pr?

UmerHA · 2023-06-28T13:03:02Z

@herbutant agree - have adjusted the readme & made AZURE_OPENAI_VERSION an optional env variables that defaults to 2023-05-15.

AntonOsika · 2023-07-02T13:45:33Z

gpt_engineer/ai.py

+                openai.api_version = api_version
+            openai.api_type = "azure"
+            openai.api_base = os.getenv("AZURE_OPENAI_ENDPOINT") 
+            openai.api_key = os.getenv("AZURE_OPENAI_KEY")


Would be great to create a separate function handling this to keep it tiy.

AntonOsika · 2023-07-02T13:47:51Z

gpt_engineer/ai.py

+            response = openai.ChatCompletion.create(model=self.model, **shared_params)
+        else:
+            # use Azure OpenAI
+            response = openai.ChatCompletion.create(engine=self.model, **shared_params)


I realize that if we want to support this we should in fact have a separate object:

The LLM model.

I'd be happy to merge if you find a good way to "abstract away" the LLM object!

AntonOsika · 2023-07-02T13:49:41Z

I think we need to create a more general abstraction for LLMs if we want to support azure to not make the code messy!

Let me know if you have thoughts

UmerHA · 2023-07-03T12:37:45Z

@AntonOsika Agree, I think we have 2 options:

write a thin wrapper around LangChain's abstraction (BaseChatModel for chat models / BaseLLM for non-chat models)
create our own simplified abstraction

I would vote for using LangChain, as LLM integration is not this project's purpose and LangChain does that well already. I'd be happy to implement the wrapper, as I'm a LC contributor as well. :)

If you think option 2 is better, I propose the following abstraction:

class AI with methods:

factory
response
update_token_count

subclasses of AI implement these methods:

__init__
_response (called by response)
get_token_usage

Details:
Option 2 in Pseudocode:

class AI
	self.model
	self.token_usage_logs = []

	@classmethod
	def create(cls, model_family_name, model_name, ai_config_filename):
		# read ai_config_filename
		# create a new instance with type depending on model_family_name (e.g. new OpenAI, or new AzureOpenAI)
		# 	and pass params from ai_config_filename to it

	@abstractmethod
	def __init__(self, model_name, ai_config_filename, **kwargs):
		# determine model_name or use default (eg gpt-3.5-turbo for OpenAI)
		# load configs (eg api key) from kwargs or from env
		# initialize model (eg  validate configs)
		
	@abstractmethod
	def _response(self, messages, prompt=None):
		# get response & meta-data (eg token_usage)

	@abstractmethod
	def get_token_usage(self, meta_data):
		# get & return token usage (handle both streaming & non-streaming cases)

	def response(self, messages, prompt=None):  # corresponds to current `next` method
		# logger.debug("starting response")
		
		# ai_response_object, meta_data = self._response(messages, prompt=None)
		
		# if is_generator(ai_response_object):  # streaming
		#	stream & print reponse chunks
		# 	ai_response = join(chat)
		# else:  # not streaming
		#	ai_response = ai_response_object[...]
		# 	print(ai_response)
		
		# logger.debug("response completed")
		# 	messages += [{"role": "assistant", "content": "".join(chat)}]
		
		# meta_data['answer'] = ai_response
		# self.update_token_count(self.get_token_usage(meta_data))
		
	def update_token_count
		# udate token usage log

then the OpenAI / AzureOpenAI would look like this:

class OpenAI(AI):
	def __init__():
		# determine model_name or use default (eg gpt-3.5-turbo for OpenAI)
		# initialize model (eg ensure all configs are given)
		
	@override
	def _response(self, messages, prompt=None):
		# get response & meta-data (ie token_usage)

	@abstractmethod
	def get_token_usage(self, meta_data):
		# get & return token usage (handle both streaming & non-streaming cases)

and

class AzureOpenAI(OpenAI):
	def __init__():
		# determine model_name
		# initialize model (eg ensure all configs are given)
		
	@override
	def _response(self, messages, prompt=None):
		# get response & meta-data (ie token_usage)

	# get_token_usage(self, meta_data) is same as in OpenAI

This would work for all kinds of models (chat/non-chat, streaming/non-streaming).

Let me know!

AntonOsika · 2023-07-06T20:55:05Z

Hey appreciate the thinking on this!

Based on discussions in other PRs it looks like we will use langchain abstractions.

Because of that and that we have conflicts I’m closing this for now — feel free to open.

Allowed using AzureOpenAI

0420b18

UmerHA changed the title ~~Allowed using Azure OpenAI~~ Allow using Azure OpenAI Jun 27, 2023

UmerHA added 2 commits June 28, 2023 14:52

api version loaded from env if it exists

751922f

Adjusted readme

84b46ac

AntonOsika reviewed Jul 2, 2023

View reviewed changes

AntonOsika closed this Jul 6, 2023

UmerHA mentioned this pull request Jul 7, 2023

Langchain integration #466

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Allow using Azure OpenAI #431

Allow using Azure OpenAI #431

UmerHA commented Jun 27, 2023

UmerHA commented Jun 27, 2023

herbutant commented Jun 28, 2023 •

edited

Loading

UmerHA commented Jun 28, 2023

AntonOsika Jul 2, 2023

AntonOsika Jul 2, 2023

AntonOsika commented Jul 2, 2023

UmerHA commented Jul 3, 2023 •

edited

Loading

AntonOsika commented Jul 6, 2023

Allow using Azure OpenAI #431

Allow using Azure OpenAI #431

Conversation

UmerHA commented Jun 27, 2023

UmerHA commented Jun 27, 2023

herbutant commented Jun 28, 2023 • edited Loading

UmerHA commented Jun 28, 2023

AntonOsika Jul 2, 2023

Choose a reason for hiding this comment

AntonOsika Jul 2, 2023

Choose a reason for hiding this comment

AntonOsika commented Jul 2, 2023

UmerHA commented Jul 3, 2023 • edited Loading

AntonOsika commented Jul 6, 2023

herbutant commented Jun 28, 2023 •

edited

Loading

UmerHA commented Jul 3, 2023 •

edited

Loading