-
Notifications
You must be signed in to change notification settings - Fork 4.7k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Support for dynamic quota in azurerm_cognitive_deployment #23988
Comments
Thank you for taking the time to open this feature request! |
Good request 🍻 I observe that if you toggle this from the portal it sends:
and backward
Notice that the Interestingly, on the I try out later if these are equal or not 👀 I suspect no as If we are lucky, they get inherited 🤣 |
That dynamic_throttling_enabled on the cognitive account resource level is different from the actual model deployments, like you mentioned. I tried it and it gives: Thanks for opening this initial issue, would love to know if there is any planned update for this or I will investigate further on implementing this via azapi terraform provider. |
Any news on this? From https://learn.microsoft.com/en-us/azure/ai-services/openai/how-to/dynamic-quota it is indeed
|
@JorisAndrade you can accomplish this via azapi provider in the meantime. Hope this helps!
|
@illgitthat thanks for sharing. will try this out. |
Thanks for reaching out, unfortunately I do not have an ETA on this enhancement. Any future work via the team or the community should end up linked here via a PR. |
This version adds many options. Should be useful for hashicorp/terraform-provider-azurerm#23988
I opened a PR on pandora to add https://github.com/Azure/azure-rest-api-specs/tree/main/specification/cognitiveservices/resource-manager/Microsoft.CognitiveServices/preview/2023-10-01-preview in https://github.com/hashicorp/go-azure-sdk After done, we will be able to import it in provider to add this feature |
Hi, thanks for raising this issue. Is there any idea of timeframe for this to get implemented please? Thank you again. |
I'm going to lock this issue because it has been closed for 30 days ⏳. This helps our maintainers find and focus on the active issues. |
Is there an existing issue for this?
Community Note
Description
A azure cognitive services deployment now supports dynamic scaling of quota when capacity is available in the account. Please add this setting to the azurerm_cognitive_deployment terraform resource so it can be auto-provisioned. (The setting is called "Dynamic Quota" in the UI:
New or Affected Resource(s)/Data Source(s)
azurerm_cognitive_deployment
Potential Terraform Configuration
References
https://microsoftlearning.github.io/mslearn-openai/Instructions/Labs/01-get-started-azure-openai.html#deploy-a-model
The text was updated successfully, but these errors were encountered: