-
Notifications
You must be signed in to change notification settings - Fork 58
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Bedrock: Agent construct fails with Claude 3.5 v2 & Haiku 3.5 #796
Comments
Thanks for reporting this issue @mccauleyp , this should be fixed when #683 is implemented |
@mccauleyp are you able to perform model invocations with the permissions you provided in your code snippet above ? |
Yep! But note that it's not just permissions that complicates using Sonnet 3.5 v2 and Haiku 3.5. If you have an Action Group, the OpenAPI schema must declare the I spent a few hours yesterday debugging to find the 18 character limit; I can't find that documented anywhere in the AWS or Anthropic docs. When I opened this ticket last week, I had it working for two agents but the actions I was using happened to have I assume it's the service team that should be alerted of that issue but I'm not sure where best to submit a ticket for that. Could you communicate the character limit issue to them? |
Thanks @mccauleyp ! I was asking as there is a bug currently in the console, if you try to create an agent and use a model with CRIS + select the option to generate a new role, the permissions generated are not sufficient to invoke the model through the agent (I was using it as a reference to add support in this lib). I reported this issue to the service team. Working on adding support for CRIS in #800. With current changes I am able to support CRIS for Agents, and Prompts. Note: Application Inference profiles are not supported yet in CloudFormation. The code will be there and ready on our end though. Thanks for the note on the operationId, I will contact the service team and post here as soon as I have an update. |
Ah, yeah I noticed the console bug too and referred to some other docs page for my snippet. One other note for the service or CloudFormation team: If the Action Group schema doesn't include HTTP-verb-prefixed |
Hi @mccauleyp , closing this ticket as https://github.com/awslabs/generative-ai-cdk-constructs/releases/tag/v0.1.283 just released add support for inference profiles. The documentation has an example on how to use CRIS with an agent. For the other points you mentioned, I opened an issue with the service team and will update you as soon as I have an answer. Thank you ! |
@mccauleyp the service team mentioned that the issue has been fixed (deploying the two models with CRIS for agents). If you face any issues please let us know! Thank you ! |
Describe the bug
Attempting to use Claude 3.5 v2 or Haiku 3.5 with the Agent construct will produce a successful deployment but a broken agent that produces "Internal server error" responses. That's because these models require invocation via an inference profile but the construct provisions them in an "on demand" mode that isn't compatible.
Expected Behavior
Should be able to deploy agents using these models.
Current Behavior
Agent deployment succeeds but produces "Internal server error" responses.
Reproduction Steps
Create an agent using Sonnet 3.5 v2 or Haiku 3.5, e.g.:
Possible Solution
I am working around the issue by using the CDK escape hatch to override the CloudFormation foundation model property, which might provide some hints as to how the construct could be modified:
Additional Information/Context
No response
CDK CLI Version
2.166.0
Framework Version
0.1.279
Node.js Version
v20.11.0
OS
OSX
Language
Python
Language Version
3.12
Region experiencing the issue
us-east-1
Code modification
No
Other information
No response
Service quota
The text was updated successfully, but these errors were encountered: