This repo provides samples to demonstrate how to build your own Generative AI solutions using AWS Generative AI CDK Constructs.
Use Case | Description | Type | Language |
---|---|---|---|
SageMaker JumpStart model | This sample provides a sample application which deploys a SageMaker real-time endpoint hosting a Llama 2 foundation model developed by Meta from Amazon JumpStart, and an AWS Lambda function to run inference requests against that endpoint. | Backend | TypeScript |
SageMaker Hugging Face model | This sample provides a sample application which deploys a SageMaker real-time endpoint hosting a model (Mistral 7B) from Hugging Face, and an AWS Lambda function to run inference requests against that endpoint. | Backend | TypeScript |
SageMaker Hugging Face model on AWS Inferentia2 | This sample provides a sample application which deploys a SageMaker real-time endpoint hosting a model (Zephyr 7B) from Hugging Face, and an AWS Lambda function to run inference requests against that endpoint. This sample uses Inferentia 2 as the hardware accelerator. | Backend | TypeScript |
SageMaker custom endpoint | This sample provides a sample application which deploys a SageMaker real-time endpoint hosting a model with artifacts stored in an Amazon Simple Storage Service (S3) bucket, and an AWS Lambda function to run inference requests against that endpoint. This sample uses Inferentia2 as the hardware accelerator. | Backend | TypeScript |
SageMaker multimodal custom endpoint | This sample provides a sample application which deploys a SageMaker real-time endpoint hosting llava-1.5-7b, with artifacts stored in an Amazon Simple Storage Service (S3) bucket, a custom inference script, and an AWS Lambda function to run inference requests against that endpoint. | Backend | TypeScript |
SageMaker image to video endpoint | This sample provides a sample application which deploys a SageMaker async endpoint hosting stable-video-diffusion-img2vid-xt-1-1, with artifacts stored in an Amazon Simple Storage Service (S3) bucket, a custom inference script, and an AWS Lambda function to run inference requests against that endpoint. | Backend | TypeScript |
LLM on SageMaker in GovCloud PDT | This sample provides a sample application which deploys a SageMaker real-time endpoint hosting Falcon-40b on GovCloud PDT. | Backend | TypeScript |
Amazon Bedrock Agents | This sample provides a sample application which deploys an Amazon Bedrock Agent and Knowledge Base backed by an OpenSearch Serverless Collection and documents in S3. It demonstrates how to use the Amazon Bedrock CDK construct. | Backend | TypeScript |
Python Samples | This project showcases the utilization of the 'generative-ai-cdk-constructs' package from the Python Package Index (PyPI). | Backend | Python |
.NET Samples | This project showcases the utilization of the 'Cdklabs.GenerativeAiCdkConstructs' package from nuget library. | Backend | .NET |
Contract Compliance Analysis | This project automates the analysis of contracts by splitting them into clauses, determining clause types, evaluating compliance against a customer's legal guidelines, and assessing overall contract risk based on the number of compliant clauses. This is achieved through a workflow that leverages Large Language Models via Amazon Bedrock. | Backend + Frontend | Python for Backend, TypeScript (React) for Frontend |
RFP Answers with GenAI | This project provides guidance on how you can use Knowledge Bases for Amazon Bedrock with custom transformations to create draft answers for Request for Proposal (RFP) documents, streamlining the answer of new potential contracts. This is achieved through a workflow that leverages Large Language Models via Amazon Bedrock. | Backend + Frontend | Python for Backend, TypeScript (React) for Frontend |
Please refer to the CONTRIBUTING document for further details on contributing to this repository.