Build generative AI applications with Amazon Aurora and Knowledge Bases for Amazon Bedrock

Amazon Bedrockis the easiest way to build and scale generative AI applications with foundational models (FMs). FMs are trained on vast quantities of data, allowing them to be used to answer questions on a variety of subjects. However, if you want to use an FM to answer questions about your private data that you have stored in yourAmazon Simple Storage Service(Amazon S3) bucket orAmazon Aurora PostgreSQL-Compatible Editiondatabase, you need to use a technique known asRetrieval Augmented Generation(RAG) to provide relevant answers for your customers.

Knowledge Bases for Amazon Bedrockis a fully managed RAG capability that allows you to customize FM responses with contextual and relevant company data. Knowledge Bases for Amazon Bedrock automates the end-to-end RAG workflow, including ingestion, retrieval, prompt augmentation, and citations, eliminating the need for you to write custom code to integrate data sources and manage queries.

Integrating Amazon Bedrock with Amazon Aurora PostgreSQL lets you utilize features that help accelerate performance of vector similarity search for RAG. Aurora delivers queries 20 times faster withpgvector’s HNSW inde xing over other inde xing methods. Additionally,Amazon Aurora Optimized Readscan increase performance for vector search withpgvectorby up to nine times for workloads that exceed regular instance memory. This is on top of the performance and availability features that let you operate Aurora cost-effectively at global scale, including Aurora Serverless andAmazon Aurora Global Database.

In this post, we explore how to useAmazon Aurorato build generative AI applications using RAG. We walk through setting up an Aurora cluster to be a knowledge base for Amazon Bedrock. We also demonstrate how to use theAmazon Aurora Machine Learningextension to generate embeddings using Amazon Bedrock from simple SQL commands.

Solution overview

The following diagram illustrates an example of a RAG workflow.

The RAG workflow contains two parts. The first part is taking unstructured data, such as text, images, and video, converting it into embeddings (vectors) using an embeddings model, and storing it in a vector database (Steps 1–3). An embedding is a numerical representation that you can use in a similarity search to find content that is most related to a query. The second part of the workflow is the request itself. The request is turned into an embedding and used to query the vector database to find content to augment the prompt (Steps 4–5). The output of the query is sent to the FM, which then sends the response to the user (Steps 6–7). Building a generative AI application that uses RAG requires orchestrating this workflow, which can require additional work beyond building the application logic.

Recently, AWS announced the general availability ofKnowledge Bases for Amazon Bedrock.With Knowledge Bases for Amazon Bedrock, you can give FMs and agents contextual information from your company’s private data sources for RAG to deliver more relevant, accurate, and customized responses. You can then use your knowledge base withAgents for Amazon Bedrockto orchestrate multi-step tasks and facilitate prompt engineering. For more information, refer toKnowledge Bases now delivers fully managed RAG experience in Amazon Bedrock.

In the following sections, we demonstrate how to set up Aurora as a knowledge base for Amazon Bedrock. We’ll also see how to use Aurora Machine Learning (ML) to generate embeddings using SQL commands.

Prerequisites

This post assumes familiarity with navigating theAWS Management Console.For this example, you’ll also need the following resources and services enabled in your AWS account:

An Aurora PostgreSQL cluster withAurora MLenabled
TheRDS Data APIenabled for the Aurora cluster
Amazon S3
AWS Secrets Manager
AWS Identity and Access Management(IAM)
Amazon Bedrock. You may need to request access to use specific foundational models in Amazon Bedrock. In this blog post, we usedAnthropic Claude 2.1andAmazon Titan Embeddings G1 – Text.

Set up Aurora as a knowledge base for Amazon Bedrock

The first step to creating a knowledge base for Amazon Bedrock is to have content that can be used to augment a foundation model. In this example, we use a PDF version of thePostgreSQL 16 manual.At the time of writing, PostgreSQL 16 was a new release and was not available for FM training on public data. Datasets used in RAG often have much more data, but we chose to keep this example simple.

Configure an S3 bucket

To begin setting up a knowledge base for Amazon Bedrock, you’ll need tocreate an S3 bucket.Make sure this bucket is private. This example uses a bucket calledbedrock-kb-demo-aurora.After you create the bucket, upload the PostgreSQL 16 manual to the bucket. The following screenshot shows what the upload looks like when it’s complete.

Configure an Aurora PostgreSQL cluster

Next, create an Aurora PostgreSQL cluster. For full instructions, refer toUsing Aurora PostgreSQL as a Knowledge Base for Amazon Bedrock.We highlight a few specific configuration options used in this example:

On the Aurora console, create a new cluster.
ForEngine options¸ selectAurora (PostgreSQL Compatible).
ForEngine version,choose your engine version.

We selected PostgreSQL 15.5 for this example; we recommend using PostgreSQL 15.5 or higher so you can use the latest version of the open source pgvector extension.

ForConfiguration options,select eitherAurora StandardorAurora I/O Optimized.

We selectedAurora I/O-Optimized,which provides improved performance with predictable pricing for I/O-intensive applications.

ForDB instance class,select your instance class.

We opted to useAmazon Aurora Serverless v2,which automatically scales your compute based on your application workload, so you only pay based on the capacity used.

Enable the RDS Data API, which is used by Amazon Bedrock to access your Aurora cluster.
Create your Aurora cluster.
While your Aurora cluster is provisioning, navigate to the cluster on the Aurora console and choose theConfigurationtab.
Note the Amazon Resource Name (ARN) for the cluster, and save it for later.

You’ll need the ARN for configuring the knowledge base for Amazon Bedrock.

It takes about 10 minutes for the Aurora PostgreSQL cluster to finish provisioning. After the cluster is provisioned, you need to run a series of SQL commands to prepare your cluster to be a knowledge base.

Log in to your Aurora cluster either as the admin user (for example,postgres) or a user that has therds_superuserprivilege, and run the following code. Note the password that you create forbedrock_user,because you’ll need it in a later step when configuring a secret in Secrets Manager. Also note the table names and column names, because they’ll be used in the knowledge base workflow on the Amazon Bedrock console.

CREATE EXTENSION IF NOT EXISTS vector;
CREATE SCHEMA bedrock_integration;
CREATE ROLE bedrock_user LOGIN;
-- enter and note the password for this user
\password bedrock_user
GRANT ALL ON SCHEMA bedrock_integration to bedrock_user;
SET SESSION AUTHORIZATION bedrock_user;
CREATE TABLE bedrock_integration.bedrock_kb (
id uuid PRIMARY KEY,
embedding vector(1536),
chunks text,
metadata json
);
CREATE INDEX ON bedrock_integration.bedrock_kb
USING hnsw (embedding vector_cosine_ops);

The Aurora cluster is now set up to be used as a knowledge base for Amazon Bedrock. We’ll now create a secret in Secrets Manager that Amazon Bedrock will use to connect to the cluster.

Create a secret in Secrets Manager

Secrets Manager lets you store your Aurora credentials so that they can be securely transmitted to applications. Complete the following steps to create your secret:

On the Secrets Manager console, create a new secret.
ForSecret type,selectCredentials for Amazon RDS database.
UnderCredentials,enter a name for your user (for this post, we usebedrock_user) and the password for that role.
In theDatabasesection, select the cluster you’re using for the knowledge base.
ChooseNext.
ForSecret name,enter a name for your secret.
ChooseNext.
Finish creating the secret and copy the secret ARN.

You’ll need the secret ARN for creating your knowledge base.

We’re now ready to use this Aurora cluster as a knowledge base for Amazon Bedrock.

Create a knowledge base for Amazon Bedrock

We can now use our cluster as a knowledge base. Complete the following steps:

On the Amazon Bedrock console, chooseKnowledge baseunderOrchestrationin the navigation pane.
ChooseCreate knowledge base.
ForKnowledge base name¸ enter a name.
ForRuntime role,selectCreate and use a new service roleand enter a service role name.
ChooseNext.
ForChoose an archive in S3,select the S3 bucket to use as a data source and chooseChoose.

For this post, we use the S3 bucket containing the PostgreSQL 16 manual that we uploaded earlier.

ForEmbeddingsmodel, select your model (for this post, we useAmazon Titan Embeddings G1 – Text).
ForVector database,selectChoose a vector store you have createdand selectAmazon Aurora.
Provide the following additional information (note the examples we use for this post):
1. ForAmazon Aurora DB Cluster ARN,enter the ARN you saved when creating your Aurora cluster.
2. ForDatabase name,enterpostgres.
3. ForTable name,enterbedrock_integration.bedrock_kb.
4. ForSecret ARN,enter the ARN you saved when creating the secret forbedrock_user.
5. ForVector field,enterembedding.
6. ForText field,enterchunks.
7. ForBedrock-managed metadata field,entermetadata.
8. ForPrimary key,enterid.
ChooseNext.
Review the summary page and chooseSync.

This begins the process of converting the unstructured data stored in the S3 bucket into embeddings and storing them in your Aurora cluster.

The syncing operation may take minutes to hours to complete, based on the size of the dataset stored in your S3 bucket. During the sync operation, Amazon Bedrock downloads documents in your S3 bucket, divides them into chunks (we opt for the “default” strategy in this post), generates the vector embedding, and stores the embedding in your Aurora cluster. When the initial sync is complete, you’ll see that the data source shows asReady.

Now you can use your knowledge base as in an agent for Amazon Bedrock. We can use this knowledge base as part of a test. For this example, we chose the PostgreSQL 16 manual as our dataset. This lets us see how RAG works in action, because foundational models may not yet have information on all the features of PostgreSQL 16.

In the following example, we use theTest knowledge basefeature of Amazon Bedrock, choose the Anthropic Claude 2.1 model, and ask it a question about a PostgreSQL 16 feature—specifically, thepg_stat_ioview. The following screenshot shows our answer.

Using the provided question, Amazon Bedrock queried our Aurora cluster to get the additional context needed to answer the question. Our knowledge base contained information on how the newpg_stat_iofeature works, and was able to provide an augmented answer via Anthropic Claude. The test tool also cites the chunks it uses to provide attribution to how it delivered its response.

Now that we’ve seen how we can augment foundational model responses with knowledge bases for Amazon Bedrock and an Aurora PostgreSQL cluster, let’s learn how we can generate embeddings directly from an Aurora cluster. This will be necessary to convert user questions into embeddings that can be compared to embeddings in Aurora with similarity search.

Create an IAM role and policy for the Aurora cluster

Before you can start generating vector embeddings from Amazon Bedrock directly from Aurora, you may need to adjust the network configuration so your Aurora cluster can communicate with Amazon Bedrock. For more information on how to do this, seeEnabling network communication from Amazon Aurora MySQL to other AWS services.These directions also apply to Aurora PostgreSQL clusters.

To allow Aurora ML to work with Amazon Bedrock, you must first create an IAM policy that allows the Aurora cluster to communicate with Amazon Bedrock models. Complete the following steps:

On the IAM console, choosePoliciesin the navigation pane, then chooseCreate policy.
In the policy editor, expandBedrockand underRead,selectInvokeModelto allow that action.
ExpandResourcesand selectSpecific.
Forfoundation-model,selectAny.As a best practice, make sure to only give access to the foundational models in Amazon Bedrock that your team requires.
Forprovisioned-model,selectAny in this account.As a best practice, make sure to only give access to provisioned models in Amazon Bedrock that your team requires.
ChooseNext.
ForPolicy name,enter a name for your policy, such asAuroraMLBedrock.
ChooseCreate policy.
On the IAM console, chooseRolesin the navigation pane, then chooseCreate role.
ForTrusted entity type,selectAWS service.
ForService or use case,chooseRDS.
SelectRDS – Add Role to Database.
ChooseNext.

Now we assign the IAM policy we created in the previous step to this IAM role.

ForPermission policies,find and select theAuroraMLBedrockpolicy.
ChooseNext.
In theRole detailssection, enter a name (for this post,AuroraMLBedrock) and a description.
Review the IAM role and confirm that theAuroraMLBedrockpolicy is attached.
ChooseCreateto create the role.

Now we need to assign theAuroraMLBedrockIAM role to the Aurora cluster.

On the Amazon RDS console, navigate to your Aurora cluster details page.
On theConnectivity & securitytab, locate theManage IAM rolessection.
ForAdd IAM roles to this cluster,choose theAuroraMLBedrockrole.
ForFeature,chooseBedrock.
ChooseAdd role.

Your cluster can now invoke models in Amazon Bedrock.

Use Aurora ML to generate vector embeddings

Aurora machine learning is an Aurora feature that lets builders work directly with AWS ML services using SQL commands, including Amazon Bedrock,Amazon SageMaker,andAmazon Comprehend.Withrecent support for Amazon Bedrock,Aurora ML gives you access to foundational models and embedding generators, helping reduce latency when working with data already stored in you Aurora cluster. This includes two new functions:aws_bedrock.invoke_model,which lets you use a foundational model from a SQL query, andaws_bedrock.invoke_model_get_embeddings,which lets you generate an embedding from a SQL query. For more information, seeUsing Amazon Aurora machine learning with Aurora PostgreSQL.

Let’s see how we can use Aurora ML to generate an embedding from Amazon Bedrock. First, log in as an account with therds_superuserprivilege (for example,postgres) to your Aurora cluster and install the Aurora ML extension in your database:

CREATE EXTENSION IF NOT EXISTS aws_ml CASCADE;

You should see the following output:

CREATE EXTENSION

You can now create embeddings directly from Aurora. The following example shows how to generate an embedding using theTitan Embeddings G1 – Textembedding model for the phrase “PostgreSQL I/O monitoring views”:

SELECT aws_bedrock.invoke_model_get_embeddings(
model_id:= 'amazon.titan-embed-text-v1',
content_type:= 'application/json',
json_key:= 'embedding',
model_input:= '{ "inputText": "PostgreSQL I/O monitoring views" }') AS embedding;

You should see the following output (abbreviated for clarity):

{-1.0625,-1.1484375,0.578125,-0.08251953,0.39453125,-0.80859375,0.051513672,-0.0016479492,-1.203125,0.49609375,-0.33984375... -0.37695312,-0.16699219,-0.796875,1.21875,-0.36523438 }

You can use theaws_bedrock.invoke_model_get_embeddingsfunction to generate embeddings from data that already exists in your database. However, because calling an embedding model on a single input can take 100–400 milliseconds to complete, you should use PostgreSQL’s stored procedure system on a batch query to prevent a single long-running transaction from blocking other processes.

The following example shows how we can add embeddings to an existing table and manage the batch import using an anonymous block. First, make sure the pgvector extension is installed in the database and create a table that will contain example data:

CREATE EXTENSION IF NOT EXISTS vector;
CREATE TABLE documents (
id int GENERATED BY DEFAULT AS IDENTITY PRIMARY KEY,
content text NOT NULL,
embedding vector(1536)
);
INSERT INTO documents (content)
VALUES
('Allow parallelization of FULL and internal right OUTER hash joins'),
('Allow logical replication from standby servers'),
('Allow logical replication subscribers to apply large transactions in parallel'),
('Allow monitoring of I/O statistics using the new pg_stat_io view'),
('Add SQL/JSON constructors and identity functions'),
('Improve performance of vacuum freezing'),
('Add support for regular expression matching of user and database names in pg_hba.conf, and user names in pg_ident.conf');

Now you can generate embeddings for this dataset in bulk using the following code. The example code loops through all of the records in thedocumentstable that don’t have an embedding. For each record, the code calls the Amazon Titan embedding model to generate the embedding, and updates and commits the result to the database.

DO
$embed$
DECLARE
doc RECORD;
emb vector(1536);
BEGIN
FOR doc in SELECT id, content FROM documents WHERE embedding IS NULL LOOP
EXECUTE $$ SELECT aws_bedrock.invoke_model_get_embeddings(
model_id:= 'amazon.titan-embed-text-v1',
content_type:= 'application/json',
json_key:= 'embedding',
model_input:= json_build_object('inputText', $1)::text)$$
INTO emb
USING doc.content;
UPDATE documents SET embedding = emb WHERE id = doc.id;
COMMIT;
END LOOP;
END;
$embed$ LANGUAGE plpgsql;

In this section, we saw how to use Aurora ML to generate embeddings using Amazon Bedrock from data that already exists in your Aurora database. This technique helps speed up creating embeddings from the data in your database, because you can reduce latency by calling Amazon Bedrock directly without first transferring to a separate system.

Clean up

If you don’t need to use any of the resources you created, delete them when you are finished:

Empty anddelete your S3 bucket.
If you no longer need to use the Aurora ML extension but would like to continue using your cluster, you can remove it from your Aurora cluster by running the following SQL command:
```
DROP EXTENSION aws_ml;
```
Additionally, if you no longer need to use the Aurora ML extension but would like to continue using your Aurora cluster, you can remove the IAM role you created to access Amazon Bedrock from your cluster, and you may need to update some of your networking configurations.
To delete your knowledge base, refer toManage your knowledge base.
If you no longer need your Aurora cluster, follow the instructions inDeleting Aurora DB clusters and DB instances.

Conclusion

RAG is a powerful technique that lets you combine domain-specific information with a FM to enrich responses in your generative AI applications. In this post, we covered multiple examples for how you can use Amazon Bedrock with Aurora to implement RAG for your generative AI applications. This included how to set up Amazon Aurora PostgreSQL as a knowledge base for Amazon Bedrock, and how to use Aurora ML to generate vector embeddings from Amazon Bedrock.

To learn more about using Amazon Aurora PostgreSQL and pgvector for AI and ML workloads, seeLeverage pgvector and Amazon Aurora PostgreSQL for Natural Language Processing, Chatbots and Sentiment Analysis.

We invite you to leave feedback in the comments.

About the Authors

Steve Dilleis a Senior Product Manager for RDS Aurora, he leads all generative AI strategy and product initiatives with Aurora databases for AWS. Previous to this role, Steve founded the performance and benchmark team for Aurora and then built and launched RDS Data API for Aurora Serverless v2. He has been with AWS for 4 years. Prior to this, he served as a software developer at NCR, product manager at HP, Data Warehousing Director at Sybase (SAP). He has over 20 years of experience as VP of Product or CMO on the executive teams of companies resulting in 5 successful company acquisitions and one IPO in the data management, analytics, and big data sectors. Steve earned a Master’s in Information and Data Science at UC Berkeley, MBA from the University of Chicago Booth School of Business and a BS in Computer Science/Math with distinction from University of Pittsburgh.

Jonathan Katzis a Principal Product Manager – Technical on the Amazon RDS team and is based in New York. He is a Core Team member of the open source PostgreSQL project and an active open source contributor.