Build a knowledge base by connecting to a data source
Amazon Bedrock Knowledge Bases supports a variety of file types stored in data sources. In order to interpret the data from a data source, Amazon Bedrock Knowledge Bases requires the conversion of the data into vector embeddings, a numerical representation of the data. These embeddings can be compared to the vector representations of a query to assess similarity and determine which sources to return during data retrieval.
Connecting your knowledge base to a data source involves the following general steps:
-
Connect the knowledge base to a supported data source.
-
If your data source contains multimodal data, including images, audio, and video files, you must choose an appropriate processing approach and embedding model that supports multimodal content.
Note
Multimodal data is only supported with Amazon S3 and custom data sources. For comprehensive guidance on working with multimodal content, see Build a knowledge base for multimodal content.
-
Choose an embeddings model to convert the data in the data source into vector embeddings.
-
Choose a vector store to store the vector representation of your data.
-
Sync your data so it's converted to vector embeddings.
-
If you modify the data in the data source, you must resync the changes.
Topics
Prerequisites for creating an Amazon Bedrock knowledge base with a unstructured data source
Create a knowledge base by connecting to a data source in Amazon Bedrock Knowledge Bases
View data source information for your Amazon Bedrock knowledge base
Delete a data source from your Amazon Bedrock knowledge base