ChunkingConfiguration
Details about how to chunk the documents in the data source. A chunk refers to an excerpt from a data source that is returned when the knowledge base that it belongs to is queried.
Contents
- chunkingStrategy
-
Knowledge base can split your source data into chunks. A chunk refers to an excerpt from a data source that is returned when the knowledge base that it belongs to is queried. You have the following options for chunking your data. If you opt for
NONE, then you may want to pre-process your files by splitting them up such that each file corresponds to a chunk.Type: String
Valid Values:
FIXED_SIZE | NONE | HIERARCHICAL | SEMANTICRequired: Yes
- fixedSizeChunkingConfiguration
-
Configurations for when you choose fixed-size chunking. If you set the
chunkingStrategyasNONE, exclude this field.Type: FixedSizeChunkingConfiguration object
Required: No
- hierarchicalChunkingConfiguration
-
Settings for hierarchical document chunking for a data source. Hierarchical chunking splits documents into layers of chunks where the first layer contains large chunks, and the second layer contains smaller chunks derived from the first layer.
Type: HierarchicalChunkingConfiguration object
Required: No
- semanticChunkingConfiguration
-
Settings for semantic document chunking for a data source. Semantic chunking splits a document into smaller documents based on groups of similar content derived from the text with natural language processing.
Type: SemanticChunkingConfiguration object
Required: No
See Also
For more information about using this API in one of the language-specific AWS SDKs, see the following: