interface SemanticChunkingConfigurationProperty
| Language | Type name |
|---|---|
.NET | Amazon.CDK.AWS.Wisdom.CfnKnowledgeBase.SemanticChunkingConfigurationProperty |
Go | github.com/aws/aws-cdk-go/awscdk/v2/awswisdom#CfnKnowledgeBase_SemanticChunkingConfigurationProperty |
Java | software.amazon.awscdk.services.wisdom.CfnKnowledgeBase.SemanticChunkingConfigurationProperty |
Python | aws_cdk.aws_wisdom.CfnKnowledgeBase.SemanticChunkingConfigurationProperty |
TypeScript | aws-cdk-lib » aws_wisdom » CfnKnowledgeBase » SemanticChunkingConfigurationProperty |
Settings for semantic document chunking for a data source.
Semantic chunking splits a document into smaller documents based on groups of similar content derived from the text with natural language processing.
Example
// The code below shows an example of how to instantiate this type.
// The values are placeholders you should change.
import { aws_wisdom as wisdom } from 'aws-cdk-lib';
const semanticChunkingConfigurationProperty: wisdom.CfnKnowledgeBase.SemanticChunkingConfigurationProperty = {
breakpointPercentileThreshold: 123,
bufferSize: 123,
maxTokens: 123,
};
Properties
| Name | Type | Description |
|---|---|---|
| breakpoint | number | The dissimilarity threshold for splitting chunks. |
| buffer | number | The buffer size. |
| max | number | The maximum number of tokens that a chunk can contain. |
breakpointPercentileThreshold
Type:
number
The dissimilarity threshold for splitting chunks.
bufferSize
Type:
number
The buffer size.
maxTokens
Type:
number
The maximum number of tokens that a chunk can contain.

.NET
Go
Java
Python
TypeScript