interface SemanticChunkingConfigurationProperty
| Language | Type name |
|---|---|
.NET | Amazon.CDK.Mixins.Preview.AWS.Wisdom.Mixins.CfnKnowledgeBasePropsMixin.SemanticChunkingConfigurationProperty |
Go | github.com/aws/aws-cdk-go/awscdkmixinspreview/v2/awswisdom/mixins#CfnKnowledgeBasePropsMixin_SemanticChunkingConfigurationProperty |
Java | software.amazon.awscdk.mixins.preview.services.wisdom.mixins.CfnKnowledgeBasePropsMixin.SemanticChunkingConfigurationProperty |
Python | aws_cdk.mixins_preview.aws_wisdom.mixins.CfnKnowledgeBasePropsMixin.SemanticChunkingConfigurationProperty |
TypeScript | @aws-cdk/mixins-preview » aws_wisdom » mixins » CfnKnowledgeBasePropsMixin » SemanticChunkingConfigurationProperty |
Settings for semantic document chunking for a data source.
Semantic chunking splits a document into smaller documents based on groups of similar content derived from the text with natural language processing.
Example
// The code below shows an example of how to instantiate this type.
// The values are placeholders you should change.
import { mixins as wisdom_mixins } from '@aws-cdk/mixins-preview/aws-wisdom';
const semanticChunkingConfigurationProperty: wisdom_mixins.CfnKnowledgeBasePropsMixin.SemanticChunkingConfigurationProperty = {
breakpointPercentileThreshold: 123,
bufferSize: 123,
maxTokens: 123,
};
Properties
| Name | Type | Description |
|---|---|---|
| breakpoint | number | The dissimilarity threshold for splitting chunks. |
| buffer | number | The buffer size. |
| max | number | The maximum number of tokens that a chunk can contain. |
breakpointPercentileThreshold?
Type:
number
(optional)
The dissimilarity threshold for splitting chunks.
bufferSize?
Type:
number
(optional)
The buffer size.
maxTokens?
Type:
number
(optional)
The maximum number of tokens that a chunk can contain.

.NET
Go
Java
Python
TypeScript