ModelTrainingDataChannel - AWS Clean Rooms ML

ModelTrainingDataChannel

Information about the model training data channel. A training data channel is a named data source that the training algorithms can consume.

Contents

channelName

The name of the training data channel.

Type: String

Length Constraints: Minimum length of 1. Maximum length of 64.

Pattern: [A-Za-z0-9\.\-_]+

Required: Yes

mlInputChannelArn

The Amazon Resource Name (ARN) of the ML input channel for this model training data channel.

Type: String

Length Constraints: Minimum length of 20. Maximum length of 2048.

Pattern: arn:aws[-a-z]*:cleanrooms-ml:[-a-z0-9]+:[0-9]{12}:membership/[0-9a-f]{8}-[0-9a-f]{4}-[0-9a-f]{4}-[0-9a-f]{4}-[0-9a-f]{12}/ml-input-channel/[-a-zA-Z0-9_/.]+

Required: Yes

s3DataDistributionType

Specifies how the training data stored in Amazon S3 should be distributed to training instances. This parameter controls the data distribution strategy for the training job:

  • FullyReplicated - The entire dataset is replicated on each training instance. This is suitable for smaller datasets and algorithms that require access to the complete dataset.

  • ShardedByS3Key - The dataset is distributed across training instances based on Amazon S3 key names. This is suitable for larger datasets and distributed training scenarios where each instance processes a subset of the data.

Type: String

Valid Values: FullyReplicated | ShardedByS3Key

Required: No

See Also

For more information about using this API in one of the language-specific AWS SDKs, see the following: