ModelTrainingDataChannel
Information about the model training data channel. A training data channel is a named data source that the training algorithms can consume.
Contents
- channelName
-
The name of the training data channel.
Type: String
Length Constraints: Minimum length of 1. Maximum length of 64.
Pattern:
[A-Za-z0-9\.\-_]+
Required: Yes
- mlInputChannelArn
-
The Amazon Resource Name (ARN) of the ML input channel for this model training data channel.
Type: String
Length Constraints: Minimum length of 20. Maximum length of 2048.
Pattern:
arn:aws[-a-z]*:cleanrooms-ml:[-a-z0-9]+:[0-9]{12}:membership/[0-9a-f]{8}-[0-9a-f]{4}-[0-9a-f]{4}-[0-9a-f]{4}-[0-9a-f]{12}/ml-input-channel/[-a-zA-Z0-9_/.]+
Required: Yes
- s3DataDistributionType
-
Specifies how the training data stored in Amazon S3 should be distributed to training instances. This parameter controls the data distribution strategy for the training job:
-
FullyReplicated
- The entire dataset is replicated on each training instance. This is suitable for smaller datasets and algorithms that require access to the complete dataset. -
ShardedByS3Key
- The dataset is distributed across training instances based on Amazon S3 key names. This is suitable for larger datasets and distributed training scenarios where each instance processes a subset of the data.
Type: String
Valid Values:
FullyReplicated | ShardedByS3Key
Required: No
-
See Also
For more information about using this API in one of the language-specific AWS SDKs, see the following: