ProcessingS3Input
Configuration for downloading input data from Amazon S3 into the processing container.
Contents
- S3DataType
-
Whether you use an
S3Prefixor aManifestFilefor the data type. If you chooseS3Prefix,S3Uriidentifies a key name prefix. Amazon SageMaker uses all objects with the specified key name prefix for the processing job. If you chooseManifestFile,S3Uriidentifies an object that is a manifest file containing a list of object keys that you want Amazon SageMaker to use for the processing job.Type: String
Valid Values:
ManifestFile | S3PrefixRequired: Yes
- S3Uri
-
The URI of the Amazon S3 prefix Amazon SageMaker downloads data required to run a processing job.
Type: String
Length Constraints: Minimum length of 0. Maximum length of 1024.
Pattern:
(https|s3)://([^/]+)/?(.*)Required: Yes
- LocalPath
-
The local path in your container where you want Amazon SageMaker to write input data to.
LocalPathis an absolute path to the input data and must begin with/opt/ml/processing/.LocalPathis a required parameter whenAppManagedisFalse(default).Type: String
Length Constraints: Minimum length of 0. Maximum length of 256.
Pattern:
.*Required: No
- S3CompressionType
-
Whether to GZIP-decompress the data in Amazon S3 as it is streamed into the processing container.
Gzipcan only be used whenPipemode is specified as theS3InputMode. InPipemode, Amazon SageMaker streams input data from the source directly to your container without using the EBS volume.Type: String
Valid Values:
None | GzipRequired: No
- S3DataDistributionType
-
Whether to distribute the data from Amazon S3 to all processing instances with
FullyReplicated, or whether the data from Amazon S3 is shared by Amazon S3 key, downloading one shard of data to each processing instance.Type: String
Valid Values:
FullyReplicated | ShardedByS3KeyRequired: No
- S3InputMode
-
Whether to use
FileorPipeinput mode. In File mode, Amazon SageMaker copies the data from the input source onto the local ML storage volume before starting your processing container. This is the most commonly used input mode. InPipemode, Amazon SageMaker streams input data from the source directly to your processing container into named pipes without using the ML storage volume.Type: String
Valid Values:
Pipe | FileRequired: No
See Also
For more information about using this API in one of the language-specific AWS SDKs, see the following: