RecommendationJobContainerConfig
Specifies mandatory fields for running an Inference Recommender job directly in the
CreateInferenceRecommendationsJob
API. The fields specified in ContainerConfig override the corresponding fields in the model package. Use
ContainerConfig if you want to specify these fields for the recommendation job but don't want to edit them in your model package.
Contents
- DataInputConfig
-
Specifies the name and shape of the expected data inputs for your trained model with a JSON dictionary form. This field is used for optimizing your model using SageMaker Neo. For more information, see DataInputConfig.
Type: String
Length Constraints: Minimum length of 1. Maximum length of 1024.
Pattern:
[\S\s]+Required: No
- Domain
-
The machine learning domain of the model and its components.
Valid Values:
COMPUTER_VISION | NATURAL_LANGUAGE_PROCESSING | MACHINE_LEARNINGType: String
Required: No
- Framework
-
The machine learning framework of the container image.
Valid Values:
TENSORFLOW | PYTORCH | XGBOOST | SAGEMAKER-SCIKIT-LEARNType: String
Required: No
- FrameworkVersion
-
The framework version of the container image.
Type: String
Length Constraints: Minimum length of 3. Maximum length of 10.
Pattern:
[0-9]\.[A-Za-z0-9.-]+Required: No
- NearestModelName
-
The name of a pre-trained machine learning model benchmarked by Amazon SageMaker Inference Recommender that matches your model.
Valid Values:
efficientnetb7 | unet | xgboost | faster-rcnn-resnet101 | nasnetlarge | vgg16 | inception-v3 | mask-rcnn | sagemaker-scikit-learn | densenet201-gluon | resnet18v2-gluon | xception | densenet201 | yolov4 | resnet152 | bert-base-cased | xceptionV1-keras | resnet50 | retinanetType: String
Required: No
- PayloadConfig
-
Specifies the
SamplePayloadUrland all other sample payload-related fields.Type: RecommendationJobPayloadConfig object
Required: No
- SupportedEndpointType
-
The endpoint type to receive recommendations for. By default this is null, and the results of the inference recommendation job return a combined list of both real-time and serverless benchmarks. By specifying a value for this field, you can receive a longer list of benchmarks for the desired endpoint type.
Type: String
Valid Values:
RealTime | ServerlessRequired: No
- SupportedInstanceTypes
-
A list of the instance types that are used to generate inferences in real-time.
Type: Array of strings
Required: No
- SupportedResponseMIMETypes
-
The supported MIME types for the output data.
Type: Array of strings
Length Constraints: Minimum length of 0. Maximum length of 1024.
Pattern:
[-\w]+\/.+Required: No
- Task
-
The machine learning task that the model accomplishes.
Valid Values:
IMAGE_CLASSIFICATION | OBJECT_DETECTION | TEXT_GENERATION | IMAGE_SEGMENTATION | FILL_MASK | CLASSIFICATION | REGRESSION | OTHERType: String
Required: No
See Also
For more information about using this API in one of the language-specific AWS SDKs, see the following: