InferenceComponentSchedulingConfig - Amazon SageMaker

InferenceComponentSchedulingConfig

The scheduling configuration that determines how inference component copies are placed across available instances when copies are added or removed.

Contents

PlacementStrategy

The strategy for placing inference component copies across available instances. If you also set AvailabilityZoneBalance, this strategy applies to placement within each Availability Zone.

SPREAD

Distributes copies evenly across available instances for better resilience.

BINPACK

Packs copies onto fewer instances to optimize resource utilization.

Type: String

Valid Values: SPREAD | BINPACK

Required: Yes

AvailabilityZoneBalance

Configuration for balancing inference component copies across Availability Zones.

Type: InferenceComponentAvailabilityZoneBalance object

Required: No

See Also

For more information about using this API in one of the language-specific AWS SDKs, see the following: