InferenceComponentSchedulingConfig
The scheduling configuration that determines how inference component copies are placed across available instances when copies are added or removed.
Contents
- PlacementStrategy
-
The strategy for placing inference component copies across available instances. If you also set
AvailabilityZoneBalance, this strategy applies to placement within each Availability Zone.- SPREAD
-
Distributes copies evenly across available instances for better resilience.
- BINPACK
-
Packs copies onto fewer instances to optimize resource utilization.
Type: String
Valid Values:
SPREAD | BINPACKRequired: Yes
- AvailabilityZoneBalance
-
Configuration for balancing inference component copies across Availability Zones.
Type: InferenceComponentAvailabilityZoneBalance object
Required: No
See Also
For more information about using this API in one of the language-specific AWS SDKs, see the following: