Sets whether the endpoint that hosts the inference component caches the model artifacts and container image.