Deploy Foundation models - Amazon SageMaker Unified Studio

Deploy Foundation models

  1. From the Models page, select a Foundation Model you want to deploy.

  2. On the model details page, review the model information including:

    • Model architecture and capabilities

    • Supported languages and input modalities

    • Training data information

    • License requirements

  3. Choose Deploy to access deployment options. This will create a Sample notebook that you can use to review and deploy.

  4. Review and execute the sample notebook content which demonstrates:

    • Model selection and configuration

    • Endpoint deployment procedures

    • Example inference code

  5. Wait for the deployment to complete. The deployment process may take several minutes.

  6. After successful model deployment, navigate to Endpoints from the left navigation menu.

  7. View your deployed endpoints in the endpoints list, which displays:

    • Endpoint name and status

    • Creation and modification timestamps

    • Endpoint configuration details

  8. Monitor endpoint status:

    • In Service indicates the endpoint is ready for inference requests

    • Creating shows the endpoint is being deployed

    • Failed indicates deployment issues that need attention