Deploy Foundation models
-
From the Models page, select a Foundation Model you want to deploy.
-
On the model details page, review the model information including:
-
Model architecture and capabilities
-
Supported languages and input modalities
-
Training data information
-
License requirements
-
-
Choose Deploy to access deployment options. This will create a Sample notebook that you can use to review and deploy.
-
Review and execute the sample notebook content which demonstrates:
-
Model selection and configuration
-
Endpoint deployment procedures
-
Example inference code
-
-
Wait for the deployment to complete. The deployment process may take several minutes.
-
After successful model deployment, navigate to Endpoints from the left navigation menu.
-
View your deployed endpoints in the endpoints list, which displays:
-
Endpoint name and status
-
Creation and modification timestamps
-
Endpoint configuration details
-
-
Monitor endpoint status:
-
In Service indicates the endpoint is ready for inference requests
-
Creating shows the endpoint is being deployed
-
Failed indicates deployment issues that need attention
-