部署 JumpStart 模型

您可以使用 CLI 或 SDK 部署预训练 JumpStart 模型进行推理。

使用 CLI

运行以下命令部署 JumpStart 模型：


hyp create hyp-jumpstart-endpoint \
    --version 1.0 \
    --model-id deepseek-llm-r1-distill-qwen-1-5b \
    --model-version 2.0.4 \
    --instance-type ml.g5.8xlarge \
    --endpoint-name endpoint-jumpstart \
    --tls-output-s3-uri s3://tls-bucket-inf1-beta2

使用 SDK

使用以下内容创建一个 Python 脚本：


from sagemaker.hyperpod.inference.config.hp_jumpstart_endpoint_config import Model, Server, SageMakerEndpoint, TlsConfig
from sagemaker.hyperpod.inference.hp_jumpstart_endpoint import HPJumpStartEndpoint

model = Model(
    model_id="deepseek-llm-r1-distill-qwen-1-5b",
    model_version="2.0.4"
)

server = Server(
    instance_type="ml.g5.8xlarge"
)

endpoint_name = SageMakerEndpoint(name="endpoint-jumpstart")

tls_config = TlsConfig(tls_certificate_output_s3_uri="s3://tls-bucket-inf1-beta2")

js_endpoint = HPJumpStartEndpoint(
    model=model,
    server=server,
    sage_maker_endpoint=endpoint_name,
    tls_config=tls_config
)

js_endpoint.create()

调用端点

使用 CLI

使用示例输入测试端点：


hyp invoke hyp-jumpstart-endpoint \
    --endpoint-name endpoint-jumpstart \
    --body '{"inputs":"What is the capital of USA?"}'

使用 SDK

将以下代码添加到你的 Python 脚本中：


data = '{"inputs":"What is the capital of USA?"}'
response = js_endpoint.invoke(body=data).body.read()
print(response)

管理终端节点

使用 CLI

列出并检查端点：


hyp list hyp-jumpstart-endpoint
hyp get hyp-jumpstart-endpoint --name endpoint-jumpstart

使用 SDK

将以下代码添加到你的 Python 脚本中：


endpoint_iterator = HPJumpStartEndpoint.list()
for endpoint in endpoint_iterator:
    print(endpoint.name, endpoint.status)

logs = js_endpoint.get_logs()
print(logs)

清理资源

完成后，请删除终端节点以避免不必要的费用。

使用 CLI


hyp delete hyp-jumpstart-endpoint --name endpoint-jumpstart

使用 SDK


js_endpoint.delete()

后续步骤

现在，您已经训练了 PyTorch 模型，将其部署为自定义终端节点，并使用 HyperPod的 CLI 和 SDK 部署了 JumpStart 模型，请探索高级功能：

多节点训练：跨多个实例扩展训练
自定义容器：构建专门的训练环境
与 SageMaker 流水线集成：自动化您的机器学习工作流程
高级监控：设置自定义指标和警报

有关更多示例和高级配置，请访问SageMaker HyperPod GitHub 存储库。

Javascript 在您的浏览器中被禁用或不可用。

要使用 Amazon Web Services 文档，必须启用 Javascript。请参阅浏览器的帮助页面以了解相关说明。

文档惯例

部署自定义模型

在集群上运行作业