View a markdown version of this page

Available foundation models - Amazon SageMaker AI

Available foundation models

Amazon SageMaker JumpStart offers state-of-the-art, built-in publicly available and proprietary foundation models to customize and integrate into your generative AI workflows.

To list all available JumpStart foundation models:

import boto3 sagemaker_client = boto3.client("sagemaker", region_name="us-west-2") # List all available models in the SageMakerPublicHub all_models = [] next_token = None while True: params = {"HubName": "SageMakerPublicHub", "HubContentType": "Model"} if next_token: params["NextToken"] = next_token response = sagemaker_client.list_hub_contents(**params) all_models.extend(response["HubContentSummaries"]) next_token = response.get("NextToken") if not next_token: break print(f"Total models available: {len(all_models)}") for model in all_models[:5]: print(model["HubContentName"], model["HubContentVersion"])

To get supported instance types and other metadata for a specific model:

import json import boto3 sagemaker_client = boto3.client("sagemaker", region_name="us-west-2") model_id = "meta-textgeneration-llama-3-3-70b-instruct" response = sagemaker_client.describe_hub_content( HubName="SageMakerPublicHub", HubContentType="Model", HubContentName=model_id ) # Parse the model metadata model_doc = json.loads(response["HubContentDocument"]) print(f"Supported instances: {model_doc['SupportedInferenceInstanceTypes']}") print(f"Default instance: {model_doc['DefaultInferenceInstanceType']}")

The following tables list available models with their license and supported inference instance types as of April 30, 2026. For the most current information, use the API examples above or see the model hub in SageMaker Studio.

Open-Weight Models (417)

Open-weight foundation models are publicly available models from providers such as Meta, Google, Mistral AI, and others. You can deploy, fine-tune, and customize these models for your use case.

To get started with one of these models, see JumpStart foundation model usage or explore one of the available Example notebooks. In a given example notebook, try switching out the model ID to experiment with different models within the same model family.

For more information on model IDs and resources on deploying publicly available JumpStart foundation models with the SageMaker Python SDK, see Use foundation models with the SageMaker Python SDK.

Provider Model Name Model ID Task License Fine-tunable Supported Inference Instance Types

01-Ai

Yi-1.5-34B

huggingface-llm-yi-1-5-34b

Text Generation

apache-2.0

No

ml.g5.12xlarge, ml.g5.24xlarge, ml.g5.48xlarge, ml.p4d.24xlarge

01-Ai

Yi-1.5-9B

huggingface-llm-yi-1-5-9b

Text Generation

apache-2.0

No

ml.g5.12xlarge, ml.g5.16xlarge, ml.g5.24xlarge, ml.g5.2xlarge, ml.g5.48xlarge, ml.g5.4xlarge, ml.g5.8xlarge, ml.p4d.24xlarge, ml.p4de.24xlarge, ml.p5.48xlarge

Ahxt

Lite Llama 460M 1T

huggingface-llm-ahxt-litellama-460m-1t

Text Generation

mit

No

ml.g5.2xlarge

Ai-Forever

Multilingual GPT

huggingface-llm-ai-forever-mgpt

Text Generation

apache-2.0

No

ml.g5.2xlarge

Aisingapore

Llama3 8B SEA-Lion v2.1 Instruct

huggingface-llm-llama3-8b-sealionv21-instruct

Text Generation

llama3

No

ml.g5.12xlarge, ml.g5.24xlarge, ml.g5.48xlarge, ml.g5.4xlarge, ml.g5.8xlarge, ml.g6.12xlarge, ml.g6.24xlarge, ml.g6.48xlarge, ml.g6.4xlarge, ml.g6.8xlarge, ml.p4d.24xlarge, ml.p5.48xlarge

Aisingapore

SEA-LION 3B

huggingface-llm-sealion-3b

Text Generation

mit

No

ml.g4dn.2xlarge, ml.g4dn.xlarge, ml.g5.2xlarge, ml.g5.4xlarge, ml.g5.xlarge, ml.p3.8xlarge

Aisingapore

SEA-LION 7B

huggingface-llm-sealion-7b

Text Generation

mit

No

ml.g4dn.12xlarge, ml.g5.12xlarge, ml.g5.24xlarge, ml.g5.48xlarge, ml.p4d.24xlarge

Aisingapore

SEA-LION 7B Instruct

huggingface-llm-sealion-7b-instruct

Text Generation

mit

No

ml.g4dn.12xlarge, ml.g5.12xlarge, ml.g5.24xlarge, ml.g5.48xlarge, ml.p4d.24xlarge

Alibaba-Nlp

Alibaba-NLP gte-Qwen2-7B-instruct

huggingface-textembedding-gte-qwen2-7b-instruct

Text Embedding

Apache-2.0

No

ml.g5.12xlarge, ml.g5.24xlarge, ml.g5.2xlarge, ml.g5.48xlarge, ml.g5.4xlarge, ml.g5.8xlarge, ml.g5.xlarge, ml.g6.12xlarge, ml.g6.24xlarge, ml.g6.2xlarge, ml.g6.48xlarge, ml.g6.4xlarge, ml.g6.8xlarge, ml.g6.xlarge, ml.p4d.24xlarge

Allenai

Olmo-3-7B-Instruct

huggingface-textgeneration-olmo-3-7b-instruct

Text Generation

apache-2.0

No

ml.g6e.24xlarge

Amazon

Chronos 2

pytorch-forecasting-chronos-2

Time Series Forecasting

apache-2.0

No

ml.c5.2xlarge, ml.c5.4xlarge, ml.c5.xlarge, ml.g4dn.2xlarge, ml.g4dn.4xlarge, ml.g4dn.xlarge, ml.g5.2xlarge, ml.g5.xlarge, ml.g6.2xlarge, ml.g6.xlarge, ml.g6e.2xlarge, ml.g6e.xlarge, ml.m5.2xlarge, ml.m5.4xlarge, ml.m5.xlarge

Amazon

Falcon Lite

huggingface-llm-amazon-falconlite

Text Generation

apache-2.0

No

ml.g5.12xlarge, ml.g5.24xlarge, ml.g5.48xlarge

Amazon

Falcon Lite 2

huggingface-llm-amazon-falconlite2

Text Generation

apache-2.0

No

ml.g5.12xlarge, ml.g5.24xlarge, ml.g5.48xlarge

Amazon

Mistral Lite

huggingface-llm-amazon-mistrallite

Text Generation

apache-2.0

No

ml.g5.12xlarge, ml.g5.24xlarge, ml.g5.2xlarge, ml.g5.48xlarge, ml.p4d.24xlarge

Autogluon

Chronos-Bolt (Base)

autogluon-forecasting-chronos-bolt-base

Time Series Forecasting

apache-2.0

No

ml.c5.2xlarge, ml.c5.4xlarge, ml.c5.xlarge, ml.g4dn.2xlarge, ml.g4dn.4xlarge, ml.g4dn.xlarge, ml.g5.2xlarge, ml.g5.4xlarge, ml.g5.xlarge, ml.m5.2xlarge, ml.m5.4xlarge, ml.m5.xlarge, ml.p3.2xlarge

Autogluon

Chronos-Bolt (Small)

autogluon-forecasting-chronos-bolt-small

Time Series Forecasting

apache-2.0

No

ml.c5.2xlarge, ml.c5.4xlarge, ml.c5.xlarge, ml.g4dn.2xlarge, ml.g4dn.4xlarge, ml.g4dn.xlarge, ml.g5.2xlarge, ml.g5.4xlarge, ml.g5.xlarge, ml.m5.2xlarge, ml.m5.4xlarge, ml.m5.xlarge, ml.p3.2xlarge

Autogluon

Chronos-T5 (Base)

autogluon-forecasting-chronos-t5-base

Time Series Forecasting

apache-2.0

No

ml.g4dn.2xlarge, ml.g4dn.4xlarge, ml.g4dn.xlarge, ml.g5.2xlarge, ml.g5.4xlarge, ml.g5.xlarge, ml.p3.2xlarge

Autogluon

Chronos-T5 (Large)

autogluon-forecasting-chronos-t5-large

Time Series Forecasting

apache-2.0

No

ml.g4dn.2xlarge, ml.g4dn.4xlarge, ml.g4dn.xlarge, ml.g5.2xlarge, ml.g5.4xlarge, ml.g5.xlarge, ml.p3.2xlarge

Autogluon

Chronos-T5 (Small)

autogluon-forecasting-chronos-t5-small

Time Series Forecasting

apache-2.0

No

ml.g4dn.2xlarge, ml.g4dn.4xlarge, ml.g4dn.xlarge, ml.g5.2xlarge, ml.g5.4xlarge, ml.g5.xlarge, ml.p3.2xlarge

Baai

BAAI bge-base-en-v1.5

huggingface-textembedding-bge-base-en-v1-5

Text Embedding

MIT

No

ml.g5.12xlarge, ml.g5.24xlarge, ml.g5.2xlarge, ml.g5.48xlarge, ml.g5.4xlarge, ml.g5.8xlarge, ml.g5.xlarge, ml.g6.12xlarge, ml.g6.24xlarge, ml.g6.2xlarge, ml.g6.48xlarge, ml.g6.4xlarge, ml.g6.8xlarge, ml.g6.xlarge, ml.p4d.24xlarge

Baai

BGE Base En

huggingface-sentencesimilarity-bge-base-en

Sentence Similarity

mit

Yes

ml.c6i.xlarge, ml.g4dn.2xlarge, ml.g5.2xlarge, ml.g5.4xlarge, ml.g5.8xlarge, ml.p3.2xlarge

Baai

BGE Base En V1.5

huggingface-sentencesimilarity-bge-base-en-v1-5

Sentence Similarity

mit

Yes

ml.c6i.xlarge, ml.g4dn.2xlarge, ml.g5.2xlarge, ml.g5.4xlarge, ml.g5.8xlarge, ml.p3.2xlarge

Baai

BGE Large En

huggingface-sentencesimilarity-bge-large-en

Sentence Similarity

mit

Yes

ml.c6i.xlarge, ml.g4dn.2xlarge, ml.g5.2xlarge, ml.g5.4xlarge, ml.g5.8xlarge, ml.p3.2xlarge

Baai

BGE Large En V1.5

huggingface-sentencesimilarity-bge-large-en-v1-5

Sentence Similarity

mit

Yes

ml.c6i.xlarge, ml.g4dn.2xlarge, ml.g5.2xlarge, ml.g5.4xlarge, ml.g5.8xlarge, ml.p3.2xlarge

Baai

BGE M3

huggingface-sentencesimilarity-bge-m3

Sentence Similarity

mit

Yes

ml.c6i.xlarge, ml.g4dn.2xlarge, ml.g5.2xlarge, ml.g5.4xlarge, ml.g5.8xlarge, ml.p3.2xlarge

Baai

BGE Small En

huggingface-sentencesimilarity-bge-small-en

Sentence Similarity

mit

Yes

ml.c6i.xlarge, ml.g4dn.2xlarge, ml.g5.2xlarge, ml.g5.4xlarge, ml.g5.8xlarge, ml.p3.2xlarge

Baai

BGE Small En V1.5

huggingface-sentencesimilarity-bge-small-en-v1-5

Sentence Similarity

mit

Yes

ml.c6i.xlarge, ml.g4dn.2xlarge, ml.g5.2xlarge, ml.g5.4xlarge, ml.g5.8xlarge, ml.p3.2xlarge

Berkeley-Nest

Starling LM 7B alpha

huggingface-llm-berkeley-nest-starling-lm-7b-alpha

Text Generation

apache-2.0

No

ml.g5.12xlarge, ml.g5.24xlarge, ml.g5.2xlarge, ml.g5.48xlarge

Bharatgenai

Param2-17B-A2.4B-Thinking

huggingface-llm-param2-17b-a2-4b-thinking

Text Generation

No

ml.g7e.2xlarge, ml.g7e.4xlarge, ml.g7e.8xlarge

Bigcode

StarCoder

huggingface-llm-starcoder

Text Generation

bigcode-openrail-m

No

ml.g5.12xlarge, ml.g5.24xlarge, ml.g5.48xlarge, ml.p4d.24xlarge

Bigcode

StarCoderBase

huggingface-llm-starcoderbase

Text Generation

bigcode-openrail-m

No

ml.g5.12xlarge, ml.g5.24xlarge, ml.g5.48xlarge, ml.p4d.24xlarge

Bigscience

Bloom 1b1

huggingface-textgeneration-bloom-1b1

Text Generation

bigscience-bloom-rail-1.0

No

ml.g5.16xlarge, ml.g5.2xlarge, ml.g5.4xlarge, ml.g5.8xlarge

Bigscience

Bloom 3B

huggingface-textgeneration1-bloom-3b

Text Generation

bigscience-bloom-rail-1.0

Yes

ml.g5.16xlarge, ml.g5.2xlarge, ml.g5.4xlarge, ml.g5.8xlarge

Bigscience

Bloom 7B1

huggingface-textgeneration1-bloom-7b1

Text Generation

bigscience-bloom-rail-1.0

Yes

ml.g5.16xlarge, ml.g5.2xlarge, ml.g5.4xlarge, ml.g5.8xlarge

Bigscience

Bloomz 1b7

huggingface-textgeneration-bloomz-1b7

Text Generation

bigscience-bloom-rail-1.0

No

ml.g5.16xlarge, ml.g5.2xlarge, ml.g5.4xlarge, ml.g5.8xlarge

Bigscience

BloomZ 7B1 FP16

huggingface-textgeneration1-bloomz-7b1-fp16

Text Generation

bigscience-bloom-rail-1.0

Yes

ml.g5.16xlarge, ml.g5.2xlarge, ml.g5.4xlarge, ml.g5.8xlarge

Bingsu

Bingsu my korean stable diffusion v1 5

huggingface-txt2img-bingsu-my-korean-stable-diffusion-v1-5

Text-to-Image

creativeml-openrail-m

No

ml.g4dn.2xlarge, ml.g4dn.xlarge, ml.p3.2xlarge

Black-Forest-Labs

Black Forest Labs FLUX.1 [schnell]

huggingface-txt2img-black-forest-labs-flux-1-schnell

Text-to-Image

apache-2.0

No

ml.g5.12xlarge, ml.g5.24xlarge, ml.g5.48xlarge, ml.p4d.24xlarge, ml.p5.48xlarge

Cognitivecomputations

Dolphin 2.2.1 Mistral 7B

huggingface-llm-dolphin-2-2-1-mistral-7b

Text Generation

apache-2.0

No

ml.g5.12xlarge, ml.g5.24xlarge, ml.g5.2xlarge, ml.g5.48xlarge

Cognitivecomputations

Dolphin 2.5 Mixtral 8 7B

huggingface-llm-dolphin-2-5-mixtral-8x7b

Text Generation

apache-2.0

No

ml.g5.48xlarge

Cognitivecomputations

Dolphin 2.7 Mixtral 8 7B

huggingface-llm-dolphin-2-7-mixtral-8x7b

Text Generation

apache-2.0

No

ml.g5.48xlarge

Cognitivecomputations

Dolphin 2.9 Llama 3 8b

huggingface-llm-cognitive-dolphin-29-llama3-8b

Text Generation

other

No

ml.g5.16xlarge, ml.g5.2xlarge, ml.g5.4xlarge, ml.g5.8xlarge

Cohereforai

Aya 101

huggingface-llm-aya-101

Text Generation

apache-2.0

No

ml.g5.12xlarge, ml.g5.24xlarge, ml.g5.48xlarge, ml.p4d.24xlarge

Cross-Encoder

Cross Encoder Nli Deberta Base

huggingface-zstc-cross-encoder-nli-deberta-base

Zero-Shot Classification

apache-2.0

No

ml.c5.2xlarge, ml.c5.xlarge, ml.g4dn.xlarge, ml.m5.large, ml.m5.xlarge, ml.p3.2xlarge

Cross-Encoder

Cross Encoder Nli Distilroberta Base

huggingface-zstc-cross-encoder-nli-distilroberta-base

Zero-Shot Classification

apache-2.0

No

ml.c5.2xlarge, ml.c5.xlarge, ml.g4dn.xlarge, ml.m5.large, ml.m5.xlarge, ml.p3.2xlarge

Cross-Encoder

Cross Encoder Nli MiniLM2 L6 H768

huggingface-zstc-cross-encoder-nli-minilm2-l6-h768

Zero-Shot Classification

apache-2.0

No

ml.c5.2xlarge, ml.c5.xlarge, ml.g4dn.xlarge, ml.m5.large, ml.m5.xlarge, ml.p3.2xlarge

Cross-Encoder

Cross Encoder Nli Roberta Base

huggingface-zstc-cross-encoder-nli-roberta-base

Zero-Shot Classification

apache-2.0

No

ml.c5.2xlarge, ml.c5.xlarge, ml.g4dn.xlarge, ml.m5.large, ml.m5.xlarge, ml.p3.2xlarge

Cyberagent

CyberAgentLM2-7B-Chat (CALM2-7B-Chat)

huggingface-llm-calm2-7b-chat-bf16

Text Generation

apache-2.0

Yes

ml.g5.12xlarge, ml.g5.24xlarge, ml.g5.2xlarge, ml.g5.4xlarge, ml.g5.xlarge

Databricks

DBRX Base

huggingface-llm-dbrx-base

Text Generation

databricks-open-model

No

ml.p4d.24xlarge, ml.p4de.24xlarge, ml.p5.48xlarge

Databricks

DBRX Instruct

huggingface-llm-dbrx-instruct

Text Generation

databricks-open-model

No

ml.p4d.24xlarge, ml.p4de.24xlarge, ml.p5.48xlarge

Databricks

Dolly V2 12b BF16

huggingface-textgeneration-dolly-v2-12b-bf16

Text Generation

apache-2.0

No

ml.g5.12xlarge, ml.g5.24xlarge, ml.g5.48xlarge, ml.p4d.24xlarge

Databricks

Dolly V2 3b BF16

huggingface-textgeneration-dolly-v2-3b-bf16

Text Generation

apache-2.0

No

ml.g5.16xlarge, ml.g5.2xlarge, ml.g5.4xlarge, ml.g5.8xlarge

Deepseek-Ai

DeepSeek-OCR

deepseek-vlm-deepseek-ocr

Image-Text-to-Text

mit

No

ml.g4dn.16xlarge, ml.g4dn.2xlarge, ml.g4dn.4xlarge, ml.g4dn.8xlarge, ml.g4dn.xlarge, ml.g5.16xlarge, ml.g5.2xlarge, ml.g5.4xlarge, ml.g5.8xlarge, ml.g5.xlarge, ml.g6.16xlarge, ml.g6.2xlarge, ml.g6.4xlarge, ml.g6.8xlarge, ml.g6.xlarge, ml.g6e.16xlarge, ml.g6e.2xlarge, ml.g6e.4xlarge, ml.g6e.8xlarge, ml.g6e.xlarge, ml.p3.2xlarge

Deepseek-Ai

DeepSeek-R1

deepseek-llm-r1

Text Generation

mit

No

ml.p5en.48xlarge

Deepseek-Ai

DeepSeek-R1-0528

deepseek-llm-r1-0528

Text Generation

mit

No

ml.p5en.48xlarge

Deepseek-Ai

DeepSeek-R1-Distill-Llama-70B

deepseek-llm-r1-distill-llama-70b

Text Generation

mit

Yes

ml.g5.48xlarge, ml.g6.48xlarge, ml.p4d.24xlarge

Deepseek-Ai

DeepSeek-R1-Distill-Llama-8B

deepseek-llm-r1-distill-llama-8b

Text Generation

mit

Yes

ml.g5.12xlarge, ml.g5.24xlarge, ml.g5.48xlarge, ml.g6.12xlarge, ml.g6.24xlarge, ml.g6.48xlarge

Deepseek-Ai

DeepSeek-R1-Distill-Qwen-1.5B

deepseek-llm-r1-distill-qwen-1-5b

Text Generation

mit

Yes

ml.g5.12xlarge, ml.g5.16xlarge, ml.g5.24xlarge, ml.g5.2xlarge, ml.g5.48xlarge, ml.g5.4xlarge, ml.g5.8xlarge, ml.g6.12xlarge, ml.g6.16xlarge, ml.g6.24xlarge, ml.g6.2xlarge, ml.g6.48xlarge, ml.g6.4xlarge, ml.g6.8xlarge

Deepseek-Ai

DeepSeek-R1-Distill-Qwen-14B

deepseek-llm-r1-distill-qwen-14b

Text Generation

mit

Yes

ml.g5.12xlarge, ml.g5.24xlarge, ml.g5.48xlarge, ml.g6.12xlarge, ml.g6.24xlarge, ml.g6.48xlarge

Deepseek-Ai

DeepSeek-R1-Distill-Qwen-32B

deepseek-llm-r1-distill-qwen-32b

Text Generation

mit

Yes

ml.g5.48xlarge, ml.g6.48xlarge, ml.p4d.24xlarge

Deepseek-Ai

DeepSeek-R1-Distill-Qwen-7B

deepseek-llm-r1-distill-qwen-7b

Text Generation

mit

Yes

ml.g5.12xlarge, ml.g5.24xlarge, ml.g5.48xlarge, ml.g6.12xlarge, ml.g6.24xlarge, ml.g6.48xlarge

Deepseek-Ai

DeepSeek-V3.1

deepseek-llm-deepseek-v3-1

Text Generation

mit

No

ml.p5en.48xlarge

Deepseek-Ai

DeepSeek-V3.2

deepseek-llm-deepseek-v3-2

Text Generation

mit

No

ml.p5en.48xlarge

Distilbert

DistilBERT Base Cased

huggingface-tc-distilbert-base-cased

Text Classification

apache-2.0

Yes

ml.c5.2xlarge, ml.c5.xlarge, ml.g4dn.xlarge, ml.m5.large, ml.m5.xlarge, ml.p3.2xlarge

Distilbert

DistilBERT Base Cased

huggingface-spc-distilbert-base-cased

Fill-Mask

apache-2.0

Yes

ml.c5.2xlarge, ml.c5.xlarge, ml.g4dn.xlarge, ml.m5.large, ml.m5.xlarge, ml.p3.2xlarge

Distilbert

DistilBERT Base Cased

huggingface-eqa-distilbert-base-cased

Question Answering

apache-2.0

Yes

ml.c5.2xlarge, ml.c5.xlarge, ml.g4dn.xlarge, ml.m5.large, ml.m5.xlarge, ml.p3.2xlarge

Distilbert

DistilBERT Base Multilingual Cased

huggingface-tc-distilbert-base-multilingual-cased

Text Classification

apache-2.0

Yes

ml.c5.2xlarge, ml.c5.xlarge, ml.g4dn.xlarge, ml.m5.large, ml.m5.xlarge, ml.p3.2xlarge

Distilbert

DistilBERT Base Multilingual Cased

huggingface-spc-distilbert-base-multilingual-cased

Fill-Mask

apache-2.0

Yes

ml.c5.2xlarge, ml.c5.xlarge, ml.g4dn.xlarge, ml.m5.large, ml.m5.xlarge, ml.p3.2xlarge

Distilbert

DistilBERT Base Multilingual Cased

huggingface-eqa-distilbert-base-multilingual-cased

Question Answering

apache-2.0

Yes

ml.c5.2xlarge, ml.c5.xlarge, ml.g4dn.xlarge, ml.m5.large, ml.m5.xlarge, ml.p3.2xlarge

Distilbert

DistilGPT 2

huggingface-textgeneration-distilgpt2

Text Generation

apache-2.0

No

ml.g5.16xlarge, ml.g5.2xlarge, ml.g5.4xlarge, ml.g5.8xlarge

Distilbert

DistilRoBERTa Base

huggingface-tc-distilroberta-base

Text Classification

apache-2.0

Yes

ml.c5.2xlarge, ml.c5.xlarge, ml.g4dn.xlarge, ml.m5.large, ml.m5.xlarge, ml.p3.2xlarge

Distilbert

DistilRoBERTa Base

huggingface-eqa-distilroberta-base

Question Answering

apache-2.0

Yes

ml.c5.2xlarge, ml.c5.xlarge, ml.g4dn.xlarge, ml.m5.large, ml.m5.xlarge, ml.p3.2xlarge

Distilbert

Question Answering

huggingface-eqa-distilbert-base-uncased

Question Answering

apache-2.0

Yes

ml.c5.2xlarge, ml.c5.xlarge, ml.g4dn.xlarge, ml.m5.large, ml.m5.xlarge, ml.p3.2xlarge

Distilbert

Sentence Pair Classification

huggingface-spc-distilbert-base-uncased

Fill-Mask

apache-2.0

Yes

ml.c5.2xlarge, ml.c5.xlarge, ml.g4dn.xlarge, ml.m5.large, ml.m5.xlarge, ml.p3.2xlarge

Distilbert

Text Classification

huggingface-tc-distilbert-base-uncased

Text Classification

apache-2.0

Yes

ml.c5.2xlarge, ml.c5.xlarge, ml.g4dn.xlarge, ml.m5.large, ml.m5.xlarge, ml.p3.2xlarge

Elastic

Distilbert Base Uncased

huggingface-ner-distilbert-base-uncased-finetuned-conll03-eng

Token Classification

apache-2.0

No

ml.c5.2xlarge, ml.c5.xlarge, ml.g4dn.xlarge, ml.m5.large, ml.m5.xlarge, ml.p3.2xlarge

Elastic

Named Entity Recognition

huggingface-ner-distilbert-base-cased-finetuned-conll03-english

Token Classification

apache-2.0

No

ml.c5.2xlarge, ml.c5.xlarge, ml.g4dn.xlarge, ml.m5.large, ml.m5.xlarge, ml.p3.2xlarge

Elastic

Named Entity Recognition

huggingface-ner-distilbert-base-cased-finetuned-conll03-eng

Token Classification

apache-2.0

No

ml.c5.2xlarge, ml.c5.xlarge, ml.g4dn.xlarge, ml.m5.large, ml.m5.xlarge, ml.p3.2xlarge

Eleutherai

GPT-J 6B

huggingface-textgeneration1-gpt-j-6b

Text Generation

apache-2.0

Yes

ml.g5.12xlarge, ml.g5.2xlarge, ml.g5.4xlarge, ml.g5.8xlarge

Eleutherai

GPT-Neo 125M

huggingface-textgeneration1-gpt-neo-125m

Text Generation

mit

Yes

ml.g5.16xlarge, ml.g5.2xlarge, ml.g5.4xlarge, ml.g5.8xlarge

Eleutherai

gpt-neox-20b

huggingface-textgeneration2-gpt-neox-20b-fp16

Text Generation

Apache-2.0

No

ml.g4dn.12xlarge, ml.g5.12xlarge, ml.g5.24xlarge, ml.g5.48xlarge, ml.g6.12xlarge, ml.g6.24xlarge, ml.g6.48xlarge, ml.g6e.12xlarge, ml.g6e.24xlarge, ml.p4d.24xlarge

Eleutherai

Pythia 160m Deduped

huggingface-llm-eleutherai-pythia-160m-deduped

Text Generation

apache-2.0

No

ml.g5.12xlarge, ml.g5.24xlarge, ml.g5.2xlarge

Eleutherai

Pythia 7m Deduped

huggingface-llm-eleutherai-pythia-70m-deduped

Text Generation

apache-2.0

No

ml.g5.12xlarge, ml.g5.24xlarge, ml.g5.2xlarge, ml.g5.48xlarge

Elyza

ELYZA-japanese-Llama-2-13b-chat

huggingface-llm-elyza-japanese-llama-2-13b-chat

Text Generation

llama2

Yes

ml.g5.12xlarge, ml.g5.24xlarge, ml.g5.48xlarge, ml.p4d.24xlarge

Elyza

ELYZA-japanese-Llama-2-13b-fast-chat

huggingface-llm-elyza-japanese-llama-2-13b-fast-chat

Text Generation

llama2

Yes

ml.g5.12xlarge, ml.g5.24xlarge, ml.g5.48xlarge, ml.p4d.24xlarge

Elyza

ELYZA-japanese-Llama-2-7b-chat

huggingface-llm-elyza-japanese-llama-2-7b-chat-bf16

Text Generation

llama2

Yes

ml.g5.12xlarge, ml.g5.24xlarge, ml.g5.2xlarge, ml.g5.48xlarge, ml.g5.4xlarge, ml.g5.xlarge

Elyza

ELYZA-japanese-Llama-2-7b-fast-chat

huggingface-llm-elyza-japanese-llama-2-7b-fast-chat-bf16

Text Generation

llama2

Yes

ml.g5.12xlarge, ml.g5.24xlarge, ml.g5.2xlarge, ml.g5.48xlarge, ml.g5.4xlarge, ml.g5.xlarge

Emrecan

Emrecan Bert Base Turkish Cased Allnli Tr

huggingface-zstc-emrecan-bert-base-turkish-cased-allnli-tr

Zero-Shot Classification

mit

No

ml.c5.2xlarge, ml.c5.xlarge, ml.g4dn.xlarge, ml.m5.large, ml.m5.xlarge, ml.p3.2xlarge

Facebook

Zero-Shot Text Classification

huggingface-zstc-facebook-bart-large-mnli

Zero-Shot Classification

mit

No

ml.c5.2xlarge, ml.c5.xlarge, ml.g4dn.xlarge, ml.m5.large, ml.m5.xlarge, ml.p3.2xlarge

Facebookai

RoBERTa Base

huggingface-tc-roberta-base

Text Classification

mit

Yes

ml.c5.2xlarge, ml.c5.xlarge, ml.g4dn.xlarge, ml.m5.large, ml.m5.xlarge, ml.p3.2xlarge

Facebookai

RoBERTa Base

huggingface-spc-roberta-base

Fill-Mask

mit

Yes

ml.c5.2xlarge, ml.c5.xlarge, ml.g4dn.xlarge, ml.m5.large, ml.m5.xlarge, ml.p3.2xlarge

Facebookai

RoBERTa Base

huggingface-eqa-roberta-base

Question Answering

mit

Yes

ml.c5.2xlarge, ml.c5.xlarge, ml.g4dn.xlarge, ml.m5.large, ml.m5.xlarge, ml.p3.2xlarge

Facebookai

RoBERTa Large

huggingface-tc-roberta-large

Text Classification

mit

Yes

ml.c5.2xlarge, ml.c5.xlarge, ml.g4dn.xlarge, ml.m5.large, ml.m5.xlarge, ml.p3.2xlarge

Facebookai

RoBERTa Large

huggingface-spc-roberta-large

Fill-Mask

mit

Yes

ml.c5.2xlarge, ml.c5.xlarge, ml.g4dn.xlarge, ml.m5.large, ml.m5.xlarge, ml.p3.2xlarge

Facebookai

RoBERTa Large

huggingface-eqa-roberta-large

Question Answering

mit

Yes

ml.c5.2xlarge, ml.c5.xlarge, ml.g4dn.xlarge, ml.m5.large, ml.m5.xlarge, ml.p3.2xlarge

Facebookai

XLM CLM English-German

huggingface-tc-xlm-clm-ende-1024

Text Classification

mit

Yes

ml.c5.2xlarge, ml.c5.xlarge, ml.g4dn.xlarge, ml.m5.large, ml.m5.xlarge, ml.p3.2xlarge

Facebookai

XLM CLM English-German

huggingface-spc-xlm-clm-ende-1024

Fill-Mask

mit

Yes

ml.c5.2xlarge, ml.c5.xlarge, ml.g4dn.xlarge, ml.m5.large, ml.m5.xlarge, ml.p3.2xlarge

Facebookai

XLM MLM 15 XNLI Languages

huggingface-spc-xlm-mlm-xnli15-1024

Fill-Mask

cc-by-nc-4.0

Yes

ml.c5.2xlarge, ml.c5.xlarge, ml.g4dn.xlarge, ml.m5.large, ml.m5.xlarge, ml.p3.2xlarge

Facebookai

XLM MLM English-German

huggingface-tc-xlm-mlm-ende-1024

Text Classification

cc-by-nc-4.0

Yes

ml.c5.2xlarge, ml.c5.xlarge, ml.g4dn.xlarge, ml.m5.large, ml.m5.xlarge, ml.p3.2xlarge

Facebookai

XLM MLM English-German

huggingface-spc-xlm-mlm-ende-1024

Fill-Mask

cc-by-nc-4.0

Yes

ml.c5.2xlarge, ml.c5.xlarge, ml.g4dn.xlarge, ml.m5.large, ml.m5.xlarge, ml.p3.2xlarge

Facebookai

XLM MLM English-Romanian

huggingface-tc-xlm-mlm-enro-1024

Text Classification

cc-by-nc-4.0

Yes

ml.c5.2xlarge, ml.c5.xlarge, ml.g4dn.xlarge, ml.m5.large, ml.m5.xlarge, ml.p3.2xlarge

Facebookai

XLM MLM English-Romanian

huggingface-spc-xlm-mlm-enro-1024

Fill-Mask

cc-by-nc-4.0

Yes

ml.c5.2xlarge, ml.c5.xlarge, ml.g4dn.xlarge, ml.m5.large, ml.m5.xlarge, ml.p3.2xlarge

Facebookai

XLM MLM TLM 15 XNLI Languages

huggingface-tc-xlm-mlm-tlm-xnli15-1024

Text Classification

cc-by-nc-4.0

Yes

ml.c5.2xlarge, ml.c5.xlarge, ml.g4dn.xlarge, ml.m5.large, ml.m5.xlarge, ml.p3.2xlarge

Facebookai

XLM MLM TLM 15 XNLI Languages

huggingface-spc-xlm-mlm-tlm-xnli15-1024

Fill-Mask

cc-by-nc-4.0

Yes

ml.c5.2xlarge, ml.c5.xlarge, ml.g4dn.xlarge, ml.m5.large, ml.m5.xlarge, ml.p3.2xlarge

Fictiverse

Fictiverse Stable Diffusion BalloonArt Model

huggingface-txt2img-fictiverse-stable-diffusion-balloonart

Text-to-Image

creativeml-openrail-m

No

ml.g4dn.4xlarge, ml.g4dn.8xlarge, ml.g5.2xlarge, ml.g5.4xlarge, ml.g5.8xlarge, ml.p3.2xlarge

Fictiverse

Fictiverse Stable Diffusion Microscopic model

huggingface-txt2img-fictiverse-stable-diffusion-micro-model

Text-to-Image

openrail

No

ml.g4dn.2xlarge, ml.g4dn.xlarge, ml.p3.2xlarge

Fictiverse

Fictiverse Stable Diffusion PaperCut Model

huggingface-txt2img-fictiverse-stable-diffusion-papercut-model

Text-to-Image

creativeml-openrail-m

No

ml.g4dn.2xlarge, ml.g4dn.xlarge, ml.p3.2xlarge

Fictiverse

Fictiverse Stable Diffusion VoxelArt Model

huggingface-txt2img-fictiverse-stable-diffusion-voxelart-model

Text-to-Image

creativeml-openrail-m

No

ml.g4dn.4xlarge, ml.g4dn.8xlarge, ml.g5.2xlarge, ml.g5.4xlarge, ml.g5.8xlarge, ml.p3.2xlarge

Google

Bigbird Pegasus Large Arxiv

huggingface-summarization-bigbird-pegasus-large-arxiv

Text Summarization

apache-2.0

No

ml.c5.2xlarge, ml.c5.xlarge, ml.g4dn.xlarge, ml.m5.large, ml.m5.xlarge, ml.p3.2xlarge

Google

Bigbird Pegasus Large Pubmed

huggingface-summarization-bigbird-pegasus-large-pubmed

Text Summarization

apache-2.0

No

ml.c5.2xlarge, ml.c5.xlarge, ml.g4dn.xlarge, ml.m5.large, ml.m5.xlarge, ml.p3.2xlarge

Google

Flan-T5 Base

huggingface-text2text-flan-t5-base

Text2Text Generation

apache-2.0

Yes

ml.g5.16xlarge, ml.g5.2xlarge, ml.g5.4xlarge, ml.g5.8xlarge

Google

Flan-T5 Large

huggingface-text2text-flan-t5-large

Text2Text Generation

apache-2.0

Yes

ml.g5.16xlarge, ml.g5.2xlarge, ml.g5.4xlarge, ml.g5.8xlarge

Google

Flan-T5 Small

huggingface-text2text-flan-t5-small

Text2Text Generation

apache-2.0

Yes

ml.g5.16xlarge, ml.g5.2xlarge, ml.g5.4xlarge, ml.g5.8xlarge

Google

Flan-T5 XL

huggingface-text2text-flan-t5-xl

Text2Text Generation

apache-2.0

Yes

ml.g5.16xlarge, ml.g5.2xlarge, ml.g5.4xlarge, ml.g5.8xlarge

Google

Flan-T5 XXL

huggingface-text2text-flan-t5-xxl

Text2Text Generation

apache-2.0

Yes

ml.g5.12xlarge, ml.g5.24xlarge, ml.g5.48xlarge, ml.p4d.24xlarge

Google

Gemma 2 27B

huggingface-llm-gemma-2-27b

Text Generation

gemma

No

ml.g5.48xlarge, ml.p4d.24xlarge

Google

Gemma 2 27B Instruct

huggingface-llm-gemma-2-27b-instruct

Text Generation

gemma

No

ml.g5.48xlarge, ml.p4d.24xlarge

Google

Gemma 2 2B

huggingface-llm-gemma-2-2b

Text Generation

gemma

No

ml.g5.12xlarge, ml.g5.16xlarge, ml.g5.24xlarge, ml.g5.2xlarge, ml.g5.48xlarge, ml.g5.4xlarge, ml.g5.8xlarge, ml.g5.xlarge, ml.p4de.24xlarge, ml.p5.48xlarge

Google

Gemma 2 2B Instruct

huggingface-llm-gemma-2-2b-instruct

Text Generation

gemma

No

ml.g5.12xlarge, ml.g5.16xlarge, ml.g5.24xlarge, ml.g5.2xlarge, ml.g5.48xlarge, ml.g5.4xlarge, ml.g5.8xlarge, ml.g5.xlarge, ml.p4de.24xlarge, ml.p5.48xlarge

Google

Gemma 2 9B

huggingface-llm-gemma-2-9b

Text Generation

gemma

No

ml.g5.12xlarge, ml.g5.24xlarge, ml.g5.48xlarge, ml.p4d.24xlarge

Google

Gemma 2 9B Instruct

huggingface-llm-gemma-2-9b-instruct

Text Generation

gemma

No

ml.g5.12xlarge, ml.g5.24xlarge, ml.g5.48xlarge, ml.p4d.24xlarge

Google

Gemma 2B Instruct

huggingface-llm-gemma-2b-instruct

Text Generation

gemma

Yes

ml.g5.12xlarge, ml.g5.16xlarge, ml.g5.24xlarge, ml.g5.2xlarge, ml.g5.48xlarge, ml.g5.4xlarge, ml.g5.8xlarge, ml.g5.xlarge, ml.p4de.24xlarge, ml.p5.48xlarge

Google

Gemma 3 1B Instruct

huggingface-llm-gemma-3-1b-instruct

Text Generation

gemma

No

ml.g5.16xlarge, ml.g5.2xlarge, ml.g5.4xlarge, ml.g5.8xlarge

Google

Gemma 3 27B Instruct

huggingface-vlm-gemma-3-27b-instruct

Image-Text-to-Text

gemma

No

ml.g5.48xlarge, ml.g6.48xlarge, ml.p4d.24xlarge, ml.p5.48xlarge

Google

Gemma 3 4B Instruct

huggingface-vlm-gemma-3-4b-instruct

Image-Text-to-Text

gemma

No

ml.g5.16xlarge, ml.g5.2xlarge, ml.g5.4xlarge, ml.g5.8xlarge

Google

gemma-2b

huggingface-llm-gemma-2b

Text Generation

gemma

Yes

ml.g4dn.xlarge, ml.g5.xlarge, ml.g6.xlarge, ml.g6e.xlarge, ml.p3.2xlarge, ml.p4d.24xlarge

Google

gemma-4-26b-a4b-it

huggingface-vlm-gemma-4-26b-a4b-it

Text Generation

apache-2.0

No

ml.g6.12xlarge, ml.g6.24xlarge, ml.g6e.12xlarge, ml.g6e.24xlarge, ml.p5.48xlarge, ml.p5en.48xlarge

Google

gemma-4-31b-it

huggingface-vlm-gemma-4-31b-it

Image-Text-to-Text

Apache-2.0

No

ml.g6.24xlarge

Google

gemma-4-e4b-it

huggingface-vlm-gemma-4-e4b-it

Text Generation

apache-2.0

No

ml.g6.12xlarge, ml.g6e.2xlarge, ml.g6e.4xlarge, ml.g6e.xlarge, ml.p5.48xlarge, ml.p5en.48xlarge

Google

gemma-7b

huggingface-llm-gemma-7b

Text Generation

gemma

Yes

ml.g6e.48xlarge, ml.p4d.24xlarge, ml.p5en.48xlarge

Google

gemma-7b-it

huggingface-llm-gemma-7b-instruct

Text Generation

gemma

Yes

ml.g5.48xlarge, ml.g6.48xlarge, ml.g6e.12xlarge, ml.g6e.24xlarge, ml.p4d.24xlarge, ml.p5en.48xlarge

Google-Bert

BERT Base Cased

huggingface-tc-bert-base-cased

Text Classification

apache-2.0

Yes

ml.c5.2xlarge, ml.c5.xlarge, ml.g4dn.xlarge, ml.m5.large, ml.m5.xlarge, ml.p3.2xlarge

Google-Bert

BERT Base Cased

huggingface-spc-bert-base-cased

Fill-Mask

apache-2.0

Yes

ml.c5.2xlarge, ml.c5.xlarge, ml.g4dn.xlarge, ml.m5.large, ml.m5.xlarge, ml.p3.2xlarge

Google-Bert

BERT Base Cased

huggingface-eqa-bert-base-cased

Question Answering

apache-2.0

Yes

ml.c5.2xlarge, ml.c5.xlarge, ml.g4dn.xlarge, ml.m5.large, ml.m5.xlarge, ml.p3.2xlarge

Google-Bert

BERT Base Multilingual Cased

huggingface-tc-bert-base-multilingual-cased

Text Classification

apache-2.0

Yes

ml.c5.2xlarge, ml.c5.xlarge, ml.g4dn.xlarge, ml.m5.large, ml.m5.xlarge, ml.p3.2xlarge

Google-Bert

BERT Base Multilingual Cased

huggingface-spc-bert-base-multilingual-cased

Fill-Mask

apache-2.0

Yes

ml.c5.2xlarge, ml.c5.xlarge, ml.g4dn.xlarge, ml.m5.large, ml.m5.xlarge, ml.p3.2xlarge

Google-Bert

BERT Base Multilingual Cased

huggingface-eqa-bert-base-multilingual-cased

Question Answering

apache-2.0

Yes

ml.c5.2xlarge, ml.c5.xlarge, ml.g4dn.xlarge, ml.m5.large, ml.m5.xlarge, ml.p3.2xlarge

Google-Bert

BERT Base Multilingual Uncased

huggingface-tc-bert-base-multilingual-uncased

Text Classification

apache-2.0

Yes

ml.c5.2xlarge, ml.c5.xlarge, ml.g4dn.xlarge, ml.m5.large, ml.m5.xlarge, ml.p3.2xlarge

Google-Bert

BERT Base Multilingual Uncased

huggingface-spc-bert-base-multilingual-uncased

Fill-Mask

apache-2.0

Yes

ml.c5.2xlarge, ml.c5.xlarge, ml.g4dn.xlarge, ml.m5.large, ml.m5.xlarge, ml.p3.2xlarge

Google-Bert

BERT Base Multilingual Uncased

huggingface-eqa-bert-base-multilingual-uncased

Question Answering

apache-2.0

Yes

ml.c5.2xlarge, ml.c5.xlarge, ml.g4dn.xlarge, ml.m5.large, ml.m5.xlarge, ml.p3.2xlarge

Google-Bert

BERT Base Uncased

huggingface-tc-bert-base-uncased

Text Classification

apache-2.0

Yes

ml.c5.2xlarge, ml.c5.xlarge, ml.g4dn.xlarge, ml.m5.large, ml.m5.xlarge, ml.p3.2xlarge

Google-Bert

BERT Base Uncased

huggingface-spc-bert-base-uncased

Fill-Mask

apache-2.0

Yes

ml.c5.2xlarge, ml.c5.xlarge, ml.g4dn.xlarge, ml.m5.large, ml.m5.xlarge, ml.p3.2xlarge

Google-Bert

BERT Base Uncased

huggingface-eqa-bert-base-uncased

Question Answering

apache-2.0

Yes

ml.c5.2xlarge, ml.c5.xlarge, ml.g4dn.xlarge, ml.m5.large, ml.m5.xlarge, ml.p3.2xlarge

Google-Bert

BERT Large Cased

huggingface-tc-bert-large-cased

Text Classification

apache-2.0

Yes

ml.c5.2xlarge, ml.c5.xlarge, ml.g4dn.xlarge, ml.m5.large, ml.m5.xlarge, ml.p3.2xlarge

Google-Bert

BERT Large Cased

huggingface-spc-bert-large-cased

Fill-Mask

apache-2.0

Yes

ml.c5.2xlarge, ml.c5.xlarge, ml.g4dn.xlarge, ml.m5.large, ml.m5.xlarge, ml.p3.2xlarge

Google-Bert

BERT Large Cased

huggingface-eqa-bert-large-cased

Question Answering

apache-2.0

Yes

ml.c5.2xlarge, ml.c5.xlarge, ml.g4dn.xlarge, ml.m5.large, ml.m5.xlarge, ml.p3.2xlarge

Google-Bert

BERT Large Cased Whole Word Masking

huggingface-spc-bert-large-cased-whole-word-masking

Fill-Mask

apache-2.0

Yes

ml.c5.2xlarge, ml.c5.xlarge, ml.g4dn.xlarge, ml.m5.large, ml.m5.xlarge, ml.p3.2xlarge

Google-Bert

BERT Large Cased Whole Word Masking

huggingface-eqa-bert-large-cased-whole-word-masking

Question Answering

apache-2.0

Yes

ml.c5.2xlarge, ml.c5.xlarge, ml.g4dn.xlarge, ml.m5.large, ml.m5.xlarge, ml.p3.2xlarge

Google-Bert

BERT Large Uncased

huggingface-tc-bert-large-uncased

Text Classification

apache-2.0

Yes

ml.c5.2xlarge, ml.c5.xlarge, ml.g4dn.xlarge, ml.m5.large, ml.m5.xlarge, ml.p3.2xlarge

Google-Bert

BERT Large Uncased

huggingface-spc-bert-large-uncased

Fill-Mask

apache-2.0

Yes

ml.c5.2xlarge, ml.c5.xlarge, ml.g4dn.xlarge, ml.m5.large, ml.m5.xlarge, ml.p3.2xlarge

Google-Bert

BERT Large Uncased Whole Word Masking

huggingface-spc-bert-large-uncased-whole-word-masking

Fill-Mask

apache-2.0

Yes

ml.c5.2xlarge, ml.c5.xlarge, ml.g4dn.xlarge, ml.m5.large, ml.m5.xlarge, ml.p3.2xlarge

Google-Bert

Fill Mask

huggingface-fillmask-bert-base-uncased

Fill-Mask

apache-2.0

Yes

ml.c5.2xlarge, ml.c5.xlarge, ml.g4dn.xlarge, ml.m5.large, ml.m5.xlarge, ml.p3.2xlarge

Google-T5

Machine Translation

huggingface-translation-t5-small

Translation

apache-2.0

No

ml.c5.2xlarge, ml.c5.xlarge, ml.g4dn.xlarge, ml.m5.large, ml.m5.xlarge, ml.p3.2xlarge

Google-T5

t5 Base en de

huggingface-translation-t5-base

Translation

apache-2.0

No

ml.c5.2xlarge, ml.c5.xlarge, ml.g4dn.xlarge, ml.m5.large, ml.m5.xlarge, ml.p3.2xlarge

Google-T5

t5 Large en de

huggingface-translation-t5-large

Translation

apache-2.0

No

ml.c5.2xlarge, ml.c5.xlarge, ml.g4dn.xlarge, ml.m5.large, ml.m5.xlarge, ml.p3.2xlarge

Gradientai

Llama-3 8B Gradient Instruct 1048k

huggingface-llm-llama-3-8b-instruct-gradient

Text Generation

llama3

No

ml.g5.12xlarge, ml.g5.16xlarge, ml.g5.24xlarge, ml.g5.2xlarge, ml.g5.48xlarge, ml.g5.4xlarge, ml.g5.8xlarge, ml.p4d.24xlarge, ml.p4de.24xlarge, ml.p5.48xlarge

Gradientai

Llama-3 8B Gradient Instruct 262k

huggingface-llm-gradientai-llama-3-8B-instruct-262k

Text Generation

llama3

No

ml.g5.12xlarge, ml.g5.16xlarge, ml.g5.24xlarge, ml.g5.2xlarge, ml.g5.48xlarge, ml.g5.4xlarge, ml.g5.8xlarge, ml.p4d.24xlarge, ml.p4de.24xlarge, ml.p5.48xlarge

Helsinki-Nlp

Helsinki opus en es

huggingface-translation-opus-mt-en-es

Translation

apache-2.0

No

ml.c5.2xlarge, ml.c5.xlarge, ml.g4dn.xlarge, ml.m5.large, ml.m5.xlarge, ml.p3.2xlarge

Helsinki-Nlp

Helsinki opus en vi

huggingface-translation-opus-mt-en-vi

Translation

apache-2.0

No

ml.c5.2xlarge, ml.c5.xlarge, ml.g4dn.xlarge, ml.m5.large, ml.m5.xlarge, ml.p3.2xlarge

Helsinki-Nlp

Helsinki opus mul en

huggingface-translation-opus-mt-mul-en

Translation

apache-2.0

No

ml.c5.2xlarge, ml.c5.xlarge, ml.g4dn.xlarge, ml.m5.large, ml.m5.xlarge, ml.p3.2xlarge

Huggingfaceh4

HuggingFaceH4 zephyr-orpo-141b-A35b-v0.1

huggingface-llm-huggingfaceh4-zephyr-orpo-141b-a35b-v01

Text Generation

apache-2.0

No

ml.g5.48xlarge, ml.p4d.24xlarge

Huggingfaceh4

Star Chat Alpha

huggingface-llm-huggingfaceh4-starchat-alpha

Text Generation

bigcode-openrail-m

No

ml.g5.12xlarge, ml.g5.24xlarge, ml.g5.48xlarge

Huggingfaceh4

Star Chat Beta

huggingface-llm-huggingfaceh4-starchat-beta

Text Generation

bigcode-openrail-m

No

ml.g5.12xlarge, ml.g5.24xlarge, ml.g5.48xlarge, ml.p4d.24xlarge

Huggingfaceh4

Zephyr 7B Beta

huggingface-llm-huggingfaceh4-zephyr-7b-beta

Text Generation

mit

No

ml.g5.12xlarge, ml.g5.24xlarge, ml.g5.2xlarge, ml.g5.48xlarge, ml.p4d.24xlarge

Ibm

Quality Controlled Paraphrase Generation

huggingface-text2text-qcpg-sentences

Text2Text Generation

apache-2.0

No

ml.g5.16xlarge, ml.g5.2xlarge, ml.g5.4xlarge, ml.g5.8xlarge

Idea-Ccnl

IDEA CCNL Taiyi Stable Diffusion 1B Chinese EN v0.1

huggingface-txt2img-idea-ccnl-taiyi-1b-chinese-en-v01

Text-to-Image

creativeml-openrail-m

No

ml.g4dn.4xlarge, ml.g4dn.8xlarge, ml.g5.2xlarge, ml.g5.4xlarge, ml.g5.8xlarge, ml.p3.2xlarge

Idea-Ccnl

IDEA CCNL Taiyi Stable Diffusion 1B Chinese v0.1

huggingface-txt2img-idea-ccnl-taiyi-1b-chinese-v0-1

Text-to-Image

creativeml-openrail-m

No

ml.g4dn.4xlarge, ml.g4dn.8xlarge, ml.g5.2xlarge, ml.g5.4xlarge, ml.g5.8xlarge, ml.p3.2xlarge

Intfloat

E5 Base

huggingface-sentencesimilarity-e5-base

Sentence Similarity

mit

Yes

ml.c6i.xlarge, ml.g4dn.2xlarge, ml.g5.2xlarge, ml.g5.4xlarge, ml.g5.8xlarge, ml.p3.2xlarge

Intfloat

E5 Base V2

huggingface-sentencesimilarity-e5-base-v2

Sentence Similarity

mit

Yes

ml.c6i.xlarge, ml.g4dn.2xlarge, ml.g5.2xlarge, ml.g5.4xlarge, ml.g5.8xlarge, ml.p3.2xlarge

Intfloat

E5 Large V2

huggingface-sentencesimilarity-e5-large-v2

Sentence Similarity

mit

Yes

ml.c6i.xlarge, ml.g4dn.2xlarge, ml.g5.2xlarge, ml.g5.4xlarge, ml.g5.8xlarge, ml.p3.2xlarge

Intfloat

Multilingual E5 Base

huggingface-sentencesimilarity-multilingual-e5-base

Sentence Similarity

mit

Yes

ml.c6i.xlarge, ml.g4dn.2xlarge, ml.g5.2xlarge, ml.g5.4xlarge, ml.g5.8xlarge, ml.p3.2xlarge

Intfloat

Multilingual E5 Large

huggingface-sentencesimilarity-multilingual-e5-large

Sentence Similarity

mit

Yes

ml.c6i.xlarge, ml.g4dn.2xlarge, ml.g5.2xlarge, ml.g5.4xlarge, ml.g5.8xlarge, ml.p3.2xlarge

Langboat

Langboat Guohua Diffusion

huggingface-txt2img-langboat-guohua-diffusion

Text-to-Image

creativeml-openrail-m

No

ml.g4dn.2xlarge, ml.g4dn.xlarge, ml.p3.2xlarge

Lighteternal

Lighteternal Nli Xlm R Greek

huggingface-zstc-lighteternal-nli-xlm-r-greek

Zero-Shot Classification

apache-2.0

No

ml.c5.2xlarge, ml.c5.xlarge, ml.g4dn.xlarge, ml.m5.large, ml.m5.xlarge, ml.p3.2xlarge

Meta

Llama-2-7b-hf

meta-textgeneration-llama-2-7b

Text Generation

llama2

Yes

ml.g4dn.12xlarge, ml.g5.12xlarge, ml.g6.12xlarge, ml.g6e.xlarge, ml.p4d.24xlarge, ml.p5.48xlarge

Meta

Llama-3.1-70B-Instruct

meta-textgeneration-llama-3-1-70b-instruct

Text Generation

llama3.1

Yes

ml.g6e.48xlarge, ml.p5en.48xlarge

Meta

Llama-3.1-8B-Instruct

meta-textgeneration-llama-3-1-8b-instruct

Text Generation

llama3.1

Yes

ml.g5.48xlarge, ml.g6.48xlarge, ml.g6e.12xlarge, ml.p3.16xlarge, ml.p4d.24xlarge, ml.p5.48xlarge

Meta

Llama-3.2-1B

meta-textgeneration-llama-3-2-1b

Text Generation

llama3.2

Yes

ml.g4dn.12xlarge, ml.g5.48xlarge, ml.g6.12xlarge, ml.g6e.2xlarge, ml.p3.16xlarge, ml.p4d.24xlarge

Meta

Llama-3.2-1B-Instruct

meta-textgeneration-llama-3-2-1b-instruct

Text Generation

llama3.2

Yes

ml.g4dn.12xlarge, ml.g5.24xlarge, ml.g6.48xlarge, ml.g6e.12xlarge, ml.p4d.24xlarge, ml.p5.48xlarge

Meta

Llama-3.2-3B

meta-textgeneration-llama-3-2-3b

Text Generation

llama3.2

Yes

ml.g5.48xlarge, ml.g6.48xlarge, ml.g6e.24xlarge, ml.g6e.48xlarge, ml.p3.16xlarge, ml.p5en.48xlarge

Meta

Llama-3.2-3B-Instruct

meta-textgeneration-llama-3-2-3b-instruct

Text Generation

llama3.2

Yes

ml.g5.24xlarge, ml.g6.24xlarge, ml.g6e.48xlarge, ml.p3.16xlarge, ml.p4d.24xlarge, ml.p5.48xlarge

Meta

Llama-Guard-3-1B

meta-textgeneration-llama-guard-3-1b

Text Generation

llama3.2

No

ml.g4dn.12xlarge, ml.g5.12xlarge, ml.g5.24xlarge, ml.g5.48xlarge, ml.g5.8xlarge, ml.g5.xlarge, ml.g6.12xlarge, ml.g6.24xlarge, ml.g6.48xlarge, ml.g6e.12xlarge, ml.g6e.16xlarge, ml.g6e.24xlarge, ml.g6e.2xlarge, ml.g6e.4xlarge, ml.g6e.8xlarge, ml.g6e.xlarge, ml.p4d.24xlarge

Meta

Llama-Guard-3-8B

meta-textgeneration-llama-guard-3-8b

Text Generation

llama3.1

No

ml.g5.12xlarge, ml.g5.24xlarge, ml.g5.48xlarge, ml.g6.12xlarge, ml.g6.24xlarge, ml.g6.48xlarge, ml.g6e.12xlarge, ml.g6e.24xlarge, ml.p4d.24xlarge

Meta

Meta Code Llama 13B

meta-textgeneration-llama-codellama-13b

Text Generation

llama2

Yes

ml.g5.12xlarge, ml.g5.24xlarge, ml.g5.48xlarge, ml.g6.12xlarge, ml.p4d.24xlarge

Meta

Meta Code Llama 13B Instruct

meta-textgeneration-llama-codellama-13b-instruct

Text Generation

llama2

No

ml.g5.12xlarge, ml.g5.24xlarge, ml.g5.48xlarge, ml.g6.12xlarge, ml.p4d.24xlarge

Meta

Meta Code Llama 13B Python

meta-textgeneration-llama-codellama-13b-python

Text Generation

llama2

Yes

ml.g5.12xlarge, ml.g5.24xlarge, ml.g5.48xlarge, ml.g6.12xlarge, ml.p4d.24xlarge

Meta

Meta Code Llama 34B

meta-textgeneration-llama-codellama-34b

Text Generation

llama2

Yes

ml.g5.48xlarge, ml.p4d.24xlarge

Meta

Meta Code Llama 34B Instruct

meta-textgeneration-llama-codellama-34b-instruct

Text Generation

llama2

No

ml.g5.48xlarge, ml.p4d.24xlarge

Meta

Meta Code Llama 34B Python

meta-textgeneration-llama-codellama-34b-python

Text Generation

llama2

Yes

ml.g5.48xlarge, ml.p4d.24xlarge

Meta

Meta Code Llama 70B

meta-textgeneration-llama-codellama-70b

Text Generation

llama2

Yes

ml.g5.48xlarge, ml.p4d.24xlarge

Meta

Meta Code Llama 70B Instruct

meta-textgeneration-llama-codellama-70b-instruct

Text Generation

llama2

No

ml.g5.48xlarge, ml.p4d.24xlarge

Meta

Meta Code Llama 70B Neuron

meta-textgenerationneuron-llama-codellama-70b

Text Generation

llama2

No

ml.inf2.48xlarge

Meta

Meta Code Llama 70B Python

meta-textgeneration-llama-codellama-70b-python

Text Generation

llama2

Yes

ml.g5.48xlarge, ml.p4d.24xlarge

Meta

Meta Code Llama 7B

meta-textgeneration-llama-codellama-7b

Text Generation

llama2

Yes

ml.g5.12xlarge, ml.g5.24xlarge, ml.g5.2xlarge, ml.g5.48xlarge, ml.g5.4xlarge, ml.g5.8xlarge, ml.g6.2xlarge, ml.p4d.24xlarge

Meta

Meta Code Llama 7B Instruct

meta-textgeneration-llama-codellama-7b-instruct

Text Generation

llama2

No

ml.g5.12xlarge, ml.g5.24xlarge, ml.g5.2xlarge, ml.g5.48xlarge, ml.g5.4xlarge, ml.g5.8xlarge, ml.g6.12xlarge, ml.g6.2xlarge, ml.p4d.24xlarge

Meta

Meta Code Llama 7B Neuron

meta-textgenerationneuron-llama-codellama-7b

Text Generation

llama2

No

ml.inf2.8xlarge, ml.inf2.xlarge

Meta

Meta Code Llama 7B Python

meta-textgeneration-llama-codellama-7b-python

Text Generation

llama2

Yes

ml.g5.12xlarge, ml.g5.24xlarge, ml.g5.2xlarge, ml.g5.48xlarge, ml.g5.4xlarge, ml.g5.8xlarge, ml.g6.12xlarge, ml.p4d.24xlarge

Meta

Meta Code Llama 7B Python Neuron

meta-textgenerationneuron-llama-codellama-7b-python

Text Generation

llama2

No

ml.inf2.8xlarge, ml.inf2.xlarge

Meta

Meta Llama 2 13B

meta-textgeneration-llama-2-13b

Text Generation

llama2

Yes

ml.g5.12xlarge, ml.g5.24xlarge, ml.g5.48xlarge, ml.g6.12xlarge, ml.p4d.24xlarge, ml.p5.48xlarge

Meta

Meta Llama 2 13B Chat

meta-textgeneration-llama-2-13b-f

Text Generation

llama2

Yes

ml.g5.12xlarge, ml.g5.24xlarge, ml.g5.48xlarge, ml.g6.12xlarge, ml.p4d.24xlarge, ml.p5.48xlarge

Meta

Meta Llama 2 13B Chat Neuron

meta-textgenerationneuron-llama-2-13b-f

Text Generation

llama2

No

ml.inf2.24xlarge, ml.inf2.48xlarge, ml.inf2.8xlarge

Meta

Meta Llama 2 13B Neuron

meta-textgenerationneuron-llama-2-13b

Text Generation

llama2

Yes

ml.inf2.24xlarge, ml.inf2.48xlarge, ml.inf2.8xlarge

Meta

Meta Llama 2 70B

meta-textgeneration-llama-2-70b

Text Generation

llama2

Yes

ml.g5.48xlarge, ml.p4d.24xlarge

Meta

Meta Llama 2 70B Chat

meta-textgeneration-llama-2-70b-f

Text Generation

llama2

Yes

ml.g5.48xlarge, ml.p4d.24xlarge

Meta

Meta Llama 2 70B Chat Neuron

meta-textgenerationneuron-llama-2-70b-f

Text Generation

llama2

No

ml.inf2.48xlarge

Meta

Meta Llama 2 70B Neuron

meta-textgenerationneuron-llama-2-70b

Text Generation

llama2

No

ml.inf2.48xlarge

Meta

Meta Llama 2 7B Chat

meta-textgeneration-llama-2-7b-f

Text Generation

llama2

Yes

ml.g5.12xlarge, ml.g5.24xlarge, ml.g5.2xlarge, ml.g5.48xlarge, ml.g5.4xlarge, ml.g5.8xlarge, ml.g6.12xlarge, ml.p4d.24xlarge

Meta

Meta Llama 2 7B Chat Neuron

meta-textgenerationneuron-llama-2-7b-f

Text Generation

llama2

No

ml.inf2.24xlarge, ml.inf2.48xlarge, ml.inf2.8xlarge, ml.inf2.xlarge

Meta

Meta Llama 2 7B Neuron

meta-textgenerationneuron-llama-2-7b

Text Generation

llama2

Yes

ml.inf2.24xlarge, ml.inf2.48xlarge, ml.inf2.8xlarge, ml.inf2.xlarge

Meta

Meta Llama 3 1 8B Neuron

meta-textgenerationneuron-llama-3-1-8b

Text Generation

llama3.1

No

ml.inf2.24xlarge, ml.inf2.48xlarge, ml.inf2.8xlarge, ml.trn1.2xlarge, ml.trn1.32xlarge, ml.trn1n.32xlarge

Meta

Meta Llama 3 70B

meta-textgeneration-llama-3-70b

Text Generation

llama3

Yes

ml.g5.48xlarge, ml.g6.48xlarge, ml.p4d.24xlarge, ml.p5.48xlarge

Meta

Meta Llama 3 70B Instruct

meta-textgeneration-llama-3-70b-instruct

Text Generation

llama3

Yes

ml.g5.48xlarge, ml.g6.48xlarge, ml.p4d.24xlarge, ml.p5.48xlarge

Meta

Meta Llama 3 70B Instruct Neuron

meta-textgenerationneuron-llama-3-70b-instruct

Text Generation

llama3

No

ml.trn1.32xlarge

Meta

Meta Llama 3 70B Neuron

meta-textgenerationneuron-llama-3-70b

Text Generation

llama3

No

ml.trn1.32xlarge

Meta

Meta Llama 3 8B Instruct

meta-textgeneration-llama-3-8b-instruct

Text Generation

llama3

Yes

ml.g5.12xlarge, ml.g5.24xlarge, ml.g5.2xlarge, ml.g5.48xlarge, ml.g5.4xlarge, ml.g5.8xlarge, ml.g6.12xlarge, ml.g6.2xlarge, ml.p4d.24xlarge, ml.p5.48xlarge

Meta

Meta Llama 3 8B Instruct Neuron

meta-textgenerationneuron-llama-3-8b-instruct

Text Generation

llama3

No

ml.inf2.24xlarge, ml.inf2.48xlarge, ml.inf2.8xlarge

Meta

Meta Llama 3 8B Neuron

meta-textgenerationneuron-llama-3-8b

Text Generation

llama3

No

ml.inf2.24xlarge, ml.inf2.48xlarge, ml.inf2.8xlarge

Meta

Meta Llama 3.1 405B FP8

meta-textgeneration-llama-3-1-405b-fp8

Text Generation

llama3.1

Yes

ml.p4de.24xlarge, ml.p5.48xlarge

Meta

Meta Llama 3.1 405B Instruct FP8

meta-textgeneration-llama-3-1-405b-instruct-fp8

Text Generation

llama3.1

Yes

ml.p4de.24xlarge, ml.p5.48xlarge

Meta

Meta Llama 3.1 70B

meta-textgeneration-llama-3-1-70b

Text Generation

llama3.1

Yes

ml.g5.48xlarge, ml.g6.48xlarge, ml.p4d.24xlarge, ml.p5.48xlarge

Meta

Meta Llama 3.1 70B Instruct Neuron

meta-textgenerationneuron-llama-3-1-70b-instruct

Text Generation

llama3.1

No

ml.inf2.48xlarge, ml.trn1.32xlarge, ml.trn1n.32xlarge

Meta

Meta Llama 3.1 70B Neuron

meta-textgenerationneuron-llama-3-1-70b

Text Generation

llama3.1

No

ml.inf2.48xlarge, ml.trn1.32xlarge, ml.trn1n.32xlarge

Meta

Meta Llama 3.1 8B

meta-textgeneration-llama-3-1-8b

Text Generation

llama3.1

Yes

ml.g5.12xlarge, ml.g5.24xlarge, ml.g5.2xlarge, ml.g5.48xlarge, ml.g5.4xlarge, ml.g5.8xlarge, ml.g6.12xlarge, ml.g6.24xlarge, ml.g6.2xlarge, ml.g6.48xlarge, ml.g6.4xlarge, ml.g6.8xlarge, ml.p4d.24xlarge, ml.p5.48xlarge

Meta

Meta Llama 3.1 8B Instruct Neuron

meta-textgenerationneuron-llama-3-1-8b-instruct

Text Generation

llama3.1

No

ml.inf2.24xlarge, ml.inf2.48xlarge, ml.inf2.8xlarge, ml.trn1.2xlarge, ml.trn1.32xlarge, ml.trn1n.32xlarge

Meta

Meta Llama 3.2 11B Vision

meta-vlm-llama-3-2-11b-vision

Image-Text-to-Text

llama3.2

Yes

ml.g6.12xlarge, ml.g6.24xlarge, ml.g6.48xlarge, ml.p4d.24xlarge, ml.p4de.24xlarge, ml.p5.48xlarge

Meta

Meta Llama 3.2 11B Vision Instruct

meta-vlm-llama-3-2-11b-vision-instruct

Image-Text-to-Text

llama3.2

Yes

ml.g6.12xlarge, ml.g6.24xlarge, ml.g6.48xlarge, ml.p4d.24xlarge, ml.p4de.24xlarge, ml.p5.48xlarge

Meta

Meta Llama 3.2 1B Instruct Neuron

meta-textgenerationneuron-llama-3-2-1b-instruct

Text Generation

llama3.2

No

ml.inf2.24xlarge, ml.inf2.48xlarge, ml.inf2.8xlarge, ml.inf2.xlarge, ml.trn1.2xlarge, ml.trn1.32xlarge, ml.trn1n.32xlarge

Meta

Meta Llama 3.2 1B Neuron

meta-textgenerationneuron-llama-3-2-1b

Text Generation

llama3.2

No

ml.inf2.24xlarge, ml.inf2.48xlarge, ml.inf2.8xlarge, ml.inf2.xlarge, ml.trn1.2xlarge, ml.trn1.32xlarge, ml.trn1n.32xlarge

Meta

Meta Llama 3.2 3B Instruct Neuron

meta-textgenerationneuron-llama-3-2-3b-instruct

Text Generation

llama3.2

No

ml.inf2.24xlarge, ml.inf2.48xlarge, ml.inf2.8xlarge, ml.inf2.xlarge, ml.trn1.2xlarge, ml.trn1.32xlarge, ml.trn1n.32xlarge

Meta

Meta Llama 3.2 3B Neuron

meta-textgenerationneuron-llama-3-2-3b

Text Generation

llama3.2

No

ml.inf2.24xlarge, ml.inf2.48xlarge, ml.inf2.8xlarge, ml.inf2.xlarge, ml.trn1.2xlarge, ml.trn1.32xlarge, ml.trn1n.32xlarge

Meta

Meta Llama 3.2 90B Vision

meta-vlm-llama-3-2-90b-vision

Image-Text-to-Text

llama3.2

Yes

ml.p4d.24xlarge, ml.p4de.24xlarge, ml.p5.48xlarge

Meta

Meta Llama 3.2 90B Vision Instruct

meta-vlm-llama-3-2-90b-vision-instruct

Image-Text-to-Text

llama3.2

Yes

ml.p4d.24xlarge, ml.p4de.24xlarge, ml.p5.48xlarge

Meta

Meta Llama 3.3 70B Instruct

meta-textgeneration-llama-3-3-70b-instruct

Text Generation

llama3.3

Yes

ml.g5.48xlarge, ml.g6.48xlarge, ml.p4d.24xlarge, ml.p5.48xlarge

Meta

Meta Llama 4 Maverick 17B 128E FP8

meta-vlm-llama-4-maverick-17b-128e-instruct-fp8

Image-Text-to-Text

llama4

No

ml.p5en.48xlarge

Meta

Meta Llama 4 Maverick 17B 128E Instruct

meta-vlm-llama-4-maverick-17b-128e-instruct

Image-Text-to-Text

llama4

No

ml.p5en.48xlarge

Meta

Meta Llama 4 Scout 17B 16E Instruct

meta-vlm-llama-4-scout-17b-16e-instruct

Image-Text-to-Text

llama4

Yes

ml.g6e.48xlarge, ml.p5.48xlarge, ml.p5en.48xlarge

Meta

Meta Llama Guard 3 11B Vision

meta-vlm-llama-guard-3-11b-vision

Image-Text-to-Text

llama3.2

No

ml.g6.12xlarge, ml.g6.24xlarge, ml.g6.48xlarge, ml.p4d.24xlarge, ml.p4de.24xlarge, ml.p5.48xlarge

Meta

Meta Llama Guard 3 1B Neuron

meta-textgenerationneuron-llama-guard-3-1b

Text Generation

llama3.2

No

ml.inf2.24xlarge, ml.inf2.48xlarge, ml.inf2.8xlarge, ml.inf2.xlarge, ml.trn1.2xlarge, ml.trn1.32xlarge, ml.trn1n.32xlarge

Meta

Meta Llama Guard 3 8B Neuron

meta-textgenerationneuron-llama-guard-3-8b

Text Generation

llama3.1

No

ml.inf2.24xlarge, ml.inf2.48xlarge, ml.inf2.8xlarge, ml.trn1.2xlarge, ml.trn1.32xlarge, ml.trn1n.32xlarge

Meta

Meta Llama Guard 7B

meta-textgeneration-llama-guard-7b

Text Generation

llama2

No

ml.g5.12xlarge, ml.g5.24xlarge, ml.g5.2xlarge, ml.g5.48xlarge, ml.g5.4xlarge, ml.g5.8xlarge, ml.p4d.24xlarge

Meta

Meta Llama Prompt Guard 86M

meta-tc-llama-prompt-guard-86m

Text Classification

llama3.1

No

ml.g5.12xlarge, ml.g5.24xlarge, ml.g5.2xlarge, ml.g5.48xlarge, ml.g5.4xlarge, ml.g5.8xlarge, ml.g5.xlarge, ml.g6.12xlarge, ml.g6.24xlarge, ml.g6.2xlarge, ml.g6.48xlarge, ml.g6.4xlarge, ml.g6.8xlarge, ml.g6.xlarge, ml.p4d.24xlarge

Meta

Meta SAM 2.1 Hiera Base Plus

meta-vs-sam-2-1-hiera-base-plus

Image Segmentation

apache-2.0

No

ml.g5.24xlarge, ml.g5.48xlarge, ml.g6.24xlarge, ml.g6.48xlarge, ml.p4d.24xlarge, ml.p4de.24xlarge

Meta

Meta SAM 2.1 Hiera Large

meta-vs-sam-2-1-hiera-large

Image Segmentation

apache-2.0

No

ml.g5.24xlarge, ml.g5.48xlarge, ml.g6.24xlarge, ml.g6.48xlarge, ml.p4d.24xlarge, ml.p4de.24xlarge

Meta

Meta SAM 2.1 Hiera Small

meta-vs-sam-2-1-hiera-small

Image Segmentation

apache-2.0

No

ml.g5.24xlarge, ml.g5.48xlarge, ml.g6.24xlarge, ml.g6.48xlarge, ml.p4d.24xlarge, ml.p4de.24xlarge

Meta

Meta SAM 2.1 Hiera Tiny

meta-vs-sam-2-1-hiera-tiny

Image Segmentation

apache-2.0

No

ml.g5.24xlarge, ml.g5.48xlarge, ml.g6.24xlarge, ml.g6.48xlarge, ml.p4d.24xlarge, ml.p4de.24xlarge

Meta

Meta-Llama-3-8B

meta-textgeneration-llama-3-8b

Text Generation

llama3

Yes

ml.g4dn.12xlarge, ml.g5.12xlarge, ml.g6.12xlarge, ml.g6e.xlarge, ml.p3.8xlarge, ml.p4d.24xlarge

Microsoft

Phi-2

huggingface-llm-phi-2

Text Generation

mit

No

ml.g5.12xlarge, ml.g5.16xlarge, ml.g5.24xlarge, ml.g5.2xlarge, ml.g5.48xlarge, ml.g5.4xlarge, ml.g5.8xlarge, ml.g5.xlarge, ml.p4de.24xlarge, ml.p5.48xlarge

Microsoft

Phi-3-Mini-128K-Instruct

huggingface-llm-phi-3-mini-128k-instruct

Text Generation

mit

No

ml.g5.12xlarge, ml.g5.16xlarge, ml.g5.24xlarge, ml.g5.2xlarge, ml.g5.48xlarge, ml.g5.4xlarge, ml.g5.8xlarge, ml.p4d.24xlarge, ml.p4de.24xlarge, ml.p5.48xlarge

Microsoft

Phi-3-mini-4k-instruct

huggingface-llm-phi-3-mini-4k-instruct

Text Generation

mit

No

ml.g4dn.12xlarge, ml.g5.12xlarge, ml.g6.12xlarge, ml.g6e.xlarge, ml.p4d.24xlarge, ml.p5en.48xlarge

Microsoft

Phi-3.5-mini-instruct

huggingface-llm-phi-3-5-mini-instruct

Text Generation

mit

No

ml.g5.48xlarge, ml.p4d.24xlarge

Microsoft

Table Transformer Detection

huggingface-od-microsoft-table-transformer-detection

Object Detection

mit

No

ml.g5.2xlarge, ml.g5.4xlarge, ml.g5.xlarge, ml.g6.2xlarge, ml.g6.4xlarge, ml.g6.xlarge, ml.g6e.2xlarge, ml.g6e.4xlarge, ml.g6e.xlarge

Minimaxai

MiniMax-M2

huggingface-llm-minimax-m2

Text Generation

other

No

ml.p4de.24xlarge

Minimaxai

MiniMax-M2.1

huggingface-llm-minimax-m2-1

Text Generation

other

No

ml.p4de.24xlarge, ml.p5.48xlarge

Minimaxai

MiniMax-M2.5

huggingface-llm-minimax-m2-5

Text Generation

other

No

ml.p5.48xlarge

Minimaxai

MiniMax-M2.7

huggingface-llm-minimax-m2-7

Text Generation

other

No

ml.p5en.48xlarge

Mistralai

Mistral 7B Instruct Neuron

huggingface-llmneuron-mistral-7b-instruct

Text Generation

apache-2.0

No

ml.inf2.8xlarge, ml.inf2.xlarge

Mistralai

Mistral 7B Neuron

huggingface-llmneuron-mistral-7b

Text Generation

apache-2.0

No

ml.inf2.8xlarge, ml.inf2.xlarge

Mistralai

Mistral Nemo Base 2407

huggingface-llm-mistral-nemo-base-2407

Text Generation

apache-2.0

No

ml.g5.12xlarge, ml.g5.24xlarge, ml.g5.48xlarge, ml.g6.12xlarge, ml.g6.24xlarge, ml.g6.48xlarge, ml.p4d.24xlarge

Mistralai

Mistral Nemo Instruct 2407

huggingface-llm-mistral-nemo-instruct-2407

Text Generation

apache-2.0

No

ml.g5.12xlarge, ml.g5.24xlarge, ml.g5.48xlarge, ml.g6.12xlarge, ml.g6.24xlarge, ml.g6.48xlarge, ml.p4d.24xlarge

Mistralai

Mistral Pixtral-12B-2409

huggingface-vlm-mistral-pixtral-12b-2409

Image-Text-to-Text

apache-2.0

No

ml.g6.12xlarge, ml.g6.24xlarge, ml.g6.48xlarge, ml.p4d.24xlarge, ml.p4de.24xlarge, ml.p5.48xlarge

Mistralai

Mistral-7B-Instruct-v0.2

huggingface-llm-mistral-7b-instruct

Text Generation

apache-2.0

No

ml.g5.12xlarge, ml.g6.24xlarge, ml.g6e.2xlarge, ml.p3.16xlarge, ml.p4d.24xlarge, ml.p5.48xlarge

Mistralai

Mistral-7B-Instruct-v0.3

huggingface-llm-mistral-7b-v3

Text Generation

apache-2.0

No

ml.g4dn.12xlarge, ml.g5.12xlarge, ml.g6.12xlarge, ml.g6e.2xlarge, ml.p4d.24xlarge, ml.p5.48xlarge

Mistralai

Mistral-7B-Instruct-v0.3

huggingface-llm-mistral-7b-instruct-v3

Text Generation

apache-2.0

No

ml.g4dn.12xlarge, ml.g5.12xlarge, ml.g6.12xlarge, ml.g6e.2xlarge, ml.p4d.24xlarge, ml.p5.48xlarge

Mistralai

Mistral-7B-v0.1

huggingface-llm-mistral-7b

Text Generation

apache-2.0

Yes

ml.g5.12xlarge, ml.g6.12xlarge, ml.g6e.12xlarge, ml.p3.16xlarge, ml.p4d.24xlarge, ml.p5.48xlarge

Mistralai

Mistral-Small-24B-Instruct-2501

huggingface-llm-mistral-small-24B-Instruct-2501

Text Generation

apache-2.0

No

ml.g5.48xlarge, ml.g6.48xlarge, ml.g6e.12xlarge, ml.g6e.24xlarge, ml.p4d.24xlarge, ml.p5en.48xlarge

Mistralai

Mistral-Small-3.1-24B-Base-2503

huggingface-vlm-mistral-small-3-1-24b-base-2503

Image-Text-to-Text

apache-2.0

No

ml.g6.12xlarge, ml.g6.16xlarge, ml.g6.24xlarge, ml.g6.48xlarge, ml.g6e.12xlarge

Mistralai

Mistral-Small-3.2-24B-Instruct-2506

huggingface-vlm-mistral-small-3-2-24b-instruct-2506

Image-Text-to-Text

apache-2.0

No

ml.g6.12xlarge, ml.g6.16xlarge, ml.g6.24xlarge, ml.g6.48xlarge, ml.g6e.12xlarge

Mistralai

Mixtral 8x7B

huggingface-llm-mixtral-8x7b

Text Generation

apache-2.0

Yes

ml.g5.48xlarge, ml.g6.48xlarge, ml.p4d.24xlarge, ml.p5.48xlarge

Mistralai

Mixtral-8x22B V1

huggingface-llm-mixtral-8x22B

Text Generation

apache-2.0

No

ml.p4d.24xlarge, ml.p4de.24xlarge

Mistralai

Mixtral-8x22B-Instruct-v0.1

huggingface-llm-mistralai-mixtral-8x22B-instruct-v0-1

Text Generation

apache-2.0

No

ml.p4de.24xlarge, ml.p5.48xlarge

Mistralai

Mixtral-8x7B-Instruct-v0.1

huggingface-llm-mixtral-8x7b-instruct

Text Generation

apache-2.0

Yes

ml.g5.48xlarge, ml.g6.48xlarge, ml.g6e.12xlarge, ml.p4d.24xlarge, ml.p5.48xlarge, ml.p5en.48xlarge

Moonshotai

Kimi-K2.5

huggingface-llm-kimi-k2-5

Text Generation

other

No

ml.p5en.48xlarge

Moritzlaurer

MoritzLaurer MDeBERTa V3 Base Xnli Multilingual Nli 2mil7

huggingface-zstc-moritzlaurer-mdeberta3base-xnli-mling-nli-2m7

Zero-Shot Classification

mit

No

ml.c5.2xlarge, ml.c5.xlarge, ml.g4dn.xlarge, ml.m5.large, ml.m5.xlarge, ml.p3.2xlarge

Mosaicml

MPT 7B BF16

huggingface-textgeneration1-mpt-7b-bf16

Text Generation

Apache-2.0

No

ml.g5.16xlarge, ml.g5.2xlarge, ml.g5.4xlarge, ml.g5.8xlarge

Mosaicml

MPT 7B Instruct BF16

huggingface-textgeneration1-mpt-7b-instruct-bf16

Text Generation

CC-BY-SA-3.0

No

ml.g5.16xlarge, ml.g5.2xlarge, ml.g5.4xlarge, ml.g5.8xlarge

Mrm8488

Bert Small2bert

huggingface-summarization-bert-small2bert-cnn-dailymail-summ

Text Summarization

apache-2.0

No

ml.c5.2xlarge, ml.c5.xlarge, ml.g4dn.xlarge, ml.m5.large, ml.m5.xlarge, ml.p3.2xlarge

Mxnet

Financial Text Embedding

mxnet-tcembedding-robertafin-base-uncased

Text Embedding

apache-2.0

No

ml.c5.2xlarge, ml.c5.xlarge, ml.g4dn.xlarge, ml.m5.large, ml.m5.xlarge, ml.p3.2xlarge

Mxnet

RoBERTa-SEC-Large

mxnet-tcembedding-robertafin-large-uncased

Text Embedding

apache-2.0

No

ml.c5.2xlarge, ml.c5.xlarge, ml.g4dn.xlarge, ml.m5.large, ml.m5.xlarge, ml.p3.2xlarge

Mxnet

RoBERTa-SEC-WIKI-Base

mxnet-tcembedding-robertafin-base-wiki-uncased

Text Embedding

apache-2.0

No

ml.c5.2xlarge, ml.c5.xlarge, ml.g4dn.xlarge, ml.m5.large, ml.m5.xlarge, ml.p3.2xlarge

Mxnet

RoBERTa-SEC-WIKI-Large

mxnet-tcembedding-robertafin-large-wiki-uncased

Text Embedding

apache-2.0

No

ml.c5.2xlarge, ml.c5.xlarge, ml.g4dn.xlarge, ml.m5.large, ml.m5.xlarge, ml.p3.2xlarge

Naclbit

Naclbit Trinart Stable Diffusion V2

huggingface-txt2img-naclbit-trinart-stable-diffusion-v2

Text-to-Image

creativeml-openrail-m

No

ml.g4dn.4xlarge, ml.g4dn.8xlarge, ml.g5.2xlarge, ml.g5.4xlarge, ml.g5.8xlarge, ml.p3.2xlarge

Narsil

Narsil Deberta Large Mnli Zero Cls

huggingface-zstc-narsil-deberta-large-mnli-zero-cls

Zero-Shot Classification

mit

No

ml.c5.2xlarge, ml.c5.xlarge, ml.g4dn.xlarge, ml.m5.large, ml.m5.xlarge, ml.p3.2xlarge

Nexaai

NexaAI Octopus-v2

huggingface-llm-nexaaidev-octopus-v2

Text Generation

cc-by-nc-4.0

No

ml.g5.16xlarge, ml.g5.2xlarge, ml.g5.4xlarge, ml.g5.8xlarge

Nexusflow

Nexusflow Starling-LM-7B-beta

huggingface-llm-nexusflow-starling-lm-7b-beta

Text Generation

apache-2.0

No

ml.g5.16xlarge, ml.g5.2xlarge, ml.g5.4xlarge, ml.g5.8xlarge

Nousresearch

Hermes 2 Pro-Llama-3 8B

huggingface-llm-nousresearch-hermes-2-pro-llama-3-8B

Text Generation

llama3

No

ml.g5.12xlarge, ml.g5.16xlarge, ml.g5.24xlarge, ml.g5.2xlarge, ml.g5.48xlarge, ml.g5.4xlarge, ml.g5.8xlarge, ml.p4d.24xlarge, ml.p4de.24xlarge, ml.p5.48xlarge

Nousresearch

Nous Hermes 2 SOLAR 10.7B

huggingface-llm-nousresearch-nous-hermes-2-solar-10-7b

Text Generation

apache-2.0

No

ml.g5.12xlarge, ml.g5.24xlarge, ml.g5.48xlarge

Nova

Nova 2.0 Lite

nova-textgeneration-lite-v2

Text Generation

AWS Service Terms

Yes

Nova

Nova Lite

nova-textgeneration-lite

Text Generation

AWS Service Terms

Yes

Nova

Nova Micro

nova-textgeneration-micro

Text Generation

AWS Service Terms

Yes

Nova

Nova Pro

nova-textgeneration-pro

Text Generation

AWS Service Terms

Yes

Nvidia

Nvidia Llama3-ChatQA-1.5 8B

huggingface-llm-nvidia-llama3-chatqa-1-5-8B

Text Generation

llama3

No

ml.g5.12xlarge, ml.g5.16xlarge, ml.g5.24xlarge, ml.g5.2xlarge, ml.g5.48xlarge, ml.g5.4xlarge, ml.g5.8xlarge, ml.p4d.24xlarge, ml.p4de.24xlarge, ml.p5.48xlarge

Nvidia

NVIDIA-Nemotron-3-Nano-30B-A3B-BF16

huggingface-reasoning-nvidia-nemotron-3-nano-30b-a3b-bf16

Reasoning

other

No

ml.g6e.24xlarge, ml.g7e.12xlarge

Nvidia

NVIDIA-Nemotron-3-Super-120B-A12B-FP8

huggingface-reasoning-nvidia-nemotron-3-super-120b-a12b-fp8

Reasoning

other

No

ml.g7e.24xlarge

Nvidia

nvidia-nemotron3-nano-omni-30ba3b-reasoning-fp8

huggingface-vlm-nvidia-nemotron3-nano-omni-30ba3b-reasoning-fp8

Image-Text-to-Text

other

No

ml.g6.12xlarge, ml.g6.24xlarge, ml.g6e.12xlarge, ml.g6e.24xlarge

Openai

gpt-oss-120b

openai-reasoning-gpt-oss-120b

Reasoning

apache-2.0

Yes

ml.g6e.48xlarge, ml.p4d.24xlarge, ml.p4de.24xlarge, ml.p5.48xlarge, ml.p5en.48xlarge

Openai

gpt-oss-20b

openai-reasoning-gpt-oss-20b

Reasoning

apache-2.0

Yes

ml.g5.12xlarge, ml.g5.24xlarge, ml.g6.12xlarge, ml.g6.24xlarge, ml.p5.48xlarge, ml.p5en.48xlarge

Openai

Whisper Base

huggingface-asr-whisper-base

Automatic Speech Recognition

apache-2.0

No

ml.g4dn.2xlarge, ml.g5.2xlarge, ml.g5.4xlarge, ml.g5.8xlarge, ml.p3.2xlarge

Openai

Whisper Large

huggingface-asr-whisper-large

Automatic Speech Recognition

apache-2.0

No

ml.g4dn.2xlarge, ml.g5.2xlarge, ml.g5.4xlarge, ml.g5.8xlarge, ml.p3.2xlarge

Openai

Whisper Large V2

huggingface-asr-whisper-large-v2

Automatic Speech Recognition

apache-2.0

No

ml.g4dn.2xlarge, ml.g5.2xlarge, ml.g5.4xlarge, ml.g5.8xlarge, ml.p3.2xlarge

Openai

Whisper Large V3

huggingface-asr-whisper-large-v3

Automatic Speech Recognition

apache-2.0

No

ml.g4dn.2xlarge, ml.g5.2xlarge, ml.g5.4xlarge, ml.g5.8xlarge, ml.p3.2xlarge

Openai

Whisper Large V3 Turbo

huggingface-asr-whisper-large-v3-turbo

Automatic Speech Recognition

mit

No

ml.g4dn.2xlarge, ml.g5.2xlarge, ml.g5.4xlarge, ml.g5.8xlarge, ml.p3.2xlarge

Openai

Whisper Medium

huggingface-asr-whisper-medium

Automatic Speech Recognition

apache-2.0

No

ml.g4dn.2xlarge, ml.g5.2xlarge, ml.g5.4xlarge, ml.g5.8xlarge, ml.p3.2xlarge

Openai

Whisper Small

huggingface-asr-whisper-small

Automatic Speech Recognition

apache-2.0

No

ml.g4dn.2xlarge, ml.g5.2xlarge, ml.g5.4xlarge, ml.g5.8xlarge, ml.p3.2xlarge

Openai

Whisper Tiny

huggingface-asr-whisper-tiny

Automatic Speech Recognition

apache-2.0

No

ml.g4dn.2xlarge, ml.g5.2xlarge, ml.g5.4xlarge, ml.g5.8xlarge, ml.p3.2xlarge

Openai-Community

GPT 2

huggingface-textgeneration-gpt2

Text Generation

mit

No

ml.g5.16xlarge, ml.g5.2xlarge, ml.g5.4xlarge, ml.g5.8xlarge

Openai-Community

GPT-2 XL

huggingface-textgeneration1-gpt-2-xl

Text Generation

mit

Yes

ml.g5.16xlarge, ml.g5.2xlarge, ml.g5.4xlarge, ml.g5.8xlarge

Openai-Community

RoBERTa Base OpenAI Detector

huggingface-tc-roberta-base-openai-detector

Text Classification

mit

Yes

ml.c5.2xlarge, ml.c5.xlarge, ml.g4dn.xlarge, ml.m5.large, ml.m5.xlarge, ml.p3.2xlarge

Openai-Community

RoBERTa Base OpenAI Detector

huggingface-spc-roberta-base-openai-detector

Fill-Mask

mit

Yes

ml.c5.2xlarge, ml.c5.xlarge, ml.g4dn.xlarge, ml.m5.large, ml.m5.xlarge, ml.p3.2xlarge

Openai-Community

RoBERTa Base OpenAI Detector

huggingface-eqa-roberta-base-openai-detector

Question Answering

mit

Yes

ml.c5.2xlarge, ml.c5.xlarge, ml.g4dn.xlarge, ml.m5.large, ml.m5.xlarge, ml.p3.2xlarge

Openai-Community

RoBERTa Large OpenAI Detector

huggingface-tc-roberta-large-openai-detector

Text Classification

mit

Yes

ml.c5.2xlarge, ml.c5.xlarge, ml.g4dn.xlarge, ml.m5.large, ml.m5.xlarge, ml.p3.2xlarge

Openai-Community

RoBERTa Large OpenAI Detector

huggingface-spc-roberta-large-openai-detector

Fill-Mask

mit

Yes

ml.c5.2xlarge, ml.c5.xlarge, ml.g4dn.xlarge, ml.m5.large, ml.m5.xlarge, ml.p3.2xlarge

Openlm-Research

Open Llama 7B V2

huggingface-llm-openlm-research-open-llama-7b-v2

Text Generation

apache-2.0

No

ml.g5.12xlarge, ml.g5.24xlarge, ml.g5.2xlarge, ml.g5.48xlarge

Philschmid

Bart Large CNN samsum

huggingface-summarization-bart-large-cnn-samsum

Text Summarization

mit

No

ml.c5.2xlarge, ml.c5.xlarge, ml.g4dn.xlarge, ml.m5.large, ml.m5.xlarge, ml.p3.2xlarge

Philschmid

Flan-T5 Base Model Fine-tuned on the Samsum Dataset

huggingface-text2text-flan-t5-base-samsum

Text2Text Generation

apache-2.0

No

ml.g5.16xlarge, ml.g5.2xlarge, ml.g5.4xlarge, ml.g5.8xlarge

Pytorch

Alexa TM 20B

pytorch-textgeneration1-alexa20b

Text Generation

amazon-software-license

No

ml.g4dn.12xlarge, ml.g5.16xlarge, ml.p3.16xlarge, ml.p3.8xlarge

Qwen

QVQ 72B Preview

huggingface-vlm-qvq-72b-preview

Image-Text-to-Text

other

No

ml.g5.48xlarge, ml.g6.48xlarge, ml.p4d.24xlarge

Qwen

Qwen2 0.5B

huggingface-llm-qwen2-0-5b

Text Generation

apache-2.0

No

ml.g5.12xlarge, ml.g5.16xlarge, ml.g5.24xlarge, ml.g5.2xlarge, ml.g5.48xlarge, ml.g5.4xlarge, ml.g5.8xlarge, ml.p4d.24xlarge

Qwen

Qwen2 0.5B Instruct

huggingface-llm-qwen2-0-5b-instruct

Text Generation

apache-2.0

No

ml.g5.12xlarge, ml.g5.16xlarge, ml.g5.24xlarge, ml.g5.2xlarge, ml.g5.48xlarge, ml.g5.4xlarge, ml.g5.8xlarge, ml.p4d.24xlarge

Qwen

Qwen2-1.5B

huggingface-llm-qwen2-1-5b

Text Generation

apache-2.0

No

ml.g4dn.12xlarge, ml.g5.12xlarge, ml.g6.12xlarge, ml.g6e.2xlarge, ml.g6e.4xlarge, ml.g6e.xlarge

Qwen

Qwen2-1.5B-Instruct

huggingface-llm-qwen2-1-5b-instruct

Text Generation

apache-2.0

No

ml.g4dn.xlarge, ml.g5.2xlarge, ml.g5.xlarge, ml.g6.2xlarge, ml.g6e.xlarge, ml.p3.2xlarge

Qwen

Qwen2-7B

huggingface-llm-qwen2-7b

Text Generation

apache-2.0

No

ml.g4dn.12xlarge, ml.g5.12xlarge, ml.g5.24xlarge, ml.g6.12xlarge, ml.g6.24xlarge, ml.g6e.12xlarge, ml.g6e.16xlarge, ml.g6e.24xlarge, ml.g6e.2xlarge, ml.g6e.4xlarge, ml.g6e.8xlarge, ml.g6e.xlarge

Qwen

Qwen2-7B-Instruct

huggingface-llm-qwen2-7b-instruct

Text Generation

apache-2.0

No

ml.g4dn.12xlarge, ml.g5.12xlarge, ml.g5.24xlarge, ml.g6.12xlarge, ml.g6.24xlarge, ml.g6e.12xlarge, ml.g6e.16xlarge, ml.g6e.24xlarge, ml.g6e.4xlarge, ml.g6e.8xlarge

Qwen

Qwen2-VL-7B-Instruct

huggingface-vlm-qwen2-vl-7b-instruct

Text Generation

apache-2.0

No

ml.g5.12xlarge, ml.g6.12xlarge, ml.g6e.12xlarge, ml.g6e.4xlarge, ml.g6e.8xlarge, ml.g6e.xlarge

Qwen

Qwen2.5 14B Instruct

huggingface-llm-qwen2-5-14b-instruct

Text Generation

apache-2.0

Yes

ml.g5.24xlarge, ml.g5.48xlarge, ml.g6.24xlarge, ml.g6.48xlarge, ml.p4d.24xlarge

Qwen

Qwen2.5 32B Instruct

huggingface-llm-qwen2-5-32b-instruct

Text Generation

apache-2.0

Yes

ml.g5.48xlarge, ml.g6.48xlarge, ml.p4d.24xlarge

Qwen

Qwen2.5 Coder 32B Instruct

huggingface-llm-qwen2-5-coder-32b-instruct

Text Generation

apache-2.0

No

ml.g5.24xlarge, ml.g5.48xlarge, ml.g6.24xlarge, ml.g6.48xlarge, ml.p4d.24xlarge

Qwen

Qwen2.5 Coder 7B Instruct

huggingface-llm-qwen2-5-coder-7b-instruct

Text Generation

apache-2.0

No

ml.g5.16xlarge, ml.g5.2xlarge, ml.g5.4xlarge, ml.g5.8xlarge, ml.g6.16xlarge, ml.g6.4xlarge, ml.g6.8xlarge, ml.p4d.24xlarge

Qwen

Qwen2.5-72B-Instruct

huggingface-llm-qwen2-5-72b-instruct

Text Generation

other

Yes

ml.g6e.48xlarge, ml.p4d.24xlarge, ml.p5en.48xlarge

Qwen

Qwen2.5-7B-Instruct

huggingface-llm-qwen2-5-7b-instruct

Text Generation

apache-2.0

Yes

ml.g4dn.12xlarge, ml.g5.12xlarge, ml.g6.12xlarge, ml.g6e.2xlarge, ml.g6e.4xlarge, ml.g6e.xlarge

Qwen

Qwen3 14B

huggingface-reasoning-qwen3-14b

Reasoning

apache-2.0

Yes

ml.g5.12xlarge, ml.g5.24xlarge, ml.g5.48xlarge, ml.g6.12xlarge, ml.g6.24xlarge, ml.g6.48xlarge, ml.p4d.24xlarge

Qwen

Qwen3-0.6B

huggingface-reasoning-qwen3-06b

Reasoning

apache-2.0

Yes

ml.g4dn.12xlarge, ml.g5.12xlarge, ml.g6.12xlarge, ml.g6e.2xlarge, ml.p4d.24xlarge, ml.p5en.48xlarge

Qwen

Qwen3-1.7B

huggingface-reasoning-qwen3-1-7b

Reasoning

apache-2.0

Yes

ml.g4dn.12xlarge, ml.g4dn.16xlarge, ml.g4dn.2xlarge, ml.g4dn.4xlarge, ml.g4dn.8xlarge, ml.g5.12xlarge, ml.g5.24xlarge, ml.g5.48xlarge, ml.g5.4xlarge, ml.g6.12xlarge, ml.g6.24xlarge, ml.g6.48xlarge, ml.g6.8xlarge, ml.g6.xlarge, ml.g6e.12xlarge, ml.g6e.16xlarge, ml.g6e.24xlarge, ml.g6e.2xlarge, ml.g6e.4xlarge, ml.g6e.8xlarge, ml.g6e.xlarge, ml.p3.2xlarge, ml.p4d.24xlarge

Qwen

Qwen3-30B-A3B

huggingface-reasoning-qwen3-30b-a3b

Reasoning

apache-2.0

No

ml.g5.24xlarge

Qwen

Qwen3-30B-A3B-Instruct-2507

huggingface-reasoning-qwen3-30b-a3b-instruct-2507

Text Generation

apache-2.0

No

ml.g5.48xlarge, ml.g6.48xlarge, ml.g6e.12xlarge, ml.p4de.24xlarge

Qwen

Qwen3-30B-A3B-Thinking-2507

huggingface-reasoning-qwen3-30b-a3b-thinking-2507

Reasoning

apache-2.0

No

ml.g6e.12xlarge

Qwen

Qwen3-32B

huggingface-reasoning-qwen3-32b

Reasoning

apache-2.0

Yes

ml.g5.48xlarge, ml.g6.48xlarge, ml.g6e.12xlarge, ml.p4d.24xlarge, ml.p5.48xlarge, ml.p5en.48xlarge

Qwen

Qwen3-4B

huggingface-reasoning-qwen3-4b

Reasoning

apache-2.0

Yes

ml.g4dn.12xlarge, ml.g5.12xlarge, ml.g5.16xlarge, ml.g5.24xlarge, ml.g5.2xlarge, ml.g5.48xlarge, ml.g5.4xlarge, ml.g5.8xlarge, ml.g5.xlarge, ml.g6.12xlarge, ml.g6.16xlarge, ml.g6.24xlarge, ml.g6.2xlarge, ml.g6.48xlarge, ml.g6.4xlarge, ml.g6.8xlarge, ml.g6.xlarge, ml.g6e.12xlarge, ml.g6e.16xlarge, ml.g6e.24xlarge, ml.g6e.2xlarge, ml.g6e.4xlarge, ml.g6e.8xlarge, ml.g6e.xlarge, ml.p4d.24xlarge

Qwen

Qwen3-4B-Instruct-2507

huggingface-reasoning-qwen3-4b-instruct-2507

Reasoning

apache-2.0

No

ml.g4dn.12xlarge, ml.g5.12xlarge, ml.g5.24xlarge, ml.g5.48xlarge, ml.g6.12xlarge, ml.g6.24xlarge, ml.g6.48xlarge, ml.g6e.12xlarge, ml.g6e.24xlarge, ml.p4d.24xlarge

Qwen

Qwen3-8B

huggingface-reasoning-qwen3-8b

Reasoning

apache-2.0

Yes

ml.g4dn.12xlarge, ml.g5.12xlarge, ml.g6.12xlarge, ml.g6e.12xlarge, ml.p4d.24xlarge, ml.p5.48xlarge

Qwen

Qwen3-Coder-30B-A3B-Instruct

huggingface-reasoning-qwen3-coder-30b-a3b-instruct

Reasoning

apache-2.0

No

ml.g6e.24xlarge

Qwen

Qwen3-Coder-Next

huggingface-reasoning-qwen3-coder-next

Reasoning

apache-2.0

No

ml.g7e.24xlarge, ml.g7e.48xlarge, ml.p4d.24xlarge, ml.p5en.48xlarge

Qwen

Qwen3-Next-80B-A3B-Instruct

huggingface-reasoning-qwen3-next-80b-a3b-instruct

Text Generation

apache-2.0

No

ml.g6e.48xlarge, ml.p4de.24xlarge

Qwen

Qwen3-VL-8B-Instruct

huggingface-vlm-qwen3-vl-8b-instruct

Text Generation

apache-2.0

No

ml.g6e.24xlarge, ml.g6e.48xlarge, ml.p4d.24xlarge

Qwen

Qwen3.5-0.8B

huggingface-vlm-qwen3-5-0-8b

Text Generation

apache-2.0

No

ml.g6.4xlarge, ml.g6e.12xlarge, ml.g6e.2xlarge, ml.g6e.4xlarge, ml.g6e.xlarge, ml.g7e.4xlarge

Qwen

Qwen3.5-27b

huggingface-vlm-qwen3-5-27b

Text Generation

apache-2.0

Yes

ml.g6.48xlarge

Qwen

Qwen3.5-27B-FP8

huggingface-vlm-qwen3-5-27b-fp8

Image-Text-to-Text

apache-2.0

No

ml.g6.24xlarge

Qwen

Qwen3.5-2B

huggingface-vlm-qwen3-5-2b

Text Generation

apache-2.0

No

ml.g6.12xlarge, ml.g6e.12xlarge, ml.g6e.2xlarge, ml.g6e.4xlarge, ml.g6e.8xlarge, ml.g6e.xlarge

Qwen

Qwen3.5-4B

huggingface-vlm-qwen3-5-4b

Text Generation

apache-2.0

Yes

ml.g6.12xlarge, ml.g6e.24xlarge, ml.g7e.12xlarge, ml.g7e.2xlarge, ml.g7e.4xlarge, ml.g7e.8xlarge

Qwen

Qwen3.5-9b

huggingface-vlm-qwen3-5-9b

Text Generation

apache-2.0

Yes

ml.g6.24xlarge, ml.g6.48xlarge

Qwen

Qwen3.6-27B

huggingface-vlm-qwen3-6-27b

Image-Text-to-Text

apache-2.0

No

ml.g7e.12xlarge, ml.g7e.24xlarge, ml.g7e.48xlarge, ml.p5en.48xlarge

Qwen

Qwen3.6-35B-A3B

huggingface-vlm-qwen3-6-35b-a3b

Text Generation

apache-2.0

No

ml.g6e.24xlarge, ml.g6e.48xlarge, ml.p5.48xlarge, ml.p5en.48xlarge

Qwen

QwQ 32B

huggingface-llm-qwq-32b

Text Generation

apache-2.0

No

ml.g5.24xlarge, ml.g5.48xlarge, ml.g6.24xlarge, ml.g6.48xlarge, ml.p4d.24xlarge

Recognai

Recognai Bert Base Spanish Wwm Cased Xnli

huggingface-zstc-recognai-bert-base-spanish-wwm-cased-xnli

Zero-Shot Classification

mit

No

ml.c5.2xlarge, ml.c5.xlarge, ml.g4dn.xlarge, ml.m5.large, ml.m5.xlarge, ml.p3.2xlarge

Recognai

Recognai Zeroshot Selectra Medium

huggingface-zstc-recognai-zeroshot-selectra-medium

Zero-Shot Classification

apache-2.0

No

ml.c5.2xlarge, ml.c5.xlarge, ml.g4dn.xlarge, ml.m5.large, ml.m5.xlarge, ml.p3.2xlarge

Rinna

Rinna Japanese GPT NeoX 3.6B Instruction PPO

huggingface-llm-rinna-3-6b-instruction-ppo-bf16

Text Generation

mit

No

ml.g5.2xlarge, ml.g5.4xlarge, ml.g5.xlarge

Salesforce

Salesforce SFR-Embedding-2_R

huggingface-textembedding-sfr-embedding-2-r

Text Embedding

CC-BY-NC-4.0

No

ml.g5.12xlarge, ml.g5.24xlarge, ml.g5.2xlarge, ml.g5.48xlarge, ml.g5.4xlarge, ml.g5.8xlarge, ml.g5.xlarge, ml.g6.12xlarge, ml.g6.24xlarge, ml.g6.2xlarge, ml.g6.48xlarge, ml.g6.4xlarge, ml.g6.8xlarge, ml.g6.xlarge, ml.p4d.24xlarge

Salesforce

Salesforce SFR-Embedding-Mistral

huggingface-textembedding-sfr-embedding-mistral

Text Embedding

CC-BY-NC-4.0

No

ml.g5.12xlarge, ml.g5.24xlarge, ml.g5.2xlarge, ml.g5.48xlarge, ml.g5.4xlarge, ml.g5.8xlarge, ml.g5.xlarge, ml.g6.12xlarge, ml.g6.24xlarge, ml.g6.2xlarge, ml.g6.48xlarge, ml.g6.4xlarge, ml.g6.8xlarge, ml.g6.xlarge, ml.p4d.24xlarge

Sentence-Transformers

All MiniLM L6 v2

huggingface-textembedding-all-MiniLM-L6-v2

Text Embedding

Apache-2.0

No

ml.g5.12xlarge, ml.g5.24xlarge, ml.g5.2xlarge, ml.g5.48xlarge, ml.g5.4xlarge, ml.g5.8xlarge, ml.g5.xlarge, ml.g6.12xlarge, ml.g6.24xlarge, ml.g6.2xlarge, ml.g6.48xlarge, ml.g6.4xlarge, ml.g6.8xlarge, ml.g6.xlarge, ml.p4d.24xlarge

Sentence-Transformers

All MiniLM L6 v2

huggingface-sentencesimilarity-all-MiniLM-L6-v2

Sentence Similarity

apache-2.0

Yes

ml.c6i.xlarge, ml.g4dn.2xlarge, ml.g5.2xlarge, ml.g5.4xlarge, ml.g5.8xlarge, ml.p3.2xlarge

Sentence-Transformers

Paraphrase Multilingual MiniLM L12 v2

huggingface-textembedding-paraphrase-multilingual-MiniLM-L12-v2

Text Embedding

Apache-2.0

No

ml.g5.xlarge, ml.g6.xlarge, ml.g6e.xlarge

Shenzhi-Wang

Llama3 8B Chinese Chat

huggingface-llm-shenzhi-wang-llama3-8B-chinese-chat

Text Generation

llama3

No

ml.g5.16xlarge, ml.g5.2xlarge, ml.g5.4xlarge, ml.g5.8xlarge

Snowflake

Snowflake Arctic Instruct Vllm

huggingface-llm-snowflake-arctic-instruct-vllm

Text Generation

apache-2.0

No

ml.p5.48xlarge

Speakleash

Bielik-11B-v3.0-Instruct

huggingface-llm-bielik-11b-v3-0-instruct

Text Generation

apache-2.0

No

ml.g5.12xlarge, ml.g6.12xlarge, ml.g6e.12xlarge, ml.g7e.2xlarge, ml.p4d.24xlarge, ml.p5en.48xlarge

Sshleifer

Distilbart CNN 12-6

huggingface-summarization-distilbart-cnn-12-6

Text Summarization

apache-2.0

No

ml.c5.2xlarge, ml.c5.xlarge, ml.g4dn.xlarge, ml.m5.large, ml.m5.xlarge, ml.p3.2xlarge

Sshleifer

Distilbart CNN 6-6

huggingface-summarization-distilbart-cnn-6-6

Text Summarization

apache-2.0

No

ml.c5.2xlarge, ml.c5.xlarge, ml.g4dn.xlarge, ml.m5.large, ml.m5.xlarge, ml.p3.2xlarge

Sshleifer

Text Summarization

huggingface-summarization-distilbart-xsum-1-1

Text Summarization

apache-2.0

No

ml.c5.2xlarge, ml.c5.xlarge, ml.g4dn.xlarge, ml.m5.large, ml.m5.xlarge, ml.p3.2xlarge

Stabilityai

Japanese StableLM Instruct Alpha 7B v2

model-textgenerationjp-japanese-stablelm-instruct-alpha-7b-v2

Text Generation

Apache-2.0

No

ml.g5.12xlarge, ml.g5.16xlarge, ml.g5.24xlarge, ml.g5.2xlarge, ml.g5.4xlarge, ml.g5.8xlarge, ml.p4d.24xlarge, ml.p4de.24xlarge

Stabilityai

Stable Diffusion 2

model-txt2img-stabilityai-stable-diffusion-v2

Text-to-Image

creativeml-openrail++-m

No

ml.g4dn.4xlarge, ml.g4dn.8xlarge, ml.g5.2xlarge, ml.g5.4xlarge, ml.g5.8xlarge, ml.p3.2xlarge

Stabilityai

Stable Diffusion 2 Depth FP16

model-depth2img-stable-diffusion-2-depth-fp16

Image-to-Image

creativeml-openrail++-m

No

ml.g4dn.16xlarge, ml.g4dn.2xlarge, ml.g4dn.4xlarge, ml.g4dn.8xlarge, ml.g4dn.xlarge, ml.g5.16xlarge, ml.g5.2xlarge, ml.g5.4xlarge, ml.g5.8xlarge, ml.g5.xlarge, ml.p3.2xlarge

Stabilityai

Stable Diffusion 2 FP16

model-txt2img-stabilityai-stable-diffusion-v2-fp16

Text-to-Image

creativeml-openrail++-m

No

ml.g4dn.4xlarge, ml.g4dn.8xlarge, ml.g5.2xlarge, ml.g5.4xlarge, ml.g5.8xlarge, ml.p3.2xlarge

Stabilityai

Stable Diffusion 2 Inpainting

model-inpainting-stabilityai-stable-diffusion-2-inpainting

Image-to-Image

creativeml-openrail++-m

No

ml.g4dn.4xlarge, ml.g4dn.8xlarge, ml.g5.2xlarge, ml.g5.4xlarge, ml.g5.8xlarge, ml.p3.2xlarge

Stabilityai

Stable Diffusion 2 Inpainting FP16

model-inpainting-stabilityai-stable-diffusion2-inpainting-fp16

Image-to-Image

creativeml-openrail++-m

No

ml.g4dn.xlarge, ml.g5.2xlarge, ml.p3.2xlarge

Stabilityai

Stable Diffusion 2 Inpainting FP16

model-inpainting-stabilityai-stable-diffusion-2-inpainting-fp16

Image-to-Image

creativeml-openrail++-m

No

ml.g4dn.xlarge, ml.g5.2xlarge, ml.p3.2xlarge

Stabilityai

Stable Diffusion 2.1

model-txt2img-stabilityai-stable-diffusion-v2-1-base

Text-to-Image

creativeml-openrail++-m

Yes

ml.g4dn.4xlarge, ml.g4dn.8xlarge, ml.g5.2xlarge, ml.g5.4xlarge, ml.g5.8xlarge, ml.p3.2xlarge

Stabilityai

Stable Diffusion 2.1 Neuron

huggingface-txt2imgneuron-stabilityai-stable-diffusion-v2-1

Text-to-Image

creativeml-openrail++-m

No

ml.inf2.24xlarge, ml.inf2.48xlarge, ml.inf2.8xlarge, ml.inf2.xlarge

Stabilityai

Stable Diffusion x4 upscaler FP16

model-upscaling-stabilityai-stable-diffusion-x4-upscaler-fp16

Image-to-Image

openrail++

No

ml.g4dn.4xlarge, ml.g4dn.8xlarge, ml.g5.2xlarge, ml.g5.4xlarge, ml.g5.8xlarge, ml.p3.2xlarge

Stabilityai

Stable Diffusion XL 1.0 (open-source)

model-imagegeneration-stabilityai-stable-diffusion-xl-base-1-0

Text-to-Image

openrail++

No

ml.g5.16xlarge, ml.g5.4xlarge, ml.g5.8xlarge

Stabilityai

Stable Diffusion XL Base 1.0 Neuron

huggingface-txt2imgneuron-stabilityai-stable-diffusion-xlbase1

Text-to-Image

openrail++

No

ml.inf2.24xlarge, ml.inf2.48xlarge, ml.inf2.8xlarge, ml.inf2.xlarge

Swiss-Ai

Apertus 70B 2509

huggingface-llm-apertus-70b-2509

Text Generation

apache-2.0

No

ml.g5.48xlarge, ml.g6.48xlarge, ml.g6e.48xlarge, ml.p4d.24xlarge, ml.p4de.24xlarge, ml.p5.48xlarge, ml.p5en.48xlarge

Swiss-Ai

Apertus 70B Instruct 2509

huggingface-llm-apertus-70b-instruct-2509

Text Generation

apache-2.0

No

ml.g5.48xlarge, ml.g6.48xlarge, ml.g6e.48xlarge, ml.p4d.24xlarge, ml.p4de.24xlarge, ml.p5.48xlarge, ml.p5en.48xlarge

Swiss-Ai

Apertus 8B Instruct 2509

huggingface-llm-apertus-8b-instruct-2509

Text Generation

apache-2.0

No

ml.g5.12xlarge, ml.g5.16xlarge, ml.g5.24xlarge, ml.g5.2xlarge, ml.g5.48xlarge, ml.g5.4xlarge, ml.g5.8xlarge, ml.g5.xlarge, ml.g6.12xlarge, ml.g6.16xlarge, ml.g6.24xlarge, ml.g6.2xlarge, ml.g6.48xlarge, ml.g6.4xlarge, ml.g6.8xlarge, ml.g6.xlarge, ml.g6e.12xlarge, ml.g6e.16xlarge, ml.g6e.24xlarge, ml.g6e.2xlarge, ml.g6e.48xlarge, ml.g6e.4xlarge, ml.g6e.8xlarge, ml.g6e.xlarge, ml.p4d.24xlarge, ml.p4de.24xlarge, ml.p5.48xlarge, ml.p5en.48xlarge

Thebloke

Mistral 7B OpenOrca AWQ

huggingface-llm-thebloke-mistral-7b-openorca-awq

Text Generation

apache-2.0

No

ml.g5.12xlarge, ml.g5.24xlarge, ml.g5.2xlarge, ml.g5.48xlarge

Thebloke

Mistral 7B OpenOrca GPTQ

huggingface-llm-mistral-7b-openorca-gptq

Text Generation

apache-2.0

No

ml.g5.12xlarge, ml.g5.16xlarge, ml.g5.24xlarge, ml.g5.2xlarge, ml.g5.48xlarge, ml.g5.4xlarge, ml.g5.8xlarge, ml.g5.xlarge, ml.p4d.24xlarge, ml.p4de.24xlarge, ml.p5.48xlarge

Thebloke

Mixtral 8x7B Instruct GPTQ

huggingface-llm-mixtral-8x7b-instruct-gptq

Text Generation

apache-2.0

No

ml.g5.12xlarge, ml.g5.24xlarge, ml.g5.48xlarge, ml.p4d.24xlarge

Thenlper

GTE Large

huggingface-sentencesimilarity-gte-large

Sentence Similarity

mit

Yes

ml.c6i.xlarge, ml.g4dn.2xlarge, ml.g5.2xlarge, ml.g5.4xlarge, ml.g5.8xlarge, ml.p3.2xlarge

Thenlper

GTE Small

huggingface-sentencesimilarity-gte-small

Sentence Similarity

mit

Yes

ml.c6i.xlarge, ml.g4dn.2xlarge, ml.g5.2xlarge, ml.g5.4xlarge, ml.g5.8xlarge, ml.p3.2xlarge

Tiiuae

Falcon 180B BF16

huggingface-llm-falcon-180b-bf16

Text Generation

unknown

No

ml.p4de.24xlarge, ml.p5.48xlarge

Tiiuae

Falcon 180B Chat BF16

huggingface-llm-falcon-180b-chat-bf16

Text Generation

unknown

No

ml.p4de.24xlarge, ml.p5.48xlarge

Tiiuae

Falcon 40B BF16

huggingface-llm-falcon-40b-bf16

Text Generation

apache-2.0

Yes

ml.g5.12xlarge, ml.g5.24xlarge, ml.g5.48xlarge, ml.p4d.24xlarge

Tiiuae

Falcon 40B Instruct BF16

huggingface-llm-falcon-40b-instruct-bf16

Text Generation

apache-2.0

Yes

ml.g5.12xlarge, ml.g5.24xlarge, ml.g5.48xlarge, ml.p4d.24xlarge

Tiiuae

Falcon 7B BF16

huggingface-llm-falcon-7b-bf16

Text Generation

apache-2.0

Yes

ml.g5.16xlarge, ml.g5.2xlarge, ml.g5.4xlarge, ml.g5.8xlarge

Tiiuae

Falcon 7B Instruct BF16

huggingface-llm-falcon-7b-instruct-bf16

Text Generation

apache-2.0

Yes

ml.g5.16xlarge, ml.g5.2xlarge, ml.g5.4xlarge, ml.g5.8xlarge

Tiiuae

Falcon RW 1B

huggingface-llm-tiiuae-falcon-rw-1b

Text Generation

apache-2.0

No

ml.g5.2xlarge

Tiiuae

Falcon-H1-0.5B-Instruct

huggingface-llm-falcon-h1-0-5b-instruct

Text Generation

other

No

ml.g4dn.12xlarge, ml.g4dn.16xlarge, ml.g4dn.2xlarge, ml.g4dn.4xlarge, ml.g4dn.8xlarge, ml.g4dn.xlarge, ml.g5.12xlarge, ml.g5.16xlarge, ml.g5.24xlarge, ml.g5.2xlarge, ml.g5.48xlarge, ml.g5.4xlarge, ml.g5.8xlarge, ml.g5.xlarge, ml.g6.12xlarge, ml.g6.16xlarge, ml.g6.24xlarge, ml.g6.2xlarge, ml.g6.48xlarge, ml.g6.4xlarge, ml.g6.8xlarge, ml.g6.xlarge, ml.g6e.12xlarge, ml.g6e.16xlarge, ml.g6e.24xlarge, ml.g6e.2xlarge, ml.g6e.4xlarge, ml.g6e.8xlarge, ml.g6e.xlarge, ml.p3.2xlarge, ml.p4d.24xlarge

Tiiuae

Falcon-H1-1.5B-Deep-Instruct

huggingface-llm-falcon-h1-1-5b-deep-instruct

Text Generation

other

No

ml.g5.16xlarge, ml.g5.4xlarge, ml.g5.8xlarge, ml.g6.16xlarge, ml.g6.2xlarge, ml.g6.4xlarge, ml.g6.8xlarge, ml.g6.xlarge, ml.g6e.16xlarge, ml.g6e.2xlarge, ml.g6e.4xlarge, ml.g6e.8xlarge, ml.g6e.xlarge

Tiiuae

Falcon-H1-1.5B-Instruct

huggingface-llm-falcon-h1-1-5b-instruct

Text Generation

other

No

ml.g6.12xlarge, ml.g6.16xlarge, ml.g6.24xlarge, ml.g6.2xlarge, ml.g6.48xlarge, ml.g6.4xlarge, ml.g6.8xlarge, ml.g6.xlarge, ml.g6e.12xlarge, ml.g6e.16xlarge, ml.g6e.24xlarge, ml.g6e.2xlarge, ml.g6e.48xlarge, ml.g6e.4xlarge, ml.g6e.8xlarge, ml.g6e.xlarge

Tiiuae

Falcon-H1-34B-Instruct

huggingface-llm-falcon-h1-34b-instruct

Text Generation

other

No

ml.g6e.12xlarge, ml.g6e.24xlarge, ml.g6e.48xlarge

Tiiuae

Falcon-H1-3B-Instruct

huggingface-llm-falcon-h1-3b-instruct

Text Generation

other

No

ml.g6e.16xlarge, ml.g6e.2xlarge, ml.g6e.4xlarge, ml.g6e.8xlarge, ml.g6e.xlarge

Tiiuae

Falcon-H1-7B-Instruct

huggingface-llm-falcon-h1-7b-instruct

Text Generation

other

No

ml.g6e.12xlarge, ml.g6e.24xlarge

Tiiuae

Falcon2-11B

huggingface-llm-falcon2-11b

Text Generation

unknown

No

ml.g5.12xlarge, ml.g5.24xlarge, ml.g5.48xlarge, ml.p4d.24xlarge

Tiiuae

Falcon3-10B-Base

huggingface-llm-falcon-3-10B-base

Text Generation

other

No

ml.g5.12xlarge, ml.g5.24xlarge, ml.g5.48xlarge, ml.g6.12xlarge, ml.g6.24xlarge, ml.g6.48xlarge, ml.p4d.24xlarge, ml.p5.48xlarge

Tiiuae

Falcon3-10B-Instruct

huggingface-llm-falcon-3-10B-Instruct

Text Generation

other

No

ml.g4dn.12xlarge, ml.g5.12xlarge, ml.g5.24xlarge, ml.g6.12xlarge, ml.g6.24xlarge, ml.g6e.12xlarge, ml.g6e.16xlarge, ml.g6e.24xlarge, ml.g6e.2xlarge, ml.g6e.4xlarge, ml.g6e.8xlarge, ml.g6e.xlarge

Tiiuae

Falcon3-1B-Instruct

huggingface-llm-falcon-3-1B-Instruct

Text Generation

other

No

ml.g4dn.xlarge, ml.g5.xlarge, ml.g6.xlarge, ml.g6e.xlarge, ml.p3.2xlarge, ml.p4d.24xlarge

Tiiuae

Falcon3-3B-Base

huggingface-llm-falcon-3-3B-base

Text Generation

other

No

ml.g5.12xlarge, ml.g5.16xlarge, ml.g5.24xlarge, ml.g5.2xlarge, ml.g5.48xlarge, ml.g5.4xlarge, ml.g5.8xlarge, ml.g5.xlarge, ml.g6.12xlarge, ml.g6.16xlarge, ml.g6.24xlarge, ml.g6.2xlarge, ml.g6.48xlarge, ml.g6.4xlarge, ml.g6.8xlarge, ml.g6.xlarge, ml.p4d.24xlarge, ml.p5.48xlarge

Tiiuae

Falcon3-3B-Instruct

huggingface-llm-falcon-3-3B-Instruct

Text Generation

other

No

ml.g4dn.12xlarge, ml.g4dn.16xlarge, ml.g4dn.4xlarge, ml.g5.12xlarge, ml.g5.16xlarge, ml.g5.24xlarge, ml.g5.2xlarge, ml.g5.4xlarge, ml.g5.xlarge, ml.g6.12xlarge, ml.g6.16xlarge, ml.g6.24xlarge, ml.g6.2xlarge, ml.g6.4xlarge, ml.g6.8xlarge, ml.g6e.12xlarge, ml.g6e.16xlarge, ml.g6e.24xlarge, ml.g6e.2xlarge, ml.g6e.8xlarge, ml.g6e.xlarge, ml.p3.2xlarge

Tiiuae

Falcon3-7B-Base

huggingface-llm-falcon-3-7B-base

Text Generation

other

No

ml.g5.12xlarge, ml.g5.24xlarge, ml.g5.48xlarge, ml.g6.12xlarge, ml.g6.24xlarge, ml.g6.48xlarge, ml.p4d.24xlarge, ml.p5.48xlarge

Tiiuae

Falcon3-7B-Instruct

huggingface-llm-falcon-3-7B-Instruct

Text Generation

other

No

ml.g4dn.12xlarge, ml.g5.12xlarge, ml.g5.24xlarge, ml.g6.12xlarge, ml.g6.24xlarge, ml.g6e.12xlarge, ml.g6e.24xlarge, ml.g6e.2xlarge, ml.g6e.4xlarge, ml.g6e.8xlarge, ml.g6e.xlarge

Tinyllama

Tiny Llama 1.1B

huggingface-llm-tinyllama-1-1b-intermediate-step-1431k-3

Text Generation

apache-2.0

No

ml.g5.12xlarge, ml.g5.24xlarge, ml.g5.2xlarge

Tinyllama

Tiny Llama 1.1B Chat V0.6

huggingface-llm-tinyllama-tinyllama-1-1b-chat-v0-6

Text Generation

apache-2.0

No

ml.g5.12xlarge, ml.g5.24xlarge, ml.g5.2xlarge

Tinyllama

Tiny Llama 1.1B Chat V1

huggingface-llm-tinyllama-tinyllama-1-1b-chat-v1-0

Text Generation

apache-2.0

No

ml.g5.12xlarge, ml.g5.24xlarge, ml.g5.2xlarge

Togethercomputer

RedPajama INCITE Chat 3B V1

huggingface-textgeneration1-redpajama-incite-chat-3B-v1-fp16

Text Generation

apache-2.0

Yes

ml.g5.16xlarge, ml.g5.2xlarge, ml.g5.4xlarge, ml.g5.8xlarge

Togethercomputer

RedPajama INCITE Chat 7B V1

huggingface-textgeneration1-redpajama-incite-chat-7B-v1-fp16

Text Generation

apache-2.0

Yes

ml.g5.16xlarge, ml.g5.2xlarge, ml.g5.4xlarge, ml.g5.8xlarge

Togethercomputer

RedPajama INCITE Instruct 3B V1

huggingface-textgeneration1-redpajama-incite-instruct-3Bv1fp16

Text Generation

apache-2.0

Yes

ml.g5.16xlarge, ml.g5.2xlarge, ml.g5.4xlarge, ml.g5.8xlarge

Togethercomputer

RedPajama INCITE Instruct 7B V1

huggingface-textgeneration1-redpajama-incite-instruct-7B1fp16

Text Generation

apache-2.0

Yes

ml.g5.16xlarge, ml.g5.2xlarge, ml.g5.4xlarge, ml.g5.8xlarge

Tokyotech-Llm

Swallow-7b-instruct-hf

huggingface-llm-swallow-7b-instruct-hf

Text Generation

llama2

No

ml.g4dn.12xlarge, ml.g5.12xlarge, ml.g5.16xlarge, ml.g5.24xlarge, ml.g5.2xlarge, ml.g5.48xlarge, ml.g5.4xlarge, ml.g5.8xlarge, ml.g5.xlarge, ml.g6.12xlarge, ml.g6.16xlarge, ml.g6.24xlarge, ml.g6.2xlarge, ml.g6.48xlarge, ml.g6.4xlarge, ml.g6.8xlarge, ml.g6.xlarge, ml.g6e.12xlarge, ml.g6e.16xlarge, ml.g6e.4xlarge, ml.g6e.8xlarge, ml.p3.16xlarge, ml.p3.8xlarge

Volrath50

volrath50 fantasy card diffusion

huggingface-txt2img-volrath50-fantasy-card-diffusion

Text-to-Image

creativeml-openrail-m

No

ml.g4dn.2xlarge, ml.g4dn.xlarge, ml.p3.2xlarge

Writer

Writer Palmyra Small

huggingface-llm-writer-palmyra-small

Text Generation

apache-2.0

No

ml.g5.2xlarge

Proprietary Models (121)

Proprietary foundation models are offered by third-party providers through Amazon SageMaker JumpStart. To use a proprietary foundation model, you must first subscribe to the model in AWS Marketplace. After subscribing, you can locate and deploy the foundation model in Amazon SageMaker Studio.

Model Name Model ID Task Fine-tunable Supported Inference Instance Types

A.X 4.0 Light

sk-telecom-ax4-light

Text Generation

No

ml.g5.2xlarge

AI21 Contextual Answers

ai21-contextual-answers

Text Generation

No

ml.g5.12xlarge, ml.g5.48xlarge, ml.p4d.24xlarge, ml.g4dn.12xlarge

AI21 Jurassic-2 Light

ai21-jurassic-2-light

Text Generation

No

ml.g4dn.12xlarge, ml.g5.12xlarge, ml.g5.48xlarge, ml.p4d.24xlarge

AI21 Jurassic-2 Mid

ai21-jurassic-2-grande-instruct

Text Generation

No

ml.g5.12xlarge, ml.g5.48xlarge, ml.p4d.24xlarge, ml.g4dn.12xlarge

AI21 Jurassic-2 Ultra

ai21-jurassic-2-jumbo-instruct

Text Generation

No

ml.g5.48xlarge, ml.p4d.24xlarge, ml.p4de.24xlarge

AI21 Paraphrase

ai21-paraphrase

Text Generation

No

ml.g4dn.2xlarge

AI21 Summarize

ai21-summarization

Text Generation

No

ml.g4dn.4xlarge, ml.g4dn.12xlarge, ml.g5.xlarge

Arcee Lite

arcee-lite

Text Generation

No

ml.g5.xlarge, ml.g5.2xlarge, ml.g5.4xlarge, ml.g5.8xlarge, ml.g5.16xlarge, ml.g6.xlarge, ml.g6.2xlarge, ml.g6.4xlarge, ml.g6.8xlarge, ml.g6.16xlarge

Arcee SuperNova

arcee-supernova

Text Generation

No

ml.p4d.24xlarge, ml.p4de.24xlarge, ml.p5.48xlarge

Arcee Virtuoso Small

arcee-virtuoso-small

Text Generation

No

ml.g5.12xlarge, ml.g5.24xlarge, ml.g6.12xlarge, ml.g6.24xlarge, ml.g6e.12xlarge, ml.g6e.24xlarge, ml.g5.48xlarge, ml.g6.48xlarge, ml.g6e.48xlarge, ml.p4d.24xlarge, ml.p4de.24xlarge, ml.p5.48xlarge

Bria 2.2HD Commercial Text-to-image

bria-ai-2-2-hd-commercial

ReRank

No

ml.g5.2xlarge, ml.g5.12xlarge, ml.g5.48xlarge, ml.p4d.24xlarge, ml.p4de.24xlarge

Bria 2.3 Commercial Text-to-image

bria-ai-2-3-commercial

ReRank

No

ml.g5.2xlarge, ml.g5.12xlarge, ml.g5.48xlarge, ml.p4d.24xlarge, ml.p4de.24xlarge

Bria 2.3Fast Commercial Text-to-image

bria-ai-2-3-fast-commercial

ReRank

No

ml.g5.2xlarge, ml.g5.12xlarge, ml.g5.48xlarge, ml.p4d.24xlarge, ml.p4de.24xlarge

Cohere Command R (A100)

cohere-command-r-a100

Text Generation

No

ml.p4de.24xlarge

Cohere Command R (H100)

cohere-command-r-h100

Text Generation

No

ml.p5.48xlarge

Cohere Command R+ (A100)

cohere-command-r-plus-a100

Text Generation

No

ml.p4de.24xlarge

Cohere Command R+ (H100)

cohere-command-r-plus-h100

Text Generation

No

ml.p5.48xlarge

Cohere Embed 4

cohere-embed-v4-0

Text Embedding

No

ml.g5.xlarge, ml.g5.2xlarge, ml.g6.xlarge, ml.g6.2xlarge

Cohere Embed Light Model v3 - English

cohere-embed-light-english

Text Embedding

No

ml.p3.2xlarge, ml.g4dn.xlarge, ml.g4dn.2xlarge, ml.g5.xlarge, ml.g5.2xlarge

Cohere Embed Light v3 - Multilingual

cohere-embed-light-multilingual

Text Embedding

No

ml.p3.2xlarge, ml.g4dn.xlarge, ml.g4dn.2xlarge, ml.g5.xlarge, ml.g5.2xlarge

Cohere Embed Model 3 - Multilingual

cohere-embed-multilingual

Text Embedding

No

ml.p3.2xlarge, ml.g4dn.xlarge, ml.g4dn.2xlarge, ml.g5.xlarge, ml.g5.2xlarge

Cohere Embed Model v3 - English

cohere-embed-english

Text Embedding

No

ml.p3.2xlarge, ml.g4dn.xlarge, ml.g4dn.2xlarge, ml.g5.xlarge, ml.g5.2xlarge

Cohere Rerank 2 Model - English

cohere-rerank-english-v2

ReRank

No

ml.p3.2xlarge, ml.g4dn.xlarge, ml.g4dn.2xlarge, ml.g5.xlarge, ml.g5.2xlarge

Cohere Rerank 2 Model - Multilingual

cohere-rerank-multilingual-v2

ReRank

No

ml.p3.2xlarge, ml.g4dn.xlarge, ml.g4dn.2xlarge, ml.g5.xlarge, ml.g5.2xlarge

Cohere Rerank 3 Model - English

cohere-rerank-v3-english

ReRank

No

ml.g5.xlarge, ml.g5.2xlarge

Cohere Rerank 3 Model - Multilingual

cohere-rerank-v3-multilingual

ReRank

No

ml.g5.xlarge, ml.g5.2xlarge

Cohere Rerank 3 Nimble Model - English

cohere-rerank-nimble-english

ReRank

No

ml.g5.xlarge, ml.g5.2xlarge

Cohere Rerank 3 Nimble Model - Multi

cohere-rerank-nimble-multi

ReRank

No

ml.g5.xlarge, ml.g5.2xlarge

Cohere Rerank v3.5

cohere-rerank-v3-5

Text Embedding

No

ml.g6.xlarge, ml.g6.2xlarge, ml.g5.xlarge, ml.g5.2xlarge

Cohere Rerank v4.0 Fast

cohere-rerank-v4-0-fast

Text Embedding

No

ml.g5.xlarge, ml.g5.2xlarge, ml.p5.4xlarge

Cohere Rerank v4.0 Pro

cohere-rerank-v4-0-pro

Text Embedding

No

ml.g5.xlarge, ml.g5.2xlarge, ml.p5.4xlarge

Command A (A100)

cohere-command-a-a100

Text Generation

No

ml.p4de.24xlarge

Command A (H100)

cohere-command-a-h100

Text Generation

No

ml.p5.48xlarge

Command R 08-2024 (H100)

cohere-command-r-08-2024-h100

Text Generation

No

ml.p5.48xlarge

Command R+ 08-2024 (H100)

cohere-command-r-plus-08-2024-h100

Text Generation

No

ml.p5.48xlarge

Document OCR

upstage-document-ocr

Optical Character Recognition

No

ml.g5.2xlarge

Document Parse

upstage-document-layout-analysis

Optical Character Recognition

No

ml.g5.2xlarge, ml.g6.2xlarge, ml.p3.2xlarge

Embed

upstage-solar-embedding-large

Text Embedding

No

ml.g5.2xlarge

ESM3-open

evolutionary-scale-esm3

Multimodal Generation

No

ml.g5.4xlarge, ml.g5.8xlarge

Evo2-NIM

nvidia-evo2-nim

Text Generation

No

ml.g6e.12xlarge, ml.g6e.24xlarge, ml.g6e.48xlarge, ml.p4d.24xlarge, ml.p4de.24xlarge, ml.p5.48xlarge, ml.p5en.48xlarge

EXAONE Atelier - Image to Text

lgresearch-exaone

Image2Text Generation

No

ml.g5.xlarge, ml.g5.12xlarge, ml.g5.48xlarge, ml.p4d.24xlarge

Gretel Navigator Tabular

gretel-navigator-tabular

Text Generation

No

ml.g5.xlarge, ml.g5.2xlarge

H-Optimus-0

bioptimus-h-optimus-0

Feature Extraction

No

ml.g5.xlarge

IBM Granite 20B Code Instruct - 8K

ibm-granite-20b-code-instruct-8k

Text Generation

No

ml.g5.12xlarge, ml.g5.24xlarge, ml.g5.48xlarge

IBM Granite 3.0 2B Instruct

granite-3-0-2b-instruct

Text Generation

No

ml.p4d.24xlarge, ml.p5.48xlarge

IBM Granite 3.0 8B Instruct

granite-3-0-8b-instruct

Text Generation

No

ml.p4d.24xlarge, ml.p5.48xlarge

IBM Granite 3.2 Instruct 2B

ibm-granite-3-2-2b-instruct

Text Generation

No

ml.g6e.2xlarge, ml.g6e.4xlarge, ml.g6e.12xlarge, ml.g6e.16xlarge, ml.g6e.24xlarge, ml.g6e.48xlarge, ml.p4d.24xlarge, ml.p5.48xlarge, ml.g6.12xlarge, ml.g6.16xlarge, ml.g6.24xlarge, ml.g6.48xlarge

IBM Granite 3.2 Instruct 8B

ibm-granite-3-2-8b-instruct

Text Generation

No

ml.g6.12xlarge, ml.g6.16xlarge, ml.g6.24xlarge, ml.g6.48xlarge, ml.g6e.2xlarge, ml.g6e.4xlarge, ml.g6e.12xlarge, ml.g6e.16xlarge, ml.g6e.24xlarge, ml.g6e.48xlarge, ml.p4d.24xlarge, ml.p5.48xlarge

IBM Granite 34B Code Instruct - 8K

ibm-granite-34b-code-instruct-8k

Text Generation

No

ml.g5.12xlarge, ml.g5.24xlarge, ml.g5.48xlarge

IBM Granite 3B Code Instruct - 128K

ibm-granite-3b-code-instruct-128k

Text Generation

No

ml.g5.4xlarge, ml.g5.8xlarge, ml.g5.16xlarge, ml.g5.12xlarge, ml.g5.24xlarge, ml.g5.48xlarge

IBM Granite 4.0 h-micro

ibm-granite-4-0-h-micro

Text Generation

No

ml.g6e.2xlarge, ml.g6e.4xlarge, ml.g6e.12xlarge, ml.g6e.16xlarge, ml.g6e.24xlarge, ml.g6e.48xlarge, ml.p4d.24xlarge, ml.p5.48xlarge

IBM Granite 4.0 h-small

ibm-granite-4-0-h-small

Text Generation

No

ml.g6e.2xlarge, ml.g6e.4xlarge, ml.g6e.12xlarge, ml.g6e.16xlarge, ml.g6e.24xlarge, ml.g6e.48xlarge, ml.p4d.24xlarge, ml.p5.48xlarge

IBM Granite 4.0 h-tiny

ibm-granite-4-0-h-tiny

Text Generation

No

ml.g6e.2xlarge, ml.g6e.4xlarge, ml.g6e.12xlarge, ml.g6e.16xlarge, ml.g6e.24xlarge, ml.g6e.48xlarge, ml.p4d.24xlarge, ml.p5.48xlarge

IBM Granite 8B Code Instruct - 128K

ibm-granite-8b-code-instruct-128k

Text Generation

No

ml.g5.4xlarge, ml.g5.8xlarge, ml.g5.16xlarge, ml.g5.12xlarge, ml.g5.24xlarge, ml.g5.48xlarge

ixi-GEN-Fin-7.8B

lg-ixi-gen

Text Generation

No

ml.g5.4xlarge

JetBrains AI Mellum All

jbai-mellum-all

text2text generation

No

ml.g5.2xlarge, ml.g6.2xlarge, ml.g6e.xlarge, ml.p4d.24xlarge, ml.p4de.24xlarge, ml.p5.48xlarge

JetBrains AI Mellum Kotlin

jbai-mellum-kotlin

text2text generation

No

ml.g5.2xlarge, ml.g6.2xlarge, ml.g6e.xlarge, ml.p4d.24xlarge, ml.p4de.24xlarge, ml.p5.48xlarge

JetBrains AI Mellum Python

jbai-mellum-python

text2text generation

No

ml.g5.2xlarge, ml.g6.2xlarge, ml.g6e.xlarge, ml.p4d.24xlarge, ml.p4de.24xlarge, ml.p5.48xlarge

Jina Embeddings v2 Base - en

jinaai-embeddings-v2-base-en

Text Embedding

No

ml.p2.xlarge, ml.p2.8xlarge, ml.p2.16xlarge, ml.p3.2xlarge, ml.p3.8xlarge, ml.p3.16xlarge, ml.g4dn.xlarge, ml.g4dn.2xlarge, ml.g4dn.4xlarge, ml.g4dn.8xlarge, ml.g4dn.12xlarge, ml.g4dn.16xlarge, ml.g5.xlarge, ml.g5.2xlarge, ml.g5.4xlarge, ml.g5.8xlarge, ml.g5.12xlarge, ml.g5.16xlarge, ml.g5.24xlarge, ml.g5.48xlarge

KARAKURI LM 8x7b instruct

karakuri-lm-8x7b-instruct

Text Generation

No

ml.g5.48xlarge, ml.g6.48xlarge

LightOn Lyra-Fr 10B

lighton-lyra-fr

Text Generation

No

ml.p4d.24xlarge

LightOn Mini-instruct 40B

lighton-mini-instruct40b

Text Generation

No

ml.p4d.24xlarge

Liquid LFM 40B (A100)

liquid-lfm-40b-a100

Text Generation

No

ml.p4d.24xlarge

Liquid LFM 40B (H100)

liquid-lfm-40b-h100

Text Generation

No

ml.p5.48xlarge

Liquid LFM 40B (L40S)

liquid-lfm-40b-l40s

Text Generation

No

ml.g6e.12xlarge

Liquid LFM 7B (L40S)

liquid-lfm-7b-l40s

Text Generation

No

ml.g6e.xlarge, ml.g6e.2xlarge, ml.g6e.4xlarge, ml.g6e.8xlarge, ml.g6e.16xlarge

Llama 3.1 Nemotron Nano 8B V1

nvidia-nemotron-nano-8b-nim

Text Generation

No

ml.g5.12xlarge, ml.g5.24xlarge, ml.g5.48xlarge, ml.g6e.2xlarge, ml.g6e.4xlarge, ml.g6e.8xlarge, ml.g6e.16xlarge, ml.g6e.12xlarge, ml.g6e.24xlarge, ml.g6e.48xlarge, ml.p4d.24xlarge, ml.p4de.24xlarge, ml.p5.48xlarge, ml.p5en.48xlarge

Llama 3.1 SuperNova Lite

arcee-llama-3-1-supernova-lite

Text Generation

No

ml.g5.2xlarge, ml.g5.4xlarge, ml.g5.8xlarge, ml.g5.16xlarge, ml.g6.2xlarge, ml.g6.4xlarge, ml.g6.8xlarge, ml.g6.16xlarge

Llama 3.2 NVRerankQA1B NIM microservice

nvidia-llama3-2-nv-rerankqa-1b-v2-nim

Text Generation

No

ml.g5.xlarge, ml.g5.2xlarge, ml.g5.4xlarge, ml.g5.8xlarge, ml.g5.12xlarge, ml.g5.16xlarge, ml.g5.24xlarge, ml.g5.48xlarge, ml.g6.xlarge, ml.g6.2xlarge, ml.g6.4xlarge, ml.g6.8xlarge, ml.g6.12xlarge, ml.g6.16xlarge, ml.g6.24xlarge, ml.g6.48xlarge, ml.p4d.24xlarge, ml.p5.48xlarge, ml.g6e.xlarge, ml.g6e.2xlarge, ml.g6e.4xlarge, ml.g6e.8xlarge, ml.g6e.12xlarge, ml.g6e.16xlarge, ml.g6e.24xlarge, ml.g6e.48xlarge, ml.p4de.24xlarge

Llama 3.3 Nemotron Super 49B V1

nvidia-nemotron-super-49b-nim

Text Generation

No

ml.g5.48xlarge, ml.g6e.12xlarge, ml.g6e.24xlarge, ml.g6e.48xlarge, ml.p4d.24xlarge, ml.p4de.24xlarge, ml.p5.48xlarge, ml.p5en.48xlarge

Llama 3.3 Nemotron Super 49B V1.5

nvidia-nemotron-super-49b-nim-1-5

Text Generation

No

ml.g5.48xlarge, ml.g6e.12xlarge, ml.g6e.24xlarge, ml.g6e.48xlarge, ml.p4d.24xlarge, ml.p4de.24xlarge, ml.p5.48xlarge, ml.p5en.48xlarge

Llama Spark

arcee-llama-spark

Text Generation

No

ml.g5.2xlarge, ml.g5.4xlarge, ml.g5.8xlarge, ml.g5.16xlarge, ml.g6.2xlarge, ml.g6.4xlarge, ml.g6.8xlarge, ml.g6.16xlarge

Llama-3-Varco-Offsetbias-8B

ncsoft-llama-3-varco-offsetbias-8b

Text Generation

No

ml.g5.12xlarge

Llama-3.1-8B-Instruct with Stained Glass Transform Support

protopia-llama-3-1-8b-instruct

Text Generation

No

ml.g4dn.12xlarge, ml.g5.4xlarge, ml.g5.16xlarge

Llama-3.2-NV-EmbedQA-1B-v2

nvidia-llama3-2-nv-embedqa-1b-v2-nim

Text Embedding

No

ml.g5.xlarge, ml.g5.2xlarge, ml.g5.4xlarge, ml.g5.8xlarge, ml.g5.12xlarge, ml.g5.16xlarge, ml.g5.24xlarge, ml.g5.48xlarge, ml.g6.xlarge, ml.g6.2xlarge, ml.g6.4xlarge, ml.g6.8xlarge, ml.g6.12xlarge, ml.g6.16xlarge, ml.g6.24xlarge, ml.g6.48xlarge, ml.p4d.24xlarge, ml.p5.48xlarge, ml.g6e.xlarge, ml.g6e.2xlarge, ml.g6e.4xlarge, ml.g6e.8xlarge, ml.g6e.12xlarge, ml.g6e.16xlarge, ml.g6e.24xlarge, ml.g6e.48xlarge, ml.p4de.24xlarge

MARS6

cambai-mars6

Text To Audio

No

ml.g4dn.2xlarge, ml.g6.2xlarge

Medical LLM - Medium

john-snow-labs-medical-summarization-qa-8b

Text Summarization

No

ml.g5.2xlarge, ml.g4dn.12xlarge

Medical LLM - Small

john-snow-labs-summarization-qa

Text Summarization

No

ml.g5.12xlarge

Medical Text Translation (EN-ES)

john-snow-labs-medical-translation-en-es

Translation

No

ml.g5.2xlarge

Mercury

inception-mercury

Text Generation

No

ml.p5.48xlarge

Mercury Coder

inception-mercury-coder

Text Generation

No

ml.p5.48xlarge

msa-search

nvidia-nim-msa-search-v2-1

Text Generation

No

ml.g6e.12xlarge, ml.g6e.24xlarge, ml.g6e.48xlarge, ml.p4d.24xlarge, ml.p5.48xlarge, ml.p4de.24xlarge, ml.p5en.48xlarge

Nemotron nano 9b v2

nvidia-nemotron-nano-9b-v2

Text Generation

No

ml.g5.2xlarge, ml.g5.4xlarge, ml.g5.8xlarge, ml.g5.16xlarge, ml.g5.12xlarge, ml.g5.24xlarge, ml.g5.48xlarge, ml.g6e.xlarge, ml.g6e.2xlarge, ml.g6e.4xlarge, ml.g6e.8xlarge, ml.g6e.16xlarge, ml.g6e.12xlarge, ml.g6e.24xlarge, ml.g6e.48xlarge, ml.p4d.24xlarge, ml.p4de.24xlarge, ml.p5.4xlarge, ml.p5.48xlarge, ml.p5en.48xlarge

nemotron-parse

nvidia-nemotron-parse

Text Generation

No

ml.g5.xlarge, ml.g5.2xlarge, ml.g5.4xlarge, ml.g5.8xlarge, ml.g5.16xlarge, ml.g5.12xlarge, ml.g5.24xlarge, ml.g5.48xlarge, ml.g6e.xlarge, ml.g6e.2xlarge, ml.g6e.4xlarge, ml.g6e.8xlarge, ml.g6e.16xlarge, ml.g6e.12xlarge, ml.g6e.24xlarge, ml.g6e.48xlarge, ml.p4d.24xlarge, ml.p4de.24xlarge, ml.p5.4xlarge, ml.p5.48xlarge, ml.p5en.48xlarge

NEXUS by Fundamental

fundamental-technologies-nexus

Classification

No

ml.g4dn.8xlarge, ml.p5en.48xlarge

Nomic Embed Text v1.5

nomic-embed-text

Text Embedding

No

ml.p3.2xlarge, ml.p3.8xlarge, ml.p3.16xlarge, ml.g4dn.xlarge, ml.g4dn.2xlarge, ml.g4dn.4xlarge, ml.g4dn.8xlarge, ml.g4dn.12xlarge, ml.g4dn.16xlarge, ml.g5.xlarge, ml.g5.2xlarge, ml.g5.4xlarge, ml.g5.8xlarge, ml.g5.12xlarge, ml.g5.16xlarge, ml.g5.24xlarge, ml.g5.48xlarge

Nomic Embed Vision v1.5

nomic-embed-image

Text Embedding

No

ml.p3.2xlarge, ml.p3.8xlarge, ml.p3.16xlarge, ml.g4dn.xlarge, ml.g4dn.2xlarge, ml.g4dn.4xlarge, ml.g4dn.8xlarge, ml.g4dn.12xlarge, ml.g4dn.16xlarge, ml.g5.xlarge, ml.g5.2xlarge, ml.g5.4xlarge, ml.g5.8xlarge, ml.g5.12xlarge, ml.g5.16xlarge, ml.g5.24xlarge, ml.g5.48xlarge

NVIDIA Cosmos Reason-1-7B

nvidia-nim-cosmos-reason1-7b

Text Generation

No

ml.g5.12xlarge, ml.g5.24xlarge, ml.g5.48xlarge, ml.g6e.xlarge, ml.g6e.2xlarge, ml.g6e.4xlarge, ml.g6e.8xlarge, ml.g6e.16xlarge, ml.g6e.12xlarge, ml.g6e.24xlarge, ml.g6e.48xlarge, ml.p4d.24xlarge, ml.p4de.24xlarge, ml.p5.4xlarge, ml.p5.48xlarge, ml.p5en.48xlarge

NVIDIA Nemotron-4 15B NIM Microservice

nvidia-nemotron-4-15b-nim

Text Generation

No

ml.g5.12xlarge, ml.g5.24xlarge

NVIDIA ParakeetvTDT 0.6B v2

nvidia-parakeetvtdt-0-6b-v2

Audio2Text Generation

No

ml.g6e.2xlarge, ml.g6e.4xlarge, ml.g6e.8xlarge, ml.g6e.12xlarge, ml.g6e.16xlarge, ml.g6e.24xlarge, ml.g6e.48xlarge, ml.g6.2xlarge, ml.g6.4xlarge, ml.g6.8xlarge, ml.g6.12xlarge, ml.g6.16xlarge, ml.g6.24xlarge, ml.g6.48xlarge

NVIDIA-Parakeet-1-1b-CTC-EN-US-ASR

nvidia-parakeet-1-1b-ctc-en-us-asr

Audio2Text Generation

No

ml.g5.2xlarge, ml.g5.4xlarge, ml.g5.8xlarge, ml.g5.16xlarge, ml.g5.12xlarge, ml.g5.24xlarge, ml.g5.48xlarge, ml.g6.2xlarge, ml.g6.4xlarge, ml.g6.8xlarge, ml.g6.16xlarge, ml.g6.12xlarge, ml.g6.24xlarge, ml.g6.48xlarge, ml.g6e.2xlarge, ml.g6e.4xlarge, ml.g6e.8xlarge, ml.g6e.16xlarge, ml.g6e.12xlarge, ml.g6e.24xlarge, ml.g6e.48xlarge, ml.p4d.24xlarge, ml.p4de.24xlarge

Orb

orbital-materials-orb

Scientific Modelling

No

ml.p3.2xlarge, ml.p3.8xlarge, ml.p3.16xlarge, ml.p4d.24xlarge, ml.g4dn.xlarge, ml.g4dn.2xlarge, ml.g4dn.4xlarge, ml.g4dn.8xlarge, ml.g4dn.12xlarge, ml.g4dn.16xlarge

PLaMo Private Deployment

preferred-networks-plamo-api

Text Generation

No

ml.p4d.24xlarge, ml.g5.12xlarge, ml.g5.24xlarge, ml.g5.48xlarge

ProteinMPNN-NIM

nvidia-nim-proteinmpnn-v1-0-2

Text Generation

No

ml.g6e.12xlarge, ml.g6e.24xlarge, ml.g6e.48xlarge, ml.p4d.24xlarge, ml.p4de.24xlarge, ml.p5.48xlarge, ml.p5en.48xlarge

rerank-lite-1 Reranker

voyage-rerank-lite-1-reranker

ReRank

No

ml.g5.xlarge, ml.g5.2xlarge, ml.g5.4xlarge, ml.g5.8xlarge, ml.g5.16xlarge

Solar Mini - Quant

upstage-solar-mini-chat-quant

Text Generation

No

ml.g5.2xlarge

Solar Pro - Quant

upstage-solar-pro-quantized

Text Generation

No

ml.g5.12xlarge

Solar Pro 2

upstage-solar-pro

Text Generation

No

ml.p4d.24xlarge, ml.p5.48xlarge

Sonic 3 SageMaker

cartesia-sonic-3-sagemaker

Text To Audio

No

ml.g6e.xlarge

Stable Diffusion 3.5 Large

stabilityai-stable-diffusion-3-5-large

Text To Image

No

ml.p5.48xlarge

Stable Diffusion XL 1.0

stabilityai-sdxl-1-0

Text To Image

No

ml.g5.2xlarge, ml.p4d.24xlarge, ml.p4de.24xlarge

Stable Diffusion XL Beta 0.8

stabilityai-sdxl-beta-0-8

Text To Image

No

ml.g5.xlarge

Stockmark-LLM-13b

stockmark-llm-13b

Text Generation

No

ml.g5.2xlarge

VARCO LLM KO-1.3B-IST

ncsoft-ko-1-3b-ist

Text Generation

No

ml.g5.2xlarge, ml.g5.4xlarge, ml.g4dn.2xlarge

VARCO LLM KO-6.4B-IST

ncsoft-ko-6-4b-ist

Text Generation

No

ml.g5.4xlarge, ml.g5.8xlarge, ml.g5.12xlarge, ml.g4dn.4xlarge, ml.g4dn.8xlarge

VARCO LLM KO/EN-13B-IST

ncsoft-ko-13b-ist

Text Generation

No

ml.g5.12xlarge, ml.g4dn.12xlarge

voyage-2 Embedding Model

voyage-2-embedding

Text Embedding

No

ml.g5.xlarge, ml.g5.2xlarge, ml.g5.4xlarge, ml.g5.8xlarge, ml.g5.16xlarge

voyage-3 Embedding Model

voyage-3-embedding

Text Embedding

No

ml.g5.xlarge, ml.g5.2xlarge, ml.g5.4xlarge, ml.g5.8xlarge, ml.g5.16xlarge, ml.g6.xlarge, ml.g6.2xlarge, ml.g6.4xlarge, ml.g6.8xlarge, ml.g6.16xlarge

voyage-3 Large Embedding Model

voyage-3-large-embedding

Text Embedding

No

ml.g6.2xlarge, ml.g5.xlarge, ml.g5.8xlarge, ml.g6.4xlarge, ml.g5.2xlarge, ml.g5.4xlarge, ml.g6.8xlarge, ml.g6.xlarge

voyage-3.5 Embedding Model

voyage-3-5-embedding

Text Embedding

No

ml.g5.16xlarge, ml.g6.16xlarge, ml.g6.2xlarge, ml.g5.xlarge, ml.g5.8xlarge, ml.g6.4xlarge, ml.g5.2xlarge, ml.g5.4xlarge, ml.g6.8xlarge, ml.g6.xlarge

voyage-3.5-lite Embedding Model

voyage-3-5-lite-embedding

Text Embedding

No

ml.g6.16xlarge, ml.g6.2xlarge, ml.g5.xlarge, ml.g5.8xlarge, ml.g6.4xlarge, ml.g5.2xlarge, ml.g5.4xlarge, ml.g6.8xlarge, ml.g5.16xlarge, ml.g6.xlarge

voyage-code-2 Embedding Model

voyage-code-2-embedding

Text Embedding

No

ml.g5.xlarge, ml.g5.2xlarge, ml.g5.4xlarge, ml.g5.8xlarge, ml.g5.16xlarge, ml.g6.xlarge, ml.g6.2xlarge, ml.g6.4xlarge, ml.g6.8xlarge, ml.g6.16xlarge

voyage-code-3 Embedding Model

voyage-code-3-embedding

Text Embedding

No

ml.g6.16xlarge, ml.g6.2xlarge, ml.g5.xlarge, ml.g5.8xlarge, ml.g6.4xlarge, ml.g5.2xlarge, ml.g5.4xlarge, ml.g6.8xlarge, ml.g5.16xlarge, ml.g6.xlarge

voyage-large-2 Embedding Model

voyage-large-2-embedding

Text Embedding

No

ml.g5.xlarge, ml.g5.2xlarge, ml.g5.4xlarge, ml.g5.8xlarge, ml.g5.16xlarge, ml.g6.xlarge, ml.g6.2xlarge, ml.g6.4xlarge, ml.g6.8xlarge, ml.g6.16xlarge

Widn Tower Anthill

widn-tower-anthill

Translation

No

ml.g5.xlarge

Widn Tower Sugarloaf

widn-tower-sugarloaf

Translation

No

ml.g5.12xlarge

Widn Tower Vesuvius

widn-llama3-tower-vesuvius

Translation

No

ml.g5.48xlarge, ml.p4d.24xlarge, ml.p4de.24xlarge

Woven City AI Vision Engine

wovenbytoyota-woven-city-ai-vision-engine

Multimodal Generation

No

ml.g5.8xlarge, ml.g5.12xlarge, ml.g5.24xlarge, ml.g5.48xlarge

Writer Palmyra-Fin-70B-32K

writer-palmyra-fin-70b-32k

Text Generation

No

ml.p4d.24xlarge

Writer Palmyra-Med-70B-32K

writer-palmyra-med-70b-32k

Text Generation

No

ml.p4d.24xlarge

Writer Palmyra-X-004

writer-palmyra-x-004

Text Generation

No

ml.p5.48xlarge

Built-in Algorithms (135)

Amazon SageMaker JumpStart provides pre-trained, task-specific models for common computer vision tasks such as image classification, object detection, and image segmentation. These models are built on established deep learning frameworks (PyTorch, TensorFlow, MXNet) and are ready to deploy or fine-tune on your own data.

Provider Model Name Model ID Task License Fine-tunable Supported Inference Instance Types

Mxnet

Faster RCNN ResNet 101 V1d

mxnet-od-faster-rcnn-resnet101-v1d-coco

Object Detection

apache-2.0

No

ml.c5.2xlarge, ml.c5.xlarge, ml.g4dn.xlarge, ml.m5.large, ml.m5.xlarge, ml.p3.2xlarge

Mxnet

Faster RCNN ResNet 101 V1d

mxnet-od-faster-rcnn-fpn-resnet101-v1d-coco

Object Detection

apache-2.0

No

ml.c5.2xlarge, ml.c5.xlarge, ml.g4dn.xlarge, ml.m5.large, ml.m5.xlarge, ml.p3.2xlarge

Mxnet

Faster RCNN ResNet 50 V1b

mxnet-od-faster-rcnn-resnet50-v1b-voc

Object Detection

apache-2.0

No

ml.c5.2xlarge, ml.c5.xlarge, ml.g4dn.xlarge, ml.m5.large, ml.m5.xlarge, ml.p3.2xlarge

Mxnet

Faster RCNN ResNet 50 V1b

mxnet-od-faster-rcnn-resnet50-v1b-coco

Object Detection

apache-2.0

No

ml.c5.2xlarge, ml.c5.xlarge, ml.g4dn.xlarge, ml.m5.large, ml.m5.xlarge, ml.p3.2xlarge

Mxnet

Faster RCNN ResNet 50 V1b

mxnet-od-faster-rcnn-fpn-resnet50-v1b-coco

Object Detection

apache-2.0

No

ml.c5.2xlarge, ml.c5.xlarge, ml.g4dn.xlarge, ml.m5.large, ml.m5.xlarge, ml.p3.2xlarge

Mxnet

FCN ResNet 101 ADE20K

mxnet-semseg-fcn-resnet101-ade

Image Segmentation

apache-2.0

Yes

ml.c5.2xlarge, ml.c5.xlarge, ml.g4dn.xlarge, ml.m5.large, ml.m5.xlarge, ml.p3.2xlarge

Mxnet

FCN ResNet 101 Pascal VOC

mxnet-semseg-fcn-resnet101-voc

Image Segmentation

apache-2.0

Yes

ml.c5.2xlarge, ml.c5.xlarge, ml.g4dn.xlarge, ml.m5.large, ml.m5.xlarge, ml.p3.2xlarge

Mxnet

FCN ResNet 50 ADE20K

mxnet-semseg-fcn-resnet50-ade

Image Segmentation

apache-2.0

Yes

ml.c5.2xlarge, ml.c5.xlarge, ml.g4dn.xlarge, ml.m5.large, ml.m5.xlarge, ml.p3.2xlarge

Mxnet

Instance Segmentation

mxnet-is-mask-rcnn-fpn-resnet101-v1d-coco

Image Segmentation

apache-2.0

No

ml.c5.2xlarge, ml.c5.xlarge, ml.g4dn.xlarge, ml.m5.large, ml.m5.xlarge, ml.p3.2xlarge

Mxnet

MASK RCNN FPN RESNET18 COCO

mxnet-is-mask-rcnn-fpn-resnet18-v1b-coco

Image Segmentation

apache-2.0

No

ml.c5.2xlarge, ml.c5.xlarge, ml.g4dn.xlarge, ml.m5.large, ml.m5.xlarge, ml.p3.2xlarge

Mxnet

MASK RCNN FPN RESNET50 COCO

mxnet-is-mask-rcnn-fpn-resnet50-v1b-coco

Image Segmentation

apache-2.0

No

ml.c5.2xlarge, ml.c5.xlarge, ml.g4dn.xlarge, ml.m5.large, ml.m5.xlarge, ml.p3.2xlarge

Mxnet

MASK RCNN RESNET18 COCO

mxnet-is-mask-rcnn-resnet18-v1b-coco

Image Segmentation

apache-2.0

No

ml.c5.2xlarge, ml.c5.xlarge, ml.g4dn.xlarge, ml.m5.large, ml.m5.xlarge, ml.p3.2xlarge

Mxnet

Object Detection

mxnet-od-ssd-512-mobilenet1-0-coco

Object Detection

apache-2.0

Yes

ml.c5.2xlarge, ml.c5.xlarge, ml.g4dn.xlarge, ml.m5.large, ml.m5.xlarge, ml.p3.2xlarge

Mxnet

Semantic Segmentation

mxnet-semseg-fcn-resnet101-coco

Image Segmentation

apache-2.0

Yes

ml.c5.2xlarge, ml.c5.xlarge, ml.g4dn.xlarge, ml.m5.large, ml.m5.xlarge, ml.p3.2xlarge

Mxnet

SSD 512 ResNet 50 V1

mxnet-od-ssd-512-resnet50-v1-coco

Object Detection

apache-2.0

Yes

ml.c5.2xlarge, ml.c5.xlarge, ml.g4dn.xlarge, ml.m5.large, ml.m5.xlarge, ml.p3.2xlarge

Mxnet

SSD MobileNet 1.0

mxnet-od-ssd-512-mobilenet1-0-voc

Object Detection

apache-2.0

Yes

ml.c5.2xlarge, ml.c5.xlarge, ml.g4dn.xlarge, ml.m5.large, ml.m5.xlarge, ml.p3.2xlarge

Mxnet

SSD ResNet 50 V1

mxnet-od-ssd-512-resnet50-v1-voc

Object Detection

apache-2.0

Yes

ml.c5.2xlarge, ml.c5.xlarge, ml.g4dn.xlarge, ml.m5.large, ml.m5.xlarge, ml.p3.2xlarge

Mxnet

SSD VGG 16 Atrous 300

mxnet-od-ssd-300-vgg16-atrous-voc

Object Detection

apache-2.0

Yes

ml.c5.2xlarge, ml.c5.xlarge, ml.g4dn.xlarge, ml.m5.large, ml.m5.xlarge, ml.p3.2xlarge

Mxnet

SSD VGG 16 Atrous 512

mxnet-od-ssd-512-vgg16-atrous-coco

Object Detection

apache-2.0

Yes

ml.c5.2xlarge, ml.c5.xlarge, ml.g4dn.xlarge, ml.m5.large, ml.m5.xlarge, ml.p3.2xlarge

Mxnet

SSD VGG16 Atrous 300

mxnet-od-ssd-300-vgg16-atrous-coco

Object Detection

apache-2.0

Yes

ml.c5.2xlarge, ml.c5.xlarge, ml.g4dn.xlarge, ml.m5.large, ml.m5.xlarge, ml.p3.2xlarge

Mxnet

SSD VGG16 Atrous 512

mxnet-od-ssd-512-vgg16-atrous-voc

Object Detection

apache-2.0

Yes

ml.c5.2xlarge, ml.c5.xlarge, ml.g4dn.xlarge, ml.m5.large, ml.m5.xlarge, ml.p3.2xlarge

Mxnet

YOLO V3 DarkNet 53

mxnet-od-yolo3-darknet53-voc

Object Detection

apache-2.0

No

ml.c5.2xlarge, ml.c5.xlarge, ml.g4dn.xlarge, ml.m5.large, ml.m5.xlarge, ml.p3.2xlarge

Mxnet

YOLO V3 DarkNet 53

mxnet-od-yolo3-darknet53-coco

Object Detection

apache-2.0

No

ml.c5.2xlarge, ml.c5.xlarge, ml.g4dn.xlarge, ml.m5.large, ml.m5.xlarge, ml.p3.2xlarge

Mxnet

YOLO V3 MobileNet 1.0

mxnet-od-yolo3-mobilenet1-0-voc

Object Detection

apache-2.0

No

ml.c5.2xlarge, ml.c5.xlarge, ml.g4dn.xlarge, ml.m5.large, ml.m5.xlarge, ml.p3.2xlarge

Mxnet

YOLO V3 MobileNet 1.0

mxnet-od-yolo3-mobilenet1-0-coco

Object Detection

apache-2.0

No

ml.c5.2xlarge, ml.c5.xlarge, ml.g4dn.xlarge, ml.m5.large, ml.m5.xlarge, ml.p3.2xlarge

Pytorch

SqueezeNet 1

pytorch-ic-squeezenet1-1

Image Classification

bsd-3-clause

Yes

ml.c5.2xlarge, ml.c5.xlarge, ml.m4.xlarge, ml.m5.large, ml.m5.xlarge

Pytorch

AlexNet

pytorch-ic-alexnet

Image Classification

bsd-3-clause

Yes

ml.c5.2xlarge, ml.c5.xlarge, ml.m4.xlarge, ml.m5.large, ml.m5.xlarge

Pytorch

DenseNet 121

pytorch-ic-densenet121

Image Classification

bsd-3-clause

Yes

ml.c5.2xlarge, ml.c5.xlarge, ml.m4.xlarge, ml.m5.large, ml.m5.xlarge

Pytorch

DenseNet 169

pytorch-ic-densenet169

Image Classification

bsd-3-clause

Yes

ml.c5.2xlarge, ml.c5.xlarge, ml.m4.xlarge, ml.m5.large, ml.m5.xlarge

Pytorch

FRCNN MobileNet V3 large 320 FPN

pytorch-od1-fasterrcnn-mobilenet-v3-large-320-fpn

Object Detection

bsd-3-clause

No

ml.c5.2xlarge, ml.c5.xlarge, ml.g4dn.xlarge, ml.m5.large, ml.m5.xlarge, ml.p3.2xlarge

Pytorch

FRCNN MobileNet V3 large FPN

pytorch-od1-fasterrcnn-mobilenet-v3-large-fpn

Object Detection

bsd-3-clause

No

ml.c5.2xlarge, ml.c5.xlarge, ml.g4dn.xlarge, ml.m5.large, ml.m5.xlarge, ml.p3.2xlarge

Pytorch

GoogLeNet

pytorch-ic-googlenet

Image Classification

bsd-3-clause

Yes

ml.c5.2xlarge, ml.c5.xlarge, ml.m4.xlarge, ml.m5.large, ml.m5.xlarge

Pytorch

Image Classification

pytorch-ic-mobilenet-v2

Image Classification

bsd-3-clause

Yes

ml.c5.2xlarge, ml.c5.xlarge, ml.m4.xlarge, ml.m5.large, ml.m5.xlarge

Pytorch

Object Detection

pytorch-od1-fasterrcnn-resnet50-fpn

Object Detection

bsd-3-clause

Yes

ml.c5.2xlarge, ml.c5.xlarge, ml.g4dn.xlarge, ml.m5.large, ml.m5.xlarge, ml.p3.2xlarge

Pytorch

ResNet 101

pytorch-ic-resnet101

Image Classification

bsd-3-clause

Yes

ml.c5.2xlarge, ml.c5.xlarge, ml.m4.xlarge, ml.m5.large, ml.m5.xlarge

Pytorch

ResNet 152

pytorch-ic-resnet152

Image Classification

bsd-3-clause

Yes

ml.c5.2xlarge, ml.c5.xlarge, ml.m4.xlarge, ml.m5.large, ml.m5.xlarge

Pytorch

ResNet 18

pytorch-ic-resnet18

Image Classification

bsd-3-clause

Yes

ml.c5.2xlarge, ml.c5.xlarge, ml.m4.xlarge, ml.m5.large, ml.m5.xlarge

Pytorch

ResNet 34

pytorch-ic-resnet34

Image Classification

bsd-3-clause

Yes

ml.c5.2xlarge, ml.c5.xlarge, ml.m4.xlarge, ml.m5.large, ml.m5.xlarge

Pytorch

ResNet 50

pytorch-ic-resnet50

Image Classification

bsd-3-clause

Yes

ml.c5.2xlarge, ml.c5.xlarge, ml.m4.xlarge, ml.m5.large, ml.m5.xlarge

Pytorch

Resnext 50

pytorch-ic-resnext50-32x4d

Image Classification

bsd-3-clause

Yes

ml.c5.2xlarge, ml.c5.xlarge, ml.m4.xlarge, ml.m5.large, ml.m5.xlarge

Pytorch

ShuffleNet V2

pytorch-ic-shufflenet-v2-x1-0

Image Classification

bsd-3-clause

Yes

ml.c5.2xlarge, ml.c5.xlarge, ml.m4.xlarge, ml.m5.large, ml.m5.xlarge

Pytorch

SSD

pytorch-od-nvidia-ssd

Object Detection

bsd-3-clause

No

ml.c5.2xlarge, ml.c5.xlarge, ml.g4dn.xlarge, ml.m5.large, ml.m5.xlarge, ml.p3.2xlarge

Pytorch

VGG 11

pytorch-ic-vgg11

Image Classification

bsd-3-clause

Yes

ml.c5.2xlarge, ml.c5.xlarge, ml.m4.xlarge, ml.m5.large, ml.m5.xlarge

Pytorch

VGG 11-BN

pytorch-ic-vgg11-bn

Image Classification

bsd-3-clause

Yes

ml.c5.2xlarge, ml.c5.xlarge, ml.m4.xlarge, ml.m5.large, ml.m5.xlarge

Pytorch

VGG 13-BN

pytorch-ic-vgg13-bn

Image Classification

bsd-3-clause

Yes

ml.c5.2xlarge, ml.c5.xlarge, ml.m4.xlarge, ml.m5.large, ml.m5.xlarge

Pytorch

VGG 16

pytorch-ic-vgg16

Image Classification

bsd-3-clause

Yes

ml.c5.2xlarge, ml.c5.xlarge, ml.m4.xlarge, ml.m5.large, ml.m5.xlarge

Pytorch

VGG 16-BN

pytorch-ic-vgg16-bn

Image Classification

bsd-3-clause

Yes

ml.c5.2xlarge, ml.c5.xlarge, ml.m4.xlarge, ml.m5.large, ml.m5.xlarge

Pytorch

VGG 19

pytorch-ic-vgg19

Image Classification

bsd-3-clause

Yes

ml.c5.2xlarge, ml.c5.xlarge, ml.m4.xlarge, ml.m5.large, ml.m5.xlarge

Pytorch

VGG-13

pytorch-ic-vgg13

Image Classification

bsd-3-clause

Yes

ml.c5.2xlarge, ml.c5.xlarge, ml.m4.xlarge, ml.m5.large, ml.m5.xlarge

Pytorch

Wide ResNet 101

pytorch-ic-wide-resnet101-2

Image Classification

bsd-3-clause

Yes

ml.c5.2xlarge, ml.c5.xlarge, ml.m4.xlarge, ml.m5.large, ml.m5.xlarge

Tensorflow

ALBERT Base

tensorflow-tc-albert-en-base

Text Classification

apache-2.0

Yes

ml.c5.18xlarge, ml.c5.2xlarge, ml.c5.4xlarge, ml.c5.9xlarge, ml.g4dn.16xlarge, ml.g4dn.2xlarge, ml.g4dn.4xlarge, ml.g4dn.8xlarge, ml.g4dn.xlarge, ml.g5.xlarge, ml.m5.12xlarge, ml.m5.24xlarge, ml.m5.2xlarge, ml.m5.4xlarge, ml.m5.xlarge, ml.p3.16xlarge, ml.p3.2xlarge, ml.p3.8xlarge

Tensorflow

BERT Base Cased

tensorflow-tc-bert-en-cased-L-12-H-768-A-12-2

Text Classification

apache-2.0

Yes

ml.c5.18xlarge, ml.c5.2xlarge, ml.c5.4xlarge, ml.c5.9xlarge, ml.g4dn.16xlarge, ml.g4dn.2xlarge, ml.g4dn.4xlarge, ml.g4dn.8xlarge, ml.g4dn.xlarge, ml.g5.xlarge, ml.m5.12xlarge, ml.m5.24xlarge, ml.m5.2xlarge, ml.m5.4xlarge, ml.m5.xlarge, ml.p3.16xlarge, ml.p3.2xlarge, ml.p3.8xlarge

Tensorflow

BERT Base MEDLINE/PubMed

tensorflow-tc-experts-bert-pubmed-1

Text Classification

apache-2.0

Yes

ml.c5.18xlarge, ml.c5.2xlarge, ml.c5.4xlarge, ml.c5.9xlarge, ml.g4dn.16xlarge, ml.g4dn.2xlarge, ml.g4dn.4xlarge, ml.g4dn.8xlarge, ml.g4dn.xlarge, ml.g5.xlarge, ml.m5.12xlarge, ml.m5.24xlarge, ml.m5.2xlarge, ml.m5.4xlarge, ml.m5.xlarge, ml.p3.16xlarge, ml.p3.2xlarge, ml.p3.8xlarge

Tensorflow

BERT Base Multilingual Cased

tensorflow-tc-bert-multi-cased-L-12-H-768-A-12-2

Text Classification

apache-2.0

Yes

ml.c5.18xlarge, ml.c5.2xlarge, ml.c5.4xlarge, ml.c5.9xlarge, ml.g4dn.16xlarge, ml.g4dn.2xlarge, ml.g4dn.4xlarge, ml.g4dn.8xlarge, ml.g4dn.xlarge, ml.g5.xlarge, ml.m5.12xlarge, ml.m5.24xlarge, ml.m5.2xlarge, ml.m5.4xlarge, ml.m5.xlarge, ml.p3.16xlarge, ml.p3.2xlarge, ml.p3.8xlarge

Tensorflow

BERT Large Uncased

tensorflow-tc-bert-en-uncased-L-24-H-1024-A-16-2

Text Classification

apache-2.0

Yes

ml.c5.18xlarge, ml.c5.2xlarge, ml.c5.4xlarge, ml.c5.9xlarge, ml.g4dn.16xlarge, ml.g4dn.2xlarge, ml.g4dn.4xlarge, ml.g4dn.8xlarge, ml.g4dn.xlarge, ml.g5.xlarge, ml.m5.12xlarge, ml.m5.24xlarge, ml.m5.2xlarge, ml.m5.4xlarge, ml.m5.xlarge, ml.p3.16xlarge, ml.p3.2xlarge, ml.p3.8xlarge

Tensorflow

BERT Small Uncased L-10 H-128

tensorflow-tcembedding-bert-en-uncased-L-10-H-128-A-2-2

Text Embedding

apache-2.0

No

ml.c5.2xlarge, ml.c5.xlarge, ml.g4dn.xlarge, ml.m5.large, ml.m5.xlarge, ml.p3.2xlarge

Tensorflow

BERT Small Uncased L-12 H-256

tensorflow-tcembedding-bert-en-uncased-L-12-H-256-A-4

Text Embedding

apache-2.0

No

ml.c5.2xlarge, ml.c5.xlarge, ml.g4dn.xlarge, ml.m5.large, ml.m5.xlarge, ml.p3.2xlarge

Tensorflow

BERT Small Uncased L-12 H-768

tensorflow-tcembedding-bert-en-uncased-L-12-H-768-A-12-2

Text Embedding

apache-2.0

No

ml.c5.2xlarge, ml.c5.xlarge, ml.g4dn.xlarge, ml.m5.large, ml.m5.xlarge, ml.p3.2xlarge

Tensorflow

BERT Small Uncased L-4 H-768

tensorflow-tcembedding-bert-en-uncased-L-4-H-768-A-12-2

Text Embedding

apache-2.0

No

ml.c5.2xlarge, ml.c5.xlarge, ml.g4dn.xlarge, ml.m5.large, ml.m5.xlarge, ml.p3.2xlarge

Tensorflow

BiT-M R101x1

tensorflow-ic-bit-m-r101x1-ilsvrc2012-classification-1

Image Classification

Apache-2.0

Yes

ml.c5.2xlarge, ml.c5.xlarge, ml.c6i.large, ml.c6i.xlarge, ml.g4dn.xlarge, ml.m4.xlarge, ml.m5.large, ml.m5.xlarge, ml.p3.2xlarge, ml.r5.large, ml.r5.xlarge

Tensorflow

BiT-M R101x1 Feature Vector

tensorflow-icembedding-bit-m-r101x1-ilsvrc2012-featurevector-1

Image Classification

apache-2.0

No

ml.c5.2xlarge, ml.c5.xlarge, ml.g4dn.xlarge, ml.m5.large, ml.m5.xlarge, ml.p3.2xlarge

Tensorflow

BiT-M R50x1 ImageNet-21k

tensorflow-ic-bit-m-r50x1-imagenet21k-classification-1

Image Classification

Apache-2.0

Yes

ml.c5.2xlarge, ml.c5.xlarge, ml.c6i.large, ml.c6i.xlarge, ml.g4dn.xlarge, ml.m4.xlarge, ml.m5.large, ml.m5.xlarge, ml.p3.2xlarge, ml.r5.large, ml.r5.xlarge

Tensorflow

CaiT S 24 224

tensorflow-ic-cait-s24-224

Image Classification

MIT

Yes

ml.c5.2xlarge, ml.c5.xlarge, ml.c6i.large, ml.c6i.xlarge, ml.g4dn.xlarge, ml.m4.xlarge, ml.m5.large, ml.m5.xlarge, ml.p3.2xlarge, ml.r5.large, ml.r5.xlarge

Tensorflow

CenterNet ResNet-v1-101

tensorflow-od-centernet-resnet101v1-fpn-512x512-1

Object Detection

apache-2.0

No

ml.g4dn.xlarge, ml.g5.xlarge, ml.p3.2xlarge

Tensorflow

CenterNet ResNet-v1-50

tensorflow-od-centernet-resnet50v1-fpn-512x512-1

Object Detection

apache-2.0

No

ml.g4dn.xlarge, ml.g5.xlarge, ml.p3.2xlarge

Tensorflow

DeiT Tiny Distilled Patch 16 224

tensorflow-ic-deit-tiny-distilled-patch16-224

Image Classification

MIT

Yes

ml.c5.2xlarge, ml.c5.xlarge, ml.c6i.large, ml.c6i.xlarge, ml.g4dn.xlarge, ml.m4.xlarge, ml.m5.large, ml.m5.xlarge, ml.p3.2xlarge, ml.r5.large, ml.r5.xlarge

Tensorflow

EfficientNet B0

tensorflow-ic-efficientnet-b0-classification-1

Image Classification

Apache-2.0

Yes

ml.c5.2xlarge, ml.c5.xlarge, ml.c6i.large, ml.c6i.xlarge, ml.g4dn.xlarge, ml.m4.xlarge, ml.m5.large, ml.m5.xlarge, ml.p3.2xlarge, ml.r5.large, ml.r5.xlarge

Tensorflow

EfficientNet B1

tensorflow-ic-efficientnet-b1-classification-1

Image Classification

Apache-2.0

Yes

ml.c5.2xlarge, ml.c5.xlarge, ml.c6i.large, ml.c6i.xlarge, ml.g4dn.xlarge, ml.m4.xlarge, ml.m5.large, ml.m5.xlarge, ml.p3.2xlarge, ml.r5.large, ml.r5.xlarge

Tensorflow

EfficientNet B1 Feature Vector

tensorflow-icembedding-efficientnet-b1-featurevector-1

Image Classification

apache-2.0

No

ml.c5.2xlarge, ml.c5.xlarge, ml.g4dn.xlarge, ml.m5.large, ml.m5.xlarge, ml.p3.2xlarge

Tensorflow

EfficientNet B3

tensorflow-ic-efficientnet-b3-classification-1

Image Classification

Apache-2.0

Yes

ml.c5.2xlarge, ml.c5.xlarge, ml.c6i.large, ml.c6i.xlarge, ml.g4dn.xlarge, ml.m4.xlarge, ml.m5.large, ml.m5.xlarge, ml.p3.2xlarge, ml.r5.large, ml.r5.xlarge

Tensorflow

EfficientNet B7

tensorflow-ic-efficientnet-b7-classification-1

Image Classification

Apache-2.0

Yes

ml.c5.9xlarge, ml.g4dn.8xlarge, ml.p3.8xlarge

Tensorflow

EfficientNet V2 ImageNet-1k Small

tensorflow-ic-efficientnet-v2-imagenet1k-s

Image Classification

Apache-2.0

Yes

ml.c5.2xlarge, ml.c5.xlarge, ml.c6i.large, ml.c6i.xlarge, ml.g4dn.xlarge, ml.m4.xlarge, ml.m5.large, ml.m5.xlarge, ml.p3.2xlarge, ml.r5.large, ml.r5.xlarge

Tensorflow

EfficientNet V2 ImageNet-21k FT1k M

tensorflow-ic-efficientnet-v2-imagenet21k-ft1k-m

Image Classification

Apache-2.0

Yes

ml.c5.2xlarge, ml.c5.xlarge, ml.c6i.large, ml.c6i.xlarge, ml.g4dn.xlarge, ml.m4.xlarge, ml.m5.large, ml.m5.xlarge, ml.p3.2xlarge, ml.r5.large, ml.r5.xlarge

Tensorflow

Faster R-CNN Resnet V2 1024x1024

tensorflow-od-faster-rcnn-inception-resnet-v2-1024x1024-1

Object Detection

apache-2.0

No

ml.g4dn.xlarge, ml.g5.xlarge, ml.p3.2xlarge

Tensorflow

Faster R-CNN Resnet-101 V1 640x640

tensorflow-od-faster-rcnn-resnet101-v1-640x640-1

Object Detection

apache-2.0

No

ml.g4dn.xlarge, ml.g5.xlarge, ml.p3.2xlarge

Tensorflow

Faster R-CNN Resnet-50 V1 1024x1024

tensorflow-od-faster-rcnn-resnet50-v1-1024x1024-1

Object Detection

apache-2.0

No

ml.g4dn.xlarge, ml.g5.xlarge, ml.p3.2xlarge

Tensorflow

Faster R-CNN Resnet-50 V1 640x640

tensorflow-od-faster-rcnn-resnet50-v1-640x640-1

Object Detection

apache-2.0

No

ml.g4dn.xlarge, ml.g5.xlarge, ml.p3.2xlarge

Tensorflow

Faster R-CNN Resnet-50 V1 800x1333

tensorflow-od-faster-rcnn-resnet50-v1-800x1333-1

Object Detection

apache-2.0

No

ml.g4dn.xlarge, ml.g5.xlarge, ml.p3.2xlarge

Tensorflow

FRILL

tensorflow-audioembedding-frill-1

Audio Classification

apache-2.0

No

ml.c5.2xlarge, ml.c5.xlarge, ml.g4dn.xlarge, ml.m5.large, ml.m5.xlarge, ml.p3.2xlarge

Tensorflow

Image Classification

tensorflow-ic-swin-base-patch4-window7-224

Image Classification

MIT

Yes

ml.c5.2xlarge, ml.c5.xlarge, ml.c6i.large, ml.c6i.xlarge, ml.g4dn.xlarge, ml.m4.xlarge, ml.m5.large, ml.m5.xlarge, ml.p3.2xlarge, ml.r5.large, ml.r5.xlarge

Tensorflow

Image Embedding

tensorflow-icembedding-imagenet-mobilenet-v2-100-224-fv-4

Image Classification

apache-2.0

No

ml.c5.2xlarge, ml.c5.xlarge, ml.g4dn.xlarge, ml.m5.large, ml.m5.xlarge, ml.p3.2xlarge

Tensorflow

Inception V3

tensorflow-ic-imagenet-inception-v3-classification-4

Image Classification

Apache-2.0

Yes

ml.c5.2xlarge, ml.c5.xlarge, ml.c6i.large, ml.c6i.xlarge, ml.g4dn.xlarge, ml.m4.xlarge, ml.m5.large, ml.m5.xlarge, ml.p3.2xlarge, ml.r5.large, ml.r5.xlarge

Tensorflow

Inception V3 Preview Feature Vector

tensorflow-icembedding-tf2-preview-inception-v3-fv-4

Image Classification

apache-2.0

No

ml.c5.2xlarge, ml.c5.xlarge, ml.g4dn.xlarge, ml.m5.large, ml.m5.xlarge, ml.p3.2xlarge

Tensorflow

MobileNet V2

tensorflow-ic-tf2-preview-mobilenet-v2-classification-4

Image Classification

Apache-2.0

Yes

ml.c5.2xlarge, ml.c5.xlarge, ml.c6i.large, ml.c6i.xlarge, ml.g4dn.xlarge, ml.m4.xlarge, ml.m5.large, ml.m5.xlarge, ml.p3.2xlarge, ml.r5.large, ml.r5.xlarge

Tensorflow

MobileNet V2 0.35 128

tensorflow-ic-imagenet-mobilenet-v2-035-128

Image Classification

Apache-2.0

Yes

ml.c5.2xlarge, ml.c5.xlarge, ml.c6i.large, ml.c6i.xlarge, ml.g4dn.xlarge, ml.m4.xlarge, ml.m5.large, ml.m5.xlarge, ml.p3.2xlarge, ml.r5.large, ml.r5.xlarge

Tensorflow

MobileNet V2 0.50 224

tensorflow-icembedding-imagenet-mobilenet-v2-050-224-fv-4

Image Classification

apache-2.0

No

ml.c5.2xlarge, ml.c5.xlarge, ml.g4dn.xlarge, ml.m5.large, ml.m5.xlarge, ml.p3.2xlarge

Tensorflow

MobileNet V2 1.00 224

tensorflow-ic-imagenet-mobilenet-v2-100-224-classification-4

Image Classification

Apache-2.0

Yes

ml.c5.2xlarge, ml.c5.xlarge, ml.c6i.large, ml.c6i.xlarge, ml.g4dn.xlarge, ml.m4.xlarge, ml.m5.large, ml.m5.xlarge, ml.p3.2xlarge, ml.r5.large, ml.r5.xlarge

Tensorflow

MobileNet V2 1.40 224

tensorflow-ic-imagenet-mobilenet-v2-140-224-classification-4

Image Classification

Apache-2.0

Yes

ml.c5.2xlarge, ml.c5.xlarge, ml.c6i.large, ml.c6i.xlarge, ml.g4dn.xlarge, ml.m4.xlarge, ml.m5.large, ml.m5.xlarge, ml.p3.2xlarge, ml.r5.large, ml.r5.xlarge

Tensorflow

MobileNet V2 Feature Vector

tensorflow-icembedding-tf2-preview-mobilenet-v2-fv-4

Image Classification

apache-2.0

No

ml.c5.2xlarge, ml.c5.xlarge, ml.g4dn.xlarge, ml.m5.large, ml.m5.xlarge, ml.p3.2xlarge

Tensorflow

MobileNet V2 Feature Vector

tensorflow-icembedding-tf2-preview-mobilenet-v2-featurevector-4

Image Classification

apache-2.0

No

ml.c5.2xlarge, ml.c5.xlarge, ml.g4dn.xlarge, ml.m5.large, ml.m5.xlarge, ml.p3.2xlarge

Tensorflow

MobileNet V3 Large 1.00 224

tensorflow-ic-imagenet-mobilenet-v3-large-100-224

Image Classification

Apache-2.0

Yes

ml.c5.2xlarge, ml.c5.xlarge, ml.c6i.large, ml.c6i.xlarge, ml.g4dn.xlarge, ml.m4.xlarge, ml.m5.large, ml.m5.xlarge, ml.p3.2xlarge, ml.r5.large, ml.r5.xlarge

Tensorflow

MobileNet V3 Small 0.75 224

tensorflow-ic-imagenet-mobilenet-v3-small-075-224

Image Classification

Apache-2.0

Yes

ml.c5.2xlarge, ml.c5.xlarge, ml.c6i.large, ml.c6i.xlarge, ml.g4dn.xlarge, ml.m4.xlarge, ml.m5.large, ml.m5.xlarge, ml.p3.2xlarge, ml.r5.large, ml.r5.xlarge

Tensorflow

MobileNet V3 Small 1.00 224

tensorflow-ic-imagenet-mobilenet-v3-small-100-224

Image Classification

Apache-2.0

Yes

ml.c5.2xlarge, ml.c5.xlarge, ml.c6i.large, ml.c6i.xlarge, ml.g4dn.xlarge, ml.m4.xlarge, ml.m5.large, ml.m5.xlarge, ml.p3.2xlarge, ml.r5.large, ml.r5.xlarge

Tensorflow

Object Detection

tensorflow-od1-ssd-resnet50-v1-fpn-640x640-coco17-tpu-8

Object Detection

apache-2.0

Yes

ml.g4dn.xlarge, ml.g5.xlarge, ml.m5.large, ml.m5.xlarge, ml.p3.2xlarge

Tensorflow

ResNet 50

tensorflow-ic-resnet-50-classification-1

Image Classification

Apache-2.0

Yes

ml.c5.9xlarge, ml.g4dn.8xlarge, ml.p3.8xlarge

Tensorflow

ResNet 50 Feature Vector

tensorflow-icembedding-resnet-50-featurevector-1

Image Classification

apache-2.0

No

ml.c5.2xlarge, ml.c5.xlarge, ml.g4dn.xlarge, ml.m5.large, ml.m5.xlarge, ml.p3.2xlarge

Tensorflow

ResNet V1 50

tensorflow-ic-imagenet-resnet-v1-50-classification-4

Image Classification

Apache-2.0

Yes

ml.c5.2xlarge, ml.c5.xlarge, ml.c6i.large, ml.c6i.xlarge, ml.g4dn.xlarge, ml.m4.xlarge, ml.m5.large, ml.m5.xlarge, ml.p3.2xlarge, ml.r5.large, ml.r5.xlarge

Tensorflow

ResNet V2 152 Feature Vector

tensorflow-icembedding-imagenet-resnet-v2-152-featurevector-4

Image Classification

apache-2.0

No

ml.c5.2xlarge, ml.c5.xlarge, ml.g4dn.xlarge, ml.m5.large, ml.m5.xlarge, ml.p3.2xlarge

Tensorflow

ResNet V2 50

tensorflow-ic-imagenet-resnet-v2-50-classification-4

Image Classification

Apache-2.0

Yes

ml.c5.2xlarge, ml.c5.xlarge, ml.c6i.large, ml.c6i.xlarge, ml.g4dn.xlarge, ml.m4.xlarge, ml.m5.large, ml.m5.xlarge, ml.p3.2xlarge, ml.r5.large, ml.r5.xlarge

Tensorflow

ResNet V2 50 Feature Vector

tensorflow-icembedding-imagenet-resnet-v2-50-featurevector-4

Image Classification

apache-2.0

No

ml.c5.2xlarge, ml.c5.xlarge, ml.g4dn.xlarge, ml.m5.large, ml.m5.xlarge, ml.p3.2xlarge

Tensorflow

Retinanet SSD Resnet-101 640x640

tensorflow-od-retinanet-resnet101-v1-fpn-640x640-1

Object Detection

apache-2.0

No

ml.g4dn.xlarge, ml.g5.xlarge, ml.p3.2xlarge

Tensorflow

Small BERT L-10_H-512_A-8

tensorflow-tc-small-bert-bert-en-uncased-L-10-H-512-A-8

Text Classification

apache-2.0

Yes

ml.c5.18xlarge, ml.c5.2xlarge, ml.c5.4xlarge, ml.c5.9xlarge, ml.g4dn.16xlarge, ml.g4dn.2xlarge, ml.g4dn.4xlarge, ml.g4dn.8xlarge, ml.g4dn.xlarge, ml.g5.xlarge, ml.m5.12xlarge, ml.m5.24xlarge, ml.m5.2xlarge, ml.m5.4xlarge, ml.m5.xlarge, ml.p3.16xlarge, ml.p3.2xlarge, ml.p3.8xlarge

Tensorflow

Small BERT L-12_H-256_A-4

tensorflow-tc-small-bert-bert-en-uncased-L-12-H-256-A-4

Text Classification

apache-2.0

Yes

ml.c5.18xlarge, ml.c5.2xlarge, ml.c5.4xlarge, ml.c5.9xlarge, ml.g4dn.16xlarge, ml.g4dn.2xlarge, ml.g4dn.4xlarge, ml.g4dn.8xlarge, ml.g4dn.xlarge, ml.g5.xlarge, ml.m5.12xlarge, ml.m5.24xlarge, ml.m5.2xlarge, ml.m5.4xlarge, ml.m5.xlarge, ml.p3.16xlarge, ml.p3.2xlarge, ml.p3.8xlarge

Tensorflow

Small BERT L-12_H-768_A-12

tensorflow-tc-small-bert-bert-en-uncased-L-12-H-768-A-12

Text Classification

apache-2.0

Yes

ml.c5.18xlarge, ml.c5.2xlarge, ml.c5.4xlarge, ml.c5.9xlarge, ml.g4dn.16xlarge, ml.g4dn.2xlarge, ml.g4dn.4xlarge, ml.g4dn.8xlarge, ml.g4dn.xlarge, ml.g5.xlarge, ml.m5.12xlarge, ml.m5.24xlarge, ml.m5.2xlarge, ml.m5.4xlarge, ml.m5.xlarge, ml.p3.16xlarge, ml.p3.2xlarge, ml.p3.8xlarge

Tensorflow

Small BERT L-2_H-128_A-2

tensorflow-tc-small-bert-bert-en-uncased-L-2-H-128-A-2

Text Classification

apache-2.0

Yes

ml.c5.18xlarge, ml.c5.2xlarge, ml.c5.4xlarge, ml.c5.9xlarge, ml.g4dn.16xlarge, ml.g4dn.2xlarge, ml.g4dn.4xlarge, ml.g4dn.8xlarge, ml.g4dn.xlarge, ml.g5.xlarge, ml.m5.12xlarge, ml.m5.24xlarge, ml.m5.2xlarge, ml.m5.4xlarge, ml.m5.xlarge, ml.p3.16xlarge, ml.p3.2xlarge, ml.p3.8xlarge

Tensorflow

Small BERT L-4_H-512_A-8

tensorflow-tc-small-bert-bert-en-uncased-L-4-H-512-A-8

Text Classification

apache-2.0

Yes

ml.c5.18xlarge, ml.c5.2xlarge, ml.c5.4xlarge, ml.c5.9xlarge, ml.g4dn.16xlarge, ml.g4dn.2xlarge, ml.g4dn.4xlarge, ml.g4dn.8xlarge, ml.g4dn.xlarge, ml.g5.xlarge, ml.m5.12xlarge, ml.m5.24xlarge, ml.m5.2xlarge, ml.m5.4xlarge, ml.m5.xlarge, ml.p3.16xlarge, ml.p3.2xlarge, ml.p3.8xlarge

Tensorflow

Small BERT L-6_H-768_A-12

tensorflow-tc-small-bert-bert-en-uncased-L-6-H-768-A-12

Text Classification

apache-2.0

Yes

ml.c5.18xlarge, ml.c5.2xlarge, ml.c5.4xlarge, ml.c5.9xlarge, ml.g4dn.16xlarge, ml.g4dn.2xlarge, ml.g4dn.4xlarge, ml.g4dn.8xlarge, ml.g4dn.xlarge, ml.g5.xlarge, ml.m5.12xlarge, ml.m5.24xlarge, ml.m5.2xlarge, ml.m5.4xlarge, ml.m5.xlarge, ml.p3.16xlarge, ml.p3.2xlarge, ml.p3.8xlarge

Tensorflow

Small BERT L-8_H-768_A-12

tensorflow-tc-small-bert-bert-en-uncased-L-8-H-768-A-12

Text Classification

apache-2.0

Yes

ml.c5.18xlarge, ml.c5.2xlarge, ml.c5.4xlarge, ml.c5.9xlarge, ml.g4dn.16xlarge, ml.g4dn.2xlarge, ml.g4dn.4xlarge, ml.g4dn.8xlarge, ml.g4dn.xlarge, ml.g5.xlarge, ml.m5.12xlarge, ml.m5.24xlarge, ml.m5.2xlarge, ml.m5.4xlarge, ml.m5.xlarge, ml.p3.16xlarge, ml.p3.2xlarge, ml.p3.8xlarge

Tensorflow

SSD ResNet 152 V1 FPN 1024x1024 COCO '17

tensorflow-od1-ssd-resnet152-v1-fpn-1024x1024-coco17-tpu-8

Object Detection

apache-2.0

Yes

ml.g4dn.xlarge, ml.g5.xlarge, ml.m5.large, ml.m5.xlarge, ml.p3.2xlarge

Tensorflow

SSD EfficientDet D0

tensorflow-od-efficientdet-d0-1

Object Detection

apache-2.0

No

ml.g4dn.xlarge, ml.g5.xlarge, ml.p3.2xlarge

Tensorflow

SSD Efficientdet D0 512x512 COCO '17

tensorflow-od1-ssd-efficientdet-d0-512x512-coco17-tpu-8

Object Detection

apache-2.0

Yes

ml.g4dn.xlarge, ml.g5.xlarge, ml.m5.large, ml.m5.xlarge, ml.p3.2xlarge

Tensorflow

SSD Efficientdet D1 640x640 COCO '17

tensorflow-od1-ssd-efficientdet-d1-640x640-coco17-tpu-8

Object Detection

apache-2.0

Yes

ml.g4dn.xlarge, ml.g5.xlarge, ml.m5.large, ml.m5.xlarge, ml.p3.2xlarge

Tensorflow

SSD EfficientDet D2

tensorflow-od-efficientdet-d2-1

Object Detection

apache-2.0

No

ml.g4dn.xlarge, ml.g5.xlarge, ml.p3.2xlarge

Tensorflow

SSD Efficientdet D2 768x768 COCO '17

tensorflow-od1-ssd-efficientdet-d2-768x768-coco17-tpu-8

Object Detection

apache-2.0

Yes

ml.g4dn.xlarge, ml.g5.xlarge, ml.m5.large, ml.m5.xlarge, ml.p3.2xlarge

Tensorflow

SSD EfficientDet D3

tensorflow-od-efficientdet-d3-1

Object Detection

apache-2.0

No

ml.g4dn.xlarge, ml.g5.xlarge, ml.p3.2xlarge

Tensorflow

SSD Efficientdet D3 896x896 COCO '17

tensorflow-od1-ssd-efficientdet-d3-896x896-coco17-tpu-32

Object Detection

apache-2.0

Yes

ml.g4dn.xlarge, ml.g5.xlarge, ml.m5.large, ml.m5.xlarge, ml.p3.2xlarge

Tensorflow

SSD EfficientDet D5

tensorflow-od-efficientdet-d5-1

Object Detection

apache-2.0

No

ml.g4dn.xlarge, ml.g5.xlarge, ml.p3.2xlarge

Tensorflow

SSD Mobilenet V1 FPN 640x640 COCO '17

tensorflow-od1-ssd-mobilenet-v1-fpn-640x640-coco17-tpu-8

Object Detection

apache-2.0

Yes

ml.g4dn.xlarge, ml.g5.xlarge, ml.m5.large, ml.m5.xlarge, ml.p3.2xlarge

Tensorflow

SSD Mobilenet V2

tensorflow-od-ssd-mobilenet-v2-2

Object Detection

apache-2.0

No

ml.g4dn.xlarge, ml.g5.xlarge, ml.p3.2xlarge

Tensorflow

SSD Mobilenet V2 640x640

tensorflow-od-ssd-mobilenet-v2-fpnlite-640x640-1

Object Detection

apache-2.0

No

ml.g4dn.xlarge, ml.g5.xlarge, ml.p3.2xlarge

Tensorflow

SSD Mobilenet V2 FPN-Lite 320x320 COCO '17

tensorflow-od1-ssd-mobilenet-v2-fpnlite-320x320-coco17-tpu-8

Object Detection

apache-2.0

Yes

ml.g4dn.xlarge, ml.g5.xlarge, ml.m5.large, ml.m5.xlarge, ml.p3.2xlarge

Tensorflow

SSD Mobilenet V2 FPN-lite 640x640 COCO '17

tensorflow-od1-ssd-mobilenet-v2-fpnlite-640x640-coco17-tpu-8

Object Detection

apache-2.0

Yes

ml.g4dn.xlarge, ml.g5.xlarge, ml.m5.large, ml.m5.xlarge, ml.p3.2xlarge

Tensorflow

SSD ResNet 101 V1 FPN 1024x1024 COCO '17

tensorflow-od1-ssd-resnet101-v1-fpn-1024x1024-coco17-tpu-8

Object Detection

apache-2.0

Yes

ml.g4dn.xlarge, ml.g5.xlarge, ml.m5.large, ml.m5.xlarge, ml.p3.2xlarge

Tensorflow

SSD ResNet 101 V1 FPN 640x640 COCO '17

tensorflow-od1-ssd-resnet101-v1-fpn-640x640-coco17-tpu-8

Object Detection

apache-2.0

Yes

ml.g4dn.xlarge, ml.g5.xlarge, ml.m5.large, ml.m5.xlarge, ml.p3.2xlarge

Tensorflow

SSD ResNet 152 V1 FPN 640x640 COCO '17

tensorflow-od1-ssd-resnet152-v1-fpn-640x640-coco17-tpu-8

Object Detection

apache-2.0

Yes

ml.g4dn.xlarge, ml.g5.xlarge, ml.m5.large, ml.m5.xlarge, ml.p3.2xlarge

Tensorflow

SSD ResNet 50 V1 FPN 1024x1024 COCO '17

tensorflow-od1-ssd-resnet50-v1-fpn-1024x1024-coco17-tpu-8

Object Detection

apache-2.0

Yes

ml.g4dn.xlarge, ml.g5.xlarge, ml.m5.large, ml.m5.xlarge, ml.p3.2xlarge

Tensorflow

Swin Base 4 12 384

tensorflow-ic-swin-base-patch4-window12-384

Image Classification

MIT

Yes

ml.c5.2xlarge, ml.c5.xlarge, ml.c6i.large, ml.c6i.xlarge, ml.g4dn.xlarge, ml.m4.xlarge, ml.m5.large, ml.m5.xlarge, ml.p3.2xlarge, ml.r5.large, ml.r5.xlarge

Tensorflow

Talking Heads Large

tensorflow-tc-talking-heads-large

Text Classification

apache-2.0

Yes

ml.c5.18xlarge, ml.c5.2xlarge, ml.c5.4xlarge, ml.c5.9xlarge, ml.g4dn.16xlarge, ml.g4dn.2xlarge, ml.g4dn.4xlarge, ml.g4dn.8xlarge, ml.g4dn.xlarge, ml.g5.xlarge, ml.m5.12xlarge, ml.m5.24xlarge, ml.m5.2xlarge, ml.m5.4xlarge, ml.m5.xlarge, ml.p3.16xlarge, ml.p3.2xlarge, ml.p3.8xlarge

Tensorflow

Text Classification

tensorflow-tc-bert-en-uncased-L-12-H-768-A-12-2

Text Classification

apache-2.0

Yes

ml.c5.18xlarge, ml.c5.2xlarge, ml.c5.4xlarge, ml.c5.9xlarge, ml.g4dn.16xlarge, ml.g4dn.2xlarge, ml.g4dn.4xlarge, ml.g4dn.8xlarge, ml.g4dn.xlarge, ml.g5.xlarge, ml.m5.12xlarge, ml.m5.24xlarge, ml.m5.2xlarge, ml.m5.4xlarge, ml.m5.xlarge, ml.p3.16xlarge, ml.p3.2xlarge, ml.p3.8xlarge

Tensorflow

Text Embedding

tensorflow-tcembedding-universal-sentc-encoder-cmlm-en-large-1

Text Embedding

apache-2.0

No

ml.c5.2xlarge, ml.c5.xlarge, ml.g4dn.xlarge, ml.m5.large, ml.m5.xlarge, ml.p3.2xlarge

Tensorflow

TRILL Distilled

tensorflow-audioembedding-trill-distilled-3

Audio Classification

apache-2.0

No

ml.c5.2xlarge, ml.c5.xlarge, ml.g4dn.xlarge, ml.m5.large, ml.m5.xlarge, ml.p3.2xlarge

Tensorflow

TRILLsson 1

tensorflow-audioembedding-trillsson1-1

Audio Classification

apache-2.0

No

ml.c5.2xlarge, ml.c5.xlarge, ml.g4dn.xlarge, ml.m5.large, ml.m5.xlarge, ml.p3.2xlarge

Tensorflow

TRILLsson 2

tensorflow-audioembedding-trillsson2-1

Audio Classification

apache-2.0

No

ml.c5.2xlarge, ml.c5.xlarge, ml.g4dn.xlarge, ml.m5.large, ml.m5.xlarge, ml.p3.2xlarge

Tensorflow

TRILLsson 3

tensorflow-audioembedding-trillsson3-1

Audio Classification

apache-2.0

No

ml.c5.2xlarge, ml.c5.xlarge, ml.g4dn.xlarge, ml.m5.large, ml.m5.xlarge, ml.p3.2xlarge

Tensorflow

Universal Sentence Encoder CMLM Base

tensorflow-tcembedding-universal-sentc-encoder-cmlm-en-base-1

Text Embedding

apache-2.0

No

ml.c5.2xlarge, ml.c5.xlarge, ml.g4dn.xlarge, ml.m5.large, ml.m5.xlarge, ml.p3.2xlarge

Classic ML Algorithms (11)

Amazon SageMaker JumpStart provides ready-to-use implementations of classical machine learning algorithms for tabular data tasks such as classification, regression, and time series forecasting. These algorithms (XGBoost, LightGBM, CatBoost, scikit-learn, AutoGluon) are optimized for structured data and can be trained from scratch on your own datasets.

Provider Model Name Model ID Task License Fine-tunable Supported Inference Instance Types

Autogluon

AutoGluon Tabular Classification

autogluon-classification-ensemble

Tabular Classification

apache-2.0

Yes

ml.c5.18xlarge, ml.c5.2xlarge, ml.c5.4xlarge, ml.c5.9xlarge, ml.g4dn.xlarge, ml.m5.12xlarge, ml.m5.24xlarge, ml.m5.2xlarge, ml.m5.4xlarge, ml.p3.2xlarge

Autogluon

AutoGluon Tabular Regression

autogluon-regression-ensemble

Tabular Regression

apache-2.0

Yes

ml.c5.18xlarge, ml.c5.2xlarge, ml.c5.4xlarge, ml.c5.9xlarge, ml.g4dn.xlarge, ml.m5.12xlarge, ml.m5.24xlarge, ml.m5.2xlarge, ml.m5.4xlarge, ml.p3.2xlarge

Catboost

CatBoost Classification

catboost-classification-model

Tabular Classification

apache-2.0

Yes

ml.c5.18xlarge, ml.c5.2xlarge, ml.c5.4xlarge, ml.c5.9xlarge, ml.m4.16xlarge, ml.m5.12xlarge, ml.m5.24xlarge, ml.m5.2xlarge, ml.m5.4xlarge

Catboost

CatBoost Regression

catboost-regression-model

Tabular Regression

apache-2.0

Yes

ml.c4.8xlarge, ml.c5.18xlarge, ml.c5.2xlarge, ml.c5.4xlarge, ml.c5.9xlarge, ml.m5.12xlarge, ml.m5.24xlarge, ml.m5.2xlarge, ml.m5.4xlarge

Lightgbm

LightGBM Classification

lightgbm-classification-model

Tabular Classification

mit

Yes

ml.c5.18xlarge, ml.c5.2xlarge, ml.c5.4xlarge, ml.c5.9xlarge, ml.m4.16xlarge, ml.m5.12xlarge, ml.m5.24xlarge, ml.m5.2xlarge, ml.m5.4xlarge

Lightgbm

LightGBM Regression

lightgbm-regression-model

Tabular Regression

mit

Yes

ml.c4.8xlarge, ml.c5.18xlarge, ml.c5.2xlarge, ml.c5.4xlarge, ml.c5.9xlarge, ml.m5.12xlarge, ml.m5.24xlarge, ml.m5.2xlarge, ml.m5.4xlarge

Pytorch

TabTransformer Classification

pytorch-tabtransformerclassification-model

Tabular Classification

apache-2.0

Yes

ml.c5.18xlarge, ml.c5.2xlarge, ml.c5.4xlarge, ml.c5.9xlarge, ml.g4dn.xlarge, ml.m5.12xlarge, ml.m5.24xlarge, ml.m5.2xlarge, ml.m5.4xlarge, ml.p3.2xlarge

Pytorch

TabTransformer Regression

pytorch-tabtransformerregression-model

Tabular Regression

apache-2.0

Yes

ml.c5.18xlarge, ml.c5.2xlarge, ml.c5.4xlarge, ml.c5.9xlarge, ml.g4dn.xlarge, ml.m5.12xlarge, ml.m5.24xlarge, ml.m5.2xlarge, ml.m5.4xlarge, ml.p3.2xlarge

Sklearn

Linear Classification

sklearn-classification-linear

Tabular Classification

bsd-3-clause

Yes

ml.c4.8xlarge, ml.c5.18xlarge, ml.c5.2xlarge, ml.c5.4xlarge, ml.c5.9xlarge, ml.m5.12xlarge, ml.m5.24xlarge, ml.m5.2xlarge, ml.m5.4xlarge, ml.m5.xlarge

Xgboost

XGBoost Classification

xgboost-classification-model

Tabular Classification

apache-2.0

Yes

ml.c4.8xlarge, ml.c5.18xlarge, ml.c5.2xlarge, ml.c5.4xlarge, ml.c5.9xlarge, ml.m5.12xlarge, ml.m5.24xlarge, ml.m5.2xlarge, ml.m5.4xlarge, ml.m5.xlarge

Xgboost

XGBoost Regression

xgboost-regression-model

Tabular Regression

apache-2.0

Yes

ml.c4.8xlarge, ml.c5.18xlarge, ml.c5.2xlarge, ml.c5.4xlarge, ml.c5.9xlarge, ml.m5.12xlarge, ml.m5.24xlarge, ml.m5.2xlarge, ml.m5.4xlarge, ml.m5.xlarge

Note

For the most accurate and up-to-date licensing information, see the model detail page in SageMaker Studio or the model's Hugging Face page.