Available foundation models
Amazon SageMaker JumpStart offers state-of-the-art, built-in publicly available and proprietary foundation models to customize and integrate into your generative AI workflows.
To list all available JumpStart foundation models:
import boto3 sagemaker_client = boto3.client("sagemaker", region_name="us-west-2") # List all available models in the SageMakerPublicHub all_models = [] next_token = None while True: params = {"HubName": "SageMakerPublicHub", "HubContentType": "Model"} if next_token: params["NextToken"] = next_token response = sagemaker_client.list_hub_contents(**params) all_models.extend(response["HubContentSummaries"]) next_token = response.get("NextToken") if not next_token: break print(f"Total models available: {len(all_models)}") for model in all_models[:5]: print(model["HubContentName"], model["HubContentVersion"])
To get supported instance types and other metadata for a specific model:
import json import boto3 sagemaker_client = boto3.client("sagemaker", region_name="us-west-2") model_id = "meta-textgeneration-llama-3-3-70b-instruct" response = sagemaker_client.describe_hub_content( HubName="SageMakerPublicHub", HubContentType="Model", HubContentName=model_id ) # Parse the model metadata model_doc = json.loads(response["HubContentDocument"]) print(f"Supported instances: {model_doc['SupportedInferenceInstanceTypes']}") print(f"Default instance: {model_doc['DefaultInferenceInstanceType']}")
The following tables list available models with their license and supported inference instance types as of April 30, 2026. For the most current information, use the API examples above or see the model hub in SageMaker Studio.
- Open-Weight Models (417)
-
Open-weight foundation models are publicly available models from providers such as Meta, Google, Mistral AI, and others. You can deploy, fine-tune, and customize these models for your use case.
To get started with one of these models, see JumpStart foundation model usage or explore one of the available Example notebooks. In a given example notebook, try switching out the model ID to experiment with different models within the same model family.
For more information on model IDs and resources on deploying publicly available JumpStart foundation models with the SageMaker Python SDK, see Use foundation models with the SageMaker Python SDK.
Provider Model Name Model ID Task License Fine-tunable Supported Inference Instance Types 01-Ai
Yi-1.5-34B
huggingface-llm-yi-1-5-34bText Generation
apache-2.0
No
ml.g5.12xlarge, ml.g5.24xlarge, ml.g5.48xlarge, ml.p4d.24xlarge
01-Ai
Yi-1.5-9B
huggingface-llm-yi-1-5-9bText Generation
apache-2.0
No
ml.g5.12xlarge, ml.g5.16xlarge, ml.g5.24xlarge, ml.g5.2xlarge, ml.g5.48xlarge, ml.g5.4xlarge, ml.g5.8xlarge, ml.p4d.24xlarge, ml.p4de.24xlarge, ml.p5.48xlarge
Ahxt
Lite Llama 460M 1T
huggingface-llm-ahxt-litellama-460m-1tText Generation
mit
No
ml.g5.2xlarge
Ai-Forever
Multilingual GPT
huggingface-llm-ai-forever-mgptText Generation
apache-2.0
No
ml.g5.2xlarge
Aisingapore
Llama3 8B SEA-Lion v2.1 Instruct
huggingface-llm-llama3-8b-sealionv21-instructText Generation
llama3
No
ml.g5.12xlarge, ml.g5.24xlarge, ml.g5.48xlarge, ml.g5.4xlarge, ml.g5.8xlarge, ml.g6.12xlarge, ml.g6.24xlarge, ml.g6.48xlarge, ml.g6.4xlarge, ml.g6.8xlarge, ml.p4d.24xlarge, ml.p5.48xlarge
Aisingapore
SEA-LION 3B
huggingface-llm-sealion-3bText Generation
mit
No
ml.g4dn.2xlarge, ml.g4dn.xlarge, ml.g5.2xlarge, ml.g5.4xlarge, ml.g5.xlarge, ml.p3.8xlarge
Aisingapore
SEA-LION 7B
huggingface-llm-sealion-7bText Generation
mit
No
ml.g4dn.12xlarge, ml.g5.12xlarge, ml.g5.24xlarge, ml.g5.48xlarge, ml.p4d.24xlarge
Aisingapore
SEA-LION 7B Instruct
huggingface-llm-sealion-7b-instructText Generation
mit
No
ml.g4dn.12xlarge, ml.g5.12xlarge, ml.g5.24xlarge, ml.g5.48xlarge, ml.p4d.24xlarge
Alibaba-Nlp
Alibaba-NLP gte-Qwen2-7B-instruct
huggingface-textembedding-gte-qwen2-7b-instructText Embedding
Apache-2.0
No
ml.g5.12xlarge, ml.g5.24xlarge, ml.g5.2xlarge, ml.g5.48xlarge, ml.g5.4xlarge, ml.g5.8xlarge, ml.g5.xlarge, ml.g6.12xlarge, ml.g6.24xlarge, ml.g6.2xlarge, ml.g6.48xlarge, ml.g6.4xlarge, ml.g6.8xlarge, ml.g6.xlarge, ml.p4d.24xlarge
Allenai
Olmo-3-7B-Instruct
huggingface-textgeneration-olmo-3-7b-instructText Generation
apache-2.0
No
ml.g6e.24xlarge
Amazon
Chronos 2
pytorch-forecasting-chronos-2Time Series Forecasting
apache-2.0
No
ml.c5.2xlarge, ml.c5.4xlarge, ml.c5.xlarge, ml.g4dn.2xlarge, ml.g4dn.4xlarge, ml.g4dn.xlarge, ml.g5.2xlarge, ml.g5.xlarge, ml.g6.2xlarge, ml.g6.xlarge, ml.g6e.2xlarge, ml.g6e.xlarge, ml.m5.2xlarge, ml.m5.4xlarge, ml.m5.xlarge
Amazon
Falcon Lite
huggingface-llm-amazon-falconliteText Generation
apache-2.0
No
ml.g5.12xlarge, ml.g5.24xlarge, ml.g5.48xlarge
Amazon
Falcon Lite 2
huggingface-llm-amazon-falconlite2Text Generation
apache-2.0
No
ml.g5.12xlarge, ml.g5.24xlarge, ml.g5.48xlarge
Amazon
Mistral Lite
huggingface-llm-amazon-mistralliteText Generation
apache-2.0
No
ml.g5.12xlarge, ml.g5.24xlarge, ml.g5.2xlarge, ml.g5.48xlarge, ml.p4d.24xlarge
Autogluon
Chronos-Bolt (Base)
autogluon-forecasting-chronos-bolt-baseTime Series Forecasting
apache-2.0
No
ml.c5.2xlarge, ml.c5.4xlarge, ml.c5.xlarge, ml.g4dn.2xlarge, ml.g4dn.4xlarge, ml.g4dn.xlarge, ml.g5.2xlarge, ml.g5.4xlarge, ml.g5.xlarge, ml.m5.2xlarge, ml.m5.4xlarge, ml.m5.xlarge, ml.p3.2xlarge
Autogluon
Chronos-Bolt (Small)
autogluon-forecasting-chronos-bolt-smallTime Series Forecasting
apache-2.0
No
ml.c5.2xlarge, ml.c5.4xlarge, ml.c5.xlarge, ml.g4dn.2xlarge, ml.g4dn.4xlarge, ml.g4dn.xlarge, ml.g5.2xlarge, ml.g5.4xlarge, ml.g5.xlarge, ml.m5.2xlarge, ml.m5.4xlarge, ml.m5.xlarge, ml.p3.2xlarge
Autogluon
Chronos-T5 (Base)
autogluon-forecasting-chronos-t5-baseTime Series Forecasting
apache-2.0
No
ml.g4dn.2xlarge, ml.g4dn.4xlarge, ml.g4dn.xlarge, ml.g5.2xlarge, ml.g5.4xlarge, ml.g5.xlarge, ml.p3.2xlarge
Autogluon
Chronos-T5 (Large)
autogluon-forecasting-chronos-t5-largeTime Series Forecasting
apache-2.0
No
ml.g4dn.2xlarge, ml.g4dn.4xlarge, ml.g4dn.xlarge, ml.g5.2xlarge, ml.g5.4xlarge, ml.g5.xlarge, ml.p3.2xlarge
Autogluon
Chronos-T5 (Small)
autogluon-forecasting-chronos-t5-smallTime Series Forecasting
apache-2.0
No
ml.g4dn.2xlarge, ml.g4dn.4xlarge, ml.g4dn.xlarge, ml.g5.2xlarge, ml.g5.4xlarge, ml.g5.xlarge, ml.p3.2xlarge
Baai
BAAI bge-base-en-v1.5
huggingface-textembedding-bge-base-en-v1-5Text Embedding
MIT
No
ml.g5.12xlarge, ml.g5.24xlarge, ml.g5.2xlarge, ml.g5.48xlarge, ml.g5.4xlarge, ml.g5.8xlarge, ml.g5.xlarge, ml.g6.12xlarge, ml.g6.24xlarge, ml.g6.2xlarge, ml.g6.48xlarge, ml.g6.4xlarge, ml.g6.8xlarge, ml.g6.xlarge, ml.p4d.24xlarge
Baai
BGE Base En
huggingface-sentencesimilarity-bge-base-enSentence Similarity
mit
Yes
ml.c6i.xlarge, ml.g4dn.2xlarge, ml.g5.2xlarge, ml.g5.4xlarge, ml.g5.8xlarge, ml.p3.2xlarge
Baai
BGE Base En V1.5
huggingface-sentencesimilarity-bge-base-en-v1-5Sentence Similarity
mit
Yes
ml.c6i.xlarge, ml.g4dn.2xlarge, ml.g5.2xlarge, ml.g5.4xlarge, ml.g5.8xlarge, ml.p3.2xlarge
Baai
BGE Large En
huggingface-sentencesimilarity-bge-large-enSentence Similarity
mit
Yes
ml.c6i.xlarge, ml.g4dn.2xlarge, ml.g5.2xlarge, ml.g5.4xlarge, ml.g5.8xlarge, ml.p3.2xlarge
Baai
BGE Large En V1.5
huggingface-sentencesimilarity-bge-large-en-v1-5Sentence Similarity
mit
Yes
ml.c6i.xlarge, ml.g4dn.2xlarge, ml.g5.2xlarge, ml.g5.4xlarge, ml.g5.8xlarge, ml.p3.2xlarge
Baai
BGE M3
huggingface-sentencesimilarity-bge-m3Sentence Similarity
mit
Yes
ml.c6i.xlarge, ml.g4dn.2xlarge, ml.g5.2xlarge, ml.g5.4xlarge, ml.g5.8xlarge, ml.p3.2xlarge
Baai
BGE Small En
huggingface-sentencesimilarity-bge-small-enSentence Similarity
mit
Yes
ml.c6i.xlarge, ml.g4dn.2xlarge, ml.g5.2xlarge, ml.g5.4xlarge, ml.g5.8xlarge, ml.p3.2xlarge
Baai
BGE Small En V1.5
huggingface-sentencesimilarity-bge-small-en-v1-5Sentence Similarity
mit
Yes
ml.c6i.xlarge, ml.g4dn.2xlarge, ml.g5.2xlarge, ml.g5.4xlarge, ml.g5.8xlarge, ml.p3.2xlarge
Berkeley-Nest
Starling LM 7B alpha
huggingface-llm-berkeley-nest-starling-lm-7b-alphaText Generation
apache-2.0
No
ml.g5.12xlarge, ml.g5.24xlarge, ml.g5.2xlarge, ml.g5.48xlarge
Bharatgenai
Param2-17B-A2.4B-Thinking
huggingface-llm-param2-17b-a2-4b-thinkingText Generation
No
ml.g7e.2xlarge, ml.g7e.4xlarge, ml.g7e.8xlarge
Bigcode
StarCoder
huggingface-llm-starcoderText Generation
bigcode-openrail-m
No
ml.g5.12xlarge, ml.g5.24xlarge, ml.g5.48xlarge, ml.p4d.24xlarge
Bigcode
StarCoderBase
huggingface-llm-starcoderbaseText Generation
bigcode-openrail-m
No
ml.g5.12xlarge, ml.g5.24xlarge, ml.g5.48xlarge, ml.p4d.24xlarge
Bigscience
Bloom 1b1
huggingface-textgeneration-bloom-1b1Text Generation
bigscience-bloom-rail-1.0
No
ml.g5.16xlarge, ml.g5.2xlarge, ml.g5.4xlarge, ml.g5.8xlarge
Bigscience
Bloom 3B
huggingface-textgeneration1-bloom-3bText Generation
bigscience-bloom-rail-1.0
Yes
ml.g5.16xlarge, ml.g5.2xlarge, ml.g5.4xlarge, ml.g5.8xlarge
Bigscience
Bloom 7B1
huggingface-textgeneration1-bloom-7b1Text Generation
bigscience-bloom-rail-1.0
Yes
ml.g5.16xlarge, ml.g5.2xlarge, ml.g5.4xlarge, ml.g5.8xlarge
Bigscience
Bloomz 1b7
huggingface-textgeneration-bloomz-1b7Text Generation
bigscience-bloom-rail-1.0
No
ml.g5.16xlarge, ml.g5.2xlarge, ml.g5.4xlarge, ml.g5.8xlarge
Bigscience
BloomZ 7B1 FP16
huggingface-textgeneration1-bloomz-7b1-fp16Text Generation
bigscience-bloom-rail-1.0
Yes
ml.g5.16xlarge, ml.g5.2xlarge, ml.g5.4xlarge, ml.g5.8xlarge
Bingsu
Bingsu my korean stable diffusion v1 5
huggingface-txt2img-bingsu-my-korean-stable-diffusion-v1-5Text-to-Image
creativeml-openrail-m
No
ml.g4dn.2xlarge, ml.g4dn.xlarge, ml.p3.2xlarge
Black-Forest-Labs
Black Forest Labs FLUX.1 [schnell]
huggingface-txt2img-black-forest-labs-flux-1-schnellText-to-Image
apache-2.0
No
ml.g5.12xlarge, ml.g5.24xlarge, ml.g5.48xlarge, ml.p4d.24xlarge, ml.p5.48xlarge
Cognitivecomputations
Dolphin 2.2.1 Mistral 7B
huggingface-llm-dolphin-2-2-1-mistral-7bText Generation
apache-2.0
No
ml.g5.12xlarge, ml.g5.24xlarge, ml.g5.2xlarge, ml.g5.48xlarge
Cognitivecomputations
Dolphin 2.5 Mixtral 8 7B
huggingface-llm-dolphin-2-5-mixtral-8x7bText Generation
apache-2.0
No
ml.g5.48xlarge
Cognitivecomputations
Dolphin 2.7 Mixtral 8 7B
huggingface-llm-dolphin-2-7-mixtral-8x7bText Generation
apache-2.0
No
ml.g5.48xlarge
Cognitivecomputations
Dolphin 2.9 Llama 3 8b
huggingface-llm-cognitive-dolphin-29-llama3-8bText Generation
other
No
ml.g5.16xlarge, ml.g5.2xlarge, ml.g5.4xlarge, ml.g5.8xlarge
Cohereforai
Aya 101
huggingface-llm-aya-101Text Generation
apache-2.0
No
ml.g5.12xlarge, ml.g5.24xlarge, ml.g5.48xlarge, ml.p4d.24xlarge
Cross-Encoder
Cross Encoder Nli Deberta Base
huggingface-zstc-cross-encoder-nli-deberta-baseZero-Shot Classification
apache-2.0
No
ml.c5.2xlarge, ml.c5.xlarge, ml.g4dn.xlarge, ml.m5.large, ml.m5.xlarge, ml.p3.2xlarge
Cross-Encoder
Cross Encoder Nli Distilroberta Base
huggingface-zstc-cross-encoder-nli-distilroberta-baseZero-Shot Classification
apache-2.0
No
ml.c5.2xlarge, ml.c5.xlarge, ml.g4dn.xlarge, ml.m5.large, ml.m5.xlarge, ml.p3.2xlarge
Cross-Encoder
Cross Encoder Nli MiniLM2 L6 H768
huggingface-zstc-cross-encoder-nli-minilm2-l6-h768Zero-Shot Classification
apache-2.0
No
ml.c5.2xlarge, ml.c5.xlarge, ml.g4dn.xlarge, ml.m5.large, ml.m5.xlarge, ml.p3.2xlarge
Cross-Encoder
Cross Encoder Nli Roberta Base
huggingface-zstc-cross-encoder-nli-roberta-baseZero-Shot Classification
apache-2.0
No
ml.c5.2xlarge, ml.c5.xlarge, ml.g4dn.xlarge, ml.m5.large, ml.m5.xlarge, ml.p3.2xlarge
Cyberagent
CyberAgentLM2-7B-Chat (CALM2-7B-Chat)
huggingface-llm-calm2-7b-chat-bf16Text Generation
apache-2.0
Yes
ml.g5.12xlarge, ml.g5.24xlarge, ml.g5.2xlarge, ml.g5.4xlarge, ml.g5.xlarge
Databricks
DBRX Base
huggingface-llm-dbrx-baseText Generation
databricks-open-model
No
ml.p4d.24xlarge, ml.p4de.24xlarge, ml.p5.48xlarge
Databricks
DBRX Instruct
huggingface-llm-dbrx-instructText Generation
databricks-open-model
No
ml.p4d.24xlarge, ml.p4de.24xlarge, ml.p5.48xlarge
Databricks
Dolly V2 12b BF16
huggingface-textgeneration-dolly-v2-12b-bf16Text Generation
apache-2.0
No
ml.g5.12xlarge, ml.g5.24xlarge, ml.g5.48xlarge, ml.p4d.24xlarge
Databricks
Dolly V2 3b BF16
huggingface-textgeneration-dolly-v2-3b-bf16Text Generation
apache-2.0
No
ml.g5.16xlarge, ml.g5.2xlarge, ml.g5.4xlarge, ml.g5.8xlarge
Deepseek-Ai
DeepSeek-OCR
deepseek-vlm-deepseek-ocrImage-Text-to-Text
mit
No
ml.g4dn.16xlarge, ml.g4dn.2xlarge, ml.g4dn.4xlarge, ml.g4dn.8xlarge, ml.g4dn.xlarge, ml.g5.16xlarge, ml.g5.2xlarge, ml.g5.4xlarge, ml.g5.8xlarge, ml.g5.xlarge, ml.g6.16xlarge, ml.g6.2xlarge, ml.g6.4xlarge, ml.g6.8xlarge, ml.g6.xlarge, ml.g6e.16xlarge, ml.g6e.2xlarge, ml.g6e.4xlarge, ml.g6e.8xlarge, ml.g6e.xlarge, ml.p3.2xlarge
Deepseek-Ai
DeepSeek-R1
deepseek-llm-r1Text Generation
mit
No
ml.p5en.48xlarge
Deepseek-Ai
DeepSeek-R1-0528
deepseek-llm-r1-0528Text Generation
mit
No
ml.p5en.48xlarge
Deepseek-Ai
DeepSeek-R1-Distill-Llama-70B
deepseek-llm-r1-distill-llama-70bText Generation
mit
Yes
ml.g5.48xlarge, ml.g6.48xlarge, ml.p4d.24xlarge
Deepseek-Ai
DeepSeek-R1-Distill-Llama-8B
deepseek-llm-r1-distill-llama-8bText Generation
mit
Yes
ml.g5.12xlarge, ml.g5.24xlarge, ml.g5.48xlarge, ml.g6.12xlarge, ml.g6.24xlarge, ml.g6.48xlarge
Deepseek-Ai
DeepSeek-R1-Distill-Qwen-1.5B
deepseek-llm-r1-distill-qwen-1-5bText Generation
mit
Yes
ml.g5.12xlarge, ml.g5.16xlarge, ml.g5.24xlarge, ml.g5.2xlarge, ml.g5.48xlarge, ml.g5.4xlarge, ml.g5.8xlarge, ml.g6.12xlarge, ml.g6.16xlarge, ml.g6.24xlarge, ml.g6.2xlarge, ml.g6.48xlarge, ml.g6.4xlarge, ml.g6.8xlarge
Deepseek-Ai
DeepSeek-R1-Distill-Qwen-14B
deepseek-llm-r1-distill-qwen-14bText Generation
mit
Yes
ml.g5.12xlarge, ml.g5.24xlarge, ml.g5.48xlarge, ml.g6.12xlarge, ml.g6.24xlarge, ml.g6.48xlarge
Deepseek-Ai
DeepSeek-R1-Distill-Qwen-32B
deepseek-llm-r1-distill-qwen-32bText Generation
mit
Yes
ml.g5.48xlarge, ml.g6.48xlarge, ml.p4d.24xlarge
Deepseek-Ai
DeepSeek-R1-Distill-Qwen-7B
deepseek-llm-r1-distill-qwen-7bText Generation
mit
Yes
ml.g5.12xlarge, ml.g5.24xlarge, ml.g5.48xlarge, ml.g6.12xlarge, ml.g6.24xlarge, ml.g6.48xlarge
Deepseek-Ai
DeepSeek-V3.1
deepseek-llm-deepseek-v3-1Text Generation
mit
No
ml.p5en.48xlarge
Deepseek-Ai
DeepSeek-V3.2
deepseek-llm-deepseek-v3-2Text Generation
mit
No
ml.p5en.48xlarge
Distilbert
DistilBERT Base Cased
huggingface-tc-distilbert-base-casedText Classification
apache-2.0
Yes
ml.c5.2xlarge, ml.c5.xlarge, ml.g4dn.xlarge, ml.m5.large, ml.m5.xlarge, ml.p3.2xlarge
Distilbert
DistilBERT Base Cased
huggingface-spc-distilbert-base-casedFill-Mask
apache-2.0
Yes
ml.c5.2xlarge, ml.c5.xlarge, ml.g4dn.xlarge, ml.m5.large, ml.m5.xlarge, ml.p3.2xlarge
Distilbert
DistilBERT Base Cased
huggingface-eqa-distilbert-base-casedQuestion Answering
apache-2.0
Yes
ml.c5.2xlarge, ml.c5.xlarge, ml.g4dn.xlarge, ml.m5.large, ml.m5.xlarge, ml.p3.2xlarge
Distilbert
DistilBERT Base Multilingual Cased
huggingface-tc-distilbert-base-multilingual-casedText Classification
apache-2.0
Yes
ml.c5.2xlarge, ml.c5.xlarge, ml.g4dn.xlarge, ml.m5.large, ml.m5.xlarge, ml.p3.2xlarge
Distilbert
DistilBERT Base Multilingual Cased
huggingface-spc-distilbert-base-multilingual-casedFill-Mask
apache-2.0
Yes
ml.c5.2xlarge, ml.c5.xlarge, ml.g4dn.xlarge, ml.m5.large, ml.m5.xlarge, ml.p3.2xlarge
Distilbert
DistilBERT Base Multilingual Cased
huggingface-eqa-distilbert-base-multilingual-casedQuestion Answering
apache-2.0
Yes
ml.c5.2xlarge, ml.c5.xlarge, ml.g4dn.xlarge, ml.m5.large, ml.m5.xlarge, ml.p3.2xlarge
Distilbert
DistilGPT 2
huggingface-textgeneration-distilgpt2Text Generation
apache-2.0
No
ml.g5.16xlarge, ml.g5.2xlarge, ml.g5.4xlarge, ml.g5.8xlarge
Distilbert
DistilRoBERTa Base
huggingface-tc-distilroberta-baseText Classification
apache-2.0
Yes
ml.c5.2xlarge, ml.c5.xlarge, ml.g4dn.xlarge, ml.m5.large, ml.m5.xlarge, ml.p3.2xlarge
Distilbert
DistilRoBERTa Base
huggingface-eqa-distilroberta-baseQuestion Answering
apache-2.0
Yes
ml.c5.2xlarge, ml.c5.xlarge, ml.g4dn.xlarge, ml.m5.large, ml.m5.xlarge, ml.p3.2xlarge
Distilbert
Question Answering
huggingface-eqa-distilbert-base-uncasedQuestion Answering
apache-2.0
Yes
ml.c5.2xlarge, ml.c5.xlarge, ml.g4dn.xlarge, ml.m5.large, ml.m5.xlarge, ml.p3.2xlarge
Distilbert
Sentence Pair Classification
huggingface-spc-distilbert-base-uncasedFill-Mask
apache-2.0
Yes
ml.c5.2xlarge, ml.c5.xlarge, ml.g4dn.xlarge, ml.m5.large, ml.m5.xlarge, ml.p3.2xlarge
Distilbert
Text Classification
huggingface-tc-distilbert-base-uncasedText Classification
apache-2.0
Yes
ml.c5.2xlarge, ml.c5.xlarge, ml.g4dn.xlarge, ml.m5.large, ml.m5.xlarge, ml.p3.2xlarge
Elastic
Distilbert Base Uncased
huggingface-ner-distilbert-base-uncased-finetuned-conll03-engToken Classification
apache-2.0
No
ml.c5.2xlarge, ml.c5.xlarge, ml.g4dn.xlarge, ml.m5.large, ml.m5.xlarge, ml.p3.2xlarge
Elastic
Named Entity Recognition
huggingface-ner-distilbert-base-cased-finetuned-conll03-englishToken Classification
apache-2.0
No
ml.c5.2xlarge, ml.c5.xlarge, ml.g4dn.xlarge, ml.m5.large, ml.m5.xlarge, ml.p3.2xlarge
Elastic
Named Entity Recognition
huggingface-ner-distilbert-base-cased-finetuned-conll03-engToken Classification
apache-2.0
No
ml.c5.2xlarge, ml.c5.xlarge, ml.g4dn.xlarge, ml.m5.large, ml.m5.xlarge, ml.p3.2xlarge
Eleutherai
GPT-J 6B
huggingface-textgeneration1-gpt-j-6bText Generation
apache-2.0
Yes
ml.g5.12xlarge, ml.g5.2xlarge, ml.g5.4xlarge, ml.g5.8xlarge
Eleutherai
GPT-Neo 125M
huggingface-textgeneration1-gpt-neo-125mText Generation
mit
Yes
ml.g5.16xlarge, ml.g5.2xlarge, ml.g5.4xlarge, ml.g5.8xlarge
Eleutherai
gpt-neox-20b
huggingface-textgeneration2-gpt-neox-20b-fp16Text Generation
Apache-2.0
No
ml.g4dn.12xlarge, ml.g5.12xlarge, ml.g5.24xlarge, ml.g5.48xlarge, ml.g6.12xlarge, ml.g6.24xlarge, ml.g6.48xlarge, ml.g6e.12xlarge, ml.g6e.24xlarge, ml.p4d.24xlarge
Eleutherai
Pythia 160m Deduped
huggingface-llm-eleutherai-pythia-160m-dedupedText Generation
apache-2.0
No
ml.g5.12xlarge, ml.g5.24xlarge, ml.g5.2xlarge
Eleutherai
Pythia 7m Deduped
huggingface-llm-eleutherai-pythia-70m-dedupedText Generation
apache-2.0
No
ml.g5.12xlarge, ml.g5.24xlarge, ml.g5.2xlarge, ml.g5.48xlarge
Elyza
ELYZA-japanese-Llama-2-13b-chat
huggingface-llm-elyza-japanese-llama-2-13b-chatText Generation
llama2
Yes
ml.g5.12xlarge, ml.g5.24xlarge, ml.g5.48xlarge, ml.p4d.24xlarge
Elyza
ELYZA-japanese-Llama-2-13b-fast-chat
huggingface-llm-elyza-japanese-llama-2-13b-fast-chatText Generation
llama2
Yes
ml.g5.12xlarge, ml.g5.24xlarge, ml.g5.48xlarge, ml.p4d.24xlarge
Elyza
ELYZA-japanese-Llama-2-7b-chat
huggingface-llm-elyza-japanese-llama-2-7b-chat-bf16Text Generation
llama2
Yes
ml.g5.12xlarge, ml.g5.24xlarge, ml.g5.2xlarge, ml.g5.48xlarge, ml.g5.4xlarge, ml.g5.xlarge
Elyza
ELYZA-japanese-Llama-2-7b-fast-chat
huggingface-llm-elyza-japanese-llama-2-7b-fast-chat-bf16Text Generation
llama2
Yes
ml.g5.12xlarge, ml.g5.24xlarge, ml.g5.2xlarge, ml.g5.48xlarge, ml.g5.4xlarge, ml.g5.xlarge
Emrecan
Emrecan Bert Base Turkish Cased Allnli Tr
huggingface-zstc-emrecan-bert-base-turkish-cased-allnli-trZero-Shot Classification
mit
No
ml.c5.2xlarge, ml.c5.xlarge, ml.g4dn.xlarge, ml.m5.large, ml.m5.xlarge, ml.p3.2xlarge
Facebook
Zero-Shot Text Classification
huggingface-zstc-facebook-bart-large-mnliZero-Shot Classification
mit
No
ml.c5.2xlarge, ml.c5.xlarge, ml.g4dn.xlarge, ml.m5.large, ml.m5.xlarge, ml.p3.2xlarge
Facebookai
RoBERTa Base
huggingface-tc-roberta-baseText Classification
mit
Yes
ml.c5.2xlarge, ml.c5.xlarge, ml.g4dn.xlarge, ml.m5.large, ml.m5.xlarge, ml.p3.2xlarge
Facebookai
RoBERTa Base
huggingface-spc-roberta-baseFill-Mask
mit
Yes
ml.c5.2xlarge, ml.c5.xlarge, ml.g4dn.xlarge, ml.m5.large, ml.m5.xlarge, ml.p3.2xlarge
Facebookai
RoBERTa Base
huggingface-eqa-roberta-baseQuestion Answering
mit
Yes
ml.c5.2xlarge, ml.c5.xlarge, ml.g4dn.xlarge, ml.m5.large, ml.m5.xlarge, ml.p3.2xlarge
Facebookai
RoBERTa Large
huggingface-tc-roberta-largeText Classification
mit
Yes
ml.c5.2xlarge, ml.c5.xlarge, ml.g4dn.xlarge, ml.m5.large, ml.m5.xlarge, ml.p3.2xlarge
Facebookai
RoBERTa Large
huggingface-spc-roberta-largeFill-Mask
mit
Yes
ml.c5.2xlarge, ml.c5.xlarge, ml.g4dn.xlarge, ml.m5.large, ml.m5.xlarge, ml.p3.2xlarge
Facebookai
RoBERTa Large
huggingface-eqa-roberta-largeQuestion Answering
mit
Yes
ml.c5.2xlarge, ml.c5.xlarge, ml.g4dn.xlarge, ml.m5.large, ml.m5.xlarge, ml.p3.2xlarge
Facebookai
XLM CLM English-German
huggingface-tc-xlm-clm-ende-1024Text Classification
mit
Yes
ml.c5.2xlarge, ml.c5.xlarge, ml.g4dn.xlarge, ml.m5.large, ml.m5.xlarge, ml.p3.2xlarge
Facebookai
XLM CLM English-German
huggingface-spc-xlm-clm-ende-1024Fill-Mask
mit
Yes
ml.c5.2xlarge, ml.c5.xlarge, ml.g4dn.xlarge, ml.m5.large, ml.m5.xlarge, ml.p3.2xlarge
Facebookai
XLM MLM 15 XNLI Languages
huggingface-spc-xlm-mlm-xnli15-1024Fill-Mask
cc-by-nc-4.0
Yes
ml.c5.2xlarge, ml.c5.xlarge, ml.g4dn.xlarge, ml.m5.large, ml.m5.xlarge, ml.p3.2xlarge
Facebookai
XLM MLM English-German
huggingface-tc-xlm-mlm-ende-1024Text Classification
cc-by-nc-4.0
Yes
ml.c5.2xlarge, ml.c5.xlarge, ml.g4dn.xlarge, ml.m5.large, ml.m5.xlarge, ml.p3.2xlarge
Facebookai
XLM MLM English-German
huggingface-spc-xlm-mlm-ende-1024Fill-Mask
cc-by-nc-4.0
Yes
ml.c5.2xlarge, ml.c5.xlarge, ml.g4dn.xlarge, ml.m5.large, ml.m5.xlarge, ml.p3.2xlarge
Facebookai
XLM MLM English-Romanian
huggingface-tc-xlm-mlm-enro-1024Text Classification
cc-by-nc-4.0
Yes
ml.c5.2xlarge, ml.c5.xlarge, ml.g4dn.xlarge, ml.m5.large, ml.m5.xlarge, ml.p3.2xlarge
Facebookai
XLM MLM English-Romanian
huggingface-spc-xlm-mlm-enro-1024Fill-Mask
cc-by-nc-4.0
Yes
ml.c5.2xlarge, ml.c5.xlarge, ml.g4dn.xlarge, ml.m5.large, ml.m5.xlarge, ml.p3.2xlarge
Facebookai
XLM MLM TLM 15 XNLI Languages
huggingface-tc-xlm-mlm-tlm-xnli15-1024Text Classification
cc-by-nc-4.0
Yes
ml.c5.2xlarge, ml.c5.xlarge, ml.g4dn.xlarge, ml.m5.large, ml.m5.xlarge, ml.p3.2xlarge
Facebookai
XLM MLM TLM 15 XNLI Languages
huggingface-spc-xlm-mlm-tlm-xnli15-1024Fill-Mask
cc-by-nc-4.0
Yes
ml.c5.2xlarge, ml.c5.xlarge, ml.g4dn.xlarge, ml.m5.large, ml.m5.xlarge, ml.p3.2xlarge
Fictiverse
Fictiverse Stable Diffusion BalloonArt Model
huggingface-txt2img-fictiverse-stable-diffusion-balloonartText-to-Image
creativeml-openrail-m
No
ml.g4dn.4xlarge, ml.g4dn.8xlarge, ml.g5.2xlarge, ml.g5.4xlarge, ml.g5.8xlarge, ml.p3.2xlarge
Fictiverse
Fictiverse Stable Diffusion Microscopic model
huggingface-txt2img-fictiverse-stable-diffusion-micro-modelText-to-Image
openrail
No
ml.g4dn.2xlarge, ml.g4dn.xlarge, ml.p3.2xlarge
Fictiverse
Fictiverse Stable Diffusion PaperCut Model
huggingface-txt2img-fictiverse-stable-diffusion-papercut-modelText-to-Image
creativeml-openrail-m
No
ml.g4dn.2xlarge, ml.g4dn.xlarge, ml.p3.2xlarge
Fictiverse
Fictiverse Stable Diffusion VoxelArt Model
huggingface-txt2img-fictiverse-stable-diffusion-voxelart-modelText-to-Image
creativeml-openrail-m
No
ml.g4dn.4xlarge, ml.g4dn.8xlarge, ml.g5.2xlarge, ml.g5.4xlarge, ml.g5.8xlarge, ml.p3.2xlarge
Google
Bigbird Pegasus Large Arxiv
huggingface-summarization-bigbird-pegasus-large-arxivText Summarization
apache-2.0
No
ml.c5.2xlarge, ml.c5.xlarge, ml.g4dn.xlarge, ml.m5.large, ml.m5.xlarge, ml.p3.2xlarge
Google
Bigbird Pegasus Large Pubmed
huggingface-summarization-bigbird-pegasus-large-pubmedText Summarization
apache-2.0
No
ml.c5.2xlarge, ml.c5.xlarge, ml.g4dn.xlarge, ml.m5.large, ml.m5.xlarge, ml.p3.2xlarge
Google
Flan-T5 Base
huggingface-text2text-flan-t5-baseText2Text Generation
apache-2.0
Yes
ml.g5.16xlarge, ml.g5.2xlarge, ml.g5.4xlarge, ml.g5.8xlarge
Google
Flan-T5 Large
huggingface-text2text-flan-t5-largeText2Text Generation
apache-2.0
Yes
ml.g5.16xlarge, ml.g5.2xlarge, ml.g5.4xlarge, ml.g5.8xlarge
Google
Flan-T5 Small
huggingface-text2text-flan-t5-smallText2Text Generation
apache-2.0
Yes
ml.g5.16xlarge, ml.g5.2xlarge, ml.g5.4xlarge, ml.g5.8xlarge
Google
Flan-T5 XL
huggingface-text2text-flan-t5-xlText2Text Generation
apache-2.0
Yes
ml.g5.16xlarge, ml.g5.2xlarge, ml.g5.4xlarge, ml.g5.8xlarge
Google
Flan-T5 XXL
huggingface-text2text-flan-t5-xxlText2Text Generation
apache-2.0
Yes
ml.g5.12xlarge, ml.g5.24xlarge, ml.g5.48xlarge, ml.p4d.24xlarge
Google
Gemma 2 27B
huggingface-llm-gemma-2-27bText Generation
gemma
No
ml.g5.48xlarge, ml.p4d.24xlarge
Google
Gemma 2 27B Instruct
huggingface-llm-gemma-2-27b-instructText Generation
gemma
No
ml.g5.48xlarge, ml.p4d.24xlarge
Google
Gemma 2 2B
huggingface-llm-gemma-2-2bText Generation
gemma
No
ml.g5.12xlarge, ml.g5.16xlarge, ml.g5.24xlarge, ml.g5.2xlarge, ml.g5.48xlarge, ml.g5.4xlarge, ml.g5.8xlarge, ml.g5.xlarge, ml.p4de.24xlarge, ml.p5.48xlarge
Google
Gemma 2 2B Instruct
huggingface-llm-gemma-2-2b-instructText Generation
gemma
No
ml.g5.12xlarge, ml.g5.16xlarge, ml.g5.24xlarge, ml.g5.2xlarge, ml.g5.48xlarge, ml.g5.4xlarge, ml.g5.8xlarge, ml.g5.xlarge, ml.p4de.24xlarge, ml.p5.48xlarge
Google
Gemma 2 9B
huggingface-llm-gemma-2-9bText Generation
gemma
No
ml.g5.12xlarge, ml.g5.24xlarge, ml.g5.48xlarge, ml.p4d.24xlarge
Google
Gemma 2 9B Instruct
huggingface-llm-gemma-2-9b-instructText Generation
gemma
No
ml.g5.12xlarge, ml.g5.24xlarge, ml.g5.48xlarge, ml.p4d.24xlarge
Google
Gemma 2B Instruct
huggingface-llm-gemma-2b-instructText Generation
gemma
Yes
ml.g5.12xlarge, ml.g5.16xlarge, ml.g5.24xlarge, ml.g5.2xlarge, ml.g5.48xlarge, ml.g5.4xlarge, ml.g5.8xlarge, ml.g5.xlarge, ml.p4de.24xlarge, ml.p5.48xlarge
Google
Gemma 3 1B Instruct
huggingface-llm-gemma-3-1b-instructText Generation
gemma
No
ml.g5.16xlarge, ml.g5.2xlarge, ml.g5.4xlarge, ml.g5.8xlarge
Google
Gemma 3 27B Instruct
huggingface-vlm-gemma-3-27b-instructImage-Text-to-Text
gemma
No
ml.g5.48xlarge, ml.g6.48xlarge, ml.p4d.24xlarge, ml.p5.48xlarge
Google
Gemma 3 4B Instruct
huggingface-vlm-gemma-3-4b-instructImage-Text-to-Text
gemma
No
ml.g5.16xlarge, ml.g5.2xlarge, ml.g5.4xlarge, ml.g5.8xlarge
Google
gemma-2b
huggingface-llm-gemma-2bText Generation
gemma
Yes
ml.g4dn.xlarge, ml.g5.xlarge, ml.g6.xlarge, ml.g6e.xlarge, ml.p3.2xlarge, ml.p4d.24xlarge
Google
gemma-4-26b-a4b-it
huggingface-vlm-gemma-4-26b-a4b-itText Generation
apache-2.0
No
ml.g6.12xlarge, ml.g6.24xlarge, ml.g6e.12xlarge, ml.g6e.24xlarge, ml.p5.48xlarge, ml.p5en.48xlarge
Google
gemma-4-31b-it
huggingface-vlm-gemma-4-31b-itImage-Text-to-Text
Apache-2.0
No
ml.g6.24xlarge
Google
gemma-4-e4b-it
huggingface-vlm-gemma-4-e4b-itText Generation
apache-2.0
No
ml.g6.12xlarge, ml.g6e.2xlarge, ml.g6e.4xlarge, ml.g6e.xlarge, ml.p5.48xlarge, ml.p5en.48xlarge
Google
gemma-7b
huggingface-llm-gemma-7bText Generation
gemma
Yes
ml.g6e.48xlarge, ml.p4d.24xlarge, ml.p5en.48xlarge
Google
gemma-7b-it
huggingface-llm-gemma-7b-instructText Generation
gemma
Yes
ml.g5.48xlarge, ml.g6.48xlarge, ml.g6e.12xlarge, ml.g6e.24xlarge, ml.p4d.24xlarge, ml.p5en.48xlarge
Google-Bert
BERT Base Cased
huggingface-tc-bert-base-casedText Classification
apache-2.0
Yes
ml.c5.2xlarge, ml.c5.xlarge, ml.g4dn.xlarge, ml.m5.large, ml.m5.xlarge, ml.p3.2xlarge
Google-Bert
BERT Base Cased
huggingface-spc-bert-base-casedFill-Mask
apache-2.0
Yes
ml.c5.2xlarge, ml.c5.xlarge, ml.g4dn.xlarge, ml.m5.large, ml.m5.xlarge, ml.p3.2xlarge
Google-Bert
BERT Base Cased
huggingface-eqa-bert-base-casedQuestion Answering
apache-2.0
Yes
ml.c5.2xlarge, ml.c5.xlarge, ml.g4dn.xlarge, ml.m5.large, ml.m5.xlarge, ml.p3.2xlarge
Google-Bert
BERT Base Multilingual Cased
huggingface-tc-bert-base-multilingual-casedText Classification
apache-2.0
Yes
ml.c5.2xlarge, ml.c5.xlarge, ml.g4dn.xlarge, ml.m5.large, ml.m5.xlarge, ml.p3.2xlarge
Google-Bert
BERT Base Multilingual Cased
huggingface-spc-bert-base-multilingual-casedFill-Mask
apache-2.0
Yes
ml.c5.2xlarge, ml.c5.xlarge, ml.g4dn.xlarge, ml.m5.large, ml.m5.xlarge, ml.p3.2xlarge
Google-Bert
BERT Base Multilingual Cased
huggingface-eqa-bert-base-multilingual-casedQuestion Answering
apache-2.0
Yes
ml.c5.2xlarge, ml.c5.xlarge, ml.g4dn.xlarge, ml.m5.large, ml.m5.xlarge, ml.p3.2xlarge
Google-Bert
BERT Base Multilingual Uncased
huggingface-tc-bert-base-multilingual-uncasedText Classification
apache-2.0
Yes
ml.c5.2xlarge, ml.c5.xlarge, ml.g4dn.xlarge, ml.m5.large, ml.m5.xlarge, ml.p3.2xlarge
Google-Bert
BERT Base Multilingual Uncased
huggingface-spc-bert-base-multilingual-uncasedFill-Mask
apache-2.0
Yes
ml.c5.2xlarge, ml.c5.xlarge, ml.g4dn.xlarge, ml.m5.large, ml.m5.xlarge, ml.p3.2xlarge
Google-Bert
BERT Base Multilingual Uncased
huggingface-eqa-bert-base-multilingual-uncasedQuestion Answering
apache-2.0
Yes
ml.c5.2xlarge, ml.c5.xlarge, ml.g4dn.xlarge, ml.m5.large, ml.m5.xlarge, ml.p3.2xlarge
Google-Bert
BERT Base Uncased
huggingface-tc-bert-base-uncasedText Classification
apache-2.0
Yes
ml.c5.2xlarge, ml.c5.xlarge, ml.g4dn.xlarge, ml.m5.large, ml.m5.xlarge, ml.p3.2xlarge
Google-Bert
BERT Base Uncased
huggingface-spc-bert-base-uncasedFill-Mask
apache-2.0
Yes
ml.c5.2xlarge, ml.c5.xlarge, ml.g4dn.xlarge, ml.m5.large, ml.m5.xlarge, ml.p3.2xlarge
Google-Bert
BERT Base Uncased
huggingface-eqa-bert-base-uncasedQuestion Answering
apache-2.0
Yes
ml.c5.2xlarge, ml.c5.xlarge, ml.g4dn.xlarge, ml.m5.large, ml.m5.xlarge, ml.p3.2xlarge
Google-Bert
BERT Large Cased
huggingface-tc-bert-large-casedText Classification
apache-2.0
Yes
ml.c5.2xlarge, ml.c5.xlarge, ml.g4dn.xlarge, ml.m5.large, ml.m5.xlarge, ml.p3.2xlarge
Google-Bert
BERT Large Cased
huggingface-spc-bert-large-casedFill-Mask
apache-2.0
Yes
ml.c5.2xlarge, ml.c5.xlarge, ml.g4dn.xlarge, ml.m5.large, ml.m5.xlarge, ml.p3.2xlarge
Google-Bert
BERT Large Cased
huggingface-eqa-bert-large-casedQuestion Answering
apache-2.0
Yes
ml.c5.2xlarge, ml.c5.xlarge, ml.g4dn.xlarge, ml.m5.large, ml.m5.xlarge, ml.p3.2xlarge
Google-Bert
BERT Large Cased Whole Word Masking
huggingface-spc-bert-large-cased-whole-word-maskingFill-Mask
apache-2.0
Yes
ml.c5.2xlarge, ml.c5.xlarge, ml.g4dn.xlarge, ml.m5.large, ml.m5.xlarge, ml.p3.2xlarge
Google-Bert
BERT Large Cased Whole Word Masking
huggingface-eqa-bert-large-cased-whole-word-maskingQuestion Answering
apache-2.0
Yes
ml.c5.2xlarge, ml.c5.xlarge, ml.g4dn.xlarge, ml.m5.large, ml.m5.xlarge, ml.p3.2xlarge
Google-Bert
BERT Large Uncased
huggingface-tc-bert-large-uncasedText Classification
apache-2.0
Yes
ml.c5.2xlarge, ml.c5.xlarge, ml.g4dn.xlarge, ml.m5.large, ml.m5.xlarge, ml.p3.2xlarge
Google-Bert
BERT Large Uncased
huggingface-spc-bert-large-uncasedFill-Mask
apache-2.0
Yes
ml.c5.2xlarge, ml.c5.xlarge, ml.g4dn.xlarge, ml.m5.large, ml.m5.xlarge, ml.p3.2xlarge
Google-Bert
BERT Large Uncased Whole Word Masking
huggingface-spc-bert-large-uncased-whole-word-maskingFill-Mask
apache-2.0
Yes
ml.c5.2xlarge, ml.c5.xlarge, ml.g4dn.xlarge, ml.m5.large, ml.m5.xlarge, ml.p3.2xlarge
Google-Bert
Fill Mask
huggingface-fillmask-bert-base-uncasedFill-Mask
apache-2.0
Yes
ml.c5.2xlarge, ml.c5.xlarge, ml.g4dn.xlarge, ml.m5.large, ml.m5.xlarge, ml.p3.2xlarge
Google-T5
Machine Translation
huggingface-translation-t5-smallTranslation
apache-2.0
No
ml.c5.2xlarge, ml.c5.xlarge, ml.g4dn.xlarge, ml.m5.large, ml.m5.xlarge, ml.p3.2xlarge
Google-T5
t5 Base en de
huggingface-translation-t5-baseTranslation
apache-2.0
No
ml.c5.2xlarge, ml.c5.xlarge, ml.g4dn.xlarge, ml.m5.large, ml.m5.xlarge, ml.p3.2xlarge
Google-T5
t5 Large en de
huggingface-translation-t5-largeTranslation
apache-2.0
No
ml.c5.2xlarge, ml.c5.xlarge, ml.g4dn.xlarge, ml.m5.large, ml.m5.xlarge, ml.p3.2xlarge
Gradientai
Llama-3 8B Gradient Instruct 1048k
huggingface-llm-llama-3-8b-instruct-gradientText Generation
llama3
No
ml.g5.12xlarge, ml.g5.16xlarge, ml.g5.24xlarge, ml.g5.2xlarge, ml.g5.48xlarge, ml.g5.4xlarge, ml.g5.8xlarge, ml.p4d.24xlarge, ml.p4de.24xlarge, ml.p5.48xlarge
Gradientai
Llama-3 8B Gradient Instruct 262k
huggingface-llm-gradientai-llama-3-8B-instruct-262kText Generation
llama3
No
ml.g5.12xlarge, ml.g5.16xlarge, ml.g5.24xlarge, ml.g5.2xlarge, ml.g5.48xlarge, ml.g5.4xlarge, ml.g5.8xlarge, ml.p4d.24xlarge, ml.p4de.24xlarge, ml.p5.48xlarge
Helsinki-Nlp
Helsinki opus en es
huggingface-translation-opus-mt-en-esTranslation
apache-2.0
No
ml.c5.2xlarge, ml.c5.xlarge, ml.g4dn.xlarge, ml.m5.large, ml.m5.xlarge, ml.p3.2xlarge
Helsinki-Nlp
Helsinki opus en vi
huggingface-translation-opus-mt-en-viTranslation
apache-2.0
No
ml.c5.2xlarge, ml.c5.xlarge, ml.g4dn.xlarge, ml.m5.large, ml.m5.xlarge, ml.p3.2xlarge
Helsinki-Nlp
Helsinki opus mul en
huggingface-translation-opus-mt-mul-enTranslation
apache-2.0
No
ml.c5.2xlarge, ml.c5.xlarge, ml.g4dn.xlarge, ml.m5.large, ml.m5.xlarge, ml.p3.2xlarge
Huggingfaceh4
HuggingFaceH4 zephyr-orpo-141b-A35b-v0.1
huggingface-llm-huggingfaceh4-zephyr-orpo-141b-a35b-v01Text Generation
apache-2.0
No
ml.g5.48xlarge, ml.p4d.24xlarge
Huggingfaceh4
Star Chat Alpha
huggingface-llm-huggingfaceh4-starchat-alphaText Generation
bigcode-openrail-m
No
ml.g5.12xlarge, ml.g5.24xlarge, ml.g5.48xlarge
Huggingfaceh4
Star Chat Beta
huggingface-llm-huggingfaceh4-starchat-betaText Generation
bigcode-openrail-m
No
ml.g5.12xlarge, ml.g5.24xlarge, ml.g5.48xlarge, ml.p4d.24xlarge
Huggingfaceh4
Zephyr 7B Beta
huggingface-llm-huggingfaceh4-zephyr-7b-betaText Generation
mit
No
ml.g5.12xlarge, ml.g5.24xlarge, ml.g5.2xlarge, ml.g5.48xlarge, ml.p4d.24xlarge
Ibm
Quality Controlled Paraphrase Generation
huggingface-text2text-qcpg-sentencesText2Text Generation
apache-2.0
No
ml.g5.16xlarge, ml.g5.2xlarge, ml.g5.4xlarge, ml.g5.8xlarge
Idea-Ccnl
IDEA CCNL Taiyi Stable Diffusion 1B Chinese EN v0.1
huggingface-txt2img-idea-ccnl-taiyi-1b-chinese-en-v01Text-to-Image
creativeml-openrail-m
No
ml.g4dn.4xlarge, ml.g4dn.8xlarge, ml.g5.2xlarge, ml.g5.4xlarge, ml.g5.8xlarge, ml.p3.2xlarge
Idea-Ccnl
IDEA CCNL Taiyi Stable Diffusion 1B Chinese v0.1
huggingface-txt2img-idea-ccnl-taiyi-1b-chinese-v0-1Text-to-Image
creativeml-openrail-m
No
ml.g4dn.4xlarge, ml.g4dn.8xlarge, ml.g5.2xlarge, ml.g5.4xlarge, ml.g5.8xlarge, ml.p3.2xlarge
Intfloat
E5 Base
huggingface-sentencesimilarity-e5-baseSentence Similarity
mit
Yes
ml.c6i.xlarge, ml.g4dn.2xlarge, ml.g5.2xlarge, ml.g5.4xlarge, ml.g5.8xlarge, ml.p3.2xlarge
Intfloat
E5 Base V2
huggingface-sentencesimilarity-e5-base-v2Sentence Similarity
mit
Yes
ml.c6i.xlarge, ml.g4dn.2xlarge, ml.g5.2xlarge, ml.g5.4xlarge, ml.g5.8xlarge, ml.p3.2xlarge
Intfloat
E5 Large V2
huggingface-sentencesimilarity-e5-large-v2Sentence Similarity
mit
Yes
ml.c6i.xlarge, ml.g4dn.2xlarge, ml.g5.2xlarge, ml.g5.4xlarge, ml.g5.8xlarge, ml.p3.2xlarge
Intfloat
Multilingual E5 Base
huggingface-sentencesimilarity-multilingual-e5-baseSentence Similarity
mit
Yes
ml.c6i.xlarge, ml.g4dn.2xlarge, ml.g5.2xlarge, ml.g5.4xlarge, ml.g5.8xlarge, ml.p3.2xlarge
Intfloat
Multilingual E5 Large
huggingface-sentencesimilarity-multilingual-e5-largeSentence Similarity
mit
Yes
ml.c6i.xlarge, ml.g4dn.2xlarge, ml.g5.2xlarge, ml.g5.4xlarge, ml.g5.8xlarge, ml.p3.2xlarge
Langboat
Langboat Guohua Diffusion
huggingface-txt2img-langboat-guohua-diffusionText-to-Image
creativeml-openrail-m
No
ml.g4dn.2xlarge, ml.g4dn.xlarge, ml.p3.2xlarge
Lighteternal
Lighteternal Nli Xlm R Greek
huggingface-zstc-lighteternal-nli-xlm-r-greekZero-Shot Classification
apache-2.0
No
ml.c5.2xlarge, ml.c5.xlarge, ml.g4dn.xlarge, ml.m5.large, ml.m5.xlarge, ml.p3.2xlarge
Meta
Llama-2-7b-hf
meta-textgeneration-llama-2-7bText Generation
llama2
Yes
ml.g4dn.12xlarge, ml.g5.12xlarge, ml.g6.12xlarge, ml.g6e.xlarge, ml.p4d.24xlarge, ml.p5.48xlarge
Meta
Llama-3.1-70B-Instruct
meta-textgeneration-llama-3-1-70b-instructText Generation
llama3.1
Yes
ml.g6e.48xlarge, ml.p5en.48xlarge
Meta
Llama-3.1-8B-Instruct
meta-textgeneration-llama-3-1-8b-instructText Generation
llama3.1
Yes
ml.g5.48xlarge, ml.g6.48xlarge, ml.g6e.12xlarge, ml.p3.16xlarge, ml.p4d.24xlarge, ml.p5.48xlarge
Meta
Llama-3.2-1B
meta-textgeneration-llama-3-2-1bText Generation
llama3.2
Yes
ml.g4dn.12xlarge, ml.g5.48xlarge, ml.g6.12xlarge, ml.g6e.2xlarge, ml.p3.16xlarge, ml.p4d.24xlarge
Meta
Llama-3.2-1B-Instruct
meta-textgeneration-llama-3-2-1b-instructText Generation
llama3.2
Yes
ml.g4dn.12xlarge, ml.g5.24xlarge, ml.g6.48xlarge, ml.g6e.12xlarge, ml.p4d.24xlarge, ml.p5.48xlarge
Meta
Llama-3.2-3B
meta-textgeneration-llama-3-2-3bText Generation
llama3.2
Yes
ml.g5.48xlarge, ml.g6.48xlarge, ml.g6e.24xlarge, ml.g6e.48xlarge, ml.p3.16xlarge, ml.p5en.48xlarge
Meta
Llama-3.2-3B-Instruct
meta-textgeneration-llama-3-2-3b-instructText Generation
llama3.2
Yes
ml.g5.24xlarge, ml.g6.24xlarge, ml.g6e.48xlarge, ml.p3.16xlarge, ml.p4d.24xlarge, ml.p5.48xlarge
Meta
Llama-Guard-3-1B
meta-textgeneration-llama-guard-3-1bText Generation
llama3.2
No
ml.g4dn.12xlarge, ml.g5.12xlarge, ml.g5.24xlarge, ml.g5.48xlarge, ml.g5.8xlarge, ml.g5.xlarge, ml.g6.12xlarge, ml.g6.24xlarge, ml.g6.48xlarge, ml.g6e.12xlarge, ml.g6e.16xlarge, ml.g6e.24xlarge, ml.g6e.2xlarge, ml.g6e.4xlarge, ml.g6e.8xlarge, ml.g6e.xlarge, ml.p4d.24xlarge
Meta
Llama-Guard-3-8B
meta-textgeneration-llama-guard-3-8bText Generation
llama3.1
No
ml.g5.12xlarge, ml.g5.24xlarge, ml.g5.48xlarge, ml.g6.12xlarge, ml.g6.24xlarge, ml.g6.48xlarge, ml.g6e.12xlarge, ml.g6e.24xlarge, ml.p4d.24xlarge
Meta
Meta Code Llama 13B
meta-textgeneration-llama-codellama-13bText Generation
llama2
Yes
ml.g5.12xlarge, ml.g5.24xlarge, ml.g5.48xlarge, ml.g6.12xlarge, ml.p4d.24xlarge
Meta
Meta Code Llama 13B Instruct
meta-textgeneration-llama-codellama-13b-instructText Generation
llama2
No
ml.g5.12xlarge, ml.g5.24xlarge, ml.g5.48xlarge, ml.g6.12xlarge, ml.p4d.24xlarge
Meta
Meta Code Llama 13B Python
meta-textgeneration-llama-codellama-13b-pythonText Generation
llama2
Yes
ml.g5.12xlarge, ml.g5.24xlarge, ml.g5.48xlarge, ml.g6.12xlarge, ml.p4d.24xlarge
Meta
Meta Code Llama 34B
meta-textgeneration-llama-codellama-34bText Generation
llama2
Yes
ml.g5.48xlarge, ml.p4d.24xlarge
Meta
Meta Code Llama 34B Instruct
meta-textgeneration-llama-codellama-34b-instructText Generation
llama2
No
ml.g5.48xlarge, ml.p4d.24xlarge
Meta
Meta Code Llama 34B Python
meta-textgeneration-llama-codellama-34b-pythonText Generation
llama2
Yes
ml.g5.48xlarge, ml.p4d.24xlarge
Meta
Meta Code Llama 70B
meta-textgeneration-llama-codellama-70bText Generation
llama2
Yes
ml.g5.48xlarge, ml.p4d.24xlarge
Meta
Meta Code Llama 70B Instruct
meta-textgeneration-llama-codellama-70b-instructText Generation
llama2
No
ml.g5.48xlarge, ml.p4d.24xlarge
Meta
Meta Code Llama 70B Neuron
meta-textgenerationneuron-llama-codellama-70bText Generation
llama2
No
ml.inf2.48xlarge
Meta
Meta Code Llama 70B Python
meta-textgeneration-llama-codellama-70b-pythonText Generation
llama2
Yes
ml.g5.48xlarge, ml.p4d.24xlarge
Meta
Meta Code Llama 7B
meta-textgeneration-llama-codellama-7bText Generation
llama2
Yes
ml.g5.12xlarge, ml.g5.24xlarge, ml.g5.2xlarge, ml.g5.48xlarge, ml.g5.4xlarge, ml.g5.8xlarge, ml.g6.2xlarge, ml.p4d.24xlarge
Meta
Meta Code Llama 7B Instruct
meta-textgeneration-llama-codellama-7b-instructText Generation
llama2
No
ml.g5.12xlarge, ml.g5.24xlarge, ml.g5.2xlarge, ml.g5.48xlarge, ml.g5.4xlarge, ml.g5.8xlarge, ml.g6.12xlarge, ml.g6.2xlarge, ml.p4d.24xlarge
Meta
Meta Code Llama 7B Neuron
meta-textgenerationneuron-llama-codellama-7bText Generation
llama2
No
ml.inf2.8xlarge, ml.inf2.xlarge
Meta
Meta Code Llama 7B Python
meta-textgeneration-llama-codellama-7b-pythonText Generation
llama2
Yes
ml.g5.12xlarge, ml.g5.24xlarge, ml.g5.2xlarge, ml.g5.48xlarge, ml.g5.4xlarge, ml.g5.8xlarge, ml.g6.12xlarge, ml.p4d.24xlarge
Meta
Meta Code Llama 7B Python Neuron
meta-textgenerationneuron-llama-codellama-7b-pythonText Generation
llama2
No
ml.inf2.8xlarge, ml.inf2.xlarge
Meta
Meta Llama 2 13B
meta-textgeneration-llama-2-13bText Generation
llama2
Yes
ml.g5.12xlarge, ml.g5.24xlarge, ml.g5.48xlarge, ml.g6.12xlarge, ml.p4d.24xlarge, ml.p5.48xlarge
Meta
Meta Llama 2 13B Chat
meta-textgeneration-llama-2-13b-fText Generation
llama2
Yes
ml.g5.12xlarge, ml.g5.24xlarge, ml.g5.48xlarge, ml.g6.12xlarge, ml.p4d.24xlarge, ml.p5.48xlarge
Meta
Meta Llama 2 13B Chat Neuron
meta-textgenerationneuron-llama-2-13b-fText Generation
llama2
No
ml.inf2.24xlarge, ml.inf2.48xlarge, ml.inf2.8xlarge
Meta
Meta Llama 2 13B Neuron
meta-textgenerationneuron-llama-2-13bText Generation
llama2
Yes
ml.inf2.24xlarge, ml.inf2.48xlarge, ml.inf2.8xlarge
Meta
Meta Llama 2 70B
meta-textgeneration-llama-2-70bText Generation
llama2
Yes
ml.g5.48xlarge, ml.p4d.24xlarge
Meta
Meta Llama 2 70B Chat
meta-textgeneration-llama-2-70b-fText Generation
llama2
Yes
ml.g5.48xlarge, ml.p4d.24xlarge
Meta
Meta Llama 2 70B Chat Neuron
meta-textgenerationneuron-llama-2-70b-fText Generation
llama2
No
ml.inf2.48xlarge
Meta
Meta Llama 2 70B Neuron
meta-textgenerationneuron-llama-2-70bText Generation
llama2
No
ml.inf2.48xlarge
Meta
Meta Llama 2 7B Chat
meta-textgeneration-llama-2-7b-fText Generation
llama2
Yes
ml.g5.12xlarge, ml.g5.24xlarge, ml.g5.2xlarge, ml.g5.48xlarge, ml.g5.4xlarge, ml.g5.8xlarge, ml.g6.12xlarge, ml.p4d.24xlarge
Meta
Meta Llama 2 7B Chat Neuron
meta-textgenerationneuron-llama-2-7b-fText Generation
llama2
No
ml.inf2.24xlarge, ml.inf2.48xlarge, ml.inf2.8xlarge, ml.inf2.xlarge
Meta
Meta Llama 2 7B Neuron
meta-textgenerationneuron-llama-2-7bText Generation
llama2
Yes
ml.inf2.24xlarge, ml.inf2.48xlarge, ml.inf2.8xlarge, ml.inf2.xlarge
Meta
Meta Llama 3 1 8B Neuron
meta-textgenerationneuron-llama-3-1-8bText Generation
llama3.1
No
ml.inf2.24xlarge, ml.inf2.48xlarge, ml.inf2.8xlarge, ml.trn1.2xlarge, ml.trn1.32xlarge, ml.trn1n.32xlarge
Meta
Meta Llama 3 70B
meta-textgeneration-llama-3-70bText Generation
llama3
Yes
ml.g5.48xlarge, ml.g6.48xlarge, ml.p4d.24xlarge, ml.p5.48xlarge
Meta
Meta Llama 3 70B Instruct
meta-textgeneration-llama-3-70b-instructText Generation
llama3
Yes
ml.g5.48xlarge, ml.g6.48xlarge, ml.p4d.24xlarge, ml.p5.48xlarge
Meta
Meta Llama 3 70B Instruct Neuron
meta-textgenerationneuron-llama-3-70b-instructText Generation
llama3
No
ml.trn1.32xlarge
Meta
Meta Llama 3 70B Neuron
meta-textgenerationneuron-llama-3-70bText Generation
llama3
No
ml.trn1.32xlarge
Meta
Meta Llama 3 8B Instruct
meta-textgeneration-llama-3-8b-instructText Generation
llama3
Yes
ml.g5.12xlarge, ml.g5.24xlarge, ml.g5.2xlarge, ml.g5.48xlarge, ml.g5.4xlarge, ml.g5.8xlarge, ml.g6.12xlarge, ml.g6.2xlarge, ml.p4d.24xlarge, ml.p5.48xlarge
Meta
Meta Llama 3 8B Instruct Neuron
meta-textgenerationneuron-llama-3-8b-instructText Generation
llama3
No
ml.inf2.24xlarge, ml.inf2.48xlarge, ml.inf2.8xlarge
Meta
Meta Llama 3 8B Neuron
meta-textgenerationneuron-llama-3-8bText Generation
llama3
No
ml.inf2.24xlarge, ml.inf2.48xlarge, ml.inf2.8xlarge
Meta
Meta Llama 3.1 405B FP8
meta-textgeneration-llama-3-1-405b-fp8Text Generation
llama3.1
Yes
ml.p4de.24xlarge, ml.p5.48xlarge
Meta
Meta Llama 3.1 405B Instruct FP8
meta-textgeneration-llama-3-1-405b-instruct-fp8Text Generation
llama3.1
Yes
ml.p4de.24xlarge, ml.p5.48xlarge
Meta
Meta Llama 3.1 70B
meta-textgeneration-llama-3-1-70bText Generation
llama3.1
Yes
ml.g5.48xlarge, ml.g6.48xlarge, ml.p4d.24xlarge, ml.p5.48xlarge
Meta
Meta Llama 3.1 70B Instruct Neuron
meta-textgenerationneuron-llama-3-1-70b-instructText Generation
llama3.1
No
ml.inf2.48xlarge, ml.trn1.32xlarge, ml.trn1n.32xlarge
Meta
Meta Llama 3.1 70B Neuron
meta-textgenerationneuron-llama-3-1-70bText Generation
llama3.1
No
ml.inf2.48xlarge, ml.trn1.32xlarge, ml.trn1n.32xlarge
Meta
Meta Llama 3.1 8B
meta-textgeneration-llama-3-1-8bText Generation
llama3.1
Yes
ml.g5.12xlarge, ml.g5.24xlarge, ml.g5.2xlarge, ml.g5.48xlarge, ml.g5.4xlarge, ml.g5.8xlarge, ml.g6.12xlarge, ml.g6.24xlarge, ml.g6.2xlarge, ml.g6.48xlarge, ml.g6.4xlarge, ml.g6.8xlarge, ml.p4d.24xlarge, ml.p5.48xlarge
Meta
Meta Llama 3.1 8B Instruct Neuron
meta-textgenerationneuron-llama-3-1-8b-instructText Generation
llama3.1
No
ml.inf2.24xlarge, ml.inf2.48xlarge, ml.inf2.8xlarge, ml.trn1.2xlarge, ml.trn1.32xlarge, ml.trn1n.32xlarge
Meta
Meta Llama 3.2 11B Vision
meta-vlm-llama-3-2-11b-visionImage-Text-to-Text
llama3.2
Yes
ml.g6.12xlarge, ml.g6.24xlarge, ml.g6.48xlarge, ml.p4d.24xlarge, ml.p4de.24xlarge, ml.p5.48xlarge
Meta
Meta Llama 3.2 11B Vision Instruct
meta-vlm-llama-3-2-11b-vision-instructImage-Text-to-Text
llama3.2
Yes
ml.g6.12xlarge, ml.g6.24xlarge, ml.g6.48xlarge, ml.p4d.24xlarge, ml.p4de.24xlarge, ml.p5.48xlarge
Meta
Meta Llama 3.2 1B Instruct Neuron
meta-textgenerationneuron-llama-3-2-1b-instructText Generation
llama3.2
No
ml.inf2.24xlarge, ml.inf2.48xlarge, ml.inf2.8xlarge, ml.inf2.xlarge, ml.trn1.2xlarge, ml.trn1.32xlarge, ml.trn1n.32xlarge
Meta
Meta Llama 3.2 1B Neuron
meta-textgenerationneuron-llama-3-2-1bText Generation
llama3.2
No
ml.inf2.24xlarge, ml.inf2.48xlarge, ml.inf2.8xlarge, ml.inf2.xlarge, ml.trn1.2xlarge, ml.trn1.32xlarge, ml.trn1n.32xlarge
Meta
Meta Llama 3.2 3B Instruct Neuron
meta-textgenerationneuron-llama-3-2-3b-instructText Generation
llama3.2
No
ml.inf2.24xlarge, ml.inf2.48xlarge, ml.inf2.8xlarge, ml.inf2.xlarge, ml.trn1.2xlarge, ml.trn1.32xlarge, ml.trn1n.32xlarge
Meta
Meta Llama 3.2 3B Neuron
meta-textgenerationneuron-llama-3-2-3bText Generation
llama3.2
No
ml.inf2.24xlarge, ml.inf2.48xlarge, ml.inf2.8xlarge, ml.inf2.xlarge, ml.trn1.2xlarge, ml.trn1.32xlarge, ml.trn1n.32xlarge
Meta
Meta Llama 3.2 90B Vision
meta-vlm-llama-3-2-90b-visionImage-Text-to-Text
llama3.2
Yes
ml.p4d.24xlarge, ml.p4de.24xlarge, ml.p5.48xlarge
Meta
Meta Llama 3.2 90B Vision Instruct
meta-vlm-llama-3-2-90b-vision-instructImage-Text-to-Text
llama3.2
Yes
ml.p4d.24xlarge, ml.p4de.24xlarge, ml.p5.48xlarge
Meta
Meta Llama 3.3 70B Instruct
meta-textgeneration-llama-3-3-70b-instructText Generation
llama3.3
Yes
ml.g5.48xlarge, ml.g6.48xlarge, ml.p4d.24xlarge, ml.p5.48xlarge
Meta
Meta Llama 4 Maverick 17B 128E FP8
meta-vlm-llama-4-maverick-17b-128e-instruct-fp8Image-Text-to-Text
llama4
No
ml.p5en.48xlarge
Meta
Meta Llama 4 Maverick 17B 128E Instruct
meta-vlm-llama-4-maverick-17b-128e-instructImage-Text-to-Text
llama4
No
ml.p5en.48xlarge
Meta
Meta Llama 4 Scout 17B 16E Instruct
meta-vlm-llama-4-scout-17b-16e-instructImage-Text-to-Text
llama4
Yes
ml.g6e.48xlarge, ml.p5.48xlarge, ml.p5en.48xlarge
Meta
Meta Llama Guard 3 11B Vision
meta-vlm-llama-guard-3-11b-visionImage-Text-to-Text
llama3.2
No
ml.g6.12xlarge, ml.g6.24xlarge, ml.g6.48xlarge, ml.p4d.24xlarge, ml.p4de.24xlarge, ml.p5.48xlarge
Meta
Meta Llama Guard 3 1B Neuron
meta-textgenerationneuron-llama-guard-3-1bText Generation
llama3.2
No
ml.inf2.24xlarge, ml.inf2.48xlarge, ml.inf2.8xlarge, ml.inf2.xlarge, ml.trn1.2xlarge, ml.trn1.32xlarge, ml.trn1n.32xlarge
Meta
Meta Llama Guard 3 8B Neuron
meta-textgenerationneuron-llama-guard-3-8bText Generation
llama3.1
No
ml.inf2.24xlarge, ml.inf2.48xlarge, ml.inf2.8xlarge, ml.trn1.2xlarge, ml.trn1.32xlarge, ml.trn1n.32xlarge
Meta
Meta Llama Guard 7B
meta-textgeneration-llama-guard-7bText Generation
llama2
No
ml.g5.12xlarge, ml.g5.24xlarge, ml.g5.2xlarge, ml.g5.48xlarge, ml.g5.4xlarge, ml.g5.8xlarge, ml.p4d.24xlarge
Meta
Meta Llama Prompt Guard 86M
meta-tc-llama-prompt-guard-86mText Classification
llama3.1
No
ml.g5.12xlarge, ml.g5.24xlarge, ml.g5.2xlarge, ml.g5.48xlarge, ml.g5.4xlarge, ml.g5.8xlarge, ml.g5.xlarge, ml.g6.12xlarge, ml.g6.24xlarge, ml.g6.2xlarge, ml.g6.48xlarge, ml.g6.4xlarge, ml.g6.8xlarge, ml.g6.xlarge, ml.p4d.24xlarge
Meta
Meta SAM 2.1 Hiera Base Plus
meta-vs-sam-2-1-hiera-base-plusImage Segmentation
apache-2.0
No
ml.g5.24xlarge, ml.g5.48xlarge, ml.g6.24xlarge, ml.g6.48xlarge, ml.p4d.24xlarge, ml.p4de.24xlarge
Meta
Meta SAM 2.1 Hiera Large
meta-vs-sam-2-1-hiera-largeImage Segmentation
apache-2.0
No
ml.g5.24xlarge, ml.g5.48xlarge, ml.g6.24xlarge, ml.g6.48xlarge, ml.p4d.24xlarge, ml.p4de.24xlarge
Meta
Meta SAM 2.1 Hiera Small
meta-vs-sam-2-1-hiera-smallImage Segmentation
apache-2.0
No
ml.g5.24xlarge, ml.g5.48xlarge, ml.g6.24xlarge, ml.g6.48xlarge, ml.p4d.24xlarge, ml.p4de.24xlarge
Meta
Meta SAM 2.1 Hiera Tiny
meta-vs-sam-2-1-hiera-tinyImage Segmentation
apache-2.0
No
ml.g5.24xlarge, ml.g5.48xlarge, ml.g6.24xlarge, ml.g6.48xlarge, ml.p4d.24xlarge, ml.p4de.24xlarge
Meta
Meta-Llama-3-8B
meta-textgeneration-llama-3-8bText Generation
llama3
Yes
ml.g4dn.12xlarge, ml.g5.12xlarge, ml.g6.12xlarge, ml.g6e.xlarge, ml.p3.8xlarge, ml.p4d.24xlarge
Microsoft
Phi-2
huggingface-llm-phi-2Text Generation
mit
No
ml.g5.12xlarge, ml.g5.16xlarge, ml.g5.24xlarge, ml.g5.2xlarge, ml.g5.48xlarge, ml.g5.4xlarge, ml.g5.8xlarge, ml.g5.xlarge, ml.p4de.24xlarge, ml.p5.48xlarge
Microsoft
Phi-3-Mini-128K-Instruct
huggingface-llm-phi-3-mini-128k-instructText Generation
mit
No
ml.g5.12xlarge, ml.g5.16xlarge, ml.g5.24xlarge, ml.g5.2xlarge, ml.g5.48xlarge, ml.g5.4xlarge, ml.g5.8xlarge, ml.p4d.24xlarge, ml.p4de.24xlarge, ml.p5.48xlarge
Microsoft
Phi-3-mini-4k-instruct
huggingface-llm-phi-3-mini-4k-instructText Generation
mit
No
ml.g4dn.12xlarge, ml.g5.12xlarge, ml.g6.12xlarge, ml.g6e.xlarge, ml.p4d.24xlarge, ml.p5en.48xlarge
Microsoft
Phi-3.5-mini-instruct
huggingface-llm-phi-3-5-mini-instructText Generation
mit
No
ml.g5.48xlarge, ml.p4d.24xlarge
Microsoft
Table Transformer Detection
huggingface-od-microsoft-table-transformer-detectionObject Detection
mit
No
ml.g5.2xlarge, ml.g5.4xlarge, ml.g5.xlarge, ml.g6.2xlarge, ml.g6.4xlarge, ml.g6.xlarge, ml.g6e.2xlarge, ml.g6e.4xlarge, ml.g6e.xlarge
Minimaxai
MiniMax-M2
huggingface-llm-minimax-m2Text Generation
other
No
ml.p4de.24xlarge
Minimaxai
MiniMax-M2.1
huggingface-llm-minimax-m2-1Text Generation
other
No
ml.p4de.24xlarge, ml.p5.48xlarge
Minimaxai
MiniMax-M2.5
huggingface-llm-minimax-m2-5Text Generation
other
No
ml.p5.48xlarge
Minimaxai
MiniMax-M2.7
huggingface-llm-minimax-m2-7Text Generation
other
No
ml.p5en.48xlarge
Mistralai
Mistral 7B Instruct Neuron
huggingface-llmneuron-mistral-7b-instructText Generation
apache-2.0
No
ml.inf2.8xlarge, ml.inf2.xlarge
Mistralai
Mistral 7B Neuron
huggingface-llmneuron-mistral-7bText Generation
apache-2.0
No
ml.inf2.8xlarge, ml.inf2.xlarge
Mistralai
Mistral Nemo Base 2407
huggingface-llm-mistral-nemo-base-2407Text Generation
apache-2.0
No
ml.g5.12xlarge, ml.g5.24xlarge, ml.g5.48xlarge, ml.g6.12xlarge, ml.g6.24xlarge, ml.g6.48xlarge, ml.p4d.24xlarge
Mistralai
Mistral Nemo Instruct 2407
huggingface-llm-mistral-nemo-instruct-2407Text Generation
apache-2.0
No
ml.g5.12xlarge, ml.g5.24xlarge, ml.g5.48xlarge, ml.g6.12xlarge, ml.g6.24xlarge, ml.g6.48xlarge, ml.p4d.24xlarge
Mistralai
Mistral Pixtral-12B-2409
huggingface-vlm-mistral-pixtral-12b-2409Image-Text-to-Text
apache-2.0
No
ml.g6.12xlarge, ml.g6.24xlarge, ml.g6.48xlarge, ml.p4d.24xlarge, ml.p4de.24xlarge, ml.p5.48xlarge
Mistralai
Mistral-7B-Instruct-v0.2
huggingface-llm-mistral-7b-instructText Generation
apache-2.0
No
ml.g5.12xlarge, ml.g6.24xlarge, ml.g6e.2xlarge, ml.p3.16xlarge, ml.p4d.24xlarge, ml.p5.48xlarge
Mistralai
Mistral-7B-Instruct-v0.3
huggingface-llm-mistral-7b-v3Text Generation
apache-2.0
No
ml.g4dn.12xlarge, ml.g5.12xlarge, ml.g6.12xlarge, ml.g6e.2xlarge, ml.p4d.24xlarge, ml.p5.48xlarge
Mistralai
Mistral-7B-Instruct-v0.3
huggingface-llm-mistral-7b-instruct-v3Text Generation
apache-2.0
No
ml.g4dn.12xlarge, ml.g5.12xlarge, ml.g6.12xlarge, ml.g6e.2xlarge, ml.p4d.24xlarge, ml.p5.48xlarge
Mistralai
Mistral-7B-v0.1
huggingface-llm-mistral-7bText Generation
apache-2.0
Yes
ml.g5.12xlarge, ml.g6.12xlarge, ml.g6e.12xlarge, ml.p3.16xlarge, ml.p4d.24xlarge, ml.p5.48xlarge
Mistralai
Mistral-Small-24B-Instruct-2501
huggingface-llm-mistral-small-24B-Instruct-2501Text Generation
apache-2.0
No
ml.g5.48xlarge, ml.g6.48xlarge, ml.g6e.12xlarge, ml.g6e.24xlarge, ml.p4d.24xlarge, ml.p5en.48xlarge
Mistralai
Mistral-Small-3.1-24B-Base-2503
huggingface-vlm-mistral-small-3-1-24b-base-2503Image-Text-to-Text
apache-2.0
No
ml.g6.12xlarge, ml.g6.16xlarge, ml.g6.24xlarge, ml.g6.48xlarge, ml.g6e.12xlarge
Mistralai
Mistral-Small-3.2-24B-Instruct-2506
huggingface-vlm-mistral-small-3-2-24b-instruct-2506Image-Text-to-Text
apache-2.0
No
ml.g6.12xlarge, ml.g6.16xlarge, ml.g6.24xlarge, ml.g6.48xlarge, ml.g6e.12xlarge
Mistralai
Mixtral 8x7B
huggingface-llm-mixtral-8x7bText Generation
apache-2.0
Yes
ml.g5.48xlarge, ml.g6.48xlarge, ml.p4d.24xlarge, ml.p5.48xlarge
Mistralai
Mixtral-8x22B V1
huggingface-llm-mixtral-8x22BText Generation
apache-2.0
No
ml.p4d.24xlarge, ml.p4de.24xlarge
Mistralai
Mixtral-8x22B-Instruct-v0.1
huggingface-llm-mistralai-mixtral-8x22B-instruct-v0-1Text Generation
apache-2.0
No
ml.p4de.24xlarge, ml.p5.48xlarge
Mistralai
Mixtral-8x7B-Instruct-v0.1
huggingface-llm-mixtral-8x7b-instructText Generation
apache-2.0
Yes
ml.g5.48xlarge, ml.g6.48xlarge, ml.g6e.12xlarge, ml.p4d.24xlarge, ml.p5.48xlarge, ml.p5en.48xlarge
Moonshotai
Kimi-K2.5
huggingface-llm-kimi-k2-5Text Generation
other
No
ml.p5en.48xlarge
Moritzlaurer
MoritzLaurer MDeBERTa V3 Base Xnli Multilingual Nli 2mil7
huggingface-zstc-moritzlaurer-mdeberta3base-xnli-mling-nli-2m7Zero-Shot Classification
mit
No
ml.c5.2xlarge, ml.c5.xlarge, ml.g4dn.xlarge, ml.m5.large, ml.m5.xlarge, ml.p3.2xlarge
Mosaicml
MPT 7B BF16
huggingface-textgeneration1-mpt-7b-bf16Text Generation
Apache-2.0
No
ml.g5.16xlarge, ml.g5.2xlarge, ml.g5.4xlarge, ml.g5.8xlarge
Mosaicml
MPT 7B Instruct BF16
huggingface-textgeneration1-mpt-7b-instruct-bf16Text Generation
CC-BY-SA-3.0
No
ml.g5.16xlarge, ml.g5.2xlarge, ml.g5.4xlarge, ml.g5.8xlarge
Mrm8488
Bert Small2bert
huggingface-summarization-bert-small2bert-cnn-dailymail-summText Summarization
apache-2.0
No
ml.c5.2xlarge, ml.c5.xlarge, ml.g4dn.xlarge, ml.m5.large, ml.m5.xlarge, ml.p3.2xlarge
Mxnet
Financial Text Embedding
mxnet-tcembedding-robertafin-base-uncasedText Embedding
apache-2.0
No
ml.c5.2xlarge, ml.c5.xlarge, ml.g4dn.xlarge, ml.m5.large, ml.m5.xlarge, ml.p3.2xlarge
Mxnet
RoBERTa-SEC-Large
mxnet-tcembedding-robertafin-large-uncasedText Embedding
apache-2.0
No
ml.c5.2xlarge, ml.c5.xlarge, ml.g4dn.xlarge, ml.m5.large, ml.m5.xlarge, ml.p3.2xlarge
Mxnet
RoBERTa-SEC-WIKI-Base
mxnet-tcembedding-robertafin-base-wiki-uncasedText Embedding
apache-2.0
No
ml.c5.2xlarge, ml.c5.xlarge, ml.g4dn.xlarge, ml.m5.large, ml.m5.xlarge, ml.p3.2xlarge
Mxnet
RoBERTa-SEC-WIKI-Large
mxnet-tcembedding-robertafin-large-wiki-uncasedText Embedding
apache-2.0
No
ml.c5.2xlarge, ml.c5.xlarge, ml.g4dn.xlarge, ml.m5.large, ml.m5.xlarge, ml.p3.2xlarge
Naclbit
Naclbit Trinart Stable Diffusion V2
huggingface-txt2img-naclbit-trinart-stable-diffusion-v2Text-to-Image
creativeml-openrail-m
No
ml.g4dn.4xlarge, ml.g4dn.8xlarge, ml.g5.2xlarge, ml.g5.4xlarge, ml.g5.8xlarge, ml.p3.2xlarge
Narsil
Narsil Deberta Large Mnli Zero Cls
huggingface-zstc-narsil-deberta-large-mnli-zero-clsZero-Shot Classification
mit
No
ml.c5.2xlarge, ml.c5.xlarge, ml.g4dn.xlarge, ml.m5.large, ml.m5.xlarge, ml.p3.2xlarge
Nexaai
NexaAI Octopus-v2
huggingface-llm-nexaaidev-octopus-v2Text Generation
cc-by-nc-4.0
No
ml.g5.16xlarge, ml.g5.2xlarge, ml.g5.4xlarge, ml.g5.8xlarge
Nexusflow
Nexusflow Starling-LM-7B-beta
huggingface-llm-nexusflow-starling-lm-7b-betaText Generation
apache-2.0
No
ml.g5.16xlarge, ml.g5.2xlarge, ml.g5.4xlarge, ml.g5.8xlarge
Nousresearch
Hermes 2 Pro-Llama-3 8B
huggingface-llm-nousresearch-hermes-2-pro-llama-3-8BText Generation
llama3
No
ml.g5.12xlarge, ml.g5.16xlarge, ml.g5.24xlarge, ml.g5.2xlarge, ml.g5.48xlarge, ml.g5.4xlarge, ml.g5.8xlarge, ml.p4d.24xlarge, ml.p4de.24xlarge, ml.p5.48xlarge
Nousresearch
Nous Hermes 2 SOLAR 10.7B
huggingface-llm-nousresearch-nous-hermes-2-solar-10-7bText Generation
apache-2.0
No
ml.g5.12xlarge, ml.g5.24xlarge, ml.g5.48xlarge
Nova
Nova 2.0 Lite
nova-textgeneration-lite-v2Text Generation
AWS Service Terms
Yes
Nova
Nova Lite
nova-textgeneration-liteText Generation
AWS Service Terms
Yes
Nova
Nova Micro
nova-textgeneration-microText Generation
AWS Service Terms
Yes
Nova
Nova Pro
nova-textgeneration-proText Generation
AWS Service Terms
Yes
Nvidia
Nvidia Llama3-ChatQA-1.5 8B
huggingface-llm-nvidia-llama3-chatqa-1-5-8BText Generation
llama3
No
ml.g5.12xlarge, ml.g5.16xlarge, ml.g5.24xlarge, ml.g5.2xlarge, ml.g5.48xlarge, ml.g5.4xlarge, ml.g5.8xlarge, ml.p4d.24xlarge, ml.p4de.24xlarge, ml.p5.48xlarge
Nvidia
NVIDIA-Nemotron-3-Nano-30B-A3B-BF16
huggingface-reasoning-nvidia-nemotron-3-nano-30b-a3b-bf16Reasoning
other
No
ml.g6e.24xlarge, ml.g7e.12xlarge
Nvidia
NVIDIA-Nemotron-3-Super-120B-A12B-FP8
huggingface-reasoning-nvidia-nemotron-3-super-120b-a12b-fp8Reasoning
other
No
ml.g7e.24xlarge
Nvidia
nvidia-nemotron3-nano-omni-30ba3b-reasoning-fp8
huggingface-vlm-nvidia-nemotron3-nano-omni-30ba3b-reasoning-fp8Image-Text-to-Text
other
No
ml.g6.12xlarge, ml.g6.24xlarge, ml.g6e.12xlarge, ml.g6e.24xlarge
Openai
gpt-oss-120b
openai-reasoning-gpt-oss-120bReasoning
apache-2.0
Yes
ml.g6e.48xlarge, ml.p4d.24xlarge, ml.p4de.24xlarge, ml.p5.48xlarge, ml.p5en.48xlarge
Openai
gpt-oss-20b
openai-reasoning-gpt-oss-20bReasoning
apache-2.0
Yes
ml.g5.12xlarge, ml.g5.24xlarge, ml.g6.12xlarge, ml.g6.24xlarge, ml.p5.48xlarge, ml.p5en.48xlarge
Openai
Whisper Base
huggingface-asr-whisper-baseAutomatic Speech Recognition
apache-2.0
No
ml.g4dn.2xlarge, ml.g5.2xlarge, ml.g5.4xlarge, ml.g5.8xlarge, ml.p3.2xlarge
Openai
Whisper Large
huggingface-asr-whisper-largeAutomatic Speech Recognition
apache-2.0
No
ml.g4dn.2xlarge, ml.g5.2xlarge, ml.g5.4xlarge, ml.g5.8xlarge, ml.p3.2xlarge
Openai
Whisper Large V2
huggingface-asr-whisper-large-v2Automatic Speech Recognition
apache-2.0
No
ml.g4dn.2xlarge, ml.g5.2xlarge, ml.g5.4xlarge, ml.g5.8xlarge, ml.p3.2xlarge
Openai
Whisper Large V3
huggingface-asr-whisper-large-v3Automatic Speech Recognition
apache-2.0
No
ml.g4dn.2xlarge, ml.g5.2xlarge, ml.g5.4xlarge, ml.g5.8xlarge, ml.p3.2xlarge
Openai
Whisper Large V3 Turbo
huggingface-asr-whisper-large-v3-turboAutomatic Speech Recognition
mit
No
ml.g4dn.2xlarge, ml.g5.2xlarge, ml.g5.4xlarge, ml.g5.8xlarge, ml.p3.2xlarge
Openai
Whisper Medium
huggingface-asr-whisper-mediumAutomatic Speech Recognition
apache-2.0
No
ml.g4dn.2xlarge, ml.g5.2xlarge, ml.g5.4xlarge, ml.g5.8xlarge, ml.p3.2xlarge
Openai
Whisper Small
huggingface-asr-whisper-smallAutomatic Speech Recognition
apache-2.0
No
ml.g4dn.2xlarge, ml.g5.2xlarge, ml.g5.4xlarge, ml.g5.8xlarge, ml.p3.2xlarge
Openai
Whisper Tiny
huggingface-asr-whisper-tinyAutomatic Speech Recognition
apache-2.0
No
ml.g4dn.2xlarge, ml.g5.2xlarge, ml.g5.4xlarge, ml.g5.8xlarge, ml.p3.2xlarge
Openai-Community
GPT 2
huggingface-textgeneration-gpt2Text Generation
mit
No
ml.g5.16xlarge, ml.g5.2xlarge, ml.g5.4xlarge, ml.g5.8xlarge
Openai-Community
GPT-2 XL
huggingface-textgeneration1-gpt-2-xlText Generation
mit
Yes
ml.g5.16xlarge, ml.g5.2xlarge, ml.g5.4xlarge, ml.g5.8xlarge
Openai-Community
RoBERTa Base OpenAI Detector
huggingface-tc-roberta-base-openai-detectorText Classification
mit
Yes
ml.c5.2xlarge, ml.c5.xlarge, ml.g4dn.xlarge, ml.m5.large, ml.m5.xlarge, ml.p3.2xlarge
Openai-Community
RoBERTa Base OpenAI Detector
huggingface-spc-roberta-base-openai-detectorFill-Mask
mit
Yes
ml.c5.2xlarge, ml.c5.xlarge, ml.g4dn.xlarge, ml.m5.large, ml.m5.xlarge, ml.p3.2xlarge
Openai-Community
RoBERTa Base OpenAI Detector
huggingface-eqa-roberta-base-openai-detectorQuestion Answering
mit
Yes
ml.c5.2xlarge, ml.c5.xlarge, ml.g4dn.xlarge, ml.m5.large, ml.m5.xlarge, ml.p3.2xlarge
Openai-Community
RoBERTa Large OpenAI Detector
huggingface-tc-roberta-large-openai-detectorText Classification
mit
Yes
ml.c5.2xlarge, ml.c5.xlarge, ml.g4dn.xlarge, ml.m5.large, ml.m5.xlarge, ml.p3.2xlarge
Openai-Community
RoBERTa Large OpenAI Detector
huggingface-spc-roberta-large-openai-detectorFill-Mask
mit
Yes
ml.c5.2xlarge, ml.c5.xlarge, ml.g4dn.xlarge, ml.m5.large, ml.m5.xlarge, ml.p3.2xlarge
Openlm-Research
Open Llama 7B V2
huggingface-llm-openlm-research-open-llama-7b-v2Text Generation
apache-2.0
No
ml.g5.12xlarge, ml.g5.24xlarge, ml.g5.2xlarge, ml.g5.48xlarge
Philschmid
Bart Large CNN samsum
huggingface-summarization-bart-large-cnn-samsumText Summarization
mit
No
ml.c5.2xlarge, ml.c5.xlarge, ml.g4dn.xlarge, ml.m5.large, ml.m5.xlarge, ml.p3.2xlarge
Philschmid
Flan-T5 Base Model Fine-tuned on the Samsum Dataset
huggingface-text2text-flan-t5-base-samsumText2Text Generation
apache-2.0
No
ml.g5.16xlarge, ml.g5.2xlarge, ml.g5.4xlarge, ml.g5.8xlarge
Pytorch
Alexa TM 20B
pytorch-textgeneration1-alexa20bText Generation
amazon-software-license
No
ml.g4dn.12xlarge, ml.g5.16xlarge, ml.p3.16xlarge, ml.p3.8xlarge
Qwen
QVQ 72B Preview
huggingface-vlm-qvq-72b-previewImage-Text-to-Text
other
No
ml.g5.48xlarge, ml.g6.48xlarge, ml.p4d.24xlarge
Qwen
Qwen2 0.5B
huggingface-llm-qwen2-0-5bText Generation
apache-2.0
No
ml.g5.12xlarge, ml.g5.16xlarge, ml.g5.24xlarge, ml.g5.2xlarge, ml.g5.48xlarge, ml.g5.4xlarge, ml.g5.8xlarge, ml.p4d.24xlarge
Qwen
Qwen2 0.5B Instruct
huggingface-llm-qwen2-0-5b-instructText Generation
apache-2.0
No
ml.g5.12xlarge, ml.g5.16xlarge, ml.g5.24xlarge, ml.g5.2xlarge, ml.g5.48xlarge, ml.g5.4xlarge, ml.g5.8xlarge, ml.p4d.24xlarge
Qwen
Qwen2-1.5B
huggingface-llm-qwen2-1-5bText Generation
apache-2.0
No
ml.g4dn.12xlarge, ml.g5.12xlarge, ml.g6.12xlarge, ml.g6e.2xlarge, ml.g6e.4xlarge, ml.g6e.xlarge
Qwen
Qwen2-1.5B-Instruct
huggingface-llm-qwen2-1-5b-instructText Generation
apache-2.0
No
ml.g4dn.xlarge, ml.g5.2xlarge, ml.g5.xlarge, ml.g6.2xlarge, ml.g6e.xlarge, ml.p3.2xlarge
Qwen
Qwen2-7B
huggingface-llm-qwen2-7bText Generation
apache-2.0
No
ml.g4dn.12xlarge, ml.g5.12xlarge, ml.g5.24xlarge, ml.g6.12xlarge, ml.g6.24xlarge, ml.g6e.12xlarge, ml.g6e.16xlarge, ml.g6e.24xlarge, ml.g6e.2xlarge, ml.g6e.4xlarge, ml.g6e.8xlarge, ml.g6e.xlarge
Qwen
Qwen2-7B-Instruct
huggingface-llm-qwen2-7b-instructText Generation
apache-2.0
No
ml.g4dn.12xlarge, ml.g5.12xlarge, ml.g5.24xlarge, ml.g6.12xlarge, ml.g6.24xlarge, ml.g6e.12xlarge, ml.g6e.16xlarge, ml.g6e.24xlarge, ml.g6e.4xlarge, ml.g6e.8xlarge
Qwen
Qwen2-VL-7B-Instruct
huggingface-vlm-qwen2-vl-7b-instructText Generation
apache-2.0
No
ml.g5.12xlarge, ml.g6.12xlarge, ml.g6e.12xlarge, ml.g6e.4xlarge, ml.g6e.8xlarge, ml.g6e.xlarge
Qwen
Qwen2.5 14B Instruct
huggingface-llm-qwen2-5-14b-instructText Generation
apache-2.0
Yes
ml.g5.24xlarge, ml.g5.48xlarge, ml.g6.24xlarge, ml.g6.48xlarge, ml.p4d.24xlarge
Qwen
Qwen2.5 32B Instruct
huggingface-llm-qwen2-5-32b-instructText Generation
apache-2.0
Yes
ml.g5.48xlarge, ml.g6.48xlarge, ml.p4d.24xlarge
Qwen
Qwen2.5 Coder 32B Instruct
huggingface-llm-qwen2-5-coder-32b-instructText Generation
apache-2.0
No
ml.g5.24xlarge, ml.g5.48xlarge, ml.g6.24xlarge, ml.g6.48xlarge, ml.p4d.24xlarge
Qwen
Qwen2.5 Coder 7B Instruct
huggingface-llm-qwen2-5-coder-7b-instructText Generation
apache-2.0
No
ml.g5.16xlarge, ml.g5.2xlarge, ml.g5.4xlarge, ml.g5.8xlarge, ml.g6.16xlarge, ml.g6.4xlarge, ml.g6.8xlarge, ml.p4d.24xlarge
Qwen
Qwen2.5-72B-Instruct
huggingface-llm-qwen2-5-72b-instructText Generation
other
Yes
ml.g6e.48xlarge, ml.p4d.24xlarge, ml.p5en.48xlarge
Qwen
Qwen2.5-7B-Instruct
huggingface-llm-qwen2-5-7b-instructText Generation
apache-2.0
Yes
ml.g4dn.12xlarge, ml.g5.12xlarge, ml.g6.12xlarge, ml.g6e.2xlarge, ml.g6e.4xlarge, ml.g6e.xlarge
Qwen
Qwen3 14B
huggingface-reasoning-qwen3-14bReasoning
apache-2.0
Yes
ml.g5.12xlarge, ml.g5.24xlarge, ml.g5.48xlarge, ml.g6.12xlarge, ml.g6.24xlarge, ml.g6.48xlarge, ml.p4d.24xlarge
Qwen
Qwen3-0.6B
huggingface-reasoning-qwen3-06bReasoning
apache-2.0
Yes
ml.g4dn.12xlarge, ml.g5.12xlarge, ml.g6.12xlarge, ml.g6e.2xlarge, ml.p4d.24xlarge, ml.p5en.48xlarge
Qwen
Qwen3-1.7B
huggingface-reasoning-qwen3-1-7bReasoning
apache-2.0
Yes
ml.g4dn.12xlarge, ml.g4dn.16xlarge, ml.g4dn.2xlarge, ml.g4dn.4xlarge, ml.g4dn.8xlarge, ml.g5.12xlarge, ml.g5.24xlarge, ml.g5.48xlarge, ml.g5.4xlarge, ml.g6.12xlarge, ml.g6.24xlarge, ml.g6.48xlarge, ml.g6.8xlarge, ml.g6.xlarge, ml.g6e.12xlarge, ml.g6e.16xlarge, ml.g6e.24xlarge, ml.g6e.2xlarge, ml.g6e.4xlarge, ml.g6e.8xlarge, ml.g6e.xlarge, ml.p3.2xlarge, ml.p4d.24xlarge
Qwen
Qwen3-30B-A3B
huggingface-reasoning-qwen3-30b-a3bReasoning
apache-2.0
No
ml.g5.24xlarge
Qwen
Qwen3-30B-A3B-Instruct-2507
huggingface-reasoning-qwen3-30b-a3b-instruct-2507Text Generation
apache-2.0
No
ml.g5.48xlarge, ml.g6.48xlarge, ml.g6e.12xlarge, ml.p4de.24xlarge
Qwen
Qwen3-30B-A3B-Thinking-2507
huggingface-reasoning-qwen3-30b-a3b-thinking-2507Reasoning
apache-2.0
No
ml.g6e.12xlarge
Qwen
Qwen3-32B
huggingface-reasoning-qwen3-32bReasoning
apache-2.0
Yes
ml.g5.48xlarge, ml.g6.48xlarge, ml.g6e.12xlarge, ml.p4d.24xlarge, ml.p5.48xlarge, ml.p5en.48xlarge
Qwen
Qwen3-4B
huggingface-reasoning-qwen3-4bReasoning
apache-2.0
Yes
ml.g4dn.12xlarge, ml.g5.12xlarge, ml.g5.16xlarge, ml.g5.24xlarge, ml.g5.2xlarge, ml.g5.48xlarge, ml.g5.4xlarge, ml.g5.8xlarge, ml.g5.xlarge, ml.g6.12xlarge, ml.g6.16xlarge, ml.g6.24xlarge, ml.g6.2xlarge, ml.g6.48xlarge, ml.g6.4xlarge, ml.g6.8xlarge, ml.g6.xlarge, ml.g6e.12xlarge, ml.g6e.16xlarge, ml.g6e.24xlarge, ml.g6e.2xlarge, ml.g6e.4xlarge, ml.g6e.8xlarge, ml.g6e.xlarge, ml.p4d.24xlarge
Qwen
Qwen3-4B-Instruct-2507
huggingface-reasoning-qwen3-4b-instruct-2507Reasoning
apache-2.0
No
ml.g4dn.12xlarge, ml.g5.12xlarge, ml.g5.24xlarge, ml.g5.48xlarge, ml.g6.12xlarge, ml.g6.24xlarge, ml.g6.48xlarge, ml.g6e.12xlarge, ml.g6e.24xlarge, ml.p4d.24xlarge
Qwen
Qwen3-8B
huggingface-reasoning-qwen3-8bReasoning
apache-2.0
Yes
ml.g4dn.12xlarge, ml.g5.12xlarge, ml.g6.12xlarge, ml.g6e.12xlarge, ml.p4d.24xlarge, ml.p5.48xlarge
Qwen
Qwen3-Coder-30B-A3B-Instruct
huggingface-reasoning-qwen3-coder-30b-a3b-instructReasoning
apache-2.0
No
ml.g6e.24xlarge
Qwen
Qwen3-Coder-Next
huggingface-reasoning-qwen3-coder-nextReasoning
apache-2.0
No
ml.g7e.24xlarge, ml.g7e.48xlarge, ml.p4d.24xlarge, ml.p5en.48xlarge
Qwen
Qwen3-Next-80B-A3B-Instruct
huggingface-reasoning-qwen3-next-80b-a3b-instructText Generation
apache-2.0
No
ml.g6e.48xlarge, ml.p4de.24xlarge
Qwen
Qwen3-VL-8B-Instruct
huggingface-vlm-qwen3-vl-8b-instructText Generation
apache-2.0
No
ml.g6e.24xlarge, ml.g6e.48xlarge, ml.p4d.24xlarge
Qwen
Qwen3.5-0.8B
huggingface-vlm-qwen3-5-0-8bText Generation
apache-2.0
No
ml.g6.4xlarge, ml.g6e.12xlarge, ml.g6e.2xlarge, ml.g6e.4xlarge, ml.g6e.xlarge, ml.g7e.4xlarge
Qwen
Qwen3.5-27b
huggingface-vlm-qwen3-5-27bText Generation
apache-2.0
Yes
ml.g6.48xlarge
Qwen
Qwen3.5-27B-FP8
huggingface-vlm-qwen3-5-27b-fp8Image-Text-to-Text
apache-2.0
No
ml.g6.24xlarge
Qwen
Qwen3.5-2B
huggingface-vlm-qwen3-5-2bText Generation
apache-2.0
No
ml.g6.12xlarge, ml.g6e.12xlarge, ml.g6e.2xlarge, ml.g6e.4xlarge, ml.g6e.8xlarge, ml.g6e.xlarge
Qwen
Qwen3.5-4B
huggingface-vlm-qwen3-5-4bText Generation
apache-2.0
Yes
ml.g6.12xlarge, ml.g6e.24xlarge, ml.g7e.12xlarge, ml.g7e.2xlarge, ml.g7e.4xlarge, ml.g7e.8xlarge
Qwen
Qwen3.5-9b
huggingface-vlm-qwen3-5-9bText Generation
apache-2.0
Yes
ml.g6.24xlarge, ml.g6.48xlarge
Qwen
Qwen3.6-27B
huggingface-vlm-qwen3-6-27bImage-Text-to-Text
apache-2.0
No
ml.g7e.12xlarge, ml.g7e.24xlarge, ml.g7e.48xlarge, ml.p5en.48xlarge
Qwen
Qwen3.6-35B-A3B
huggingface-vlm-qwen3-6-35b-a3bText Generation
apache-2.0
No
ml.g6e.24xlarge, ml.g6e.48xlarge, ml.p5.48xlarge, ml.p5en.48xlarge
Qwen
QwQ 32B
huggingface-llm-qwq-32bText Generation
apache-2.0
No
ml.g5.24xlarge, ml.g5.48xlarge, ml.g6.24xlarge, ml.g6.48xlarge, ml.p4d.24xlarge
Recognai
Recognai Bert Base Spanish Wwm Cased Xnli
huggingface-zstc-recognai-bert-base-spanish-wwm-cased-xnliZero-Shot Classification
mit
No
ml.c5.2xlarge, ml.c5.xlarge, ml.g4dn.xlarge, ml.m5.large, ml.m5.xlarge, ml.p3.2xlarge
Recognai
Recognai Zeroshot Selectra Medium
huggingface-zstc-recognai-zeroshot-selectra-mediumZero-Shot Classification
apache-2.0
No
ml.c5.2xlarge, ml.c5.xlarge, ml.g4dn.xlarge, ml.m5.large, ml.m5.xlarge, ml.p3.2xlarge
Rinna
Rinna Japanese GPT NeoX 3.6B Instruction PPO
huggingface-llm-rinna-3-6b-instruction-ppo-bf16Text Generation
mit
No
ml.g5.2xlarge, ml.g5.4xlarge, ml.g5.xlarge
Salesforce
Salesforce SFR-Embedding-2_R
huggingface-textembedding-sfr-embedding-2-rText Embedding
CC-BY-NC-4.0
No
ml.g5.12xlarge, ml.g5.24xlarge, ml.g5.2xlarge, ml.g5.48xlarge, ml.g5.4xlarge, ml.g5.8xlarge, ml.g5.xlarge, ml.g6.12xlarge, ml.g6.24xlarge, ml.g6.2xlarge, ml.g6.48xlarge, ml.g6.4xlarge, ml.g6.8xlarge, ml.g6.xlarge, ml.p4d.24xlarge
Salesforce
Salesforce SFR-Embedding-Mistral
huggingface-textembedding-sfr-embedding-mistralText Embedding
CC-BY-NC-4.0
No
ml.g5.12xlarge, ml.g5.24xlarge, ml.g5.2xlarge, ml.g5.48xlarge, ml.g5.4xlarge, ml.g5.8xlarge, ml.g5.xlarge, ml.g6.12xlarge, ml.g6.24xlarge, ml.g6.2xlarge, ml.g6.48xlarge, ml.g6.4xlarge, ml.g6.8xlarge, ml.g6.xlarge, ml.p4d.24xlarge
Sentence-Transformers
All MiniLM L6 v2
huggingface-textembedding-all-MiniLM-L6-v2Text Embedding
Apache-2.0
No
ml.g5.12xlarge, ml.g5.24xlarge, ml.g5.2xlarge, ml.g5.48xlarge, ml.g5.4xlarge, ml.g5.8xlarge, ml.g5.xlarge, ml.g6.12xlarge, ml.g6.24xlarge, ml.g6.2xlarge, ml.g6.48xlarge, ml.g6.4xlarge, ml.g6.8xlarge, ml.g6.xlarge, ml.p4d.24xlarge
Sentence-Transformers
All MiniLM L6 v2
huggingface-sentencesimilarity-all-MiniLM-L6-v2Sentence Similarity
apache-2.0
Yes
ml.c6i.xlarge, ml.g4dn.2xlarge, ml.g5.2xlarge, ml.g5.4xlarge, ml.g5.8xlarge, ml.p3.2xlarge
Sentence-Transformers
Paraphrase Multilingual MiniLM L12 v2
huggingface-textembedding-paraphrase-multilingual-MiniLM-L12-v2Text Embedding
Apache-2.0
No
ml.g5.xlarge, ml.g6.xlarge, ml.g6e.xlarge
Shenzhi-Wang
Llama3 8B Chinese Chat
huggingface-llm-shenzhi-wang-llama3-8B-chinese-chatText Generation
llama3
No
ml.g5.16xlarge, ml.g5.2xlarge, ml.g5.4xlarge, ml.g5.8xlarge
Snowflake
Snowflake Arctic Instruct Vllm
huggingface-llm-snowflake-arctic-instruct-vllmText Generation
apache-2.0
No
ml.p5.48xlarge
Speakleash
Bielik-11B-v3.0-Instruct
huggingface-llm-bielik-11b-v3-0-instructText Generation
apache-2.0
No
ml.g5.12xlarge, ml.g6.12xlarge, ml.g6e.12xlarge, ml.g7e.2xlarge, ml.p4d.24xlarge, ml.p5en.48xlarge
Sshleifer
Distilbart CNN 12-6
huggingface-summarization-distilbart-cnn-12-6Text Summarization
apache-2.0
No
ml.c5.2xlarge, ml.c5.xlarge, ml.g4dn.xlarge, ml.m5.large, ml.m5.xlarge, ml.p3.2xlarge
Sshleifer
Distilbart CNN 6-6
huggingface-summarization-distilbart-cnn-6-6Text Summarization
apache-2.0
No
ml.c5.2xlarge, ml.c5.xlarge, ml.g4dn.xlarge, ml.m5.large, ml.m5.xlarge, ml.p3.2xlarge
Sshleifer
Text Summarization
huggingface-summarization-distilbart-xsum-1-1Text Summarization
apache-2.0
No
ml.c5.2xlarge, ml.c5.xlarge, ml.g4dn.xlarge, ml.m5.large, ml.m5.xlarge, ml.p3.2xlarge
Stabilityai
Japanese StableLM Instruct Alpha 7B v2
model-textgenerationjp-japanese-stablelm-instruct-alpha-7b-v2Text Generation
Apache-2.0
No
ml.g5.12xlarge, ml.g5.16xlarge, ml.g5.24xlarge, ml.g5.2xlarge, ml.g5.4xlarge, ml.g5.8xlarge, ml.p4d.24xlarge, ml.p4de.24xlarge
Stabilityai
Stable Diffusion 2
model-txt2img-stabilityai-stable-diffusion-v2Text-to-Image
creativeml-openrail++-m
No
ml.g4dn.4xlarge, ml.g4dn.8xlarge, ml.g5.2xlarge, ml.g5.4xlarge, ml.g5.8xlarge, ml.p3.2xlarge
Stabilityai
Stable Diffusion 2 Depth FP16
model-depth2img-stable-diffusion-2-depth-fp16Image-to-Image
creativeml-openrail++-m
No
ml.g4dn.16xlarge, ml.g4dn.2xlarge, ml.g4dn.4xlarge, ml.g4dn.8xlarge, ml.g4dn.xlarge, ml.g5.16xlarge, ml.g5.2xlarge, ml.g5.4xlarge, ml.g5.8xlarge, ml.g5.xlarge, ml.p3.2xlarge
Stabilityai
Stable Diffusion 2 FP16
model-txt2img-stabilityai-stable-diffusion-v2-fp16Text-to-Image
creativeml-openrail++-m
No
ml.g4dn.4xlarge, ml.g4dn.8xlarge, ml.g5.2xlarge, ml.g5.4xlarge, ml.g5.8xlarge, ml.p3.2xlarge
Stabilityai
Stable Diffusion 2 Inpainting
model-inpainting-stabilityai-stable-diffusion-2-inpaintingImage-to-Image
creativeml-openrail++-m
No
ml.g4dn.4xlarge, ml.g4dn.8xlarge, ml.g5.2xlarge, ml.g5.4xlarge, ml.g5.8xlarge, ml.p3.2xlarge
Stabilityai
Stable Diffusion 2 Inpainting FP16
model-inpainting-stabilityai-stable-diffusion2-inpainting-fp16Image-to-Image
creativeml-openrail++-m
No
ml.g4dn.xlarge, ml.g5.2xlarge, ml.p3.2xlarge
Stabilityai
Stable Diffusion 2 Inpainting FP16
model-inpainting-stabilityai-stable-diffusion-2-inpainting-fp16Image-to-Image
creativeml-openrail++-m
No
ml.g4dn.xlarge, ml.g5.2xlarge, ml.p3.2xlarge
Stabilityai
Stable Diffusion 2.1
model-txt2img-stabilityai-stable-diffusion-v2-1-baseText-to-Image
creativeml-openrail++-m
Yes
ml.g4dn.4xlarge, ml.g4dn.8xlarge, ml.g5.2xlarge, ml.g5.4xlarge, ml.g5.8xlarge, ml.p3.2xlarge
Stabilityai
Stable Diffusion 2.1 Neuron
huggingface-txt2imgneuron-stabilityai-stable-diffusion-v2-1Text-to-Image
creativeml-openrail++-m
No
ml.inf2.24xlarge, ml.inf2.48xlarge, ml.inf2.8xlarge, ml.inf2.xlarge
Stabilityai
Stable Diffusion x4 upscaler FP16
model-upscaling-stabilityai-stable-diffusion-x4-upscaler-fp16Image-to-Image
openrail++
No
ml.g4dn.4xlarge, ml.g4dn.8xlarge, ml.g5.2xlarge, ml.g5.4xlarge, ml.g5.8xlarge, ml.p3.2xlarge
Stabilityai
Stable Diffusion XL 1.0 (open-source)
model-imagegeneration-stabilityai-stable-diffusion-xl-base-1-0Text-to-Image
openrail++
No
ml.g5.16xlarge, ml.g5.4xlarge, ml.g5.8xlarge
Stabilityai
Stable Diffusion XL Base 1.0 Neuron
huggingface-txt2imgneuron-stabilityai-stable-diffusion-xlbase1Text-to-Image
openrail++
No
ml.inf2.24xlarge, ml.inf2.48xlarge, ml.inf2.8xlarge, ml.inf2.xlarge
Swiss-Ai
Apertus 70B 2509
huggingface-llm-apertus-70b-2509Text Generation
apache-2.0
No
ml.g5.48xlarge, ml.g6.48xlarge, ml.g6e.48xlarge, ml.p4d.24xlarge, ml.p4de.24xlarge, ml.p5.48xlarge, ml.p5en.48xlarge
Swiss-Ai
Apertus 70B Instruct 2509
huggingface-llm-apertus-70b-instruct-2509Text Generation
apache-2.0
No
ml.g5.48xlarge, ml.g6.48xlarge, ml.g6e.48xlarge, ml.p4d.24xlarge, ml.p4de.24xlarge, ml.p5.48xlarge, ml.p5en.48xlarge
Swiss-Ai
Apertus 8B Instruct 2509
huggingface-llm-apertus-8b-instruct-2509Text Generation
apache-2.0
No
ml.g5.12xlarge, ml.g5.16xlarge, ml.g5.24xlarge, ml.g5.2xlarge, ml.g5.48xlarge, ml.g5.4xlarge, ml.g5.8xlarge, ml.g5.xlarge, ml.g6.12xlarge, ml.g6.16xlarge, ml.g6.24xlarge, ml.g6.2xlarge, ml.g6.48xlarge, ml.g6.4xlarge, ml.g6.8xlarge, ml.g6.xlarge, ml.g6e.12xlarge, ml.g6e.16xlarge, ml.g6e.24xlarge, ml.g6e.2xlarge, ml.g6e.48xlarge, ml.g6e.4xlarge, ml.g6e.8xlarge, ml.g6e.xlarge, ml.p4d.24xlarge, ml.p4de.24xlarge, ml.p5.48xlarge, ml.p5en.48xlarge
Thebloke
Mistral 7B OpenOrca AWQ
huggingface-llm-thebloke-mistral-7b-openorca-awqText Generation
apache-2.0
No
ml.g5.12xlarge, ml.g5.24xlarge, ml.g5.2xlarge, ml.g5.48xlarge
Thebloke
Mistral 7B OpenOrca GPTQ
huggingface-llm-mistral-7b-openorca-gptqText Generation
apache-2.0
No
ml.g5.12xlarge, ml.g5.16xlarge, ml.g5.24xlarge, ml.g5.2xlarge, ml.g5.48xlarge, ml.g5.4xlarge, ml.g5.8xlarge, ml.g5.xlarge, ml.p4d.24xlarge, ml.p4de.24xlarge, ml.p5.48xlarge
Thebloke
Mixtral 8x7B Instruct GPTQ
huggingface-llm-mixtral-8x7b-instruct-gptqText Generation
apache-2.0
No
ml.g5.12xlarge, ml.g5.24xlarge, ml.g5.48xlarge, ml.p4d.24xlarge
Thenlper
GTE Large
huggingface-sentencesimilarity-gte-largeSentence Similarity
mit
Yes
ml.c6i.xlarge, ml.g4dn.2xlarge, ml.g5.2xlarge, ml.g5.4xlarge, ml.g5.8xlarge, ml.p3.2xlarge
Thenlper
GTE Small
huggingface-sentencesimilarity-gte-smallSentence Similarity
mit
Yes
ml.c6i.xlarge, ml.g4dn.2xlarge, ml.g5.2xlarge, ml.g5.4xlarge, ml.g5.8xlarge, ml.p3.2xlarge
Tiiuae
Falcon 180B BF16
huggingface-llm-falcon-180b-bf16Text Generation
unknown
No
ml.p4de.24xlarge, ml.p5.48xlarge
Tiiuae
Falcon 180B Chat BF16
huggingface-llm-falcon-180b-chat-bf16Text Generation
unknown
No
ml.p4de.24xlarge, ml.p5.48xlarge
Tiiuae
Falcon 40B BF16
huggingface-llm-falcon-40b-bf16Text Generation
apache-2.0
Yes
ml.g5.12xlarge, ml.g5.24xlarge, ml.g5.48xlarge, ml.p4d.24xlarge
Tiiuae
Falcon 40B Instruct BF16
huggingface-llm-falcon-40b-instruct-bf16Text Generation
apache-2.0
Yes
ml.g5.12xlarge, ml.g5.24xlarge, ml.g5.48xlarge, ml.p4d.24xlarge
Tiiuae
Falcon 7B BF16
huggingface-llm-falcon-7b-bf16Text Generation
apache-2.0
Yes
ml.g5.16xlarge, ml.g5.2xlarge, ml.g5.4xlarge, ml.g5.8xlarge
Tiiuae
Falcon 7B Instruct BF16
huggingface-llm-falcon-7b-instruct-bf16Text Generation
apache-2.0
Yes
ml.g5.16xlarge, ml.g5.2xlarge, ml.g5.4xlarge, ml.g5.8xlarge
Tiiuae
Falcon RW 1B
huggingface-llm-tiiuae-falcon-rw-1bText Generation
apache-2.0
No
ml.g5.2xlarge
Tiiuae
Falcon-H1-0.5B-Instruct
huggingface-llm-falcon-h1-0-5b-instructText Generation
other
No
ml.g4dn.12xlarge, ml.g4dn.16xlarge, ml.g4dn.2xlarge, ml.g4dn.4xlarge, ml.g4dn.8xlarge, ml.g4dn.xlarge, ml.g5.12xlarge, ml.g5.16xlarge, ml.g5.24xlarge, ml.g5.2xlarge, ml.g5.48xlarge, ml.g5.4xlarge, ml.g5.8xlarge, ml.g5.xlarge, ml.g6.12xlarge, ml.g6.16xlarge, ml.g6.24xlarge, ml.g6.2xlarge, ml.g6.48xlarge, ml.g6.4xlarge, ml.g6.8xlarge, ml.g6.xlarge, ml.g6e.12xlarge, ml.g6e.16xlarge, ml.g6e.24xlarge, ml.g6e.2xlarge, ml.g6e.4xlarge, ml.g6e.8xlarge, ml.g6e.xlarge, ml.p3.2xlarge, ml.p4d.24xlarge
Tiiuae
Falcon-H1-1.5B-Deep-Instruct
huggingface-llm-falcon-h1-1-5b-deep-instructText Generation
other
No
ml.g5.16xlarge, ml.g5.4xlarge, ml.g5.8xlarge, ml.g6.16xlarge, ml.g6.2xlarge, ml.g6.4xlarge, ml.g6.8xlarge, ml.g6.xlarge, ml.g6e.16xlarge, ml.g6e.2xlarge, ml.g6e.4xlarge, ml.g6e.8xlarge, ml.g6e.xlarge
Tiiuae
Falcon-H1-1.5B-Instruct
huggingface-llm-falcon-h1-1-5b-instructText Generation
other
No
ml.g6.12xlarge, ml.g6.16xlarge, ml.g6.24xlarge, ml.g6.2xlarge, ml.g6.48xlarge, ml.g6.4xlarge, ml.g6.8xlarge, ml.g6.xlarge, ml.g6e.12xlarge, ml.g6e.16xlarge, ml.g6e.24xlarge, ml.g6e.2xlarge, ml.g6e.48xlarge, ml.g6e.4xlarge, ml.g6e.8xlarge, ml.g6e.xlarge
Tiiuae
Falcon-H1-34B-Instruct
huggingface-llm-falcon-h1-34b-instructText Generation
other
No
ml.g6e.12xlarge, ml.g6e.24xlarge, ml.g6e.48xlarge
Tiiuae
Falcon-H1-3B-Instruct
huggingface-llm-falcon-h1-3b-instructText Generation
other
No
ml.g6e.16xlarge, ml.g6e.2xlarge, ml.g6e.4xlarge, ml.g6e.8xlarge, ml.g6e.xlarge
Tiiuae
Falcon-H1-7B-Instruct
huggingface-llm-falcon-h1-7b-instructText Generation
other
No
ml.g6e.12xlarge, ml.g6e.24xlarge
Tiiuae
Falcon2-11B
huggingface-llm-falcon2-11bText Generation
unknown
No
ml.g5.12xlarge, ml.g5.24xlarge, ml.g5.48xlarge, ml.p4d.24xlarge
Tiiuae
Falcon3-10B-Base
huggingface-llm-falcon-3-10B-baseText Generation
other
No
ml.g5.12xlarge, ml.g5.24xlarge, ml.g5.48xlarge, ml.g6.12xlarge, ml.g6.24xlarge, ml.g6.48xlarge, ml.p4d.24xlarge, ml.p5.48xlarge
Tiiuae
Falcon3-10B-Instruct
huggingface-llm-falcon-3-10B-InstructText Generation
other
No
ml.g4dn.12xlarge, ml.g5.12xlarge, ml.g5.24xlarge, ml.g6.12xlarge, ml.g6.24xlarge, ml.g6e.12xlarge, ml.g6e.16xlarge, ml.g6e.24xlarge, ml.g6e.2xlarge, ml.g6e.4xlarge, ml.g6e.8xlarge, ml.g6e.xlarge
Tiiuae
Falcon3-1B-Instruct
huggingface-llm-falcon-3-1B-InstructText Generation
other
No
ml.g4dn.xlarge, ml.g5.xlarge, ml.g6.xlarge, ml.g6e.xlarge, ml.p3.2xlarge, ml.p4d.24xlarge
Tiiuae
Falcon3-3B-Base
huggingface-llm-falcon-3-3B-baseText Generation
other
No
ml.g5.12xlarge, ml.g5.16xlarge, ml.g5.24xlarge, ml.g5.2xlarge, ml.g5.48xlarge, ml.g5.4xlarge, ml.g5.8xlarge, ml.g5.xlarge, ml.g6.12xlarge, ml.g6.16xlarge, ml.g6.24xlarge, ml.g6.2xlarge, ml.g6.48xlarge, ml.g6.4xlarge, ml.g6.8xlarge, ml.g6.xlarge, ml.p4d.24xlarge, ml.p5.48xlarge
Tiiuae
Falcon3-3B-Instruct
huggingface-llm-falcon-3-3B-InstructText Generation
other
No
ml.g4dn.12xlarge, ml.g4dn.16xlarge, ml.g4dn.4xlarge, ml.g5.12xlarge, ml.g5.16xlarge, ml.g5.24xlarge, ml.g5.2xlarge, ml.g5.4xlarge, ml.g5.xlarge, ml.g6.12xlarge, ml.g6.16xlarge, ml.g6.24xlarge, ml.g6.2xlarge, ml.g6.4xlarge, ml.g6.8xlarge, ml.g6e.12xlarge, ml.g6e.16xlarge, ml.g6e.24xlarge, ml.g6e.2xlarge, ml.g6e.8xlarge, ml.g6e.xlarge, ml.p3.2xlarge
Tiiuae
Falcon3-7B-Base
huggingface-llm-falcon-3-7B-baseText Generation
other
No
ml.g5.12xlarge, ml.g5.24xlarge, ml.g5.48xlarge, ml.g6.12xlarge, ml.g6.24xlarge, ml.g6.48xlarge, ml.p4d.24xlarge, ml.p5.48xlarge
Tiiuae
Falcon3-7B-Instruct
huggingface-llm-falcon-3-7B-InstructText Generation
other
No
ml.g4dn.12xlarge, ml.g5.12xlarge, ml.g5.24xlarge, ml.g6.12xlarge, ml.g6.24xlarge, ml.g6e.12xlarge, ml.g6e.24xlarge, ml.g6e.2xlarge, ml.g6e.4xlarge, ml.g6e.8xlarge, ml.g6e.xlarge
Tinyllama
Tiny Llama 1.1B
huggingface-llm-tinyllama-1-1b-intermediate-step-1431k-3Text Generation
apache-2.0
No
ml.g5.12xlarge, ml.g5.24xlarge, ml.g5.2xlarge
Tinyllama
Tiny Llama 1.1B Chat V0.6
huggingface-llm-tinyllama-tinyllama-1-1b-chat-v0-6Text Generation
apache-2.0
No
ml.g5.12xlarge, ml.g5.24xlarge, ml.g5.2xlarge
Tinyllama
Tiny Llama 1.1B Chat V1
huggingface-llm-tinyllama-tinyllama-1-1b-chat-v1-0Text Generation
apache-2.0
No
ml.g5.12xlarge, ml.g5.24xlarge, ml.g5.2xlarge
Togethercomputer
RedPajama INCITE Chat 3B V1
huggingface-textgeneration1-redpajama-incite-chat-3B-v1-fp16Text Generation
apache-2.0
Yes
ml.g5.16xlarge, ml.g5.2xlarge, ml.g5.4xlarge, ml.g5.8xlarge
Togethercomputer
RedPajama INCITE Chat 7B V1
huggingface-textgeneration1-redpajama-incite-chat-7B-v1-fp16Text Generation
apache-2.0
Yes
ml.g5.16xlarge, ml.g5.2xlarge, ml.g5.4xlarge, ml.g5.8xlarge
Togethercomputer
RedPajama INCITE Instruct 3B V1
huggingface-textgeneration1-redpajama-incite-instruct-3Bv1fp16Text Generation
apache-2.0
Yes
ml.g5.16xlarge, ml.g5.2xlarge, ml.g5.4xlarge, ml.g5.8xlarge
Togethercomputer
RedPajama INCITE Instruct 7B V1
huggingface-textgeneration1-redpajama-incite-instruct-7B1fp16Text Generation
apache-2.0
Yes
ml.g5.16xlarge, ml.g5.2xlarge, ml.g5.4xlarge, ml.g5.8xlarge
Tokyotech-Llm
Swallow-7b-instruct-hf
huggingface-llm-swallow-7b-instruct-hfText Generation
llama2
No
ml.g4dn.12xlarge, ml.g5.12xlarge, ml.g5.16xlarge, ml.g5.24xlarge, ml.g5.2xlarge, ml.g5.48xlarge, ml.g5.4xlarge, ml.g5.8xlarge, ml.g5.xlarge, ml.g6.12xlarge, ml.g6.16xlarge, ml.g6.24xlarge, ml.g6.2xlarge, ml.g6.48xlarge, ml.g6.4xlarge, ml.g6.8xlarge, ml.g6.xlarge, ml.g6e.12xlarge, ml.g6e.16xlarge, ml.g6e.4xlarge, ml.g6e.8xlarge, ml.p3.16xlarge, ml.p3.8xlarge
Volrath50
volrath50 fantasy card diffusion
huggingface-txt2img-volrath50-fantasy-card-diffusionText-to-Image
creativeml-openrail-m
No
ml.g4dn.2xlarge, ml.g4dn.xlarge, ml.p3.2xlarge
Writer
Writer Palmyra Small
huggingface-llm-writer-palmyra-smallText Generation
apache-2.0
No
ml.g5.2xlarge
- Proprietary Models (121)
-
Proprietary foundation models are offered by third-party providers through Amazon SageMaker JumpStart. To use a proprietary foundation model, you must first subscribe to the model in AWS Marketplace. After subscribing, you can locate and deploy the foundation model in Amazon SageMaker Studio.
Model Name Model ID Task Fine-tunable Supported Inference Instance Types A.X 4.0 Light
sk-telecom-ax4-lightText Generation
No
ml.g5.2xlarge
AI21 Contextual Answers
ai21-contextual-answersText Generation
No
ml.g5.12xlarge, ml.g5.48xlarge, ml.p4d.24xlarge, ml.g4dn.12xlarge
AI21 Jurassic-2 Light
ai21-jurassic-2-lightText Generation
No
ml.g4dn.12xlarge, ml.g5.12xlarge, ml.g5.48xlarge, ml.p4d.24xlarge
AI21 Jurassic-2 Mid
ai21-jurassic-2-grande-instructText Generation
No
ml.g5.12xlarge, ml.g5.48xlarge, ml.p4d.24xlarge, ml.g4dn.12xlarge
AI21 Jurassic-2 Ultra
ai21-jurassic-2-jumbo-instructText Generation
No
ml.g5.48xlarge, ml.p4d.24xlarge, ml.p4de.24xlarge
AI21 Paraphrase
ai21-paraphraseText Generation
No
ml.g4dn.2xlarge
AI21 Summarize
ai21-summarizationText Generation
No
ml.g4dn.4xlarge, ml.g4dn.12xlarge, ml.g5.xlarge
Arcee Lite
arcee-liteText Generation
No
ml.g5.xlarge, ml.g5.2xlarge, ml.g5.4xlarge, ml.g5.8xlarge, ml.g5.16xlarge, ml.g6.xlarge, ml.g6.2xlarge, ml.g6.4xlarge, ml.g6.8xlarge, ml.g6.16xlarge
Arcee SuperNova
arcee-supernovaText Generation
No
ml.p4d.24xlarge, ml.p4de.24xlarge, ml.p5.48xlarge
Arcee Virtuoso Small
arcee-virtuoso-smallText Generation
No
ml.g5.12xlarge, ml.g5.24xlarge, ml.g6.12xlarge, ml.g6.24xlarge, ml.g6e.12xlarge, ml.g6e.24xlarge, ml.g5.48xlarge, ml.g6.48xlarge, ml.g6e.48xlarge, ml.p4d.24xlarge, ml.p4de.24xlarge, ml.p5.48xlarge
Bria 2.2HD Commercial Text-to-image
bria-ai-2-2-hd-commercialReRank
No
ml.g5.2xlarge, ml.g5.12xlarge, ml.g5.48xlarge, ml.p4d.24xlarge, ml.p4de.24xlarge
Bria 2.3 Commercial Text-to-image
bria-ai-2-3-commercialReRank
No
ml.g5.2xlarge, ml.g5.12xlarge, ml.g5.48xlarge, ml.p4d.24xlarge, ml.p4de.24xlarge
Bria 2.3Fast Commercial Text-to-image
bria-ai-2-3-fast-commercialReRank
No
ml.g5.2xlarge, ml.g5.12xlarge, ml.g5.48xlarge, ml.p4d.24xlarge, ml.p4de.24xlarge
Cohere Command R (A100)
cohere-command-r-a100Text Generation
No
ml.p4de.24xlarge
Cohere Command R (H100)
cohere-command-r-h100Text Generation
No
ml.p5.48xlarge
Cohere Command R+ (A100)
cohere-command-r-plus-a100Text Generation
No
ml.p4de.24xlarge
Cohere Command R+ (H100)
cohere-command-r-plus-h100Text Generation
No
ml.p5.48xlarge
Cohere Embed 4
cohere-embed-v4-0Text Embedding
No
ml.g5.xlarge, ml.g5.2xlarge, ml.g6.xlarge, ml.g6.2xlarge
Cohere Embed Light Model v3 - English
cohere-embed-light-englishText Embedding
No
ml.p3.2xlarge, ml.g4dn.xlarge, ml.g4dn.2xlarge, ml.g5.xlarge, ml.g5.2xlarge
Cohere Embed Light v3 - Multilingual
cohere-embed-light-multilingualText Embedding
No
ml.p3.2xlarge, ml.g4dn.xlarge, ml.g4dn.2xlarge, ml.g5.xlarge, ml.g5.2xlarge
Cohere Embed Model 3 - Multilingual
cohere-embed-multilingualText Embedding
No
ml.p3.2xlarge, ml.g4dn.xlarge, ml.g4dn.2xlarge, ml.g5.xlarge, ml.g5.2xlarge
Cohere Embed Model v3 - English
cohere-embed-englishText Embedding
No
ml.p3.2xlarge, ml.g4dn.xlarge, ml.g4dn.2xlarge, ml.g5.xlarge, ml.g5.2xlarge
Cohere Rerank 2 Model - English
cohere-rerank-english-v2ReRank
No
ml.p3.2xlarge, ml.g4dn.xlarge, ml.g4dn.2xlarge, ml.g5.xlarge, ml.g5.2xlarge
Cohere Rerank 2 Model - Multilingual
cohere-rerank-multilingual-v2ReRank
No
ml.p3.2xlarge, ml.g4dn.xlarge, ml.g4dn.2xlarge, ml.g5.xlarge, ml.g5.2xlarge
Cohere Rerank 3 Model - English
cohere-rerank-v3-englishReRank
No
ml.g5.xlarge, ml.g5.2xlarge
Cohere Rerank 3 Model - Multilingual
cohere-rerank-v3-multilingualReRank
No
ml.g5.xlarge, ml.g5.2xlarge
Cohere Rerank 3 Nimble Model - English
cohere-rerank-nimble-englishReRank
No
ml.g5.xlarge, ml.g5.2xlarge
Cohere Rerank 3 Nimble Model - Multi
cohere-rerank-nimble-multiReRank
No
ml.g5.xlarge, ml.g5.2xlarge
Cohere Rerank v3.5
cohere-rerank-v3-5Text Embedding
No
ml.g6.xlarge, ml.g6.2xlarge, ml.g5.xlarge, ml.g5.2xlarge
Cohere Rerank v4.0 Fast
cohere-rerank-v4-0-fastText Embedding
No
ml.g5.xlarge, ml.g5.2xlarge, ml.p5.4xlarge
Cohere Rerank v4.0 Pro
cohere-rerank-v4-0-proText Embedding
No
ml.g5.xlarge, ml.g5.2xlarge, ml.p5.4xlarge
Command A (A100)
cohere-command-a-a100Text Generation
No
ml.p4de.24xlarge
Command A (H100)
cohere-command-a-h100Text Generation
No
ml.p5.48xlarge
Command R 08-2024 (H100)
cohere-command-r-08-2024-h100Text Generation
No
ml.p5.48xlarge
Command R+ 08-2024 (H100)
cohere-command-r-plus-08-2024-h100Text Generation
No
ml.p5.48xlarge
Document OCR
upstage-document-ocrOptical Character Recognition
No
ml.g5.2xlarge
Document Parse
upstage-document-layout-analysisOptical Character Recognition
No
ml.g5.2xlarge, ml.g6.2xlarge, ml.p3.2xlarge
Embed
upstage-solar-embedding-largeText Embedding
No
ml.g5.2xlarge
ESM3-open
evolutionary-scale-esm3Multimodal Generation
No
ml.g5.4xlarge, ml.g5.8xlarge
Evo2-NIM
nvidia-evo2-nimText Generation
No
ml.g6e.12xlarge, ml.g6e.24xlarge, ml.g6e.48xlarge, ml.p4d.24xlarge, ml.p4de.24xlarge, ml.p5.48xlarge, ml.p5en.48xlarge
EXAONE Atelier - Image to Text
lgresearch-exaoneImage2Text Generation
No
ml.g5.xlarge, ml.g5.12xlarge, ml.g5.48xlarge, ml.p4d.24xlarge
Gretel Navigator Tabular
gretel-navigator-tabularText Generation
No
ml.g5.xlarge, ml.g5.2xlarge
H-Optimus-0
bioptimus-h-optimus-0Feature Extraction
No
ml.g5.xlarge
IBM Granite 20B Code Instruct - 8K
ibm-granite-20b-code-instruct-8kText Generation
No
ml.g5.12xlarge, ml.g5.24xlarge, ml.g5.48xlarge
IBM Granite 3.0 2B Instruct
granite-3-0-2b-instructText Generation
No
ml.p4d.24xlarge, ml.p5.48xlarge
IBM Granite 3.0 8B Instruct
granite-3-0-8b-instructText Generation
No
ml.p4d.24xlarge, ml.p5.48xlarge
IBM Granite 3.2 Instruct 2B
ibm-granite-3-2-2b-instructText Generation
No
ml.g6e.2xlarge, ml.g6e.4xlarge, ml.g6e.12xlarge, ml.g6e.16xlarge, ml.g6e.24xlarge, ml.g6e.48xlarge, ml.p4d.24xlarge, ml.p5.48xlarge, ml.g6.12xlarge, ml.g6.16xlarge, ml.g6.24xlarge, ml.g6.48xlarge
IBM Granite 3.2 Instruct 8B
ibm-granite-3-2-8b-instructText Generation
No
ml.g6.12xlarge, ml.g6.16xlarge, ml.g6.24xlarge, ml.g6.48xlarge, ml.g6e.2xlarge, ml.g6e.4xlarge, ml.g6e.12xlarge, ml.g6e.16xlarge, ml.g6e.24xlarge, ml.g6e.48xlarge, ml.p4d.24xlarge, ml.p5.48xlarge
IBM Granite 34B Code Instruct - 8K
ibm-granite-34b-code-instruct-8kText Generation
No
ml.g5.12xlarge, ml.g5.24xlarge, ml.g5.48xlarge
IBM Granite 3B Code Instruct - 128K
ibm-granite-3b-code-instruct-128kText Generation
No
ml.g5.4xlarge, ml.g5.8xlarge, ml.g5.16xlarge, ml.g5.12xlarge, ml.g5.24xlarge, ml.g5.48xlarge
IBM Granite 4.0 h-micro
ibm-granite-4-0-h-microText Generation
No
ml.g6e.2xlarge, ml.g6e.4xlarge, ml.g6e.12xlarge, ml.g6e.16xlarge, ml.g6e.24xlarge, ml.g6e.48xlarge, ml.p4d.24xlarge, ml.p5.48xlarge
IBM Granite 4.0 h-small
ibm-granite-4-0-h-smallText Generation
No
ml.g6e.2xlarge, ml.g6e.4xlarge, ml.g6e.12xlarge, ml.g6e.16xlarge, ml.g6e.24xlarge, ml.g6e.48xlarge, ml.p4d.24xlarge, ml.p5.48xlarge
IBM Granite 4.0 h-tiny
ibm-granite-4-0-h-tinyText Generation
No
ml.g6e.2xlarge, ml.g6e.4xlarge, ml.g6e.12xlarge, ml.g6e.16xlarge, ml.g6e.24xlarge, ml.g6e.48xlarge, ml.p4d.24xlarge, ml.p5.48xlarge
IBM Granite 8B Code Instruct - 128K
ibm-granite-8b-code-instruct-128kText Generation
No
ml.g5.4xlarge, ml.g5.8xlarge, ml.g5.16xlarge, ml.g5.12xlarge, ml.g5.24xlarge, ml.g5.48xlarge
ixi-GEN-Fin-7.8B
lg-ixi-genText Generation
No
ml.g5.4xlarge
JetBrains AI Mellum All
jbai-mellum-alltext2text generation
No
ml.g5.2xlarge, ml.g6.2xlarge, ml.g6e.xlarge, ml.p4d.24xlarge, ml.p4de.24xlarge, ml.p5.48xlarge
JetBrains AI Mellum Kotlin
jbai-mellum-kotlintext2text generation
No
ml.g5.2xlarge, ml.g6.2xlarge, ml.g6e.xlarge, ml.p4d.24xlarge, ml.p4de.24xlarge, ml.p5.48xlarge
JetBrains AI Mellum Python
jbai-mellum-pythontext2text generation
No
ml.g5.2xlarge, ml.g6.2xlarge, ml.g6e.xlarge, ml.p4d.24xlarge, ml.p4de.24xlarge, ml.p5.48xlarge
Jina Embeddings v2 Base - en
jinaai-embeddings-v2-base-enText Embedding
No
ml.p2.xlarge, ml.p2.8xlarge, ml.p2.16xlarge, ml.p3.2xlarge, ml.p3.8xlarge, ml.p3.16xlarge, ml.g4dn.xlarge, ml.g4dn.2xlarge, ml.g4dn.4xlarge, ml.g4dn.8xlarge, ml.g4dn.12xlarge, ml.g4dn.16xlarge, ml.g5.xlarge, ml.g5.2xlarge, ml.g5.4xlarge, ml.g5.8xlarge, ml.g5.12xlarge, ml.g5.16xlarge, ml.g5.24xlarge, ml.g5.48xlarge
KARAKURI LM 8x7b instruct
karakuri-lm-8x7b-instructText Generation
No
ml.g5.48xlarge, ml.g6.48xlarge
LightOn Lyra-Fr 10B
lighton-lyra-frText Generation
No
ml.p4d.24xlarge
LightOn Mini-instruct 40B
lighton-mini-instruct40bText Generation
No
ml.p4d.24xlarge
Liquid LFM 40B (A100)
liquid-lfm-40b-a100Text Generation
No
ml.p4d.24xlarge
Liquid LFM 40B (H100)
liquid-lfm-40b-h100Text Generation
No
ml.p5.48xlarge
Liquid LFM 40B (L40S)
liquid-lfm-40b-l40sText Generation
No
ml.g6e.12xlarge
Liquid LFM 7B (L40S)
liquid-lfm-7b-l40sText Generation
No
ml.g6e.xlarge, ml.g6e.2xlarge, ml.g6e.4xlarge, ml.g6e.8xlarge, ml.g6e.16xlarge
Llama 3.1 Nemotron Nano 8B V1
nvidia-nemotron-nano-8b-nimText Generation
No
ml.g5.12xlarge, ml.g5.24xlarge, ml.g5.48xlarge, ml.g6e.2xlarge, ml.g6e.4xlarge, ml.g6e.8xlarge, ml.g6e.16xlarge, ml.g6e.12xlarge, ml.g6e.24xlarge, ml.g6e.48xlarge, ml.p4d.24xlarge, ml.p4de.24xlarge, ml.p5.48xlarge, ml.p5en.48xlarge
Llama 3.1 SuperNova Lite
arcee-llama-3-1-supernova-liteText Generation
No
ml.g5.2xlarge, ml.g5.4xlarge, ml.g5.8xlarge, ml.g5.16xlarge, ml.g6.2xlarge, ml.g6.4xlarge, ml.g6.8xlarge, ml.g6.16xlarge
Llama 3.2 NVRerankQA1B NIM microservice
nvidia-llama3-2-nv-rerankqa-1b-v2-nimText Generation
No
ml.g5.xlarge, ml.g5.2xlarge, ml.g5.4xlarge, ml.g5.8xlarge, ml.g5.12xlarge, ml.g5.16xlarge, ml.g5.24xlarge, ml.g5.48xlarge, ml.g6.xlarge, ml.g6.2xlarge, ml.g6.4xlarge, ml.g6.8xlarge, ml.g6.12xlarge, ml.g6.16xlarge, ml.g6.24xlarge, ml.g6.48xlarge, ml.p4d.24xlarge, ml.p5.48xlarge, ml.g6e.xlarge, ml.g6e.2xlarge, ml.g6e.4xlarge, ml.g6e.8xlarge, ml.g6e.12xlarge, ml.g6e.16xlarge, ml.g6e.24xlarge, ml.g6e.48xlarge, ml.p4de.24xlarge
Llama 3.3 Nemotron Super 49B V1
nvidia-nemotron-super-49b-nimText Generation
No
ml.g5.48xlarge, ml.g6e.12xlarge, ml.g6e.24xlarge, ml.g6e.48xlarge, ml.p4d.24xlarge, ml.p4de.24xlarge, ml.p5.48xlarge, ml.p5en.48xlarge
Llama 3.3 Nemotron Super 49B V1.5
nvidia-nemotron-super-49b-nim-1-5Text Generation
No
ml.g5.48xlarge, ml.g6e.12xlarge, ml.g6e.24xlarge, ml.g6e.48xlarge, ml.p4d.24xlarge, ml.p4de.24xlarge, ml.p5.48xlarge, ml.p5en.48xlarge
Llama Spark
arcee-llama-sparkText Generation
No
ml.g5.2xlarge, ml.g5.4xlarge, ml.g5.8xlarge, ml.g5.16xlarge, ml.g6.2xlarge, ml.g6.4xlarge, ml.g6.8xlarge, ml.g6.16xlarge
Llama-3-Varco-Offsetbias-8B
ncsoft-llama-3-varco-offsetbias-8bText Generation
No
ml.g5.12xlarge
Llama-3.1-8B-Instruct with Stained Glass Transform Support
protopia-llama-3-1-8b-instructText Generation
No
ml.g4dn.12xlarge, ml.g5.4xlarge, ml.g5.16xlarge
Llama-3.2-NV-EmbedQA-1B-v2
nvidia-llama3-2-nv-embedqa-1b-v2-nimText Embedding
No
ml.g5.xlarge, ml.g5.2xlarge, ml.g5.4xlarge, ml.g5.8xlarge, ml.g5.12xlarge, ml.g5.16xlarge, ml.g5.24xlarge, ml.g5.48xlarge, ml.g6.xlarge, ml.g6.2xlarge, ml.g6.4xlarge, ml.g6.8xlarge, ml.g6.12xlarge, ml.g6.16xlarge, ml.g6.24xlarge, ml.g6.48xlarge, ml.p4d.24xlarge, ml.p5.48xlarge, ml.g6e.xlarge, ml.g6e.2xlarge, ml.g6e.4xlarge, ml.g6e.8xlarge, ml.g6e.12xlarge, ml.g6e.16xlarge, ml.g6e.24xlarge, ml.g6e.48xlarge, ml.p4de.24xlarge
MARS6
cambai-mars6Text To Audio
No
ml.g4dn.2xlarge, ml.g6.2xlarge
Medical LLM - Medium
john-snow-labs-medical-summarization-qa-8bText Summarization
No
ml.g5.2xlarge, ml.g4dn.12xlarge
Medical LLM - Small
john-snow-labs-summarization-qaText Summarization
No
ml.g5.12xlarge
Medical Text Translation (EN-ES)
john-snow-labs-medical-translation-en-esTranslation
No
ml.g5.2xlarge
Mercury
inception-mercuryText Generation
No
ml.p5.48xlarge
Mercury Coder
inception-mercury-coderText Generation
No
ml.p5.48xlarge
msa-search
nvidia-nim-msa-search-v2-1Text Generation
No
ml.g6e.12xlarge, ml.g6e.24xlarge, ml.g6e.48xlarge, ml.p4d.24xlarge, ml.p5.48xlarge, ml.p4de.24xlarge, ml.p5en.48xlarge
Nemotron nano 9b v2
nvidia-nemotron-nano-9b-v2Text Generation
No
ml.g5.2xlarge, ml.g5.4xlarge, ml.g5.8xlarge, ml.g5.16xlarge, ml.g5.12xlarge, ml.g5.24xlarge, ml.g5.48xlarge, ml.g6e.xlarge, ml.g6e.2xlarge, ml.g6e.4xlarge, ml.g6e.8xlarge, ml.g6e.16xlarge, ml.g6e.12xlarge, ml.g6e.24xlarge, ml.g6e.48xlarge, ml.p4d.24xlarge, ml.p4de.24xlarge, ml.p5.4xlarge, ml.p5.48xlarge, ml.p5en.48xlarge
nemotron-parse
nvidia-nemotron-parseText Generation
No
ml.g5.xlarge, ml.g5.2xlarge, ml.g5.4xlarge, ml.g5.8xlarge, ml.g5.16xlarge, ml.g5.12xlarge, ml.g5.24xlarge, ml.g5.48xlarge, ml.g6e.xlarge, ml.g6e.2xlarge, ml.g6e.4xlarge, ml.g6e.8xlarge, ml.g6e.16xlarge, ml.g6e.12xlarge, ml.g6e.24xlarge, ml.g6e.48xlarge, ml.p4d.24xlarge, ml.p4de.24xlarge, ml.p5.4xlarge, ml.p5.48xlarge, ml.p5en.48xlarge
NEXUS by Fundamental
fundamental-technologies-nexusClassification
No
ml.g4dn.8xlarge, ml.p5en.48xlarge
Nomic Embed Text v1.5
nomic-embed-textText Embedding
No
ml.p3.2xlarge, ml.p3.8xlarge, ml.p3.16xlarge, ml.g4dn.xlarge, ml.g4dn.2xlarge, ml.g4dn.4xlarge, ml.g4dn.8xlarge, ml.g4dn.12xlarge, ml.g4dn.16xlarge, ml.g5.xlarge, ml.g5.2xlarge, ml.g5.4xlarge, ml.g5.8xlarge, ml.g5.12xlarge, ml.g5.16xlarge, ml.g5.24xlarge, ml.g5.48xlarge
Nomic Embed Vision v1.5
nomic-embed-imageText Embedding
No
ml.p3.2xlarge, ml.p3.8xlarge, ml.p3.16xlarge, ml.g4dn.xlarge, ml.g4dn.2xlarge, ml.g4dn.4xlarge, ml.g4dn.8xlarge, ml.g4dn.12xlarge, ml.g4dn.16xlarge, ml.g5.xlarge, ml.g5.2xlarge, ml.g5.4xlarge, ml.g5.8xlarge, ml.g5.12xlarge, ml.g5.16xlarge, ml.g5.24xlarge, ml.g5.48xlarge
NVIDIA Cosmos Reason-1-7B
nvidia-nim-cosmos-reason1-7bText Generation
No
ml.g5.12xlarge, ml.g5.24xlarge, ml.g5.48xlarge, ml.g6e.xlarge, ml.g6e.2xlarge, ml.g6e.4xlarge, ml.g6e.8xlarge, ml.g6e.16xlarge, ml.g6e.12xlarge, ml.g6e.24xlarge, ml.g6e.48xlarge, ml.p4d.24xlarge, ml.p4de.24xlarge, ml.p5.4xlarge, ml.p5.48xlarge, ml.p5en.48xlarge
NVIDIA Nemotron-4 15B NIM Microservice
nvidia-nemotron-4-15b-nimText Generation
No
ml.g5.12xlarge, ml.g5.24xlarge
NVIDIA ParakeetvTDT 0.6B v2
nvidia-parakeetvtdt-0-6b-v2Audio2Text Generation
No
ml.g6e.2xlarge, ml.g6e.4xlarge, ml.g6e.8xlarge, ml.g6e.12xlarge, ml.g6e.16xlarge, ml.g6e.24xlarge, ml.g6e.48xlarge, ml.g6.2xlarge, ml.g6.4xlarge, ml.g6.8xlarge, ml.g6.12xlarge, ml.g6.16xlarge, ml.g6.24xlarge, ml.g6.48xlarge
NVIDIA-Parakeet-1-1b-CTC-EN-US-ASR
nvidia-parakeet-1-1b-ctc-en-us-asrAudio2Text Generation
No
ml.g5.2xlarge, ml.g5.4xlarge, ml.g5.8xlarge, ml.g5.16xlarge, ml.g5.12xlarge, ml.g5.24xlarge, ml.g5.48xlarge, ml.g6.2xlarge, ml.g6.4xlarge, ml.g6.8xlarge, ml.g6.16xlarge, ml.g6.12xlarge, ml.g6.24xlarge, ml.g6.48xlarge, ml.g6e.2xlarge, ml.g6e.4xlarge, ml.g6e.8xlarge, ml.g6e.16xlarge, ml.g6e.12xlarge, ml.g6e.24xlarge, ml.g6e.48xlarge, ml.p4d.24xlarge, ml.p4de.24xlarge
Orb
orbital-materials-orbScientific Modelling
No
ml.p3.2xlarge, ml.p3.8xlarge, ml.p3.16xlarge, ml.p4d.24xlarge, ml.g4dn.xlarge, ml.g4dn.2xlarge, ml.g4dn.4xlarge, ml.g4dn.8xlarge, ml.g4dn.12xlarge, ml.g4dn.16xlarge
PLaMo Private Deployment
preferred-networks-plamo-apiText Generation
No
ml.p4d.24xlarge, ml.g5.12xlarge, ml.g5.24xlarge, ml.g5.48xlarge
ProteinMPNN-NIM
nvidia-nim-proteinmpnn-v1-0-2Text Generation
No
ml.g6e.12xlarge, ml.g6e.24xlarge, ml.g6e.48xlarge, ml.p4d.24xlarge, ml.p4de.24xlarge, ml.p5.48xlarge, ml.p5en.48xlarge
rerank-lite-1 Reranker
voyage-rerank-lite-1-rerankerReRank
No
ml.g5.xlarge, ml.g5.2xlarge, ml.g5.4xlarge, ml.g5.8xlarge, ml.g5.16xlarge
Solar Mini - Quant
upstage-solar-mini-chat-quantText Generation
No
ml.g5.2xlarge
Solar Pro - Quant
upstage-solar-pro-quantizedText Generation
No
ml.g5.12xlarge
Solar Pro 2
upstage-solar-proText Generation
No
ml.p4d.24xlarge, ml.p5.48xlarge
Sonic 3 SageMaker
cartesia-sonic-3-sagemakerText To Audio
No
ml.g6e.xlarge
Stable Diffusion 3.5 Large
stabilityai-stable-diffusion-3-5-largeText To Image
No
ml.p5.48xlarge
Stable Diffusion XL 1.0
stabilityai-sdxl-1-0Text To Image
No
ml.g5.2xlarge, ml.p4d.24xlarge, ml.p4de.24xlarge
Stable Diffusion XL Beta 0.8
stabilityai-sdxl-beta-0-8Text To Image
No
ml.g5.xlarge
Stockmark-LLM-13b
stockmark-llm-13bText Generation
No
ml.g5.2xlarge
VARCO LLM KO-1.3B-IST
ncsoft-ko-1-3b-istText Generation
No
ml.g5.2xlarge, ml.g5.4xlarge, ml.g4dn.2xlarge
VARCO LLM KO-6.4B-IST
ncsoft-ko-6-4b-istText Generation
No
ml.g5.4xlarge, ml.g5.8xlarge, ml.g5.12xlarge, ml.g4dn.4xlarge, ml.g4dn.8xlarge
VARCO LLM KO/EN-13B-IST
ncsoft-ko-13b-istText Generation
No
ml.g5.12xlarge, ml.g4dn.12xlarge
voyage-2 Embedding Model
voyage-2-embeddingText Embedding
No
ml.g5.xlarge, ml.g5.2xlarge, ml.g5.4xlarge, ml.g5.8xlarge, ml.g5.16xlarge
voyage-3 Embedding Model
voyage-3-embeddingText Embedding
No
ml.g5.xlarge, ml.g5.2xlarge, ml.g5.4xlarge, ml.g5.8xlarge, ml.g5.16xlarge, ml.g6.xlarge, ml.g6.2xlarge, ml.g6.4xlarge, ml.g6.8xlarge, ml.g6.16xlarge
voyage-3 Large Embedding Model
voyage-3-large-embeddingText Embedding
No
ml.g6.2xlarge, ml.g5.xlarge, ml.g5.8xlarge, ml.g6.4xlarge, ml.g5.2xlarge, ml.g5.4xlarge, ml.g6.8xlarge, ml.g6.xlarge
voyage-3.5 Embedding Model
voyage-3-5-embeddingText Embedding
No
ml.g5.16xlarge, ml.g6.16xlarge, ml.g6.2xlarge, ml.g5.xlarge, ml.g5.8xlarge, ml.g6.4xlarge, ml.g5.2xlarge, ml.g5.4xlarge, ml.g6.8xlarge, ml.g6.xlarge
voyage-3.5-lite Embedding Model
voyage-3-5-lite-embeddingText Embedding
No
ml.g6.16xlarge, ml.g6.2xlarge, ml.g5.xlarge, ml.g5.8xlarge, ml.g6.4xlarge, ml.g5.2xlarge, ml.g5.4xlarge, ml.g6.8xlarge, ml.g5.16xlarge, ml.g6.xlarge
voyage-code-2 Embedding Model
voyage-code-2-embeddingText Embedding
No
ml.g5.xlarge, ml.g5.2xlarge, ml.g5.4xlarge, ml.g5.8xlarge, ml.g5.16xlarge, ml.g6.xlarge, ml.g6.2xlarge, ml.g6.4xlarge, ml.g6.8xlarge, ml.g6.16xlarge
voyage-code-3 Embedding Model
voyage-code-3-embeddingText Embedding
No
ml.g6.16xlarge, ml.g6.2xlarge, ml.g5.xlarge, ml.g5.8xlarge, ml.g6.4xlarge, ml.g5.2xlarge, ml.g5.4xlarge, ml.g6.8xlarge, ml.g5.16xlarge, ml.g6.xlarge
voyage-large-2 Embedding Model
voyage-large-2-embeddingText Embedding
No
ml.g5.xlarge, ml.g5.2xlarge, ml.g5.4xlarge, ml.g5.8xlarge, ml.g5.16xlarge, ml.g6.xlarge, ml.g6.2xlarge, ml.g6.4xlarge, ml.g6.8xlarge, ml.g6.16xlarge
Widn Tower Anthill
widn-tower-anthillTranslation
No
ml.g5.xlarge
Widn Tower Sugarloaf
widn-tower-sugarloafTranslation
No
ml.g5.12xlarge
Widn Tower Vesuvius
widn-llama3-tower-vesuviusTranslation
No
ml.g5.48xlarge, ml.p4d.24xlarge, ml.p4de.24xlarge
Woven City AI Vision Engine
wovenbytoyota-woven-city-ai-vision-engineMultimodal Generation
No
ml.g5.8xlarge, ml.g5.12xlarge, ml.g5.24xlarge, ml.g5.48xlarge
Writer Palmyra-Fin-70B-32K
writer-palmyra-fin-70b-32kText Generation
No
ml.p4d.24xlarge
Writer Palmyra-Med-70B-32K
writer-palmyra-med-70b-32kText Generation
No
ml.p4d.24xlarge
Writer Palmyra-X-004
writer-palmyra-x-004Text Generation
No
ml.p5.48xlarge
- Built-in Algorithms (135)
-
Amazon SageMaker JumpStart provides pre-trained, task-specific models for common computer vision tasks such as image classification, object detection, and image segmentation. These models are built on established deep learning frameworks (PyTorch, TensorFlow, MXNet) and are ready to deploy or fine-tune on your own data.
Provider Model Name Model ID Task License Fine-tunable Supported Inference Instance Types Mxnet
Faster RCNN ResNet 101 V1d
mxnet-od-faster-rcnn-resnet101-v1d-cocoObject Detection
apache-2.0
No
ml.c5.2xlarge, ml.c5.xlarge, ml.g4dn.xlarge, ml.m5.large, ml.m5.xlarge, ml.p3.2xlarge
Mxnet
Faster RCNN ResNet 101 V1d
mxnet-od-faster-rcnn-fpn-resnet101-v1d-cocoObject Detection
apache-2.0
No
ml.c5.2xlarge, ml.c5.xlarge, ml.g4dn.xlarge, ml.m5.large, ml.m5.xlarge, ml.p3.2xlarge
Mxnet
Faster RCNN ResNet 50 V1b
mxnet-od-faster-rcnn-resnet50-v1b-vocObject Detection
apache-2.0
No
ml.c5.2xlarge, ml.c5.xlarge, ml.g4dn.xlarge, ml.m5.large, ml.m5.xlarge, ml.p3.2xlarge
Mxnet
Faster RCNN ResNet 50 V1b
mxnet-od-faster-rcnn-resnet50-v1b-cocoObject Detection
apache-2.0
No
ml.c5.2xlarge, ml.c5.xlarge, ml.g4dn.xlarge, ml.m5.large, ml.m5.xlarge, ml.p3.2xlarge
Mxnet
Faster RCNN ResNet 50 V1b
mxnet-od-faster-rcnn-fpn-resnet50-v1b-cocoObject Detection
apache-2.0
No
ml.c5.2xlarge, ml.c5.xlarge, ml.g4dn.xlarge, ml.m5.large, ml.m5.xlarge, ml.p3.2xlarge
Mxnet
FCN ResNet 101 ADE20K
mxnet-semseg-fcn-resnet101-adeImage Segmentation
apache-2.0
Yes
ml.c5.2xlarge, ml.c5.xlarge, ml.g4dn.xlarge, ml.m5.large, ml.m5.xlarge, ml.p3.2xlarge
Mxnet
FCN ResNet 101 Pascal VOC
mxnet-semseg-fcn-resnet101-vocImage Segmentation
apache-2.0
Yes
ml.c5.2xlarge, ml.c5.xlarge, ml.g4dn.xlarge, ml.m5.large, ml.m5.xlarge, ml.p3.2xlarge
Mxnet
FCN ResNet 50 ADE20K
mxnet-semseg-fcn-resnet50-adeImage Segmentation
apache-2.0
Yes
ml.c5.2xlarge, ml.c5.xlarge, ml.g4dn.xlarge, ml.m5.large, ml.m5.xlarge, ml.p3.2xlarge
Mxnet
Instance Segmentation
mxnet-is-mask-rcnn-fpn-resnet101-v1d-cocoImage Segmentation
apache-2.0
No
ml.c5.2xlarge, ml.c5.xlarge, ml.g4dn.xlarge, ml.m5.large, ml.m5.xlarge, ml.p3.2xlarge
Mxnet
MASK RCNN FPN RESNET18 COCO
mxnet-is-mask-rcnn-fpn-resnet18-v1b-cocoImage Segmentation
apache-2.0
No
ml.c5.2xlarge, ml.c5.xlarge, ml.g4dn.xlarge, ml.m5.large, ml.m5.xlarge, ml.p3.2xlarge
Mxnet
MASK RCNN FPN RESNET50 COCO
mxnet-is-mask-rcnn-fpn-resnet50-v1b-cocoImage Segmentation
apache-2.0
No
ml.c5.2xlarge, ml.c5.xlarge, ml.g4dn.xlarge, ml.m5.large, ml.m5.xlarge, ml.p3.2xlarge
Mxnet
MASK RCNN RESNET18 COCO
mxnet-is-mask-rcnn-resnet18-v1b-cocoImage Segmentation
apache-2.0
No
ml.c5.2xlarge, ml.c5.xlarge, ml.g4dn.xlarge, ml.m5.large, ml.m5.xlarge, ml.p3.2xlarge
Mxnet
Object Detection
mxnet-od-ssd-512-mobilenet1-0-cocoObject Detection
apache-2.0
Yes
ml.c5.2xlarge, ml.c5.xlarge, ml.g4dn.xlarge, ml.m5.large, ml.m5.xlarge, ml.p3.2xlarge
Mxnet
Semantic Segmentation
mxnet-semseg-fcn-resnet101-cocoImage Segmentation
apache-2.0
Yes
ml.c5.2xlarge, ml.c5.xlarge, ml.g4dn.xlarge, ml.m5.large, ml.m5.xlarge, ml.p3.2xlarge
Mxnet
SSD 512 ResNet 50 V1
mxnet-od-ssd-512-resnet50-v1-cocoObject Detection
apache-2.0
Yes
ml.c5.2xlarge, ml.c5.xlarge, ml.g4dn.xlarge, ml.m5.large, ml.m5.xlarge, ml.p3.2xlarge
Mxnet
SSD MobileNet 1.0
mxnet-od-ssd-512-mobilenet1-0-vocObject Detection
apache-2.0
Yes
ml.c5.2xlarge, ml.c5.xlarge, ml.g4dn.xlarge, ml.m5.large, ml.m5.xlarge, ml.p3.2xlarge
Mxnet
SSD ResNet 50 V1
mxnet-od-ssd-512-resnet50-v1-vocObject Detection
apache-2.0
Yes
ml.c5.2xlarge, ml.c5.xlarge, ml.g4dn.xlarge, ml.m5.large, ml.m5.xlarge, ml.p3.2xlarge
Mxnet
SSD VGG 16 Atrous 300
mxnet-od-ssd-300-vgg16-atrous-vocObject Detection
apache-2.0
Yes
ml.c5.2xlarge, ml.c5.xlarge, ml.g4dn.xlarge, ml.m5.large, ml.m5.xlarge, ml.p3.2xlarge
Mxnet
SSD VGG 16 Atrous 512
mxnet-od-ssd-512-vgg16-atrous-cocoObject Detection
apache-2.0
Yes
ml.c5.2xlarge, ml.c5.xlarge, ml.g4dn.xlarge, ml.m5.large, ml.m5.xlarge, ml.p3.2xlarge
Mxnet
SSD VGG16 Atrous 300
mxnet-od-ssd-300-vgg16-atrous-cocoObject Detection
apache-2.0
Yes
ml.c5.2xlarge, ml.c5.xlarge, ml.g4dn.xlarge, ml.m5.large, ml.m5.xlarge, ml.p3.2xlarge
Mxnet
SSD VGG16 Atrous 512
mxnet-od-ssd-512-vgg16-atrous-vocObject Detection
apache-2.0
Yes
ml.c5.2xlarge, ml.c5.xlarge, ml.g4dn.xlarge, ml.m5.large, ml.m5.xlarge, ml.p3.2xlarge
Mxnet
YOLO V3 DarkNet 53
mxnet-od-yolo3-darknet53-vocObject Detection
apache-2.0
No
ml.c5.2xlarge, ml.c5.xlarge, ml.g4dn.xlarge, ml.m5.large, ml.m5.xlarge, ml.p3.2xlarge
Mxnet
YOLO V3 DarkNet 53
mxnet-od-yolo3-darknet53-cocoObject Detection
apache-2.0
No
ml.c5.2xlarge, ml.c5.xlarge, ml.g4dn.xlarge, ml.m5.large, ml.m5.xlarge, ml.p3.2xlarge
Mxnet
YOLO V3 MobileNet 1.0
mxnet-od-yolo3-mobilenet1-0-vocObject Detection
apache-2.0
No
ml.c5.2xlarge, ml.c5.xlarge, ml.g4dn.xlarge, ml.m5.large, ml.m5.xlarge, ml.p3.2xlarge
Mxnet
YOLO V3 MobileNet 1.0
mxnet-od-yolo3-mobilenet1-0-cocoObject Detection
apache-2.0
No
ml.c5.2xlarge, ml.c5.xlarge, ml.g4dn.xlarge, ml.m5.large, ml.m5.xlarge, ml.p3.2xlarge
Pytorch
SqueezeNet 1
pytorch-ic-squeezenet1-1Image Classification
bsd-3-clause
Yes
ml.c5.2xlarge, ml.c5.xlarge, ml.m4.xlarge, ml.m5.large, ml.m5.xlarge
Pytorch
AlexNet
pytorch-ic-alexnetImage Classification
bsd-3-clause
Yes
ml.c5.2xlarge, ml.c5.xlarge, ml.m4.xlarge, ml.m5.large, ml.m5.xlarge
Pytorch
DenseNet 121
pytorch-ic-densenet121Image Classification
bsd-3-clause
Yes
ml.c5.2xlarge, ml.c5.xlarge, ml.m4.xlarge, ml.m5.large, ml.m5.xlarge
Pytorch
DenseNet 169
pytorch-ic-densenet169Image Classification
bsd-3-clause
Yes
ml.c5.2xlarge, ml.c5.xlarge, ml.m4.xlarge, ml.m5.large, ml.m5.xlarge
Pytorch
FRCNN MobileNet V3 large 320 FPN
pytorch-od1-fasterrcnn-mobilenet-v3-large-320-fpnObject Detection
bsd-3-clause
No
ml.c5.2xlarge, ml.c5.xlarge, ml.g4dn.xlarge, ml.m5.large, ml.m5.xlarge, ml.p3.2xlarge
Pytorch
FRCNN MobileNet V3 large FPN
pytorch-od1-fasterrcnn-mobilenet-v3-large-fpnObject Detection
bsd-3-clause
No
ml.c5.2xlarge, ml.c5.xlarge, ml.g4dn.xlarge, ml.m5.large, ml.m5.xlarge, ml.p3.2xlarge
Pytorch
GoogLeNet
pytorch-ic-googlenetImage Classification
bsd-3-clause
Yes
ml.c5.2xlarge, ml.c5.xlarge, ml.m4.xlarge, ml.m5.large, ml.m5.xlarge
Pytorch
Image Classification
pytorch-ic-mobilenet-v2Image Classification
bsd-3-clause
Yes
ml.c5.2xlarge, ml.c5.xlarge, ml.m4.xlarge, ml.m5.large, ml.m5.xlarge
Pytorch
Object Detection
pytorch-od1-fasterrcnn-resnet50-fpnObject Detection
bsd-3-clause
Yes
ml.c5.2xlarge, ml.c5.xlarge, ml.g4dn.xlarge, ml.m5.large, ml.m5.xlarge, ml.p3.2xlarge
Pytorch
ResNet 101
pytorch-ic-resnet101Image Classification
bsd-3-clause
Yes
ml.c5.2xlarge, ml.c5.xlarge, ml.m4.xlarge, ml.m5.large, ml.m5.xlarge
Pytorch
ResNet 152
pytorch-ic-resnet152Image Classification
bsd-3-clause
Yes
ml.c5.2xlarge, ml.c5.xlarge, ml.m4.xlarge, ml.m5.large, ml.m5.xlarge
Pytorch
ResNet 18
pytorch-ic-resnet18Image Classification
bsd-3-clause
Yes
ml.c5.2xlarge, ml.c5.xlarge, ml.m4.xlarge, ml.m5.large, ml.m5.xlarge
Pytorch
ResNet 34
pytorch-ic-resnet34Image Classification
bsd-3-clause
Yes
ml.c5.2xlarge, ml.c5.xlarge, ml.m4.xlarge, ml.m5.large, ml.m5.xlarge
Pytorch
ResNet 50
pytorch-ic-resnet50Image Classification
bsd-3-clause
Yes
ml.c5.2xlarge, ml.c5.xlarge, ml.m4.xlarge, ml.m5.large, ml.m5.xlarge
Pytorch
Resnext 50
pytorch-ic-resnext50-32x4dImage Classification
bsd-3-clause
Yes
ml.c5.2xlarge, ml.c5.xlarge, ml.m4.xlarge, ml.m5.large, ml.m5.xlarge
Pytorch
ShuffleNet V2
pytorch-ic-shufflenet-v2-x1-0Image Classification
bsd-3-clause
Yes
ml.c5.2xlarge, ml.c5.xlarge, ml.m4.xlarge, ml.m5.large, ml.m5.xlarge
Pytorch
SSD
pytorch-od-nvidia-ssdObject Detection
bsd-3-clause
No
ml.c5.2xlarge, ml.c5.xlarge, ml.g4dn.xlarge, ml.m5.large, ml.m5.xlarge, ml.p3.2xlarge
Pytorch
VGG 11
pytorch-ic-vgg11Image Classification
bsd-3-clause
Yes
ml.c5.2xlarge, ml.c5.xlarge, ml.m4.xlarge, ml.m5.large, ml.m5.xlarge
Pytorch
VGG 11-BN
pytorch-ic-vgg11-bnImage Classification
bsd-3-clause
Yes
ml.c5.2xlarge, ml.c5.xlarge, ml.m4.xlarge, ml.m5.large, ml.m5.xlarge
Pytorch
VGG 13-BN
pytorch-ic-vgg13-bnImage Classification
bsd-3-clause
Yes
ml.c5.2xlarge, ml.c5.xlarge, ml.m4.xlarge, ml.m5.large, ml.m5.xlarge
Pytorch
VGG 16
pytorch-ic-vgg16Image Classification
bsd-3-clause
Yes
ml.c5.2xlarge, ml.c5.xlarge, ml.m4.xlarge, ml.m5.large, ml.m5.xlarge
Pytorch
VGG 16-BN
pytorch-ic-vgg16-bnImage Classification
bsd-3-clause
Yes
ml.c5.2xlarge, ml.c5.xlarge, ml.m4.xlarge, ml.m5.large, ml.m5.xlarge
Pytorch
VGG 19
pytorch-ic-vgg19Image Classification
bsd-3-clause
Yes
ml.c5.2xlarge, ml.c5.xlarge, ml.m4.xlarge, ml.m5.large, ml.m5.xlarge
Pytorch
VGG-13
pytorch-ic-vgg13Image Classification
bsd-3-clause
Yes
ml.c5.2xlarge, ml.c5.xlarge, ml.m4.xlarge, ml.m5.large, ml.m5.xlarge
Pytorch
Wide ResNet 101
pytorch-ic-wide-resnet101-2Image Classification
bsd-3-clause
Yes
ml.c5.2xlarge, ml.c5.xlarge, ml.m4.xlarge, ml.m5.large, ml.m5.xlarge
Tensorflow
ALBERT Base
tensorflow-tc-albert-en-baseText Classification
apache-2.0
Yes
ml.c5.18xlarge, ml.c5.2xlarge, ml.c5.4xlarge, ml.c5.9xlarge, ml.g4dn.16xlarge, ml.g4dn.2xlarge, ml.g4dn.4xlarge, ml.g4dn.8xlarge, ml.g4dn.xlarge, ml.g5.xlarge, ml.m5.12xlarge, ml.m5.24xlarge, ml.m5.2xlarge, ml.m5.4xlarge, ml.m5.xlarge, ml.p3.16xlarge, ml.p3.2xlarge, ml.p3.8xlarge
Tensorflow
BERT Base Cased
tensorflow-tc-bert-en-cased-L-12-H-768-A-12-2Text Classification
apache-2.0
Yes
ml.c5.18xlarge, ml.c5.2xlarge, ml.c5.4xlarge, ml.c5.9xlarge, ml.g4dn.16xlarge, ml.g4dn.2xlarge, ml.g4dn.4xlarge, ml.g4dn.8xlarge, ml.g4dn.xlarge, ml.g5.xlarge, ml.m5.12xlarge, ml.m5.24xlarge, ml.m5.2xlarge, ml.m5.4xlarge, ml.m5.xlarge, ml.p3.16xlarge, ml.p3.2xlarge, ml.p3.8xlarge
Tensorflow
BERT Base MEDLINE/PubMed
tensorflow-tc-experts-bert-pubmed-1Text Classification
apache-2.0
Yes
ml.c5.18xlarge, ml.c5.2xlarge, ml.c5.4xlarge, ml.c5.9xlarge, ml.g4dn.16xlarge, ml.g4dn.2xlarge, ml.g4dn.4xlarge, ml.g4dn.8xlarge, ml.g4dn.xlarge, ml.g5.xlarge, ml.m5.12xlarge, ml.m5.24xlarge, ml.m5.2xlarge, ml.m5.4xlarge, ml.m5.xlarge, ml.p3.16xlarge, ml.p3.2xlarge, ml.p3.8xlarge
Tensorflow
BERT Base Multilingual Cased
tensorflow-tc-bert-multi-cased-L-12-H-768-A-12-2Text Classification
apache-2.0
Yes
ml.c5.18xlarge, ml.c5.2xlarge, ml.c5.4xlarge, ml.c5.9xlarge, ml.g4dn.16xlarge, ml.g4dn.2xlarge, ml.g4dn.4xlarge, ml.g4dn.8xlarge, ml.g4dn.xlarge, ml.g5.xlarge, ml.m5.12xlarge, ml.m5.24xlarge, ml.m5.2xlarge, ml.m5.4xlarge, ml.m5.xlarge, ml.p3.16xlarge, ml.p3.2xlarge, ml.p3.8xlarge
Tensorflow
BERT Large Uncased
tensorflow-tc-bert-en-uncased-L-24-H-1024-A-16-2Text Classification
apache-2.0
Yes
ml.c5.18xlarge, ml.c5.2xlarge, ml.c5.4xlarge, ml.c5.9xlarge, ml.g4dn.16xlarge, ml.g4dn.2xlarge, ml.g4dn.4xlarge, ml.g4dn.8xlarge, ml.g4dn.xlarge, ml.g5.xlarge, ml.m5.12xlarge, ml.m5.24xlarge, ml.m5.2xlarge, ml.m5.4xlarge, ml.m5.xlarge, ml.p3.16xlarge, ml.p3.2xlarge, ml.p3.8xlarge
Tensorflow
BERT Small Uncased L-10 H-128
tensorflow-tcembedding-bert-en-uncased-L-10-H-128-A-2-2Text Embedding
apache-2.0
No
ml.c5.2xlarge, ml.c5.xlarge, ml.g4dn.xlarge, ml.m5.large, ml.m5.xlarge, ml.p3.2xlarge
Tensorflow
BERT Small Uncased L-12 H-256
tensorflow-tcembedding-bert-en-uncased-L-12-H-256-A-4Text Embedding
apache-2.0
No
ml.c5.2xlarge, ml.c5.xlarge, ml.g4dn.xlarge, ml.m5.large, ml.m5.xlarge, ml.p3.2xlarge
Tensorflow
BERT Small Uncased L-12 H-768
tensorflow-tcembedding-bert-en-uncased-L-12-H-768-A-12-2Text Embedding
apache-2.0
No
ml.c5.2xlarge, ml.c5.xlarge, ml.g4dn.xlarge, ml.m5.large, ml.m5.xlarge, ml.p3.2xlarge
Tensorflow
BERT Small Uncased L-4 H-768
tensorflow-tcembedding-bert-en-uncased-L-4-H-768-A-12-2Text Embedding
apache-2.0
No
ml.c5.2xlarge, ml.c5.xlarge, ml.g4dn.xlarge, ml.m5.large, ml.m5.xlarge, ml.p3.2xlarge
Tensorflow
BiT-M R101x1
tensorflow-ic-bit-m-r101x1-ilsvrc2012-classification-1Image Classification
Apache-2.0
Yes
ml.c5.2xlarge, ml.c5.xlarge, ml.c6i.large, ml.c6i.xlarge, ml.g4dn.xlarge, ml.m4.xlarge, ml.m5.large, ml.m5.xlarge, ml.p3.2xlarge, ml.r5.large, ml.r5.xlarge
Tensorflow
BiT-M R101x1 Feature Vector
tensorflow-icembedding-bit-m-r101x1-ilsvrc2012-featurevector-1Image Classification
apache-2.0
No
ml.c5.2xlarge, ml.c5.xlarge, ml.g4dn.xlarge, ml.m5.large, ml.m5.xlarge, ml.p3.2xlarge
Tensorflow
BiT-M R50x1 ImageNet-21k
tensorflow-ic-bit-m-r50x1-imagenet21k-classification-1Image Classification
Apache-2.0
Yes
ml.c5.2xlarge, ml.c5.xlarge, ml.c6i.large, ml.c6i.xlarge, ml.g4dn.xlarge, ml.m4.xlarge, ml.m5.large, ml.m5.xlarge, ml.p3.2xlarge, ml.r5.large, ml.r5.xlarge
Tensorflow
CaiT S 24 224
tensorflow-ic-cait-s24-224Image Classification
MIT
Yes
ml.c5.2xlarge, ml.c5.xlarge, ml.c6i.large, ml.c6i.xlarge, ml.g4dn.xlarge, ml.m4.xlarge, ml.m5.large, ml.m5.xlarge, ml.p3.2xlarge, ml.r5.large, ml.r5.xlarge
Tensorflow
CenterNet ResNet-v1-101
tensorflow-od-centernet-resnet101v1-fpn-512x512-1Object Detection
apache-2.0
No
ml.g4dn.xlarge, ml.g5.xlarge, ml.p3.2xlarge
Tensorflow
CenterNet ResNet-v1-50
tensorflow-od-centernet-resnet50v1-fpn-512x512-1Object Detection
apache-2.0
No
ml.g4dn.xlarge, ml.g5.xlarge, ml.p3.2xlarge
Tensorflow
DeiT Tiny Distilled Patch 16 224
tensorflow-ic-deit-tiny-distilled-patch16-224Image Classification
MIT
Yes
ml.c5.2xlarge, ml.c5.xlarge, ml.c6i.large, ml.c6i.xlarge, ml.g4dn.xlarge, ml.m4.xlarge, ml.m5.large, ml.m5.xlarge, ml.p3.2xlarge, ml.r5.large, ml.r5.xlarge
Tensorflow
EfficientNet B0
tensorflow-ic-efficientnet-b0-classification-1Image Classification
Apache-2.0
Yes
ml.c5.2xlarge, ml.c5.xlarge, ml.c6i.large, ml.c6i.xlarge, ml.g4dn.xlarge, ml.m4.xlarge, ml.m5.large, ml.m5.xlarge, ml.p3.2xlarge, ml.r5.large, ml.r5.xlarge
Tensorflow
EfficientNet B1
tensorflow-ic-efficientnet-b1-classification-1Image Classification
Apache-2.0
Yes
ml.c5.2xlarge, ml.c5.xlarge, ml.c6i.large, ml.c6i.xlarge, ml.g4dn.xlarge, ml.m4.xlarge, ml.m5.large, ml.m5.xlarge, ml.p3.2xlarge, ml.r5.large, ml.r5.xlarge
Tensorflow
EfficientNet B1 Feature Vector
tensorflow-icembedding-efficientnet-b1-featurevector-1Image Classification
apache-2.0
No
ml.c5.2xlarge, ml.c5.xlarge, ml.g4dn.xlarge, ml.m5.large, ml.m5.xlarge, ml.p3.2xlarge
Tensorflow
EfficientNet B3
tensorflow-ic-efficientnet-b3-classification-1Image Classification
Apache-2.0
Yes
ml.c5.2xlarge, ml.c5.xlarge, ml.c6i.large, ml.c6i.xlarge, ml.g4dn.xlarge, ml.m4.xlarge, ml.m5.large, ml.m5.xlarge, ml.p3.2xlarge, ml.r5.large, ml.r5.xlarge
Tensorflow
EfficientNet B7
tensorflow-ic-efficientnet-b7-classification-1Image Classification
Apache-2.0
Yes
ml.c5.9xlarge, ml.g4dn.8xlarge, ml.p3.8xlarge
Tensorflow
EfficientNet V2 ImageNet-1k Small
tensorflow-ic-efficientnet-v2-imagenet1k-sImage Classification
Apache-2.0
Yes
ml.c5.2xlarge, ml.c5.xlarge, ml.c6i.large, ml.c6i.xlarge, ml.g4dn.xlarge, ml.m4.xlarge, ml.m5.large, ml.m5.xlarge, ml.p3.2xlarge, ml.r5.large, ml.r5.xlarge
Tensorflow
EfficientNet V2 ImageNet-21k FT1k M
tensorflow-ic-efficientnet-v2-imagenet21k-ft1k-mImage Classification
Apache-2.0
Yes
ml.c5.2xlarge, ml.c5.xlarge, ml.c6i.large, ml.c6i.xlarge, ml.g4dn.xlarge, ml.m4.xlarge, ml.m5.large, ml.m5.xlarge, ml.p3.2xlarge, ml.r5.large, ml.r5.xlarge
Tensorflow
Faster R-CNN Resnet V2 1024x1024
tensorflow-od-faster-rcnn-inception-resnet-v2-1024x1024-1Object Detection
apache-2.0
No
ml.g4dn.xlarge, ml.g5.xlarge, ml.p3.2xlarge
Tensorflow
Faster R-CNN Resnet-101 V1 640x640
tensorflow-od-faster-rcnn-resnet101-v1-640x640-1Object Detection
apache-2.0
No
ml.g4dn.xlarge, ml.g5.xlarge, ml.p3.2xlarge
Tensorflow
Faster R-CNN Resnet-50 V1 1024x1024
tensorflow-od-faster-rcnn-resnet50-v1-1024x1024-1Object Detection
apache-2.0
No
ml.g4dn.xlarge, ml.g5.xlarge, ml.p3.2xlarge
Tensorflow
Faster R-CNN Resnet-50 V1 640x640
tensorflow-od-faster-rcnn-resnet50-v1-640x640-1Object Detection
apache-2.0
No
ml.g4dn.xlarge, ml.g5.xlarge, ml.p3.2xlarge
Tensorflow
Faster R-CNN Resnet-50 V1 800x1333
tensorflow-od-faster-rcnn-resnet50-v1-800x1333-1Object Detection
apache-2.0
No
ml.g4dn.xlarge, ml.g5.xlarge, ml.p3.2xlarge
Tensorflow
FRILL
tensorflow-audioembedding-frill-1Audio Classification
apache-2.0
No
ml.c5.2xlarge, ml.c5.xlarge, ml.g4dn.xlarge, ml.m5.large, ml.m5.xlarge, ml.p3.2xlarge
Tensorflow
Image Classification
tensorflow-ic-swin-base-patch4-window7-224Image Classification
MIT
Yes
ml.c5.2xlarge, ml.c5.xlarge, ml.c6i.large, ml.c6i.xlarge, ml.g4dn.xlarge, ml.m4.xlarge, ml.m5.large, ml.m5.xlarge, ml.p3.2xlarge, ml.r5.large, ml.r5.xlarge
Tensorflow
Image Embedding
tensorflow-icembedding-imagenet-mobilenet-v2-100-224-fv-4Image Classification
apache-2.0
No
ml.c5.2xlarge, ml.c5.xlarge, ml.g4dn.xlarge, ml.m5.large, ml.m5.xlarge, ml.p3.2xlarge
Tensorflow
Inception V3
tensorflow-ic-imagenet-inception-v3-classification-4Image Classification
Apache-2.0
Yes
ml.c5.2xlarge, ml.c5.xlarge, ml.c6i.large, ml.c6i.xlarge, ml.g4dn.xlarge, ml.m4.xlarge, ml.m5.large, ml.m5.xlarge, ml.p3.2xlarge, ml.r5.large, ml.r5.xlarge
Tensorflow
Inception V3 Preview Feature Vector
tensorflow-icembedding-tf2-preview-inception-v3-fv-4Image Classification
apache-2.0
No
ml.c5.2xlarge, ml.c5.xlarge, ml.g4dn.xlarge, ml.m5.large, ml.m5.xlarge, ml.p3.2xlarge
Tensorflow
MobileNet V2
tensorflow-ic-tf2-preview-mobilenet-v2-classification-4Image Classification
Apache-2.0
Yes
ml.c5.2xlarge, ml.c5.xlarge, ml.c6i.large, ml.c6i.xlarge, ml.g4dn.xlarge, ml.m4.xlarge, ml.m5.large, ml.m5.xlarge, ml.p3.2xlarge, ml.r5.large, ml.r5.xlarge
Tensorflow
MobileNet V2 0.35 128
tensorflow-ic-imagenet-mobilenet-v2-035-128Image Classification
Apache-2.0
Yes
ml.c5.2xlarge, ml.c5.xlarge, ml.c6i.large, ml.c6i.xlarge, ml.g4dn.xlarge, ml.m4.xlarge, ml.m5.large, ml.m5.xlarge, ml.p3.2xlarge, ml.r5.large, ml.r5.xlarge
Tensorflow
MobileNet V2 0.50 224
tensorflow-icembedding-imagenet-mobilenet-v2-050-224-fv-4Image Classification
apache-2.0
No
ml.c5.2xlarge, ml.c5.xlarge, ml.g4dn.xlarge, ml.m5.large, ml.m5.xlarge, ml.p3.2xlarge
Tensorflow
MobileNet V2 1.00 224
tensorflow-ic-imagenet-mobilenet-v2-100-224-classification-4Image Classification
Apache-2.0
Yes
ml.c5.2xlarge, ml.c5.xlarge, ml.c6i.large, ml.c6i.xlarge, ml.g4dn.xlarge, ml.m4.xlarge, ml.m5.large, ml.m5.xlarge, ml.p3.2xlarge, ml.r5.large, ml.r5.xlarge
Tensorflow
MobileNet V2 1.40 224
tensorflow-ic-imagenet-mobilenet-v2-140-224-classification-4Image Classification
Apache-2.0
Yes
ml.c5.2xlarge, ml.c5.xlarge, ml.c6i.large, ml.c6i.xlarge, ml.g4dn.xlarge, ml.m4.xlarge, ml.m5.large, ml.m5.xlarge, ml.p3.2xlarge, ml.r5.large, ml.r5.xlarge
Tensorflow
MobileNet V2 Feature Vector
tensorflow-icembedding-tf2-preview-mobilenet-v2-fv-4Image Classification
apache-2.0
No
ml.c5.2xlarge, ml.c5.xlarge, ml.g4dn.xlarge, ml.m5.large, ml.m5.xlarge, ml.p3.2xlarge
Tensorflow
MobileNet V2 Feature Vector
tensorflow-icembedding-tf2-preview-mobilenet-v2-featurevector-4Image Classification
apache-2.0
No
ml.c5.2xlarge, ml.c5.xlarge, ml.g4dn.xlarge, ml.m5.large, ml.m5.xlarge, ml.p3.2xlarge
Tensorflow
MobileNet V3 Large 1.00 224
tensorflow-ic-imagenet-mobilenet-v3-large-100-224Image Classification
Apache-2.0
Yes
ml.c5.2xlarge, ml.c5.xlarge, ml.c6i.large, ml.c6i.xlarge, ml.g4dn.xlarge, ml.m4.xlarge, ml.m5.large, ml.m5.xlarge, ml.p3.2xlarge, ml.r5.large, ml.r5.xlarge
Tensorflow
MobileNet V3 Small 0.75 224
tensorflow-ic-imagenet-mobilenet-v3-small-075-224Image Classification
Apache-2.0
Yes
ml.c5.2xlarge, ml.c5.xlarge, ml.c6i.large, ml.c6i.xlarge, ml.g4dn.xlarge, ml.m4.xlarge, ml.m5.large, ml.m5.xlarge, ml.p3.2xlarge, ml.r5.large, ml.r5.xlarge
Tensorflow
MobileNet V3 Small 1.00 224
tensorflow-ic-imagenet-mobilenet-v3-small-100-224Image Classification
Apache-2.0
Yes
ml.c5.2xlarge, ml.c5.xlarge, ml.c6i.large, ml.c6i.xlarge, ml.g4dn.xlarge, ml.m4.xlarge, ml.m5.large, ml.m5.xlarge, ml.p3.2xlarge, ml.r5.large, ml.r5.xlarge
Tensorflow
Object Detection
tensorflow-od1-ssd-resnet50-v1-fpn-640x640-coco17-tpu-8Object Detection
apache-2.0
Yes
ml.g4dn.xlarge, ml.g5.xlarge, ml.m5.large, ml.m5.xlarge, ml.p3.2xlarge
Tensorflow
ResNet 50
tensorflow-ic-resnet-50-classification-1Image Classification
Apache-2.0
Yes
ml.c5.9xlarge, ml.g4dn.8xlarge, ml.p3.8xlarge
Tensorflow
ResNet 50 Feature Vector
tensorflow-icembedding-resnet-50-featurevector-1Image Classification
apache-2.0
No
ml.c5.2xlarge, ml.c5.xlarge, ml.g4dn.xlarge, ml.m5.large, ml.m5.xlarge, ml.p3.2xlarge
Tensorflow
ResNet V1 50
tensorflow-ic-imagenet-resnet-v1-50-classification-4Image Classification
Apache-2.0
Yes
ml.c5.2xlarge, ml.c5.xlarge, ml.c6i.large, ml.c6i.xlarge, ml.g4dn.xlarge, ml.m4.xlarge, ml.m5.large, ml.m5.xlarge, ml.p3.2xlarge, ml.r5.large, ml.r5.xlarge
Tensorflow
ResNet V2 152 Feature Vector
tensorflow-icembedding-imagenet-resnet-v2-152-featurevector-4Image Classification
apache-2.0
No
ml.c5.2xlarge, ml.c5.xlarge, ml.g4dn.xlarge, ml.m5.large, ml.m5.xlarge, ml.p3.2xlarge
Tensorflow
ResNet V2 50
tensorflow-ic-imagenet-resnet-v2-50-classification-4Image Classification
Apache-2.0
Yes
ml.c5.2xlarge, ml.c5.xlarge, ml.c6i.large, ml.c6i.xlarge, ml.g4dn.xlarge, ml.m4.xlarge, ml.m5.large, ml.m5.xlarge, ml.p3.2xlarge, ml.r5.large, ml.r5.xlarge
Tensorflow
ResNet V2 50 Feature Vector
tensorflow-icembedding-imagenet-resnet-v2-50-featurevector-4Image Classification
apache-2.0
No
ml.c5.2xlarge, ml.c5.xlarge, ml.g4dn.xlarge, ml.m5.large, ml.m5.xlarge, ml.p3.2xlarge
Tensorflow
Retinanet SSD Resnet-101 640x640
tensorflow-od-retinanet-resnet101-v1-fpn-640x640-1Object Detection
apache-2.0
No
ml.g4dn.xlarge, ml.g5.xlarge, ml.p3.2xlarge
Tensorflow
Small BERT L-10_H-512_A-8
tensorflow-tc-small-bert-bert-en-uncased-L-10-H-512-A-8Text Classification
apache-2.0
Yes
ml.c5.18xlarge, ml.c5.2xlarge, ml.c5.4xlarge, ml.c5.9xlarge, ml.g4dn.16xlarge, ml.g4dn.2xlarge, ml.g4dn.4xlarge, ml.g4dn.8xlarge, ml.g4dn.xlarge, ml.g5.xlarge, ml.m5.12xlarge, ml.m5.24xlarge, ml.m5.2xlarge, ml.m5.4xlarge, ml.m5.xlarge, ml.p3.16xlarge, ml.p3.2xlarge, ml.p3.8xlarge
Tensorflow
Small BERT L-12_H-256_A-4
tensorflow-tc-small-bert-bert-en-uncased-L-12-H-256-A-4Text Classification
apache-2.0
Yes
ml.c5.18xlarge, ml.c5.2xlarge, ml.c5.4xlarge, ml.c5.9xlarge, ml.g4dn.16xlarge, ml.g4dn.2xlarge, ml.g4dn.4xlarge, ml.g4dn.8xlarge, ml.g4dn.xlarge, ml.g5.xlarge, ml.m5.12xlarge, ml.m5.24xlarge, ml.m5.2xlarge, ml.m5.4xlarge, ml.m5.xlarge, ml.p3.16xlarge, ml.p3.2xlarge, ml.p3.8xlarge
Tensorflow
Small BERT L-12_H-768_A-12
tensorflow-tc-small-bert-bert-en-uncased-L-12-H-768-A-12Text Classification
apache-2.0
Yes
ml.c5.18xlarge, ml.c5.2xlarge, ml.c5.4xlarge, ml.c5.9xlarge, ml.g4dn.16xlarge, ml.g4dn.2xlarge, ml.g4dn.4xlarge, ml.g4dn.8xlarge, ml.g4dn.xlarge, ml.g5.xlarge, ml.m5.12xlarge, ml.m5.24xlarge, ml.m5.2xlarge, ml.m5.4xlarge, ml.m5.xlarge, ml.p3.16xlarge, ml.p3.2xlarge, ml.p3.8xlarge
Tensorflow
Small BERT L-2_H-128_A-2
tensorflow-tc-small-bert-bert-en-uncased-L-2-H-128-A-2Text Classification
apache-2.0
Yes
ml.c5.18xlarge, ml.c5.2xlarge, ml.c5.4xlarge, ml.c5.9xlarge, ml.g4dn.16xlarge, ml.g4dn.2xlarge, ml.g4dn.4xlarge, ml.g4dn.8xlarge, ml.g4dn.xlarge, ml.g5.xlarge, ml.m5.12xlarge, ml.m5.24xlarge, ml.m5.2xlarge, ml.m5.4xlarge, ml.m5.xlarge, ml.p3.16xlarge, ml.p3.2xlarge, ml.p3.8xlarge
Tensorflow
Small BERT L-4_H-512_A-8
tensorflow-tc-small-bert-bert-en-uncased-L-4-H-512-A-8Text Classification
apache-2.0
Yes
ml.c5.18xlarge, ml.c5.2xlarge, ml.c5.4xlarge, ml.c5.9xlarge, ml.g4dn.16xlarge, ml.g4dn.2xlarge, ml.g4dn.4xlarge, ml.g4dn.8xlarge, ml.g4dn.xlarge, ml.g5.xlarge, ml.m5.12xlarge, ml.m5.24xlarge, ml.m5.2xlarge, ml.m5.4xlarge, ml.m5.xlarge, ml.p3.16xlarge, ml.p3.2xlarge, ml.p3.8xlarge
Tensorflow
Small BERT L-6_H-768_A-12
tensorflow-tc-small-bert-bert-en-uncased-L-6-H-768-A-12Text Classification
apache-2.0
Yes
ml.c5.18xlarge, ml.c5.2xlarge, ml.c5.4xlarge, ml.c5.9xlarge, ml.g4dn.16xlarge, ml.g4dn.2xlarge, ml.g4dn.4xlarge, ml.g4dn.8xlarge, ml.g4dn.xlarge, ml.g5.xlarge, ml.m5.12xlarge, ml.m5.24xlarge, ml.m5.2xlarge, ml.m5.4xlarge, ml.m5.xlarge, ml.p3.16xlarge, ml.p3.2xlarge, ml.p3.8xlarge
Tensorflow
Small BERT L-8_H-768_A-12
tensorflow-tc-small-bert-bert-en-uncased-L-8-H-768-A-12Text Classification
apache-2.0
Yes
ml.c5.18xlarge, ml.c5.2xlarge, ml.c5.4xlarge, ml.c5.9xlarge, ml.g4dn.16xlarge, ml.g4dn.2xlarge, ml.g4dn.4xlarge, ml.g4dn.8xlarge, ml.g4dn.xlarge, ml.g5.xlarge, ml.m5.12xlarge, ml.m5.24xlarge, ml.m5.2xlarge, ml.m5.4xlarge, ml.m5.xlarge, ml.p3.16xlarge, ml.p3.2xlarge, ml.p3.8xlarge
Tensorflow
SSD ResNet 152 V1 FPN 1024x1024 COCO '17
tensorflow-od1-ssd-resnet152-v1-fpn-1024x1024-coco17-tpu-8Object Detection
apache-2.0
Yes
ml.g4dn.xlarge, ml.g5.xlarge, ml.m5.large, ml.m5.xlarge, ml.p3.2xlarge
Tensorflow
SSD EfficientDet D0
tensorflow-od-efficientdet-d0-1Object Detection
apache-2.0
No
ml.g4dn.xlarge, ml.g5.xlarge, ml.p3.2xlarge
Tensorflow
SSD Efficientdet D0 512x512 COCO '17
tensorflow-od1-ssd-efficientdet-d0-512x512-coco17-tpu-8Object Detection
apache-2.0
Yes
ml.g4dn.xlarge, ml.g5.xlarge, ml.m5.large, ml.m5.xlarge, ml.p3.2xlarge
Tensorflow
SSD Efficientdet D1 640x640 COCO '17
tensorflow-od1-ssd-efficientdet-d1-640x640-coco17-tpu-8Object Detection
apache-2.0
Yes
ml.g4dn.xlarge, ml.g5.xlarge, ml.m5.large, ml.m5.xlarge, ml.p3.2xlarge
Tensorflow
SSD EfficientDet D2
tensorflow-od-efficientdet-d2-1Object Detection
apache-2.0
No
ml.g4dn.xlarge, ml.g5.xlarge, ml.p3.2xlarge
Tensorflow
SSD Efficientdet D2 768x768 COCO '17
tensorflow-od1-ssd-efficientdet-d2-768x768-coco17-tpu-8Object Detection
apache-2.0
Yes
ml.g4dn.xlarge, ml.g5.xlarge, ml.m5.large, ml.m5.xlarge, ml.p3.2xlarge
Tensorflow
SSD EfficientDet D3
tensorflow-od-efficientdet-d3-1Object Detection
apache-2.0
No
ml.g4dn.xlarge, ml.g5.xlarge, ml.p3.2xlarge
Tensorflow
SSD Efficientdet D3 896x896 COCO '17
tensorflow-od1-ssd-efficientdet-d3-896x896-coco17-tpu-32Object Detection
apache-2.0
Yes
ml.g4dn.xlarge, ml.g5.xlarge, ml.m5.large, ml.m5.xlarge, ml.p3.2xlarge
Tensorflow
SSD EfficientDet D5
tensorflow-od-efficientdet-d5-1Object Detection
apache-2.0
No
ml.g4dn.xlarge, ml.g5.xlarge, ml.p3.2xlarge
Tensorflow
SSD Mobilenet V1 FPN 640x640 COCO '17
tensorflow-od1-ssd-mobilenet-v1-fpn-640x640-coco17-tpu-8Object Detection
apache-2.0
Yes
ml.g4dn.xlarge, ml.g5.xlarge, ml.m5.large, ml.m5.xlarge, ml.p3.2xlarge
Tensorflow
SSD Mobilenet V2
tensorflow-od-ssd-mobilenet-v2-2Object Detection
apache-2.0
No
ml.g4dn.xlarge, ml.g5.xlarge, ml.p3.2xlarge
Tensorflow
SSD Mobilenet V2 640x640
tensorflow-od-ssd-mobilenet-v2-fpnlite-640x640-1Object Detection
apache-2.0
No
ml.g4dn.xlarge, ml.g5.xlarge, ml.p3.2xlarge
Tensorflow
SSD Mobilenet V2 FPN-Lite 320x320 COCO '17
tensorflow-od1-ssd-mobilenet-v2-fpnlite-320x320-coco17-tpu-8Object Detection
apache-2.0
Yes
ml.g4dn.xlarge, ml.g5.xlarge, ml.m5.large, ml.m5.xlarge, ml.p3.2xlarge
Tensorflow
SSD Mobilenet V2 FPN-lite 640x640 COCO '17
tensorflow-od1-ssd-mobilenet-v2-fpnlite-640x640-coco17-tpu-8Object Detection
apache-2.0
Yes
ml.g4dn.xlarge, ml.g5.xlarge, ml.m5.large, ml.m5.xlarge, ml.p3.2xlarge
Tensorflow
SSD ResNet 101 V1 FPN 1024x1024 COCO '17
tensorflow-od1-ssd-resnet101-v1-fpn-1024x1024-coco17-tpu-8Object Detection
apache-2.0
Yes
ml.g4dn.xlarge, ml.g5.xlarge, ml.m5.large, ml.m5.xlarge, ml.p3.2xlarge
Tensorflow
SSD ResNet 101 V1 FPN 640x640 COCO '17
tensorflow-od1-ssd-resnet101-v1-fpn-640x640-coco17-tpu-8Object Detection
apache-2.0
Yes
ml.g4dn.xlarge, ml.g5.xlarge, ml.m5.large, ml.m5.xlarge, ml.p3.2xlarge
Tensorflow
SSD ResNet 152 V1 FPN 640x640 COCO '17
tensorflow-od1-ssd-resnet152-v1-fpn-640x640-coco17-tpu-8Object Detection
apache-2.0
Yes
ml.g4dn.xlarge, ml.g5.xlarge, ml.m5.large, ml.m5.xlarge, ml.p3.2xlarge
Tensorflow
SSD ResNet 50 V1 FPN 1024x1024 COCO '17
tensorflow-od1-ssd-resnet50-v1-fpn-1024x1024-coco17-tpu-8Object Detection
apache-2.0
Yes
ml.g4dn.xlarge, ml.g5.xlarge, ml.m5.large, ml.m5.xlarge, ml.p3.2xlarge
Tensorflow
Swin Base 4 12 384
tensorflow-ic-swin-base-patch4-window12-384Image Classification
MIT
Yes
ml.c5.2xlarge, ml.c5.xlarge, ml.c6i.large, ml.c6i.xlarge, ml.g4dn.xlarge, ml.m4.xlarge, ml.m5.large, ml.m5.xlarge, ml.p3.2xlarge, ml.r5.large, ml.r5.xlarge
Tensorflow
Talking Heads Large
tensorflow-tc-talking-heads-largeText Classification
apache-2.0
Yes
ml.c5.18xlarge, ml.c5.2xlarge, ml.c5.4xlarge, ml.c5.9xlarge, ml.g4dn.16xlarge, ml.g4dn.2xlarge, ml.g4dn.4xlarge, ml.g4dn.8xlarge, ml.g4dn.xlarge, ml.g5.xlarge, ml.m5.12xlarge, ml.m5.24xlarge, ml.m5.2xlarge, ml.m5.4xlarge, ml.m5.xlarge, ml.p3.16xlarge, ml.p3.2xlarge, ml.p3.8xlarge
Tensorflow
Text Classification
tensorflow-tc-bert-en-uncased-L-12-H-768-A-12-2Text Classification
apache-2.0
Yes
ml.c5.18xlarge, ml.c5.2xlarge, ml.c5.4xlarge, ml.c5.9xlarge, ml.g4dn.16xlarge, ml.g4dn.2xlarge, ml.g4dn.4xlarge, ml.g4dn.8xlarge, ml.g4dn.xlarge, ml.g5.xlarge, ml.m5.12xlarge, ml.m5.24xlarge, ml.m5.2xlarge, ml.m5.4xlarge, ml.m5.xlarge, ml.p3.16xlarge, ml.p3.2xlarge, ml.p3.8xlarge
Tensorflow
Text Embedding
tensorflow-tcembedding-universal-sentc-encoder-cmlm-en-large-1Text Embedding
apache-2.0
No
ml.c5.2xlarge, ml.c5.xlarge, ml.g4dn.xlarge, ml.m5.large, ml.m5.xlarge, ml.p3.2xlarge
Tensorflow
TRILL Distilled
tensorflow-audioembedding-trill-distilled-3Audio Classification
apache-2.0
No
ml.c5.2xlarge, ml.c5.xlarge, ml.g4dn.xlarge, ml.m5.large, ml.m5.xlarge, ml.p3.2xlarge
Tensorflow
TRILLsson 1
tensorflow-audioembedding-trillsson1-1Audio Classification
apache-2.0
No
ml.c5.2xlarge, ml.c5.xlarge, ml.g4dn.xlarge, ml.m5.large, ml.m5.xlarge, ml.p3.2xlarge
Tensorflow
TRILLsson 2
tensorflow-audioembedding-trillsson2-1Audio Classification
apache-2.0
No
ml.c5.2xlarge, ml.c5.xlarge, ml.g4dn.xlarge, ml.m5.large, ml.m5.xlarge, ml.p3.2xlarge
Tensorflow
TRILLsson 3
tensorflow-audioembedding-trillsson3-1Audio Classification
apache-2.0
No
ml.c5.2xlarge, ml.c5.xlarge, ml.g4dn.xlarge, ml.m5.large, ml.m5.xlarge, ml.p3.2xlarge
Tensorflow
Universal Sentence Encoder CMLM Base
tensorflow-tcembedding-universal-sentc-encoder-cmlm-en-base-1Text Embedding
apache-2.0
No
ml.c5.2xlarge, ml.c5.xlarge, ml.g4dn.xlarge, ml.m5.large, ml.m5.xlarge, ml.p3.2xlarge
- Classic ML Algorithms (11)
-
Amazon SageMaker JumpStart provides ready-to-use implementations of classical machine learning algorithms for tabular data tasks such as classification, regression, and time series forecasting. These algorithms (XGBoost, LightGBM, CatBoost, scikit-learn, AutoGluon) are optimized for structured data and can be trained from scratch on your own datasets.
Provider Model Name Model ID Task License Fine-tunable Supported Inference Instance Types Autogluon
AutoGluon Tabular Classification
autogluon-classification-ensembleTabular Classification
apache-2.0
Yes
ml.c5.18xlarge, ml.c5.2xlarge, ml.c5.4xlarge, ml.c5.9xlarge, ml.g4dn.xlarge, ml.m5.12xlarge, ml.m5.24xlarge, ml.m5.2xlarge, ml.m5.4xlarge, ml.p3.2xlarge
Autogluon
AutoGluon Tabular Regression
autogluon-regression-ensembleTabular Regression
apache-2.0
Yes
ml.c5.18xlarge, ml.c5.2xlarge, ml.c5.4xlarge, ml.c5.9xlarge, ml.g4dn.xlarge, ml.m5.12xlarge, ml.m5.24xlarge, ml.m5.2xlarge, ml.m5.4xlarge, ml.p3.2xlarge
Catboost
CatBoost Classification
catboost-classification-modelTabular Classification
apache-2.0
Yes
ml.c5.18xlarge, ml.c5.2xlarge, ml.c5.4xlarge, ml.c5.9xlarge, ml.m4.16xlarge, ml.m5.12xlarge, ml.m5.24xlarge, ml.m5.2xlarge, ml.m5.4xlarge
Catboost
CatBoost Regression
catboost-regression-modelTabular Regression
apache-2.0
Yes
ml.c4.8xlarge, ml.c5.18xlarge, ml.c5.2xlarge, ml.c5.4xlarge, ml.c5.9xlarge, ml.m5.12xlarge, ml.m5.24xlarge, ml.m5.2xlarge, ml.m5.4xlarge
Lightgbm
LightGBM Classification
lightgbm-classification-modelTabular Classification
mit
Yes
ml.c5.18xlarge, ml.c5.2xlarge, ml.c5.4xlarge, ml.c5.9xlarge, ml.m4.16xlarge, ml.m5.12xlarge, ml.m5.24xlarge, ml.m5.2xlarge, ml.m5.4xlarge
Lightgbm
LightGBM Regression
lightgbm-regression-modelTabular Regression
mit
Yes
ml.c4.8xlarge, ml.c5.18xlarge, ml.c5.2xlarge, ml.c5.4xlarge, ml.c5.9xlarge, ml.m5.12xlarge, ml.m5.24xlarge, ml.m5.2xlarge, ml.m5.4xlarge
Pytorch
TabTransformer Classification
pytorch-tabtransformerclassification-modelTabular Classification
apache-2.0
Yes
ml.c5.18xlarge, ml.c5.2xlarge, ml.c5.4xlarge, ml.c5.9xlarge, ml.g4dn.xlarge, ml.m5.12xlarge, ml.m5.24xlarge, ml.m5.2xlarge, ml.m5.4xlarge, ml.p3.2xlarge
Pytorch
TabTransformer Regression
pytorch-tabtransformerregression-modelTabular Regression
apache-2.0
Yes
ml.c5.18xlarge, ml.c5.2xlarge, ml.c5.4xlarge, ml.c5.9xlarge, ml.g4dn.xlarge, ml.m5.12xlarge, ml.m5.24xlarge, ml.m5.2xlarge, ml.m5.4xlarge, ml.p3.2xlarge
Sklearn
Linear Classification
sklearn-classification-linearTabular Classification
bsd-3-clause
Yes
ml.c4.8xlarge, ml.c5.18xlarge, ml.c5.2xlarge, ml.c5.4xlarge, ml.c5.9xlarge, ml.m5.12xlarge, ml.m5.24xlarge, ml.m5.2xlarge, ml.m5.4xlarge, ml.m5.xlarge
Xgboost
XGBoost Classification
xgboost-classification-modelTabular Classification
apache-2.0
Yes
ml.c4.8xlarge, ml.c5.18xlarge, ml.c5.2xlarge, ml.c5.4xlarge, ml.c5.9xlarge, ml.m5.12xlarge, ml.m5.24xlarge, ml.m5.2xlarge, ml.m5.4xlarge, ml.m5.xlarge
Xgboost
XGBoost Regression
xgboost-regression-modelTabular Regression
apache-2.0
Yes
ml.c4.8xlarge, ml.c5.18xlarge, ml.c5.2xlarge, ml.c5.4xlarge, ml.c5.9xlarge, ml.m5.12xlarge, ml.m5.24xlarge, ml.m5.2xlarge, ml.m5.4xlarge, ml.m5.xlarge
Note
For the most accurate and up-to-date licensing information, see the model detail page in SageMaker Studio or the model's Hugging Face page.