AI Service

Choosing an AWS generative AI service

Determine which AWS generative AI services are the best fit for your organization.

Overview of AWS generative AI services

Generative AI is a set of artificial intelligence (AI) systems and models designed to generate content such as code, text, images, music, or other forms of data. These systems can produce new content based on patterns and knowledge learned from existing data.

This twelve-minute video discusses building generative AI applications on AWS, part one of a four-part series. View part two , part three , and part four .
Increasingly, organizations and businesses are using generative AI to:
  • Automate creative workflows: Use generative AI services to automate the workflows of time-consuming creative processes such as writing, image or video creation, and graphic design.
  • Customize and personalize content: Generate targeted content, product recommendations, and customized offerings for an audience-specific context.
  • Augment data: Synthesize large training datasets for other ML models to unlock scenarios where human-labeled data is scarce.
  • Reduce cost: Potentially lower costs by using synthesized data, content, and digital assets.
  • Faster experimentation: Test and iterate on more content variations and creative concepts than would be possible manually.

Key concepts

Amazon offers a range of generative AI services, applications, tools, and supporting infrastructure. Which of these you use depends largely on what you’re trying to accomplish, how much choice you need in foundation models, the degree of customization you need in your generative AI applications, and the expertise within your organization.

Foundation models and types

Foundation models (FMs) are the backbone of generative AI. These pre-trained AI models can be customized for specific tasks, and come in various types:

  • Text models: Process and generate natural language
  • Image models: Work with visual data for tasks like image generation or analysis
  • Multimodal models: Handle multiple types of data simultaneously

The capabilities of an FM are often related to its size, measured in parameters. Larger models can capture more complex patterns, but require more computational resources.

After you've decided on a generative AI service, choose the foundation model (FM) that gives you the best results for your use case. Amazon Bedrock has a model evaluation capability that can assist in evaluating, comparing, and selecting the best FMs for your use case.

Use cases

Transform business processes and content creation with AI-powered automation and insights

Enterprise knowledge management and support
Software development and DevOps
Content creation & customizations
Business process automation

Compare services

The AWS generative AI stack provides options at every layer of implementation, from ready-to-use applications to foundational infrastructure. This diagram shows how these services relate to each other.
Diagram showing the AWS generative AI stack. This diagram shows the infrastructure to build and train AI models at the bottom of the stack, models and tools to build generative AI apps in the middle, and applications that use LLMs and other FMs to boost productivity, at the top.

As you choose services, consider your team’s technical expertise, development timeline, and specific use case requirements. The following table helps you match your business needs to the appropriate AWS service.

If you need to... To get started
Generate code and get answers to business questions across your enterprise data

Amazon Q Business

Amazon Q Developer in chat applications

Choose from multiple foundation models, customize them with your data, and build generative AI applications

Amazon Bedrock

Amazon Nova

Amazon Titan

Build, train, and deploy machine learning models at scale Amazon SageMaker AI
Maximize price-performance for foundation model training and inference

AWS Trainium

AWS Inferentia

Start building

Now that we've covered the criteria you need to apply in choosing an AWS generative AI service, you can select which services are optimized for your needs and explore how you might get started using each of them.
Amazon Q Business
What is Amazon Q Business?

Get an overview of Amazon Q Business, with explanations of what it is, how it works, and how to get started using it.

Create a sample Amazon Q Business application

Learn how to create your first Amazon Q Business application in either the AWS Management Console or using the command line interface (CLI).

Combine Amazon Q Business and AWS IAM Identity Center to build generative AI

Build private and secure enterprise generative AI apps with Amazon Q Business and AWS IAM Identity Center.

Amazon Q Developer
What is Amazon Q Developer?

Get an overview of Amazon Q Developer, with explanations of what it is, how it works, and how to get started using it.

Understanding tiers of service for Amazon Q Developer

Review the following information to understand the tiers of service for Amazon Q Developer, including Amazon Q Developer Pro and Amazon Q Developer at the free tier.

Working with Amazon Q Developer

Use the Amazon Q Developer Center for fast access to key Amazon Q Developer articles, blog posts, videos, and tips.

Amazon Bedrock
What is Amazon Bedrock?

Learn how to use this fully managed service to make foundation models (FMs) from Amazon and third parties available for your use through a unified API.

Frequently asked questions about Amazon Bedrock

Get answers to the most commonly-asked questions about Amazon Bedrock, including how to use agents, security considerations, details on Amazon Bedrock software development kits (SDKs), Retrieval Augmented Generation (RAG), how to use model evaluation, and how billing works.

Guidance for generating product descriptions with Amazon Bedrock

Learn how to use Amazon Bedrock as part of a solution to automate your product review and approval process for an ecommerce marketplace or retail website.

Amazon Bedrock IDE

Amazon Bedrock Studio, renamed to Amazon Bedrock IDE, is now available in Amazon SageMaker Unified Studio

What is Amazon Bedrock IDE?

Use Amazon Bedrock IDE to discover Bedrock models, and build generative AI apps that use Bedrock models and features.

Build generative AI applications with Amazon Bedrock IDE

This blog post describes how you can build applications using a wide array of top performing models. It then explains how to evaluate and share your generative AI apps with Amazon Bedrock IDE.

Building a chat app with Amazon Bedrock IDE

Build an Amazon Bedrock IDE chat agent app that allows users to chat with an Amazon Bedrock model through a conversational interface.

Amazon SageMaker AI
What is Amazon SageMaker AI?

Learn how you can use this fully managed machine learning (ML) service to build, train, and deploy ML models into a production-ready hosted environment.

Get started with Amazon SageMaker AI

Learn how to join an Amazon SageMaker AI domain, giving you access to SageMaker Studio and RStudio on SageMaker.

Get started with Amazon SageMaker JumpStart

Explore Amazon SageMaker JumpStart solution templates that set up infrastructure for common use cases, and executable example notebooks for machine learning with SageMaker.

Amazon SageMaker Unified Studio
What is Amazon SageMaker Unified Studio?

Learn how you can use Amazon SageMaker Unified Studio to build, deploy, execute, and monitor workflows from a single interface.

An integrated experience for all your data and AI with Amazon SageMaker Unified Studio

Learn about the new Amazon SageMaker Unified Studio and explore how it unifies your analytic workloads.

Get started with Amazon SageMaker Unified Studio

Learn how to gain access to Amazon SageMaker Unified Studio, create a project, and then add members to the project and use the sample JupyterLab notebook to begin building with a variety of tools and resources.

Amazon Nova
What is Amazon Nova?

Explore Amazon Nova, a new generation of foundation models available on Amazon Bedrock.

Introducing Amazon Nova foundation models: Frontier intelligence and industry leading price performance

Learn about the new Amazon Nova models, including examples of using Amazon Nova models to analyze complex documents and videos, understand charts and diagrams, generate engaging video content, and build sophisticated AI agents.

Get started with Amazon Nova in the Amazon Bedrock console

Learn how to use the playgrounds in the console to submit a text prompt to Amazon Nova models and generate a text or image response

Amazon Titan
Amazon Titan in Amazon Bedrock overview

Get an overview of Amazon Titan foundation models (FMs) to support your use cases.

Cost-effective document classification using the Amazon Titan Multimodal Embeddings Model

Learn how you can use this model to categorize and extract insights from high volumes of documents of different formats. This blog explores how you can use it to help determine the next set of actions to take, depending on the type of document.

Build generative AI applications with Amazon Titan Text Premier, Amazon Bedrock, and AWS CDK

Explore building and deploying two sample applications powered by Amazon Titan Text Premier in this blog post.

AWS Trainium
Overview of AWS Trainium

Learn about AWS Trainium, the second-generation machine learning (ML) accelerator that AWS purpose built for deep learning training of 100B+ parameter models. Each EC2 Trn1 instance deploys up to 16 AWS Trainium accelerators to deliver a high-performance, low-cost solution for deep learning (DL) training in the cloud.

Recommended Trainium Instances

Explore how AWS Trainium instances are designed to provide high performance and cost efficiency for deep learning model inference workloads.

Scaling distributed training with AWS Trainium and Amazon EKS

If you're deploying your deep learning (DL) workloads using Amazon Elastic Kubernetes Service (EKS), learn how you can benefit from the general availability of EC2 Trn1 instances powered by Trainium—a purpose-built ML accelerator optimized to provide a high-performance, cost-effective, and massively scalable platform for training DL models in the cloud.

AWS Inferentia
Overview of Inferentia

Understand how AWS designs accelerators to deliver high performance at the lowest cost for your deep learning (DL) inference applications.

AWS Inferentia2 builds on AWS Inferentia1 by delivering 4x higher throughput and 10x lower latency

Understand what Inferentia2 is optimized for and how it was designed to deliver higher performance, while lowering the cost of LLMs and generative AI inference.

Machine learning inference using Inferentia

Learn how to create an Amazon EKS cluster with nodes running EC2 Inf1 instances and optionally deploy a sample application. EC2 Inf1 instances are powered by AWS Inferentia chips, which are custom built by AWS to provide high-performance and low-cost inference in the cloud.

Resources