Overview of AWS generative AI services

Generative AI is a set of artificial intelligence (AI) systems and models designed to generate content such as code, text, images, music, or other forms of data. These systems can produce new content based on patterns and knowledge learned from existing data.

This twelve-minute video discusses building generative AI applications on AWS, part one of a four-part series. View part two , part three , and part four .

Increasingly, organizations and businesses are using generative AI to:

Automate creative workflows: Use generative AI services to automate the workflows of time-consuming creative processes such as writing, image or video creation, and graphic design.
Customize and personalize content: Generate targeted content, product recommendations, and customized offerings for an audience-specific context.
Augment data: Synthesize large training datasets for other ML models to unlock scenarios where human-labeled data is scarce.
Reduce cost: Potentially lower costs by using synthesized data, content, and digital assets.
Faster experimentation: Test and iterate on more content variations and creative concepts than would be possible manually.

Key concepts

Amazon offers a range of generative AI services, applications, tools, and supporting infrastructure. Which of these you use depends largely on what you’re trying to accomplish, how much choice you need in foundation models, the degree of customization you need in your generative AI applications, and the expertise within your organization.

Foundation models and types

Foundation models (FMs) are the backbone of generative AI. These pre-trained AI models can be customized for specific tasks, and come in various types:

Text models: Process and generate natural language
Image models: Work with visual data for tasks like image generation or analysis
Multimodal models: Handle multiple types of data simultaneously

The capabilities of an FM are often related to its size, measured in parameters. Larger models can capture more complex patterns, but require more computational resources.

After you've decided on a generative AI service, choose the foundation model (FM) that gives you the best results for your use case. Amazon Bedrock has a model evaluation capability that can assist in evaluating, comparing, and selecting the best FMs for your use case.

Public foundation models

Amazon Bedrock provides access to a variety of foundation models from different companies. These include Anthropic Claude , Cohere Command & Embed , AI21 Labs Jurassic , Meta Llama , Mistral AI , Stable Diffusion XL , Amazon Nova , and Amazon Titan . With this multi-model approach, you can select and compare different models based on your specific use case requirements. Using Amazon Bedrock and Amazon SageMaker AI, you can experiment with a variety of foundation models, and privately customize them with your data.

Learn more

Model customization and enhancement

Adapting FMs to specific use cases is crucial for optimal performance. Key customization techniques include:

Fine-tuning: Further training the model on domain-specific data
Continuous pre-training: Extending the model's knowledge with new data over time
RAG ( retrieval augmented generation) : Enhancing responses by retrieving external information

Prompt engineering is another vital skill, allowing users to craft inputs that guide the model to produce desired outputs effectively.

Get started

Performance considerations

When you evaluate generative AI model performance, consider two factors:

Inference latency: The speed at which a model generates responses
Context window: Amount of text a model can consider at once

Get started

Data considerations

The quality and quantity of data significantly impact model performance. Key factors include:

Proper data handling: Ensuring data security and privacy, effective annotation and labeling processes, and the right balance between data quantity with quality
Data relevance: How well the data matches the intended use case
Data accuracy and consistency
Bias and fairness: Ensuring training data doesn't perpetuate harmful biases

Learn more

Use cases

Transform business processes and content creation with AI-powered automation and insights

Enterprise knowledge management and support

Querying and analyzing company data across multiple enterprise systems
Generating summaries and insights from internal documents
Creating automated responses for customer service
Accessing and synthesizing information from various business data sources like Salesforce, SharePoint, and Confluence

Learn more: New Amazon Bedrock capabilities enhance data processing and retrieval

Software development and DevOps

Code generation and optimization
Application testing and debugging
Infrastructure management and AWS resource optimization
Automated code upgrades and security fixes
Technical documentation creation and maintenance

Get started: How generative AI is transforming developer workflows at Amazon

Content creation & customizations

Automated writing and content generation
Image and multimedia content creation
Personalized content and recommendations for specific audiences
Document classification and analysis
Converting data into visualizations and reports

Get started: Experiment with Amazon Nova foundation models

Business process automation

Automating creative workflows and time-consuming manual tasks
Building conversational AI applications and chatbots
Data analysis and pattern recognition
Creating synthetic training data for ML models
Process automation through multi-step reasoning and planning

Get started: Amazon Q Business is adding new workflow automation capability and 50+ action integrations

Compare services

The AWS generative AI stack provides options at every layer of implementation, from ready-to-use applications to foundational infrastructure. This diagram shows how these services relate to each other.

Diagram showing the AWS generative AI stack. This diagram shows the infrastructure to build and train AI models at the bottom of the stack, models and tools to build generative AI apps in the middle, and applications that use LLMs and other FMs to boost productivity, at the top.

As you choose services, consider your team’s technical expertise, development timeline, and specific use case requirements. The following table helps you match your business needs to the appropriate AWS service.

If you need to...	To get started
Generate code and get answers to business questions across your enterprise data	Amazon Q Business Amazon Q Developer in chat applications
Choose from multiple foundation models, customize them with your data, and build generative AI applications	Amazon Bedrock Amazon Nova Amazon Titan
Build, train, and deploy machine learning models at scale	Amazon SageMaker AI
Maximize price-performance for foundation model training and inference	AWS Trainium AWS Inferentia

Start building

Now that we've covered the criteria you need to apply in choosing an AWS generative AI service, you can select which services are optimized for your needs and explore how you might get started using each of them.

Amazon Q Business

What is Amazon Q Business?

Get an overview of Amazon Q Business, with explanations of what it is, how it works, and how to get started using it.

Create a sample Amazon Q Business application

Learn how to create your first Amazon Q Business application in either the AWS Management Console or using the command line interface (CLI).

Combine Amazon Q Business and AWS IAM Identity Center to build generative AI

Build private and secure enterprise generative AI apps with Amazon Q Business and AWS IAM Identity Center.

Amazon Q Developer

What is Amazon Q Developer?

Get an overview of Amazon Q Developer, with explanations of what it is, how it works, and how to get started using it.

Understanding tiers of service for Amazon Q Developer

Review the following information to understand the tiers of service for Amazon Q Developer, including Amazon Q Developer Pro and Amazon Q Developer at the free tier.

Working with Amazon Q Developer

Use the Amazon Q Developer Center for fast access to key Amazon Q Developer articles, blog posts, videos, and tips.

Amazon Bedrock

What is Amazon Bedrock?

Learn how to use this fully managed service to make foundation models (FMs) from Amazon and third parties available for your use through a unified API.

Frequently asked questions about Amazon Bedrock

Get answers to the most commonly-asked questions about Amazon Bedrock, including how to use agents, security considerations, details on Amazon Bedrock software development kits (SDKs), Retrieval Augmented Generation (RAG), how to use model evaluation, and how billing works.

Guidance for generating product descriptions with Amazon Bedrock

Learn how to use Amazon Bedrock as part of a solution to automate your product review and approval process for an ecommerce marketplace or retail website.

Amazon Bedrock IDE

Amazon Bedrock Studio, renamed to Amazon Bedrock IDE, is now available in Amazon SageMaker Unified Studio

What is Amazon Bedrock IDE?

Use Amazon Bedrock IDE to discover Bedrock models, and build generative AI apps that use Bedrock models and features.

Build generative AI applications with Amazon Bedrock IDE

This blog post describes how you can build applications using a wide array of top performing models. It then explains how to evaluate and share your generative AI apps with Amazon Bedrock IDE.

Building a chat app with Amazon Bedrock IDE

Build an Amazon Bedrock IDE chat agent app that allows users to chat with an Amazon Bedrock model through a conversational interface.

Amazon SageMaker AI

What is Amazon SageMaker AI?

Learn how you can use this fully managed machine learning (ML) service to build, train, and deploy ML models into a production-ready hosted environment.

Get started with Amazon SageMaker AI

Learn how to join an Amazon SageMaker AI domain, giving you access to SageMaker Studio and RStudio on SageMaker.

Get started with Amazon SageMaker JumpStart

Explore Amazon SageMaker JumpStart solution templates that set up infrastructure for common use cases, and executable example notebooks for machine learning with SageMaker.

Amazon SageMaker Unified Studio

What is Amazon SageMaker Unified Studio?

Learn how you can use Amazon SageMaker Unified Studio to build, deploy, execute, and monitor workflows from a single interface.

An integrated experience for all your data and AI with Amazon SageMaker Unified Studio

Learn about the new Amazon SageMaker Unified Studio and explore how it unifies your analytic workloads.

Get started with Amazon SageMaker Unified Studio

Learn how to gain access to Amazon SageMaker Unified Studio, create a project, and then add members to the project and use the sample JupyterLab notebook to begin building with a variety of tools and resources.

Amazon Nova

What is Amazon Nova?

Explore Amazon Nova, a new generation of foundation models available on Amazon Bedrock.

Introducing Amazon Nova foundation models: Frontier intelligence and industry leading price performance

Learn about the new Amazon Nova models, including examples of using Amazon Nova models to analyze complex documents and videos, understand charts and diagrams, generate engaging video content, and build sophisticated AI agents.

Get started with Amazon Nova in the Amazon Bedrock console

Learn how to use the playgrounds in the console to submit a text prompt to Amazon Nova models and generate a text or image response

Amazon Titan

Amazon Titan in Amazon Bedrock overview

Get an overview of Amazon Titan foundation models (FMs) to support your use cases.

Cost-effective document classification using the Amazon Titan Multimodal Embeddings Model

Learn how you can use this model to categorize and extract insights from high volumes of documents of different formats. This blog explores how you can use it to help determine the next set of actions to take, depending on the type of document.

Build generative AI applications with Amazon Titan Text Premier, Amazon Bedrock, and AWS CDK

Explore building and deploying two sample applications powered by Amazon Titan Text Premier in this blog post.

AWS Trainium

Overview of AWS Trainium

Learn about AWS Trainium, the second-generation machine learning (ML) accelerator that AWS purpose built for deep learning training of 100B+ parameter models. Each EC2 Trn1 instance deploys up to 16 AWS Trainium accelerators to deliver a high-performance, low-cost solution for deep learning (DL) training in the cloud.

Recommended Trainium Instances

Explore how AWS Trainium instances are designed to provide high performance and cost efficiency for deep learning model inference workloads.

Scaling distributed training with AWS Trainium and Amazon EKS

If you're deploying your deep learning (DL) workloads using Amazon Elastic Kubernetes Service (EKS), learn how you can benefit from the general availability of EC2 Trn1 instances powered by Trainium—a purpose-built ML accelerator optimized to provide a high-performance, cost-effective, and massively scalable platform for training DL models in the cloud.

AWS Inferentia

Overview of Inferentia

Understand how AWS designs accelerators to deliver high performance at the lowest cost for your deep learning (DL) inference applications.

AWS Inferentia2 builds on AWS Inferentia1 by delivering 4x higher throughput and 10x lower latency

Understand what Inferentia2 is optimized for and how it was designed to deliver higher performance, while lowering the cost of LLMs and generative AI inference.

Machine learning inference using Inferentia

Learn how to create an Amazon EKS cluster with nodes running EC2 Inf1 instances and optionally deploy a sample application. EC2 Inf1 instances are powered by AWS Inferentia chips, which are custom built by AWS to provide high-performance and low-cost inference in the cloud.

Resources