Public foundation models
Amazon Bedrock provides access to a variety of foundation models from different
companies. These include Anthropic
Claude
Generative AI is a set of artificial intelligence (AI) systems and models designed to generate content such as code, text, images, music, or other forms of data. These systems can produce new content based on patterns and knowledge learned from existing data.
Amazon offers a range of generative AI services, applications, tools, and supporting infrastructure. Which of these you use depends largely on what you’re trying to accomplish, how much choice you need in foundation models, the degree of customization you need in your generative AI applications, and the expertise within your organization.
Foundation models and types
Foundation models (FMs) are the backbone of generative AI. These pre-trained AI models can be customized for specific tasks, and come in various types:
The capabilities of an FM are often related to its size, measured in parameters. Larger models can capture more complex patterns, but require more computational resources.
After you've decided on a generative AI service, choose the foundation model (FM) that gives you the best results for your use case. Amazon Bedrock has a model evaluation capability that can assist in evaluating, comparing, and selecting the best FMs for your use case.
Amazon Bedrock provides access to a variety of foundation models from different
companies. These include Anthropic
Claude
Adapting FMs to specific use cases is crucial for optimal performance. Key customization techniques include:
Prompt engineering is another vital skill, allowing users to craft inputs that guide the model to produce desired outputs effectively.
When you evaluate generative AI model performance, consider two factors:
The quality and quantity of data significantly impact model performance. Key factors include:
Transform business processes and content creation with AI-powered automation and insights
Learn more:
New Amazon Bedrock capabilities enhance data
processing and retrieval
Get started:
How generative AI is transforming developer workflows at Amazon
Get started:
Experiment with Amazon Nova foundation models
Get started:
Amazon Q Business is adding new workflow automation capability and 50+ action
integrations
As you choose services, consider your team’s technical expertise, development timeline, and specific use case requirements. The following table helps you match your business needs to the appropriate AWS service.
| If you need to... | To get started |
|---|---|
| Generate code and get answers to business questions across your enterprise data | |
| Choose from multiple foundation models, customize them with your data, and build generative AI applications | |
| Build, train, and deploy machine learning models at scale | Amazon SageMaker AI |
| Maximize price-performance for foundation model training and inference |
Get an overview of Amazon Q Business, with explanations of what it is, how it works, and how to get started using it.
Create a sample Amazon Q Business applicationLearn how to create your first Amazon Q Business application in either the AWS Management Console or using the command line interface (CLI).
Combine Amazon Q Business and AWS IAM Identity Center to build generative AIBuild private and secure enterprise generative AI apps with Amazon Q Business and AWS IAM Identity Center.
Get an overview of Amazon Q Developer, with explanations of what it is, how it works, and how to get started using it.
Understanding tiers of service for Amazon Q DeveloperReview the following information to understand the tiers of service for Amazon Q Developer, including Amazon Q Developer Pro and Amazon Q Developer at the free tier.
Working with Amazon Q DeveloperUse the Amazon Q Developer Center for fast access to key Amazon Q Developer articles, blog posts, videos, and tips.
Learn how to use this fully managed service to make foundation models (FMs) from Amazon and third parties available for your use through a unified API.
Frequently asked questions about Amazon BedrockGet answers to the most commonly-asked questions about Amazon Bedrock, including how to use agents, security considerations, details on Amazon Bedrock software development kits (SDKs), Retrieval Augmented Generation (RAG), how to use model evaluation, and how billing works.
Guidance for generating product descriptions with Amazon BedrockLearn how to use Amazon Bedrock as part of a solution to automate your product review and approval process for an ecommerce marketplace or retail website.
Amazon Bedrock Studio, renamed to Amazon Bedrock IDE, is now available in Amazon SageMaker Unified Studio
What is Amazon Bedrock IDE?Use Amazon Bedrock IDE to discover Bedrock models, and build generative AI apps that use Bedrock models and features.
Build generative AI applications with Amazon Bedrock IDEThis blog post describes how you can build applications using a wide array of top performing models. It then explains how to evaluate and share your generative AI apps with Amazon Bedrock IDE.
Building a chat app with Amazon Bedrock IDEBuild an Amazon Bedrock IDE chat agent app that allows users to chat with an Amazon Bedrock model through a conversational interface.
Learn how you can use this fully managed machine learning (ML) service to build, train, and deploy ML models into a production-ready hosted environment.
Get started with Amazon SageMaker AILearn how to join an Amazon SageMaker AI domain, giving you access to SageMaker Studio and RStudio on SageMaker.
Get started with Amazon SageMaker JumpStartExplore Amazon SageMaker JumpStart solution templates that set up infrastructure for common use cases, and executable example notebooks for machine learning with SageMaker.
Learn how you can use Amazon SageMaker Unified Studio to build, deploy, execute, and monitor workflows from a single interface.
An integrated experience for all your data and AI with Amazon SageMaker Unified StudioLearn about the new Amazon SageMaker Unified Studio and explore how it unifies your analytic workloads.
Get started with Amazon SageMaker Unified StudioLearn how to gain access to Amazon SageMaker Unified Studio, create a project, and then add members to the project and use the sample JupyterLab notebook to begin building with a variety of tools and resources.
Explore Amazon Nova, a new generation of foundation models available on Amazon Bedrock.
Introducing Amazon Nova foundation models: Frontier intelligence and industry leading price performanceLearn about the new Amazon Nova models, including examples of using Amazon Nova models to analyze complex documents and videos, understand charts and diagrams, generate engaging video content, and build sophisticated AI agents.
Get started with Amazon Nova in the Amazon Bedrock consoleLearn how to use the playgrounds in the console to submit a text prompt to Amazon Nova models and generate a text or image response
Get an overview of Amazon Titan foundation models (FMs) to support your use cases.
Cost-effective document classification using the Amazon Titan Multimodal Embeddings ModelLearn how you can use this model to categorize and extract insights from high volumes of documents of different formats. This blog explores how you can use it to help determine the next set of actions to take, depending on the type of document.
Build generative AI applications with Amazon Titan Text Premier, Amazon Bedrock, and AWS CDKExplore building and deploying two sample applications powered by Amazon Titan Text Premier in this blog post.
Learn about AWS Trainium, the second-generation machine learning (ML) accelerator that AWS purpose built for deep learning training of 100B+ parameter models. Each EC2 Trn1 instance deploys up to 16 AWS Trainium accelerators to deliver a high-performance, low-cost solution for deep learning (DL) training in the cloud.
Recommended Trainium InstancesExplore how AWS Trainium instances are designed to provide high performance and cost efficiency for deep learning model inference workloads.
Scaling distributed training with AWS Trainium and Amazon EKSIf you're deploying your deep learning (DL) workloads using Amazon Elastic Kubernetes Service (EKS), learn how you can benefit from the general availability of EC2 Trn1 instances powered by Trainium—a purpose-built ML accelerator optimized to provide a high-performance, cost-effective, and massively scalable platform for training DL models in the cloud.
Understand how AWS designs accelerators to deliver high performance at the lowest cost for your deep learning (DL) inference applications.
AWS Inferentia2 builds on AWS Inferentia1 by delivering 4x higher throughput and 10x lower latencyUnderstand what Inferentia2 is optimized for and how it was designed to deliver higher performance, while lowering the cost of LLMs and generative AI inference.
Machine learning inference using InferentiaLearn how to create an Amazon EKS cluster with nodes running EC2 Inf1 instances and optionally deploy a sample application. EC2 Inf1 instances are powered by AWS Inferentia chips, which are custom built by AWS to provide high-performance and low-cost inference in the cloud.