What types of data formats are accepted by Amazon Bedrock Knowledge Bases?

Supported data formats include .pdf, .txt, .md, .html, .doc and .docx, .csv, .xls, and .xlsx files. Files must be uploaded to Amazon S3. Point to the location of your data in Amazon S3, and Amazon Bedrock Knowledge Bases takes care of the entire ingestion workflow into your vector database.

How does Amazon Bedrock Knowledge Bases chunk the documents before converting those chunks to embeddings?

Amazon Bedrock Knowledge Bases provides three options to chunk text before converting it to embeddings. 1. Default option: Amazon Bedrock Knowledge Bases automatically splits your document into chunks each containing 200 tokens, ensuring that a sentence is not broken in the middle. If a document contains less than 200 tokens, then it is not split any further. An overlap of 20% of tokens is maintained between two consecutive chunks. 2. Fixed size chunking: In this option, you can specify the maximum number of tokens per chunk and the overlap percentage between chunks for Amazon Bedrock Knowledge Bases, so your document will be automatically split into chunks, ensuring that a sentence is not broken in the middle. 3. Create one embedding per document option: Amazon Bedrock creates one embedding per document. This option is suitable if you have preprocessed your documents by splitting them into separate files and do not want Amazon Bedrock to further chunk your documents.

Why should I use Titan multimodal FMs?

Titan multimodal FMs provide compelling differentiators to existing offerings. They are built on state-of-the-art technology and are pretrained on vast amount of data from diverse sources providing highly quality results (e.g., human faces in generated images). Titan Image FM allows customers to bring in their stylistic elements to the generated images by easily customizing the model using only a few examples. They enable users to build reimagined applications that benefit from our technology that is built with responsible AI in mind.

Which embeddings model is used to convert documents into embeddings (vectors)?

At present, Amazon Bedrock Knowledge Bases uses the latest version of the Amazon Titan Text Embeddings model available in Amazon Bedrock. Titan Text Embeddings V2 model supports 8K tokens and 100+ languages and creates an embeddings of flexible 256, 512, and 1,024 dimension size.

How can customers get started with Titan multimodal FMs?

Customers can use AWS Bedrock to get started with Titan multimodal FMs. They can view the list of available tasks, and any models customized by them. Through the Amazon Bedrock UI, customers evaluate responses. For Titan Image FM, customers can start with an input prompt or by uploading an image and specifying a task, and specifying output options such as resolution and the number of responses. The Amazon Bedrock UI will also display the equivalent API request for the current task, allowing developers to get the source code to embed Amazon Bedrock APIs into their applications. Customers can also use the AWS Bedrock for customizing the Titan Image FM with their own data by simply pointing to the training datasets containing images and text in their S3 bucket.

Which vector databases are supported by Amazon Bedrock Knowledge Bases?

Amazon Bedrock Knowledge Bases takes care of the entire ingestion workflow of converting your documents into embeddings (vector) and storing the embeddings in a specialized vector database. Amazon Bedrock Knowledge Bases supports popular databases for vector storage, including vector engine for Amazon OpenSearch Serverless, Pinecone, Redis Enterprise Cloud, Amazon Aurora (coming soon), and MongoDB (coming soon). If you do not have an existing vector database, Amazon Bedrock creates an OpenSearch Serverless vector store for you.

Amazon Bedrock FAQs

General

Open all

What is Amazon Bedrock?

Amazon Bedrock is a fully managed service that offers a choice of industry leading foundation models (FMs) along with a broad set of capabilities that you need to build generative AI applications, simplifying development with security, privacy, and responsible AI. With the comprehensive capabilities of Amazon Bedrock, you can experiment with a variety of top FMs, customize them privately with your data using techniques such as fine-tuning and retrieval-augmented generation (RAG), and create managed agents that execute complex business tasks—from booking travel and processing insurance claims to creating ad campaigns and managing inventory—all without writing any code. Since Amazon Bedrock is serverless, you don't have to manage any infrastructure, and you can securely integrate and deploy generative AI capabilities into your applications using the AWS services you are already familiar with.

Which FMs are available in Amazon Bedrock?

Amazon Bedrock customers can choose from some of the most cutting-edge FMs available today. This includes models from:

AI21 Labs
Amazon
Anthropic
Cohere
DeepSeek
Luma AI
Meta
Mistral AI
poolside (coming soon)
Stability AI
TwelveLabs (coming soon)

See here for supported foundation models from each provider:
https://docs.aws.amazon.com/bedrock/latest/userguide/models-supported.html

Why should I use Amazon Bedrock?

There are five reasons to use Amazon Bedrock for building generative AI applications.

Choice of leading FMs: Amazon Bedrock offers an easy-to-use developer experience to work with a broad range of high-performing FMs from Amazon and leading AI companies like AI21 Labs, Anthropic, Cohere, Meta, Mistral AI, and Stability AI. You can quickly experiment with a variety of FMs in the playground, and use a single API for inference regardless of the models you choose, giving you the flexibility to use FMs from different providers and keep up to date with the latest model versions with minimal code changes.
Easy model customization with your data: Privately customize FMs with your own data through a visual interface without writing any code. Simply select the training and validation data sets stored in Amazon Simple Storage Service (Amazon S3) and, if required, adjust the hyperparameters to achieve the best possible model performance.
Fully managed agents that can invoke APIs dynamically to execute tasks: Build agents that execute complex business tasks—from booking travel and processing insurance claims to creating ad campaigns, preparing tax filings, and managing your inventory—by dynamically calling your company systems and APIs. Fully managed agents for Amazon Bedrock extend the reasoning capabilities of FMs to break down tasks, create an orchestration plan, and execute it.
Native support for RAG to extend the power of FMs with proprietary data: With Amazon Bedrock Knowledge Bases, you can securely connect FMs to your data sources for retrieval augmentation—from within the managed service—extending the FM’s already powerful capabilities and making it more knowledgeable about your specific domain and organization.
Data security and compliance certifications: Amazon Bedrock offers several capabilities to support security and privacy requirements. Amazon Bedrock is in scope for common compliance standards such as Service and Organization Control (SOC), International Organization for Standardization (ISO), is Health Insurance Portability and Accountability Act (HIPAA) eligible, and customers can use Amazon Bedrock in compliance with the General Data Protection Regulation (GDPR). Amazon Bedrock is CSA Security Trust Assurance and Risk (STAR) Level 2 certified, which validates the use of best practices and the security posture of AWS cloud offerings. With Amazon Bedrock, your content is not used to improve the base models and is not shared with any model providers. Your data in Amazon Bedrock is always encrypted in transit and at rest, and you can optionally encrypt the data using your own keys. You can use AWS PrivateLink with Amazon Bedrock to establish private connectivity between your FMs and your Amazon Virtual Private Cloud (Amazon VPC) without exposing your traffic to the Internet.

How can I get started with Amazon Bedrock?

With the serverless experience of Amazon Bedrock, you can quickly get started. Navigate to Amazon Bedrock in the AWS Management Console and try out the FMs in the playground. You can also create an agent and test it in the console. Once you’ve identified your use case, you can easily integrate the FMs into your applications using AWS tools without having to manage any infrastructure.
Link to Amazon Bedrock getting started course
Link to Amazon Bedrock user guide

What are the most common use cases for Amazon Bedrock?

You can quickly get started with use cases:

Create new pieces of original content, such as short stories, essays, social media posts, and web page copy.
Search, find, and synthesize information to answer questions from a large corpus of data.
Create realistic and artistic images of various subjects, environments, and scenes from language prompts.
Help customers find what they’re looking for with more relevant and contextual product recommendations than word matching.
Get a summary of textual content such as articles, blog posts, books, and documents to get the gist without having to read the full content.
Suggest products that match shopper preferences and past purchases

Explore more generative AI use cases.

What is Amazon Bedrock Playground?

Amazon Bedrock offers a playground that allows you to experiment with various FMs using a conversational chat interface. You can provide a prompt and use a web interface inside the console to supply a prompt and use the pretrained models to generate text or images, or alternatively use a fine-tuned model that has been adapted for your use case.

In which AWS Regions is Amazon Bedrock available?

For a list of AWS Regions where Amazon Bedrock is available, see Amazon Bedrock endpoints and quotas in the Amazon Bedrock Reference Guide.

How do I customize a model on Amazon Bedrock?

You can easily fine-tune FMs on Amazon Bedrock using tagged data or by using continued pre-train feature to customize the model using non-tagged data. To get started, provide the training and validation dataset, configure hyperparameters (epochs, batch size, learning rate, warmup steps) and submit the job. Within a couple of hours, your fine-tuned model can be accessed with the same API (InvokeModel).

Can I train a model and deploy it on Amazon Bedrock?

Yes, you can train select publicly available models and import them into the Amazon Bedrock using the Custom Model Import feature. Currently, this feature only supports Llama 2/3, Mistral, and Flan architectures. For additional information, please refer the documentation.

What is latency-optimized inference in Amazon Bedrock?

Available in public preview, latency-optimized inference in Amazon Bedrock offers reduced latency without compromising accuracy. As verified by Anthropic, with latency-optimized inference on Amazon Bedrock, Claude 3.5 Haiku runs faster on AWS than anywhere else. Additionally, with latency-optimized inference in Bedrock, Llama 3.1 70B and 405B runs faster on AWS than any other major cloud provider. Using purpose-built AI chips like AWS Trainium2 and advanced software optimizations in Amazon Bedrock, customers can access more options to optimize their inference for a particular use case.

Key Features:

Reduces response times for foundation model interactions
Maintains accuracy while improving speed
Requires no additional setup or model fine-tuning

Supported Models: Anthropic's Claude 3.5 Haiku and Meta's Llama 3.1 models 405B and 70B

Availability: The US East (Ohio) Region via cross-region inference

To get started, visit the Amazon Bedrock console. For more information visit the Amazon Bedrock documentation.

How do we get started with latency-optimized inference in Amazon Bedrock?

Accessing the latency-optimized inference in Amazon Bedrock requires no additional setup or model fine-tuning, allowing for immediate enhancement of existing generative AI applications with faster response times. You can toggle on the “Latency optimized” parameter while invoking the Bedrock inference API.

To get started, visit the Amazon Bedrock console. For more information visit the Amazon Bedrock documentation.

Agents

Open all

What are Amazon Bedrock Agents?

Amazon Bedrock Agents are fully managed capabilities that make it easier for developers to create generative AI–based applications that can complete complex tasks for a wide range of use cases and deliver up-to-date answers based on proprietary knowledge sources. In just a few short steps, Amazon Bedrock Agents automatically break down tasks and create an orchestration plan–without any manual coding. The agent securely connects to company data through an API, automatically converting data into a machine-readable format, and augmenting the request with relevant information to generate the most accurate response. Agents can then automatically call APIs to fulfill a user’s request. For example, a manufacturing company might want to develop a generative AI application that automates tracking inventory levels, sales data, supply chain information and that can recommend optimal reorder points and quantities to maximize efficiency. As fully managed capabilities, Amazon Bedrock Agents remove the undifferentiated lifting of managing system integration and infrastructure provisioning, allowing developers to use generative AI to its full extent throughout their organization.

How can I connect FMs to my company data sources?

You can securely connect FMs to your company data sources using Amazon Bedrock Agents. With a knowledge base, you can use agents to give FMs in Amazon Bedrock access to additional data that helps the model generate more relevant, context-specific, and accurate responses without continually retraining the FM. Based on user input, agents identify the appropriate knowledge base, retrieve the relevant information, and add the information to the input prompt, giving the model more context information to generate a completion.

What are some use cases for Amazon Bedrock Agents?

Amazon Bedrock Agents can help you increase productivity, improve your customer service experience, and automate workflows (such as processing insurance claims).

How do Amazon Bedrock Agents help improve developer productivity?

With agents, developers have seamless support for monitoring, encryption, user permissions, versioning, and API invocation management without writing custom code. Amazon Bedrock Agents automate the prompt engineering and orchestration of user-requested tasks. Developers can use the agent-created prompt template as a baseline to further refine it for an enhanced user experience. They can update the user input, orchestration plan, and the FM response. With access to the prompt template developers have better control over the Agent orchestration.

With fully managed agents, you don’t have to worry about provisioning or managing infrastructure and can take applications to production faster.

Security

Open all

Is the content processed by Amazon Bedrock moved outside the AWS Region where I am using Amazon Bedrock?

Any customer content processed by Amazon Bedrock is encrypted and stored at rest in the AWS Region where you are using Amazon Bedrock.

Are user inputs and model outputs made available to third-party model providers?

No. Users' inputs and model outputs are not shared with any model providers.

What security and compliance standards does Amazon Bedrock support?

Amazon Bedrock offers several capabilities to support security and privacy requirements. Amazon Bedrock is in scope for common compliance standards such as Fedramp Moderate, Service and Organization Control (SOC), International Organization for Standardization (ISO), Health Insurance Portability and Accountability Act (HIPAA) eligibility, and customers can use Bedrock in compliance with the General Data Protection Regulation (GDPR). Amazon Bedrock is included in the scope of the SOC 1, 2, 3 reports, allowing customers to gain insights into our security controls. We demonstrate compliance through extensive third-party audits of our AWS controls. Amazon Bedrock is one of the AWS services under ISO Compliance for the ISO 9001, ISO 27001, ISO 27017, ISO 27018, ISO 27701, ISO 22301, and ISO 20000 standards. Amazon Bedrock is CSA Security Trust Assurance and Risk (STAR) Level 2 certified, which validates the use of best practices and the security posture of AWS cloud offerings. With Amazon Bedrock, your content is not used to improve the base models and is not shared with any model providers. You can use AWS PrivateLink to establish private connectivity from Amazon VPC to Amazon Bedrock, without having to expose your data to internet traffic.

Will AWS and third-party model providers use customer inputs to or outputs from Amazon Bedrock to train Amazon Nova, Amazon Titan or any third-party models?

No, AWS and the third-party model providers will not use any inputs to or outputs from Amazon Bedrock to train Amazon Nova, Amazon Titan, or any third-party models.

SDK

Open all

What SDKs are supported for Amazon Bedrock?

Amazon Bedrock supports SDKs for runtime services. iOS and Android SDKs, as well as Java, JS, Python, CLI, .Net, Ruby, PHP, Go, and C++, support both text and speech input.

What SDKs support streaming functionality?

Streaming is supported on all the SDKs.

Billing and support

Open all

How much does Amazon Bedrock cost?

Please see the Amazon Bedrock pricing page for current pricing information.

What support is provided for Amazon Bedrock?

Depending on your AWS Support contract, Amazon Bedrock is supported under Developer Support, Business Support and Enterprise Support plans.

How can I track the input and output tokens?

You can use CloudWatch metrics to track the inputs and output token.

Why do I see a billing entry for AWS Marketplace for my usage of AWS Bedrock?

Customers will see an AWS Marketplace bill for certain Bedrock serverless models and Bedrock Marketplace models. This is because these models are sold by third party providers as "Third-Party Content", as described in the AWS service terms section 50.12.

Customization

Open all

How can I securely use my data to customize FMs available through Amazon Bedrock?

With Amazon Bedrock, you can privately customize FMs, retaining control over how your data is used and encrypted. Amazon Bedrock makes a separate copy of the base FM and trains this private copy of the model. Your data including prompts, information used to supplement a prompt, and FM responses. Customized FMs remain in the Region where the API call is processed.

How does Amazon Bedrock ensure my data used in fine-tuning remains private and confidential?

When you’re fine-tuning a model, your data is never exposed to the public internet, never leaves the AWS network, is securely transferred through your VPC, and is encrypted in transit and at rest. Amazon Bedrock also enforces the same AWS access controls that you have with any of our other services.

Does Amazon Bedrock support continued pretraining?

We launched continued pretraining for Amazon Titan Text Express and Amazon Titan models on Amazon Bedrock. Continued pretraining allows you to continue the pretraining on an Amazon Titan base model using large amounts of unlabeled data. This type of training will adapt the model from a general domain corpus to a more specific domain corpus such as medical, law, finance, and so on, while still preserving most of the capabilities of the Amazon Titan base model.

Why should I use continued pretraining in Amazon Bedrock?

Enterprises may want to build models for tasks in a specific domain. The base models may not be trained on the technical jargon used in that specific domain. Thus, directly fine-tuning the base model requires large amounts of labeled training records and a long training duration to get accurate results. To ease this burden, the customer can instead provide large amounts of unlabeled data for a continued pretraining job. This job will adapt the Amazon Titan base model to the new domain. Then the customer may fine-tune the newly pretrained custom model to downstream tasks, using significantly fewer labeled training records and with a shorter training duration.

How does the continued pretraining feature relate to other AWS services?

Amazon Bedrock continued pretraining and fine-tuning have very similar requirements. For this reason, we are choosing to create unified APIs that support both continued pretraining and fine-tuning. Unification of the APIs reduces the learning curve and will help customers use standard features such as Amazon EventBridge to track long running jobs, Amazon S3 integration for fetching training data, resource tags, and model encryption.

How do I use continued pre-training?

Continued pretraining helps you adapt the Amazon Titan models to your domain specific data while still preserving the base functionality of the Amazon Titan models. To create a continued pretraining job, navigate to the Amazon Bedrock console and click on "Custom Models." You will navigate to the custom model page that has two tabs: Models and Training jobs. Both tabs provide a “Customize Model” drop-down menu on the right. Select “Continued Pretraining” from the drop-down menu to navigate to “Create Continued Pretraining Job." You will provide the source model, name, model encryption, input data, hyper-parameters and output data. Additionally, you can provide tags, along with details about AWS Identity and Access Management (IAM) roles and resource policies for the job.

Amazon Titan

Open all

What are Amazon Titan models?

Exclusive to Amazon Bedrock, the Amazon Titan family of models incorporates 25 years of Amazon experience innovating with AI and machine learning across the business. Amazon Titan FMs provide customers with a breadth of high-performing image, multimodal, and text model choices through a fully managed API. Amazon Titan models are created by AWS and pretrained on large datasets, making them powerful, general-purpose models built to support a variety of use cases, while also supporting the responsible use of AI. Use them as is or privately customize them with your own data. Learn more about Amazon Titan.

Where can I learn more about the data processed to develop and train Amazon Titan FMs?

To learn more about data processed to develop and train Amazon Titan FMs, visit Amazon Titan Model Training and Privacy page.

Knowledge Bases / RAG

Open all

Which data sources can I connect to Amazon Bedrock Knowledge Bases?

You can ingest content from various sources, including the web, Amazon Simple Storage Service (Amazon S3), Confluence (preview), Salesforce (preview), and SharePoint (preview). You can also programmatically ingest streaming data or data from unsupported sources. You can also connect to your structured data sources such as Redshift datawarehouse and AWS Glue data catalog.

How does Amazon Bedrock Knowledge Base retrieve data from structured data sources?

Amazon Bedrock Knowledge Bases provides a managed Natural Language to SQL to convert natural language into actionable SQL queries and retrieve data, allowing you to build application using data from these sources.

Does Amazon Bedrock Knowledge Bases support multi-turn conversations?

Yes, session context management is built-in, allowing your applications to maintain context across multiple interactions, which is essential for supporting multi-turn conversations.

Does Amazon Bedrock Knowledge Bases provide source attribution for retrieved information?

Yes, all information retrieved includes citations, improving transparency and minimizing the risk of hallucinations in the generated responses.

What multi-modal capabilities does Amazon Bedrock Knowledge Bases offer?

Amazon Bedrock Knowledge Bases supports multi-modal data processing, allowing developers to build generative AI applications that analyze both text and visual data, including images, charts, diagrams, and tables. Model responses can leverage insights from visual elements in addition to text, providing. more accurate and contextually relevant answers. Additionally, source attribution for responses includes visual elements, enhancing transparency and trust in the responses.

What multi-modal data formats does Amazon Bedrock Knowledge Bases support?

Amazon Bedrock Knowledge Bases can process visually rich documents in PDF format, which may contain images, tables, charts, and diagrams. For image-only data, Bedrock Knowledge Bases supports standard image formats like JPEG and PNG, enabling search capabilities where users can retrieve relevant images based on text-based queries.

What are the different parsing options available in Amazon Bedrock Knowledge Bases?

Customers have three parsing options for Bedrock Knowledge Bases. For text-only processing, the built-in default Bedrock parser is available at no additional cost, ideal for cases where multimodal data processing is not required. Amazon Bedrock Data Automation (BDA) or foundation models can be used to parse multimodal data. For more information, refer to the product documentation.

How does Amazon Bedrock Knowledge Bases ensure data security and manage workflow complexities?

Amazon Bedrock Knowledge Base handles various workflow complexities such as content comparison, failure handling, throughput control, and encryption, ensuring that your data is securely processed and managed according to AWS’s stringent security standards.

Model evaluation

Open all

What is Model Evaluation on Amazon Bedrock?

Model Evaluation on Amazon Bedrock allows you to evaluate, compare, and select the best FM for your use case in just a few short steps. Amazon Bedrock offers a choice of automatic evaluation and human evaluation. You can use automatic evaluation with predefined metrics such as accuracy, robustness, and toxicity. You can use human evaluation workflows for subjective or custom metrics such as friendliness, style, and alignment to brand voice. For human evaluation, you can use your in-house employees or an AWS-managed team as reviewers. Model Evaluation on Amazon Bedrock provides built-in curated datasets or you can bring your own datasets.

Against what metrics can I evaluate FMs?

You can evaluate variety of predefined metrics such as accuracy, robustness, and toxicity using automatic evaluations. You can also use human evaluation workflows for subjective or custom metrics, such as friendliness, relevance, style, and alignment to brand voice.

What is the difference between human-based and automatic evaluations?

Automatic evaluations allow you to quickly narrow down the list of available FMs against standard criteria (such as accuracy, toxicity and robustness). Human-based evaluations are often used to evaluate more nuanced or subjective criteria that require human judgment and where automatic evaluations might not exist (such as brand voice, creative intent, friendliness).

How does automatic evaluation work?

You can quickly evaluate Amazon Bedrock models for metrics such as accuracy, robustness, and toxicity by using curated built-in data sets or by bringing your own prompt datasets. After your prompt datasets are sent to Amazon Bedrock models for inference, the model responses are scored with evaluation algorithms for each dimension. The backend engine aggregates individual prompt response scores into summary scores and presents them through easy-to-understand visual reports.

How does human evaluation work?

Amazon Bedrock allows you to set up human review workflows in a few short steps and bring your in-house employees, or use an expert team managed by AWS, to evaluate models. Through Amazon Bedrock’s intuitive interface, humans can review and give feedback on model responses by clicking thumbs up or down, rating on a scale of 1-5, choosing the best of multiple responses, or ranking prompts. For example, a team member can be shown how two models respond to the same prompt, and then be asked to select the model that shows more accurate, relevant, or stylistic outputs. You can specify the evaluation criteria that matter to you by customizing the instructions and buttons to appear on the evaluation UI for your team. You can also provide detailed instructions with examples and the overall goal of model evaluation, so users can align their work accordingly. This method is useful to evaluate subjective criteria that require human judgement or more nuanced subject matter expertise and that cannot be easily judged by automatic evaluations.

Responsible AI with Amazon Bedrock Guardrails

Open all

What is Amazon Bedrock Guardrails?

Amazon Bedrock Guardrails helps you implement safeguards for your generative AI applications based on your use cases and responsible AI policies. You can create multiple guardrails tailored to different use cases and apply them across multiple foundation models (FMs), providing a consistent user experience and standardizing safety and privacy controls across your generative AI applications.

What are the safeguards available in Amazon Bedrock Guardrails?

Guardrails help you to define a set of six policies to help safeguard your generative AI applications. You can configure the following policies in Amazon Bedrock Guardrails:

Multi modal content filters – Configure thresholds to detect and filter harmful text and/or image content across categories including hate, insults, sexual, violence, misconduct, and prompt attacks.
Denied topics – Define a set of topics that are undesirable in the context of your application. The filter will help block them if detected in user queries or model responses.
Word filters – Configure filters to help block undesirable words, phrases, and profanity (exact match). Such words can include offensive terms, competitor names, etc.
Sensitive information filters – Configure filters to help block or mask sensitive information, such as personally identifiable information (PII), or custom regex in user inputs and model responses. Blocking or masking is done based on probabilistic detection of sensitive information in standard formats in entities such as SSN number, Date of Birth, address, etc. This also allows configuring regular expression based detection of patterns for identifiers.
Contextual grounding check – Help detect and filter hallucinations if the responses are not grounded (e.g., factually inaccurate or new information) in the source information and irrelevant to user’s query or instruction.
Automated Reasoning checks – Help detect factual inaccuracies in generated content, suggest corrections, and explain why responses are accurate by checking against a structured, mathematical representation of knowledge called an Automated Reasoning Policy.

What modalities are supported with Bedrock Guardrails?

Bedrock Guardrails supports both text and image content to enable customers build secure generative AI applications at scale.

Can I use Guardrails with all available FMs and tools on Amazon Bedrock?

Amazon Bedrock Guardrails works with a wide range of models including FMs supported in Amazon Bedrock, fine-tuned models, as well as self-hosted models outside Amazon Bedrock. User inputs and model outputs can be evaluated independently for third-party and self-hosted models using the ApplyGuardrail API. Amazon Bedrock Guardrails can also be integrated with Amazon Bedrock Agents and Amazon Bedrock Knowledge Bases to build safe and secure generative AI applications aligned with responsible AI policies

What languages are supported with Bedrock Guardrails?

Currently, Amazon Bedrock Guardrails supports English, French, and Spanish in natural language. Using any other language will result in ineffective results.

Do you have a list of off-the-shelf (built-in) guardrails, and what can be customized?

There are five guardrail policies each with different off-the-shelf protections

Content filters – This has 6 off the shelf categories (hate, insults, sexual, violence, misconduct (incl. criminal activity) and prompt attack (jailbreak and prompt injection. Each category can have further customized thresholds in terms of aggressiveness of filtering - low/medium/high for both text and image content.
Denied topic – These are customized topics that customers can define using simple natural language description
Sensitive information filter – These come with 30+ off the shelf PIIs. It can be further customized by adding customer’s proprietary information that are sensitive.
Word filters – It comes with off the shelf profanity filtering and can be further customized with custom words.
Contextual grounding checks – It can help detect hallucinations for RAG, summarization, and conversational applications, where source information can be used as reference to validate the model response.

How can I enforce Guardrails across my organization?

Amazon Bedrock Guardrails provides the ability to establish mandatory guardrails for every inference call using IAM policy-based enforcement capabilities. See here for details.

Does AWS offer an intellectual property indemnity covering copyright claims for its generative AI services?

AWS offers an uncapped intellectual property (IP) indemnity for copyright claims arising from generative output of the following generally available Amazon generative AI services: Amazon models, and other services listed in Section 50.10 of the Service Terms (the “Indemnified Generative AI Services”). This means that customers are protected from third-party claims alleging copyright infringement by the output generated by the Indemnified Generative AI Services in response to inputs or other data provided by the customer. Customers must also use the services responsibly, such as not inputting infringing data or disabling a service’s filtering features.

What is the pricing model for using Amazon Bedrock Guardrails?

Amazon Bedrock Guardrails is priced on a per-use model for both text and image content. Please see the Guardrails pricing page for pricing details.

Are customers able to run automated tests on the effectiveness of the Guardrails they set? Is there a “test case builder” (the journalist’s terminology) for ongoing monitoring?

Yes, Amazon Bedrock Guardrail APIs help customers run automated tests. “Test case builder” maybe something you want to use prior to deploying guardrails in production. There is no native test case builder yet. For ongoing monitoring of production traffic, guardrails help provide detailed logs of all violation for each input and output, so that customers can granularly monitor each and every input coming and going out of their gen AI application. These logs can be stored in CloudWatch or S3 and can be used to create custom dashboards based on customers’ requirements.

How is validation using Automated Reasoning checks different from Contextual Grounding checks?

Using an Automated Reasoning Policy, Automated Reasoning checks can point both accurate claims and factual inaccuracies in content. For both accurate and inaccurate statements, Automated Reasoning check provides verifiable, logical explanations for its output. Automated Reasoning check requires upfront involvement from a domain expert to create a Policy and only supports content that defines rules. On the other hand, Contextual grounding checks in Bedrock Guardrails uses machine learning techniques to ensure the generated content closely follows the documents that were provided as input from a knowledge base, without requiring any additional upfront work. Both Automated Reasoning Checks and Contextual Grounding provide their feedback in the Guardrail API output. You can use the feedback to update the generated content.

What image formats are supported for multimodal content?

PNG and JPEG image formats are supported with Bedrock Guardrails.

Marketplace

Open all

What is Amazon Bedrock Marketplace?

Amazon Bedrock Marketplace offers customers over 100 popular, emerging, or specialized models, in addition to the serverless FMs of Amazon Bedrock so customers can easily build and optimize their generative AI applications. Within the Amazon Bedrock console, customers will be able to discover a broad catalog of FMs offered by various providers. You can then deploy these models onto fully managed endpoints, where you can choose your desired number of instances and instance types. Once the models are deployed, the models can be accessed through Amazon Bedrock’s Invoke API. For chat-tuned, text-to-text models, customers can use our new Converse API, a unified API that abstracts FM differences and enables model switching with a single parameter change. Where applicable, the models can be used with Amazon Bedrock Playground, Agents, Knowledge Bases, Prompt Management, Prompt Flows, Guardrails, and Model Evaluation.

Why should I use Amazon Bedrock Marketplace?

You should use Amazon Bedrock Marketplace to benefit from the powerful models which are emerging rapidly as the generative AI industry continues to innovate. You can quickly access and deploy popular, emerging, and specialized models tailored to you unique requirements, which can accelerate the time-to-market, improve the accuracy, or reduce the cost of your generative AI workflows. You can access the models through Bedrock’s unified APIs and, if they are compatible with Bedrock’s Converse API, use them natively with Bedrock tools such as Agents, Knowledge Bases, and Guardrails. You can easily connect Amazon Bedrock Marketplace to Amazon Bedrock’s serverless models, all from a single place.

How do I get started with Amazon Bedrock Marketplace?

Simply navigate to the Amazon Bedrock Model Catalog page in the Bedrock console where you can search for Amazon Bedrock Marketplace model listings along with the serverless Amazon Bedrock models. After you have selected the Amazon Bedrock Marketplace model you want to use, you can subscribe to the model through the Model Detail page, accepting the EULA and price(s) set by the provider. Once the subscription is complete, which typically takes a few minutes, you can deploy the model to a fully managed SageMaker endpoint by clicking on Deploy in the Model Detail page or by using APIs. In the deployment step, you can select your desired number of instances and instance types to meet your workload. Once the endpoint is setup, which typically takes 10 – 15 minutes, you can start making inference calls to the endpoint and use the model in Bedrock’s advanced tools, provided the model is compatible with Bedrock’s Converse API.

Can I fine-tune Amazon Bedrock Marketplace models?

Models with architectures supported by Custom Model Import (Mistral, Mixtral, Flan, and Llama2/3/3.1/3.2) can be fine-tuned in SageMaker and made available in Amazon Bedrock via Custom Model Import. Models which are not supported by Custom Model Import can still be fine-tuned in SageMaker. However, the fine-tuned version of these models can not be used in Amazon Bedrock.

Data Automation

Open all

What is Bedrock Data Automation?

What is Bedrock Data Automation? Amazon Bedrock Data Automation is a GenAI-powered capability of Bedrock that streamlines the development of generative AI applications and automates workflows involving documents, images, audio, and videos. By leveraging Bedrock Data Automation, developers can reduce development time and effort, making it easier to build intelligent document processing, media analysis, and other multimodal data-centric automation solutions. Bedrock Data Automation offers industry-leading accuracy at lower cost than alternative solutions, along with features such as visual grounding with confidence scores for explainability and built-in hallucination mitigation. This ensures trustworthy and accurate insights from unstructured, multi-modal data sources. Customers can easily customize Bedrock Data Automation output to generate specific insights in consistent formats required by their systems and applications. Developers get started with Bedrock Data Automation on the Amazon Bedrock console, where they can configure and customize output using their sample data. They can then integrate Bedrock Data Automation’s unified multi-modal inference API into their applications to process their unstructured content at production scale with high accuracy and consistency. Bedrock Data Automation is also integrated with Bedrock Knowledge Bases, making it easier for developers to generate meaningful information from their unstructured multi-modal content to provide more relevant responses for retrieval augmented generation (RAG).

Why should I use Bedrock Data Automation?

Bedrock Data Automation makes it easy to transform unstructured enterprise data into application-specific output formats that can be utilized by gen AI applications and ETL workflows. Customers no longer need to spend time and effort managing and orchestrating multiple models, engineering prompts, implementing safety guardrails, or stitching together outputs to align to downstream system requirements. Bedrock Data Automation delivers highly accurate, consistent, and cost-effective processing of unstructured data. Bedrock Data Automation is built with responsible AI in mind, providing customers with key features such as visual grounding and confidence scores, that make it easy to integrate Bedrock Data Automation within enterprise workflows.

What does Amazon Bedrock Data Automation manage on my behalf?

Bedrock Data Automation capabilities are available via a fully managed API that customers can easily integrate into their applications. Customers do not need to worry about scaling underlying compute resources, selecting and orchestrating models, or managing prompts for FMs.

What is a blueprint?

A blueprint is a feature that customers use to specify their output requirements using natural language or a schema editor. It includes a list of fields that they desire to extract, a data format for each field, and natural language instructions for each field. For example, developers can type, “Create a blueprint for invoices with the following fields: tax, dueDate, ReceiptDate” or “Confirm the invoice total matches the sum of line items.” They reference blueprints as part of the inference API calls so that the system returns information in the format described in the blueprint.

What features and file formats are supported per modality by Amazon Bedrock Data Automation

Documents

Bedrock Data Automation supports both standard output and custom output for documents.

Standard output will provide extraction of text from documents and generative output such as document summary and captions for tables/figures/diagrams. Output is returned in reading order and can optionally be grouped by layout element, which will include headers/footers/titles/tables/figures/diagrams. Standard output will be used for BDA integration with Bedrock Knowledge Bases.
Custom Output leverages blueprints, which specify output requirements using natural language or a schema editor. Blueprints include a list of fields to extract and a data format for each field.

Bedrock Data Automation supports PDF, PNG, JPG, TIFF, a max of 1500 pages, and a max file size of 500MB per API request. By default, BDA will support 50 concurrent jobs and 10 transactions per second per customer.

Images

Bedrock Data Automation supports both standard output and custom output for images.

Standard output will provide summarization, detected explicit content, detected text, logo detection and Ad taxonomy: IAB for images. Standard output will be used for BDA integration with Bedrock Knowledge Bases.
Custom Output leverages blueprints, which specify output requirements using natural language or a schema editor. Blueprints include a list of fields to extract and a data format for each field.

Bedrock Data Automation supports JPG, PNG, a max resolution of 4K, and a max file size of 5 MB per API request. By default, BDA supports a max concurrency of 20 images at 10 transactions per second (TPS) per customer.

Videos

Bedrock Data Automation supports both standard output for videos.

Standard output will provide full video summary, chapter segmentation, chapter summary, full audio transcription, speaker identification, detected explicit content, detected text, logo detection and Interactive Advertising Bureau (IAB) taxonomy for videos. Full video summary is optimized for content with descriptive dialogue such as product overviews, trainings, news casts, and documentaries.

Bedrock Data Automation supports MOV and MKV with H.264, VP8, VP9, a max video duration of 4 hours, and a max file size of 2 GB per API request. By default, BDA supports a max concurrency of 20 videos at 10 transactions per second (TPS) per customer.

Audio

Bedrock Data Automation supports both standard output for audio.

Standard output will provide summarization including chapter summarization, full transcription, and detect explicit content moderation for audio files.

Bedrock Data Automation supports FLAC, M4A, MP3, MP4, Ogg, WebM, WAV, a max audio duration of 4 hours, and a max file size of 2 GB per API request.

In which AWS regions is Amazon Bedrock Data Automation available?

Amazon Bedrock Data Automation is generally available in the US West (Oregon) and US East (N. Virginia) AWS Regions.

What languages does Amazon Bedrock Data Automation support?

Amazon Bedrock Data Automation currently supports English. Additional language support is coming soon in 2025.

Amazon Bedrock in SageMaker Unified Studio

Open all

What is Amazon Bedrock in SageMaker Unified Studio?

Amazon Bedrock can be accessed through the AWS Management Console, APIs, or Amazon SageMaker Unified Studio. Within Amazon SageMaker Unified Studio, users can quickly build and iterate on generative AI applications using high-performing foundation models (FMs). Through an intuitive interface, you can experiment with models, collaborate on projects, and get streamlined access to various Bedrock tools and resources to build generative AI applications quickly.

How do I access Amazon Bedrock's capabilities in Amazon SageMaker Unified Studio?

To access Amazon Bedrock's capabilities within Amazon SageMaker Unified Studio, developers and their admins will need to follow these steps:

Create a new domain in Amazon SageMaker Unified Studio.
Enable the Gen AI application development project profile.
Access Amazon Bedrock through the Generative AI Playground (Discover) and Generative AI App Development (Build) sections, using their company's single sign-on (SSO) credentials within Amazon SageMaker Unified Studio.

What are the key features and capabilities of Amazon Bedrock in Amazon SageMaker Unified Studio? How is it different from Amazon Bedrock Studio and Amazon Bedrock IDE?

While Amazon Bedrock can be accessed through the AWS Management Console, APIs, or Amazon SageMaker Unified Studio, its capabilities within SageMaker Unified Studio build upon the original Amazon Bedrock Studio (that is no longer available) with several key improvements. When accessed through Amazon SageMaker Unified Studio, it provides access to advanced AI models from leading companies, tools for creating and testing AI prompts, and seamless integration with Amazon Bedrock Knowledge Bases, Amazon Bedrock Guardrails, Amazon Bedrock Flows, and Amazon Bedrock Agents. Teams can collaborate in a shared workspace to build custom AI applications tailored to their needs.

New features include a model hub for side-by-side AI model comparison, an expanded playground supporting chat, image, and video interactions, and improved Knowledge Base creation with web crawling. It introduces Agent creation for more complex chat applications and simplifies sharing of AI apps and prompts within organizations. It also offers access to underlying application code and the ability to export chat apps as CloudFormation templates. By managing AWS infrastructure details, it enables users of various skill levels to create AI applications more efficiently, making it a more versatile and powerful tool than its predecessor.

Amazon Bedrock IDE was renamed to better represent the core capability of Amazon Bedrock being accessed through Amazon SageMaker Unified Studio's governed environment.

How does Amazon Bedrock in SageMaker Unified Studio enable collaboration among teams within an organization?

When accessing Amazon Bedrock’s interface through Amazon SageMaker Unified Studio, teams benefit from a governed environment that enables collaboration. Teams can create projects, invite colleagues, and collaboratively build generative AI applications together. They can receive quick feedback on their prototypes and share the applications with anyone in SageMaker Unified Studio or with specific users in the domain. Robust access controls and governance features allow only authorized members to access project resources such as data or the generative AI applications, supporting data privacy and compliance, and thus fostering secure cross-functional collaboration and sharing. In addition, generative AI applications can be shared from a builder to specific users in the SageMaker Unified Studio domain, or with specific individuals, allowing for proper access rights, controls, and governance of such assets.

Why is Amazon Bedrock being integrated into Amazon SageMaker Unified Studio?

While Amazon Bedrock can be accessed through the AWS Management Console, APIs, or Amazon SageMaker Unified Studio, this integration eliminates barriers between data, tools, and developers in the generative AI development process. Teams gain a unified development experience by accessing familiar JupyterLab environments and analytics tools while seamlessly incorporating Amazon Bedrock's powerful generative AI capabilities—all within the same workspace.

The unified environment allows seamless collaboration among developers of various skill levels throughout the development lifecycle - from data preparation to model development and generative AI application building. Teams can access integrated tools for knowledge base creation, guardrail configuration, and high-performing generative AI application development, all within a secure and governed framework.

Within Amazon SageMaker Unified Studio, developers can effortlessly switch between different tools based on their needs, combining analytics, machine learning, and generative AI capabilities in a single workspace. This consolidated approach reduces development complexity and accelerates time-to-value for generative AI projects. By bringing Amazon Bedrock into Amazon SageMaker Unified Studio, AWS lowers the barriers to entry for generative AI development while maintaining enterprise-grade security and governance, ultimately enabling organizations to innovate faster and more effectively with generative AI.

When should I use Amazon Bedrock's capabilities in Amazon SageMaker Unified Studio?

Amazon Bedrock's capabilities in Amazon SageMaker Unified Studio are ideal for enterprise teams who need a governed environment for collaboratively building and deploying generative AI applications. Through Amazon SageMaker Unified Studio, teams can access:

The Generative AI Playground in the Discover section enables teams to experiment with foundation models (FMs), test different models and configurations, compare model outputs, and collaborate on prompts and applications. This environment provides a seamless way for teams to evaluate and understand the capabilities of different models before implementing them in their applications.
The Generative AI App Development in the Build section provides teams with the tools needed to create production-ready generative AI applications. Teams can create and manage Knowledge Bases, implement Guardrails for responsible AI, develop Agents and Flows, and collaborate securely while maintaining governance and compliance controls. This environment is particularly valuable for organizations that require secure collaboration and seamless access to Amazon Bedrock's full range of capabilities while maintaining enterprise security and compliance standards.

How does Amazon Bedrock integrate with other AWS services within Amazon SageMaker Unified Studio to create generative AI applications?

Amazon Bedrock's capabilities are now generally available within Amazon SageMaker Unified Studio, offering a governed collaborative environment that empowers developers to rapidly create and customize generative AI applications. This intuitive interface caters to developers of all skill levels, providing seamless access to Amazon Bedrock's high-performance foundation models (FMs) and advanced customization tools for collaborative development of tailored generative AI applications.

Within Amazon SageMaker Unified Studio, Amazon Bedrock seamlessly integrates with Amazon SageMaker's analytics, machine learning (ML), and generative AI capabilities. Organizations can move from concept to production faster by prototyping and experimenting with foundation models in Amazon Bedrock, then easily transitioning to JupyterLab notebooks or code editors to integrate these resources into broader applications and workflows. This consolidated workspace streamlines complexity, enabling faster prototyping, iteration, and deployment of production-ready, responsible generative AI applications that align with specific business requirements.

Are there any limits or quotas on the usage of Amazon Bedrock in SageMaker Unified Studio?

Amazon Bedrock in SageMaker Unified Studio is bound by the account limits and quotas defined for the platform and the underlying Amazon Bedrock resources, such as foundation models (FMs), Knowledge Bases, Agents, Flows, and Guardrails.

What are the pricing and billing models for using Amazon Bedrock in SageMaker Unified Studio?

Access to Amazon Bedrock through SageMaker Unified Studio comes at no extra cost, and users only pay for the usage of the underlying resources that are required by the generative AI applications that they build. For example, customers will only pay for the associated model, Guardrail and Knowledge Base that they have used on their generative AI application. For more information, please visit the Amazon Bedrock pricing page.

What are the Service Level Agreements (SLAs) for Amazon Bedrock in SageMaker Unified Studio?

Amazon Bedrock within SageMaker Unified Studio is bound by the same SLAs as Amazon Bedrock. For more information, visit the Amazon Bedrock Service Level Agreement page.

What documentation and support resources are available for Amazon Bedrock in SageMaker Unified Studio?

To facilitate a smooth onboarding experience with Amazon Bedrock in SageMaker Unified Studio, you can find detailed documentation in the User Guide. If you have any additional questions or need further assistance, please don't hesitate to reach out to your AWS account team.

How to get started

Tutorial

Explore common generative AI use cases with Amazon Bedrock Workshop

Gain hands-on experience

Video

Get started with a step-by-step tutorial

Watch the video

Did you find what you were looking for today?

Let us know so we can improve the quality of the content on our pages.

Select your cookie preferences

Amazon Bedrock FAQs

Page topics

General

What is Amazon Bedrock?

Which FMs are available in Amazon Bedrock?

Why should I use Amazon Bedrock?

How can I get started with Amazon Bedrock?

What are the most common use cases for Amazon Bedrock?

What is Amazon Bedrock Playground?

In which AWS Regions is Amazon Bedrock available?

How do I customize a model on Amazon Bedrock?

Can I train a model and deploy it on Amazon Bedrock?

What is latency-optimized inference in Amazon Bedrock?

How do we get started with latency-optimized inference in Amazon Bedrock?

Agents

What are Amazon Bedrock Agents?

How can I connect FMs to my company data sources?

What are some use cases for Amazon Bedrock Agents?

How do Amazon Bedrock Agents help improve developer productivity?

Security

Is the content processed by Amazon Bedrock moved outside the AWS Region where I am using Amazon Bedrock?

Are user inputs and model outputs made available to third-party model providers?

What security and compliance standards does Amazon Bedrock support?

Will AWS and third-party model providers use customer inputs to or outputs from Amazon Bedrock to train Amazon Nova, Amazon Titan or any third-party models?

SDK

What SDKs are supported for Amazon Bedrock?

What SDKs support streaming functionality?

Billing and support

How much does Amazon Bedrock cost?

What support is provided for Amazon Bedrock?

How can I track the input and output tokens?

Why do I see a billing entry for AWS Marketplace for my usage of AWS Bedrock?

Customization

How can I securely use my data to customize FMs available through Amazon Bedrock?

How does Amazon Bedrock ensure my data used in fine-tuning remains private and confidential?

Does Amazon Bedrock support continued pretraining?

Why should I use continued pretraining in Amazon Bedrock?

How does the continued pretraining feature relate to other AWS services?

How do I use continued pre-training?

Amazon Titan

What are Amazon Titan models?

Where can I learn more about the data processed to develop and train Amazon Titan FMs?

Knowledge Bases / RAG

Which data sources can I connect to Amazon Bedrock Knowledge Bases?

How does Amazon Bedrock Knowledge Base retrieve data from structured data sources?

Does Amazon Bedrock Knowledge Bases support multi-turn conversations?

Does Amazon Bedrock Knowledge Bases provide source attribution for retrieved information?

What multi-modal capabilities does Amazon Bedrock Knowledge Bases offer?

What multi-modal data formats does Amazon Bedrock Knowledge Bases support?

What are the different parsing options available in Amazon Bedrock Knowledge Bases?

How does Amazon Bedrock Knowledge Bases ensure data security and manage workflow complexities?

Model evaluation

What is Model Evaluation on Amazon Bedrock?

Against what metrics can I evaluate FMs?

What is the difference between human-based and automatic evaluations?

How does automatic evaluation work?

How does human evaluation work?

Responsible AI with Amazon Bedrock Guardrails

What is Amazon Bedrock Guardrails?

What are the safeguards available in Amazon Bedrock Guardrails?

What modalities are supported with Bedrock Guardrails?

Can I use Guardrails with all available FMs and tools on Amazon Bedrock?

What languages are supported with Bedrock Guardrails?

Do you have a list of off-the-shelf (built-in) guardrails, and what can be customized?

How can I enforce Guardrails across my organization?

Does AWS offer an intellectual property indemnity covering copyright claims for its generative AI services?

Do default Guardrails automatically detect social security numbers or phone numbers?

What is the pricing model for using Amazon Bedrock Guardrails?

Are customers able to run automated tests on the effectiveness of the Guardrails they set? Is there a “test case builder” (the journalist’s terminology) for ongoing monitoring?

How is validation using Automated Reasoning checks different from Contextual Grounding checks?

What image formats are supported for multimodal content?

Marketplace

What is Amazon Bedrock Marketplace?

Why should I use Amazon Bedrock Marketplace?

How do I get started with Amazon Bedrock Marketplace?

Can I fine-tune Amazon Bedrock Marketplace models?

Data Automation

What is Bedrock Data Automation?

Why should I use Bedrock Data Automation?