How to Use Amazon Bedrock: Complete Beginner's Guide (2026)
Last verified: May 14, 2026 · Format: Guide · Est. time: 20-25 min
By the end of this guide, you will have made your first API call to a foundation model on Amazon Bedrock, connected a document store through Knowledge Bases for retrieval-augmented generation, and configured Guardrails to filter harmful content before it reaches your users. Every step includes a verification checkpoint so you know it worked before moving on.
Bedrock is not a model. It is a fully managed, serverless platform that gives you API access to over 100 foundation models from providers including Anthropic (Claude), Meta (Llama), Mistral AI, Amazon (Nova), and others. You never provision a GPU, manage an endpoint, or download model weights. If your infrastructure already runs on AWS, Bedrock turns generative AI into a configuration choice rather than an infrastructure project. For a deeper look at what the platform offers, see our What Is Amazon Bedrock breakdown.
What You Need Before Starting
Amazon Bedrock runs inside the AWS Management Console. Unlike consumer AI chatbots, you need an AWS account with billing enabled and appropriate IAM permissions before you can make your first request. The service itself has no minimum spend or upfront commitment on the Standard tier. You pay per token (or per image, per second of audio) only when you use it.
AmazonBedrockFullAccess managed policy attached to your IAM user or role.us-east-1 (N. Virginia) or us-west-2 (Oregon) for the widest model selection.aws configure for programmatic access. Alternatively, use the Console Playground for your first test.- ✓Step 1: Access the Bedrock Console
- ✓Step 2: Request Model Access
- ✓Step 3: Make Your First API Call
- ✓Step 4: Set Up Knowledge Bases
- ✓Step 5: Build a Bedrock Agent
- ✓Step 6: Configure Guardrails
- ✓Step 7: Optimize for Production
Step 1: Accessing the Bedrock Console
The Bedrock console is your command center for model management, testing, and monitoring. Here is how to get there.
- Sign in to the AWS Management Console with your IAM credentials.
- In the search bar at the top, type Bedrock and select Amazon Bedrock from the results.
- Confirm your region in the top-right dropdown. Select
us-east-1(N. Virginia) orus-west-2(Oregon) for the broadest model catalog. - You will land on the Bedrock dashboard showing model providers, usage metrics, and quick-start links.
The left navigation panel organizes Bedrock's features: Model catalog (browse and enable models), Playgrounds (test models interactively), Knowledge bases, Agents, Guardrails, Flows (visual pipeline builder), and Custom models (fine-tuning and distillation).
Verification: You should see the Bedrock dashboard with a left sidebar listing "Model catalog," "Playgrounds," "Knowledge bases," and "Agents." If you see an "access denied" error, check that your IAM user or role has the AmazonBedrockFullAccess policy attached. Go to IAM > Users > your user > Permissions to verify.
Step 2: Requesting Model Access
Bedrock does not enable all models by default. Each model provider requires you to request access before you can invoke it. This is a one-time step per model, per region.
- In the Bedrock console, click Model catalog in the left sidebar.
- Browse available providers: Amazon (Nova family), Anthropic (Claude), Meta (Llama), Mistral AI, AI21 Labs, Cohere, DeepSeek, Stability AI, and others.
- Click on the model you want to enable. For this guide, start with Amazon Nova Micro (cheapest at $0.035/1M input tokens) and Anthropic Claude 3.5 Haiku (fast, cost-effective at $0.25/1M input tokens).
- Click Request model access. Most Amazon and Meta models are approved instantly. Anthropic models may take a few minutes.
- Repeat for any additional models you want to test.
The Bedrock Marketplace (accessible from the model catalog) adds another 100+ models beyond the core catalog, including specialized models for code generation, medical text, and domain-specific tasks.
Verification: Navigate to Model catalog and filter by "Access granted." You should see at least one model with a green "Access granted" status. If a model shows "In progress," wait 5-10 minutes and refresh. If it stays pending beyond 30 minutes, check your account's service quotas in the AWS Service Quotas console.
Step 3: Making Your First API Call
Bedrock offers two paths for your first interaction: the Console Playground (no code required) and the AWS SDK (programmatic access). Start with the Playground to verify everything works, then move to code.
Option A: Console Playground (Fastest)
- In the left sidebar, click Playgrounds > Chat.
- Select a model from the dropdown. Choose Amazon Nova Micro for a quick, cheap test.
- Type a prompt: "Explain Amazon Bedrock in three sentences for a developer who has never used it."
- Click Run. The response appears in the output panel within seconds.
- Experiment with the parameters panel on the right: Temperature (0 = deterministic, 1 = creative), Max tokens (response length), and Top P (nucleus sampling).
Option B: AWS CLI
With the AWS CLI v2 installed and configured, run this command to invoke the Converse API (AWS's unified format that works across all Bedrock models):
aws bedrock-runtime converse --model-id amazon.nova-micro-v1:0 --messages '[{"role":"user","content":[{"text":"What is Amazon Bedrock?"}]}]' --region us-east-1
The response includes the model's output text, token usage counts, and a stop reason. You pay only for the tokens consumed by this call.
Option C: Python SDK (boto3)
Bedrock supports five API formats: Converse API (unified, recommended), Invoke API (provider-native), Messages API (Anthropic-native for Claude), Chat Completions API (OpenAI-compatible), and Responses API (newer OpenAI patterns). The Converse API is the best starting point because it works with every model without format changes.
Verification: Whether you used the Playground, CLI, or SDK, you should have received a coherent text response. Check the response metadata for inputTokens and outputTokens counts. Your first call with Nova Micro should cost less than $0.001. Confirm billing is tracking by checking AWS Cost Explorer > filter by "Amazon Bedrock" (costs may take 24 hours to appear).
Step 4: Setting Up Knowledge Bases (Managed RAG)
Knowledge Bases give your models access to your own data through retrieval-augmented generation (RAG). Instead of the model relying only on its training data, it searches your documents and grounds its answers in your specific content. Every response includes source attribution so you can trace answers back to the original document.
- Navigate to Knowledge bases in the left sidebar and click Create knowledge base.
- Name your knowledge base and select an IAM role (Bedrock can create one for you).
- Choose a data source. Bedrock supports S3 buckets, Confluence, Salesforce, SharePoint, and web crawlers.
- For a quick start, create an S3 bucket, upload 5-10 PDF or text documents, and select that bucket as your data source.
- Configure the chunking strategy. The default (fixed-size chunks) works for most use cases. Advanced options include semantic chunking and hierarchical chunking.
- Select a vector store. Bedrock can create a managed OpenSearch Serverless collection automatically, or you can connect Aurora PostgreSQL, MongoDB Atlas, Pinecone, or Redis.
- Click Create and wait for the sync to complete. Bedrock handles chunking, embedding, and vector storage automatically.
- Once synced, test it in the Knowledge Base Playground: ask a question about your documents and verify the response cites specific sources.
Verification: In the Knowledge Base Playground, ask a question that can only be answered from your uploaded documents (not from general knowledge). The response should include a "Sources" section listing the specific document and page where the answer was found. If the response says "I don't have information about that," check that the sync completed successfully and your documents contain relevant content.
Step 5: Building a Bedrock Agent
Bedrock Agents handle multi-step task automation with tool use, code interpretation, and memory retention across sessions. An agent can break down a complex request into sub-tasks, call external APIs, query Knowledge Bases, and synthesize a final answer.
- Navigate to Agents in the left sidebar and click Create Agent.
- Name your agent and write clear instructions describing its role and behavior. Example: "You are a customer support agent for [company]. Answer questions about our products using the product knowledge base. If the customer needs to place an order, collect their email and product selection."
- Select a foundation model. Claude 3.5 Haiku ($0.25/1M input tokens) is a strong default for agents that need fast responses. Nova Pro ($0.80/1M input tokens) works well for agents that need multimodal understanding.
- Attach your Knowledge Base from Step 4 so the agent can search your documents.
- Define action groups: these are the tools your agent can use. Each action group maps to an API schema (OpenAPI format) or a Lambda function. For a basic agent, start with just the Knowledge Base. Add action groups later as you need external integrations.
- Enable Code Interpreter if your agent needs to run calculations, generate charts, or process data files.
- Click Prepare to compile the agent, then test it in the Agent Playground.
For complex workflows, multi-agent collaboration lets you chain specialized agents together. AgentCore, which reached general availability in late 2025, extends this further with its own runtime, gateway, memory, identity management, and policy engine. AgentCore works with any framework (LangChain, CrewAI, custom) and any model, not just Bedrock models.
Verification: In the Agent Playground, ask your agent a question that requires searching the Knowledge Base. The agent should show its "thought process" (the chain of reasoning) before delivering the answer. Check the trace to confirm it queried the Knowledge Base and retrieved relevant documents. If the agent gives generic answers without citing your documents, verify the Knowledge Base is correctly attached in the agent configuration.
Agents follow their system instructions literally. If you write "help users," expect generic responses. Be specific: define the agent's role, data sources, tone, and what it should refuse to answer.
An agent without a Knowledge Base attached can only use its training data. Always connect your document store before testing - otherwise the agent hallucinates answers instead of citing your sources.
Agents can generate any content the underlying model supports. Attach Guardrails (Step 6) before exposing an agent to end users. Content filtering and PII redaction are not enabled by default.
Step 6: Configuring Guardrails
Guardrails add a safety layer between your users and the model. AWS reports that Bedrock Guardrails block 88% of harmful content (AWS documentation). Six configurable safety policies let you control exactly what gets through.
- Navigate to Guardrails in the left sidebar and click Create guardrail.
- Name your guardrail and add a description.
- Configure Content filters: set sensitivity thresholds (None, Low, Medium, High) for hate speech, insults, sexual content, violence, and misconduct. Start with "High" filtering across all categories.
- Add Denied topics: describe topics your application should refuse to discuss. Example: "Investment advice," "Medical diagnosis," "Competitor product recommendations."
- Enable PII detection and redaction: Bedrock can automatically detect and mask personal information including names, email addresses, phone numbers, Social Security numbers, and credit card numbers.
- Configure Contextual grounding checks: these verify that the model's response is grounded in the provided context (from Knowledge Bases) and is relevant to the user's question. This reduces hallucination.
- Optionally enable Automated Reasoning: this uses formal logic to achieve 99% accuracy on factual validation (AWS documentation). It is the strongest anti-hallucination control available.
- Click Create, then attach the guardrail to your model invocations or agents.
Two tiers are available: Standard (newer, recommended) and Classic. The ApplyGuardrail API lets you use Bedrock Guardrails with models running outside Bedrock, so you can add safety filtering even if your inference runs on SageMaker, self-hosted models, or third-party APIs. For a full walkthrough of every policy type and configuration option, see our Bedrock Guardrails deep dive.
Verification: In the Playground, invoke a model with your guardrail attached and deliberately test the boundaries. Try: "Write a hateful message about [group]." The response should be blocked with a guardrail intervention message. Then try a legitimate question to confirm the guardrail does not over-block normal usage. Check the guardrail metrics in the Bedrock console to see blocked request counts and filter trigger rates.
Step 7: Optimizing for Production
Once your prototype works, these optimizations reduce cost and latency before you go live. Robinhood achieved an 80% cost reduction after scaling from 500 million to 5 billion tokens per day on Bedrock in 6 months (AWS case study).
Prompt Caching
If your prompts share a common prefix (system instructions, few-shot examples), prompt caching with 5-minute and 1-hour TTL windows reduces costs by up to 90% and latency by 85% (AWS Bedrock Prompt Caching documentation). Enable it by adding the cacheConfig parameter to your Converse API calls.
Intelligent Prompt Routing
This feature automatically selects the most cost-effective model for each request based on complexity. AWS claims up to 30% cost reduction from routing alone (AWS Bedrock Intelligent Prompt Routing documentation). Enable it in the Bedrock console under inference settings.
Model Distillation
Compress a large model's capabilities into a smaller, cheaper one. AWS claims up to 500% faster inference and 75% lower cost with distilled models (AWS Bedrock Model Distillation documentation). Useful when you have validated that a smaller model can handle your specific use case with acceptable quality.
Batch Inference
For non-real-time workloads (document processing, bulk classification, data extraction), batch inference runs at 50% off on-demand pricing. Submit a batch job through the Bedrock console or API and retrieve results when complete.
Service Tiers
Bedrock offers four pricing tiers: Standard (pay-per-token, no commitment), Flex (50% off Standard for async-friendly workloads), Priority (75% premium for guaranteed high throughput and lowest latency), and Reserved (custom term commitment for dedicated capacity). Start on Standard, move to Flex for batch workloads, and evaluate Reserved only when your monthly spend justifies the commitment.
Verification: Enable prompt caching on a test call that reuses a system prompt. Compare the response latency and token billing between the cached and uncached call. The cached call should show a cacheReadInputTokens field in the response metadata and significantly lower latency. Check AWS Cost Explorer after 24 hours to confirm the cost difference.
Troubleshooting and FAQ
AmazonBedrockFullAccess policy. Check IAM > Users > Permissions. For more granular control, use the bedrock:InvokeModel permission scoped to specific model ARNs.s3:GetObject permission on your bucket. (2) Documents are in unsupported formats (Bedrock supports PDF, TXT, MD, HTML, DOC/DOCX, CSV, XLS/XLSX). (3) The S3 bucket is in a different region than your Knowledge Base. All resources must be in the same region.Next Step
Build a retrieval-augmented generation workflow end to end. Upload your organization's internal documentation to a Knowledge Base, attach it to a Bedrock Agent with Guardrails enabled, and test it with real questions from your team. Measure response accuracy against your documents and compare costs across Nova Micro, Claude Haiku, and Llama models to find the price-performance sweet spot for your use case. For the full platform analysis, see our What Is Amazon Bedrock breakdown.
Amazon Bedrock processes your inputs on AWS infrastructure. AWS states that your data is not used to train or improve base models, and data is not shared with model providers. Data stays within your AWS account and selected region unless cross-region inference is enabled. Bedrock supports VPC endpoints, PrivateLink, and customer-managed KMS keys for encryption. Review AWS's data privacy practices before processing sensitive information.
AI systems are tools, not decision-makers. Foundation models can produce plausible-sounding but incorrect outputs. If you or someone you know is in crisis:
- 988 Suicide & Crisis Lifeline — Call or text 988 (US)
- SAMHSA Helpline — 1-800-662-4357
- Crisis Text Line — Text HOME to 741741
For medical, legal, financial, or safety-critical decisions, always consult a qualified professional. Use Bedrock Guardrails to prevent AI systems from providing advice in restricted domains.
Under GDPR and CCPA, you have rights to access, correct, and delete your data. AWS provides data processing agreements and compliance certifications (SOC, ISO, HIPAA eligibility, FedRAMP). Tech Jacks Solutions maintains editorial independence. This article was not sponsored or reviewed by Amazon Web Services. Tech Jacks Solutions may earn referral fees from links to vendor products. These fees never influence editorial recommendations. The EU AI Act applies to high-risk AI deployments, including foundation model platforms used in regulated industries.
Amazon Web Services, AWS, Amazon Bedrock, Amazon Nova, and related marks are trademarks of Amazon.com, Inc. Anthropic and Claude are trademarks of Anthropic, PBC. Meta and Llama are trademarks of Meta Platforms, Inc. All other trademarks belong to their respective owners. This article is an independent editorial product of Tech Jacks Solutions and is not affiliated with, sponsored by, or endorsed by Amazon Web Services.