Build Pillar

Cloud Agent Platforms

AWS Bedrock vs. Google ADK vs. Azure AI Agent Service

2,842 Words 12 Min Read 8 Sources 18 Citations

01 // Context The Platform Decision Strategic

Building an AI agent is one problem. Running it in production is a different one entirely. The gap between a prototype that works on your laptop and an enterprise-grade agent handling thousands of requests per day is filled with infrastructure decisions: model hosting, tool orchestration, guardrails enforcement, memory management, identity controls, and observability. If you are still getting oriented on what agentic AI is and how it differs from traditional AI, start there before diving into platform selection. Cloud platforms exist to close that gap.

AWS, Google Cloud, and Microsoft Azure have each built distinct agentic AI platforms reflecting fundamentally different philosophies. Amazon Bedrock takes a managed-service approach where agents are configured rather than coded, with guardrails and multi-agent orchestration built into the platform. Google's Agent Development Kit (ADK) leads with an open-source, framework-agnostic developer experience optimized for Gemini but compatible with any model. Azure AI Foundry Agent Service converges Semantic Kernel and AutoGen into a unified runtime that connects natively to the Microsoft 365 ecosystem.

The right choice depends less on raw feature counts and more on your existing cloud investment, your team's technical profile, and where your enterprise data already lives. A company running its stack on AWS with data in S3 and DynamoDB has a materially different optimal path than one embedded in Microsoft 365 with SharePoint and Teams. This article maps each platform's architecture, strengths, and tradeoffs so you can make that decision with full context. For the latest platform announcements, see the AI News Hub.

🌎

Market

70K+

Organizations using Azure AI Foundry

Source ↗

⚠️

Reality Check

95%

Enterprise AI projects failing to deliver on promises

McKinsey State of AI ↗

📈

Growth

44%

Projected CAGR for low-code agent platforms

Precedence Research ↗

02 // AWS Amazon Bedrock Agents GA

Amazon Bedrock is AWS's fully managed service for building generative AI applications. Bedrock Agents extend this with a managed orchestration layer that lets you define agents through the console or API without writing orchestration code yourself. The core mental model: you declare what an agent should do and connect it to the tools and knowledge it needs. Bedrock handles the reasoning loop, prompt management, and execution.

The architecture has four primary building blocks. Action Groups define what an agent can do by connecting to AWS Lambda functions via OpenAPI schemas. When the agent decides via tool calling it needs to call an external system, check a database, or trigger a workflow, it routes through an action group. Knowledge Bases provide the agent's memory of facts via Retrieval Augmented Generation (RAG). You point a knowledge base at documents in S3, and Bedrock handles chunking, embedding, and vector storage with semantic search plus optional reranking. Guardrails enforce safety boundaries. Memory gives agents session persistence and long-term context retention across conversations.

Multi-agent collaboration, now generally available, is where Bedrock's managed approach shows its strength. A supervisor agent orchestrates specialized collaborator agents, each with their own action groups, knowledge bases, and guardrails. Bedrock offers two modes: supervisor mode, where the supervisor analyzes requests, decomposes them into subtasks, and coordinates collaborators; and supervisor with routing mode, which routes simple requests directly to the right collaborator while falling back to full orchestration for complex queries. Each collaborator agent runs in parallel with non-overlapping responsibilities.

Action Groups

Define agent capabilities via OpenAPI schemas connected to Lambda functions. Support for APIs, database queries, and any custom business logic. The agent reasons about when and how to invoke each action.

Knowledge Bases

Managed RAG with vector indexing, semantic search with cosine similarity, hybrid keyword matching, and optional reranking. Connects to S3, Confluence, SharePoint, and web crawlers.

Guardrails

Six safeguard policies: content filters (hate, insults, sexual, violence, misconduct), denied topics, word filters, PII detection and redaction, prompt attack detection, and Automated Reasoning checks for hallucination reduction.

Multi-Agent

Supervisor + collaborators. Each agent maintains its own tools, knowledge, and guardrails. Supervisor mode for complex decomposition, routing mode for direct delegation. Parallel execution with clear domain boundaries.

Model Selection

Access to Anthropic Claude, Meta Llama, Mistral, Amazon Nova, Cohere, and AI21 through a unified API. Amazon Nova models highlighted for cost-effective agent workloads. Switch models without changing agent logic.

Bedrock Guardrails deserves specific attention. The six safeguard policies cover content moderation, topic classification, PII handling, prompt attack detection, and hallucination reduction via Automated Reasoning checks. AWS provides multiple safeguard policies designed to filter harmful content through its Bedrock Guardrails service. PII handling supports three modes: block the content entirely, anonymize it by masking detected PII, or pass it through while returning detection metadata. You can configure guardrails at the agent level, meaning each collaborator in a multi-agent system can have its own safety boundary appropriate to its domain.

Best for: Teams already invested in AWS infrastructure who want a managed, configuration-driven approach. Bedrock's strength is operational simplicity. You trade flexibility for a platform that handles orchestration, memory, and safety without requiring you to build those layers yourself. The deep integration with Lambda, S3, DynamoDB, and Step Functions means agent tools can leverage your existing AWS services directly.

03 // Google Cloud Agent Development Kit (ADK) GA

Google's approach to agent infrastructure splits into two complementary layers. The Agent Development Kit (ADK) is an open-source framework for building agents, while Vertex AI Agent Builder provides the managed cloud runtime for deploying them. ADK is designed to make "agent development feel more like software development," and the Python SDK reached its stable 1.0 release in 2025 with Java and Go support following shortly after.

ADK supports three categories of agents. LLM Agents use language models for dynamic routing and decision-making. Workflow Agents provide predictable execution through sequential, parallel, and loop patterns. Custom Agents allow developers to build entirely bespoke implementations. The framework is model-agnostic by design: while optimized for Gemini, ADK works with Claude, Vertex AI hosted LLMs, and open-source models through Ollama, vLLM, and LiteLLM connectors.

What sets Google's platform apart is the Agent-to-Agent (A2A) protocol. This open standard enables agents to communicate and collaborate regardless of which framework or vendor built them. Agents advertise their capabilities via "Agent Cards" (digital business cards) and establish secure, dynamic collaboration. A2A is framework-agnostic by specification: agents built with ADK, LangGraph, CrewAI, or any other framework can interoperate. Version 0.2 of the protocol added stateless interaction support and standardized authentication schemes based on OpenAPI-like patterns.

Agent Types

LLM Agents for dynamic reasoning, Workflow Agents for sequential/parallel/loop patterns, and Custom Agents for bespoke logic. Multi-agent hierarchies with delegation and coordination built in.

Tool Ecosystem

Pre-built tools (Search, Code Execution), custom functions, OpenAPI integration, MCP tools, Cloud API Registry for private tool governance, and the ability to use other agents as tools.

A2A Protocol

Open standard for inter-agent communication. Agent Cards advertise capabilities. Framework-agnostic: works across ADK, LangGraph, CrewAI, and others. V0.2 adds stateless mode and standardized auth.

Deployment

Vertex AI Agent Engine for fully managed production runtime. Also supports Cloud Run, GKE, and custom infrastructure. Single-command deployment from ADK to production.

Security & Governance

Agent identities tied to Cloud IAM. Model Armor blocks prompt injection attacks. Cloud API Registry acts as a private registry for tool governance. Built-in evaluation for response quality and execution trajectory assessment.

The deployment story is flexible. Vertex AI Agent Engine provides a fully managed runtime with built-in testing, release management, and global-scale reliability. But you're not locked in: ADK agents can also deploy to Cloud Run containers, Google Kubernetes Engine, or entirely custom infrastructure. Google's ADK documentation demonstrates basic agent creation in under 100 lines of Python, though production deployments require additional infrastructure.

State management received significant attention in recent releases. ADK now supports native failure recovery (restoring agent state automatically), human-in-the-loop pausing at any point in execution, and persistent session memory. Grounding capabilities connect agents to Google Search and Vertex AI Search for real-time information retrieval.

Best for: Developer teams that value framework flexibility and open standards. ADK's model-agnostic design and the A2A protocol make it the strongest choice for organizations building multi-vendor agent ecosystems or planning to avoid lock-in. The Gemini optimization gives a performance edge if you're using Google's models, but the ability to swap in Claude or Llama without rewriting agent logic is a genuine differentiator. Strong fit for teams already on GCP or those building agents that need to interoperate across organizational boundaries.

04 // Microsoft Azure Azure AI Foundry Agent Service GA

Microsoft's agent strategy is the broadest of the three hyperscalers, spanning developer SDKs, low-code platforms, and end-user products. Azure AI Foundry Agent Service, now generally available, is the production runtime. The Microsoft Agent Framework is the open-source SDK layer. And Copilot Studio provides the low-code experience for business users. Understanding how these three pieces fit together is essential to evaluating the Azure option.

The Microsoft Agent Framework converges two previously separate projects: Semantic Kernel, Microsoft's enterprise-grade orchestration SDK, and AutoGen, a research-origin multi-agent framework. The result is a single API surface for defining, chaining, and managing both single-agent and multi-agent workflows. Semantic Kernel provides the modular, production-ready architecture (plugins, planners, memory connectors), while AutoGen contributes dynamic orchestration patterns including the Magentic One multi-agent pattern. Production-ready SDKs ship across Python, JavaScript, Java, and .NET.

Foundry Agent Service runs on top of this framework with enterprise-grade additions. Connected Agents enable point-to-point interactions where agents call other agents as tools for specialized tasks. Multi-Agent Workflows coordinate long-running, stateful processes with automated error handling and recovery. The tool ecosystem is deep: over 1,400 Azure Logic Apps workflows are available for agent automation, plus native SharePoint integration, Microsoft Fabric connectors, and Bing Search. Third-party integrations from partners including LexisNexis, Morningstar, and Celonis provide domain-specific capabilities.

Agent Framework

Semantic Kernel + AutoGen converged. Single API surface for single-agent and multi-agent workflows. SDKs in Python, JavaScript, Java, .NET. MCP and OpenAPI support for tool connections.

Copilot Studio

Low-code agent builder for business users. One-click publishing to Microsoft Teams and Microsoft 365 Copilot. Multi-agent orchestration with enterprise compliance and governance.

Tool Ecosystem

1,400+ Logic Apps workflows. Native SharePoint, Microsoft Fabric, Bing Search integration. Partner tools from LexisNexis, Morningstar, Celonis. A2A API for cross-cloud orchestration.

Security & Governance

Responsible AI built in: task adherence monitoring, prompt shields with spotlighting against injection, PII detection. Private networking with BYO VNet. Azure RBAC integration. Content filtering on inputs, outputs, tool calls, and tool responses.

Observability

OpenTelemetry-based tracing across frameworks (including LangChain, LangGraph, OpenAI Agents SDK). Built-in evaluation metrics for tool accuracy and task adherence. Azure Monitor dashboards for response times and completion rates. Continuous red teaming capabilities.

The Copilot Studio angle is where Microsoft differentiates most sharply. While AWS and Google focus primarily on developers, Copilot Studio puts agent creation in the hands of business users with a visual, low-code interface. Agents built in Copilot Studio can be published directly to Teams and Microsoft 365 Copilot with one click. For organizations already deep in the Microsoft ecosystem, this removes significant deployment friction. It also means the same agent can be authored visually in Copilot Studio or programmatically via the Agent Framework SDK, and deployed through the same Foundry Agent Service runtime.

Observability is a notable strength. Microsoft contributed OpenTelemetry instrumentation that provides unified tracing across not just their own frameworks but also LangChain, LangGraph, and OpenAI Agents SDK. Combined with Azure Monitor integration and built-in evaluation metrics for tool accuracy and task adherence, the monitoring story is more mature than what AWS or Google currently offer natively.

Best for: Enterprises embedded in the Microsoft ecosystem. If your organization runs on Microsoft 365, SharePoint, and Teams, the native integrations create a path of least resistance that the other platforms cannot replicate. The dual-track approach (Copilot Studio for business users, Agent Framework for developers) also makes Azure the strongest choice for organizations that need both technical and citizen-developer agent creation. The Semantic Kernel + AutoGen convergence gives production-grade multi-agent orchestration with the widest SDK language support across the three platforms.

05 // Comparison Head-to-Head Matrix

Feature lists are necessary but insufficient. The table below maps each platform across the dimensions that matter most for enterprise agent deployment. Use it as a reference, not a verdict. The "best" platform depends on context that no comparison table can capture: your existing cloud spend, your team's language preferences, your compliance requirements, and where your data lives.

Dimension	AWS Bedrock	Google ADK	Azure AI Foundry
Approach	Managed service, configuration-driven	Open-source framework, code-first	Converged SDK + managed runtime + low-code
SDK Languages	Python, Java, JavaScript (AWS SDK)	Python, Java, Go (TypeScript in progress)	Python, JavaScript, Java, .NET
Model Access	Claude, Llama, Mistral, Nova, Cohere, AI21	Gemini, Claude, Llama (via Vertex AI, Ollama, vLLM)	GPT-4o, GPT-4, Llama, Mistral, Phi (via Azure OpenAI)
Multi-Agent	Supervisor + collaborator (GA)	Hierarchical composition, A2A cross-framework	Connected Agents, Multi-Agent Workflows
Guardrails	6 safeguard policies, Automated Reasoning	Model Armor, Cloud IAM agent identities	Content filters, prompt shields, PII detection, task adherence
Tool Integration	Action Groups (Lambda + OpenAPI)	OpenAPI, MCP, Cloud API Registry, agents-as-tools	1,400+ Logic Apps, SharePoint, Fabric, A2A
Knowledge/RAG	Managed RAG with vector search + reranking	Vertex AI Search, Google Search grounding	Azure AI Search, SharePoint, Cosmos DB
Observability	CloudWatch, step-by-step tracing	Built-in evaluation, trajectory assessment	OpenTelemetry, Azure Monitor, cross-framework tracing
Low-Code Option	Console-based agent builder	Vertex AI Agent Builder console	Copilot Studio (full visual builder)
Open Standards	OpenAPI for actions	A2A protocol, MCP, model-agnostic	MCP, OpenAPI, A2A API
Lock-In Risk	High (Action Groups tied to Lambda)	Low (open-source, multi-deployment)	Medium (framework portable, integrations Microsoft-specific)

Cross-Reference

These platforms host the agent frameworks we compared separately. LangChain and LlamaIndex run on all three. AutoGen is now part of Azure's Agent Framework. The Model Context Protocol (MCP) is supported by both Google ADK and Azure as a tool integration standard.

06 // Guidance Decision Framework Actionable

Choosing a cloud agent platform is a downstream consequence of two upstream decisions: where your data lives and what your team knows. If you're starting from scratch with no cloud commitment, the technical comparison matters most. If you're an enterprise with years of investment in one cloud provider, the switching costs almost certainly outweigh any feature delta between platforms. Here's the decision logic distilled. Teams looking to build or validate cloud platform expertise should explore relevant IT certifications across AWS, Google Cloud, and Azure.

☁️

Choose AWS Bedrock

Managed Simplicity

Your data is in S3 and DynamoDB
You want configuration over code
You need multi-model flexibility (Claude + Llama + Nova)
Your workloads run on Lambda
Guardrails enforcement is a top priority
You prefer fully managed infrastructure

🔥

Choose Google ADK

Open & Interoperable

You want framework-agnostic, open-source tooling
Cross-organization agent collaboration matters (A2A)
Your team prefers code-first development
You use or plan to use Gemini models
Avoiding vendor lock-in is a strategic priority
You need multi-cloud deployment flexibility

💼

Choose Azure AI Foundry

Enterprise Ecosystem

Your org runs on Microsoft 365, SharePoint, Teams
You need both pro-code and low-code tracks
Your developers use .NET or TypeScript
Copilot Studio adoption is planned or in progress
Cross-framework observability is important
You want the broadest SDK language support

A few patterns cut across all three platforms. Start with a single agent before going multi-agent. Every platform supports multi-agent orchestration, but the complexity jump is significant. Get one agent working reliably with the right tools and guardrails before introducing a supervisor. Guardrails are not optional. All three platforms provide safety mechanisms, but they're only as good as the threat model informing their configuration (track emerging threats on the Security News Hub). Test with production-representative load. Agent behavior under high concurrency is materially different from single-request testing, especially for multi-agent systems where coordination overhead scales non-linearly.

07 // Strategy Multi-Cloud and Vendor Lock-In Caution

The multi-cloud question for agents is different from the multi-cloud question for compute. With VMs and containers, workloads are reasonably portable. With agents, portability depends on how tightly your agent's tools, knowledge, and orchestration logic are coupled to a specific platform.

Bedrock agents are the most tightly coupled. Action Groups connect to Lambda, Knowledge Bases live in S3, and the orchestration logic runs on Bedrock's managed runtime. Moving a Bedrock agent to another cloud means rewriting tool integrations, migrating knowledge stores, and rebuilding the orchestration layer. This isn't inherently bad; it's the tradeoff for a fully managed experience.

Google ADK offers the most portability by design. As an open-source framework, ADK agents can run on any infrastructure: GCP, AWS, Azure, or on-premises. The A2A protocol extends this to agent-level interoperability. You could theoretically have an ADK-built agent on GCP collaborating with a LangGraph agent on AWS and a Semantic Kernel agent on Azure, all communicating through A2A. The practical reality is more nuanced (authentication, latency, debugging), but the architectural possibility is real.

Azure sits in the middle. The Microsoft Agent Framework (Semantic Kernel + AutoGen) is open-source and runs anywhere. But the value of Azure's agent offering comes from its integrations with Microsoft 365, SharePoint, and Copilot Studio. Those integrations don't travel. An enterprise using Foundry Agent Service for its Teams integration won't easily replicate that on GCP.

Portability

Interoperability

Compliance

Portability Layer: To build portable agents, focus on open standards at each layer. Use MCP for tool integration (supported by Google ADK and Azure). Use OpenAPI schemas for action definitions (supported by all three). Use framework-agnostic agent logic libraries like LangChain or LlamaIndex that run on any cloud. The agent's reasoning logic (prompts, tool selection, orchestration patterns) should be the most portable layer. Platform-specific pieces (deployment, monitoring, identity) are where lock-in lives.

Cross-Platform Agents: The A2A protocol is the most promising path to genuine multi-cloud agent systems. Azure AI Foundry already supports A2A API for cross-cloud orchestration with SAP Joule and Google Vertex AI. Google built A2A as an open specification from the start. The practical challenge is debugging distributed agents across cloud boundaries, where observability tools from different vendors don't naturally converge. Microsoft's OpenTelemetry contributions for cross-framework tracing are a step toward solving this.

Compliance Factors: Data sovereignty requirements may force your hand. If your agent processes data that must remain in specific geographic regions, your cloud platform choice is constrained by available regions and data residency certifications. All three platforms offer region-specific deployments, but the compliance certification coverage varies. The EU AI Act's high-risk classification (see the EU AI Act Hub for the full regulatory context) will likely apply to many enterprise agent systems. Azure's private networking with BYO VNet, AWS's VPC integration, and GCP's VPC Service Controls each provide network isolation, but the configuration and certification details differ. Build compliance requirements into your platform evaluation criteria from day one, not as an afterthought. Our Agent Governance Stack maps the NIST AI RMF, ISO, and EU AI Act controls you will need to satisfy. For broader responsible AI governance strategy that spans platform and policy, see the AI Governance Hub.

The pragmatic recommendation: pick the cloud where your data and team already live, then use open standards at the tool integration layer to preserve optionality. Running agents multi-cloud from day one adds complexity that rarely pays off early. Running tools that are accessible across clouds via MCP or OpenAPI is a more realistic and valuable form of portability.

To figure out exactly which architecture decisions fit your specific context, try our Agent Blueprint Quest. It walks through eight layers of architecture decisions, including cloud platform and deployment, and generates a personalized deployment blueprint based on your constraints.

Agent Observability: The Missing Production Layer

Choosing a cloud platform and framework gets your agent to deployment. Keeping it running reliably requires observability tooling purpose-built for agentic workloads. Traditional APM tools track request latency and error rates. Agent observability tracks reasoning chains, tool call sequences, token costs, and decision quality across multi-step workflows that may span minutes and dozens of LLM calls.

Four platforms lead the agent observability space, each with a distinct positioning:

LangSmith

LangChain's official observability platform. Zero-config tracing for LangChain and LangGraph applications with automatic instrumentation. Per-trace pricing model. Best fit when your entire stack is LangChain-native and framework coupling is acceptable.

Langfuse

Open-source (MIT license, 19,000+ GitHub stars) with self-hosting support. Covers tracing, prompt management, and evaluations with multi-turn conversation tracking. The strongest option for teams that need full data control or want to avoid vendor lock-in.

Arize

Enterprise-grade observability serving organizations like Tripadvisor and Microsoft. Offers both Arize AX (commercial) and Arize Phoenix (open-source). Deep support for structured outputs, tool use tracing, and integration with evaluation frameworks.

Braintrust

Evaluation-first platform with CI/CD integration. Automated scorers execute on every pull request, analyze statistical significance, and block merges when quality degrades. Best fit for teams that treat agent quality as a deployment gate, not a monitoring afterthought.

The practical recommendation: if you're using LangGraph, start with LangSmith for frictionless tracing. If you need vendor-neutral observability or self-hosting, Langfuse is the strongest open-source option. For enterprise scale with dedicated support, evaluate Arize AX. For teams prioritizing evaluation-driven development, Braintrust's CI/CD integration catches regressions before they reach production. Whichever you choose, instrument from day one. Debugging a multi-step agent failure without trace data is like debugging a distributed system without logs.

Ready to map your agent architecture? The Agent Blueprint Quest walks you through model, framework, memory, tools, orchestration, security, governance, and deployment decisions. Your intake answers determine which choices are optimal for your context. Or continue to the Secure pillar to understand the threat landscape that shapes how these platforms build guardrails.

← Previous Model Context Protocol (MCP) Next → The Agentic AI Threat Landscape

Gallery

Contacts

Cloud Agent Platforms

Agent Observability: The Missing Production Layer

Our Address

Our Mailbox

Our Phone