Deployments, also referred to as Agent Runtime, is Google’s managed runtime environment for operating autonomous AI agents in production. The service evolved from Vertex AI Agent Engine, introduced in 2024, and is now part of the Gemini Enterprise Agent Platform (formerly Vertex AI). It targets development teams that want to run agent frameworks such as the Agent Development Kit (ADK), LangChain, LangGraph, or LlamaIndex at scale without managing their own infrastructure.

What is Deployments (Agent Runtime)?

Deployments handles all the infrastructure surrounding the operation of LLM agents: automatic scaling, session and memory management via a Memory Bank, logging/tracing, and integration into the Google Cloud ecosystem. Developers deploy their existing agent frameworks directly onto the runtime without needing to manage containers or Kubernetes clusters themselves. The service supports tool use — the ability of agents to call external functions and APIs — and manages the associated state across multiple conversation turns.

An important distinction from the related Agent Studio (formerly Vertex AI Agent Builder): Agent Studio focuses on low-code creation of RAG applications, search systems, and chatbots via a graphical interface. Deployments/Agent Runtime, on the other hand, is the programmatic runtime environment for code-first agents that execute complex workflows, orchestrate multiple tools, and integrate into existing systems.

The service offers resource controls (CPU, memory, concurrency limits), custom service accounts and agent identities, and monitoring via traces and logs. Integration with other Google Cloud services lets agents access enterprise data directly. Security features such as VPC Service Controls and IAM-based access control make the service production-ready for enterprise applications.

Integration with innFactory

As a certified Google Cloud partner, innFactory supports you in designing and operating AI agent architectures on Deployments/Agent Runtime — from choosing the right agent framework and integrating tools to production-ready deployment on the Gemini Enterprise Agent Platform.

Deployments (formerly Agent Engine) - AI Agent Runtime

What is Deployments (Agent Runtime)?

Integration with innFactory

Typical Use Cases

Quick Links

Google Cloud Partner

Similar Products from Other Clouds

Amazon Augmented AI (A2I) - Human Review for ML

Amazon Bedrock AgentCore - AI Agent Runtime

Amazon Bedrock Agents (Classic): Status and Alternative

Amazon Bedrock Data Automation - Structure Data

Amazon Bedrock Guardrails - Safety for Generative AI

Amazon Bedrock Knowledge Bases: Managed RAG

Ready to start with Deployments (formerly Agent Engine) - AI Agent Runtime?