How Uber Runs 60,000 AI Agent Tasks Per Week With MCP
This Agentic AI Foundation blog post summarizes a talk by Meghana Somasundara and Rush Tehrani at the MCP Dev Summit North America 2026 about Uber's production-scale MCP deployment. Uber runs 60,000 AI agent executions per week, with over 1,500 active agents monthly and more than 90% of Uber's 5,000-plus engineers using AI tooling every month. The infrastructure is built on MCP, which the authors state "are what make AI usable at Uber." Before MCP, every agent team built bespoke integrations to Uber's 10,000-plus internal services, resulting in hundreds of non-reusable parallel integrations. The solution was a control plane consisting of the MCP Gateway and Registry, which automatically translates Uber's service interface definitions (proto and thrift files) into MCP tool descriptions using an LLM. Everything runs through code as pull requests with security scanning before deployment. Security layers include authentication on by default for sensitive data, a PII redactor service, continuous code scanning, and guardrails blocking write operations to critical services. Three consumption surfaces are supported: Uber Agent Builder (no-code platform for thousands of internal agents), Uber Agent SDK (code-first for production-grade agents like grocery assistant and customer support), and coding agents (Claude Code, Cursor, and Uber's Minions which produces roughly 1,800 code changes per week used by 95% of engineers). The roadmap includes evaluation metrics and service SLAs in the registry, a tool-search tool that finds other MCPs on demand, and a Skills Registry for reusable workflow knowledge. The authors present the MCP Gateway and Registry pattern as a reproducible model for production-grade MCP deployment at scale.
https://aaif.io/blog/how-uber-runs-60000-ai-agent-tasks-per-week-with-mcp/
Comments
Post a Comment