AI agents are moving from demo to deployment. Startups across ANZ are building production-grade assistants using open-source orchestration frameworks, fine-tuned foundation models, and GPU-accelerated inference on AWS and NVIDIA infrastructure. This panel explores what it actually takes to ship agentic use casesfrom choosing the right models and frameworks to managing latency, cost, and reliability at scale. We'll hear from AirTree VC on where the investment thesis is heading, from NVIDIA on how accelerated compute is shaping the agent stack, and from Heidi Health building and scaling these systems in production. Whether it's vertical agents for healthcare, customer support, or code generation, we'll focus on what's working, what's hype, and where the real startup opportunities lie in the agent ecosystem.
What this session is about
Playbook
Editorial commentary · what to actually do about this on Monday
Independent editorial perspective — not an official AWS or speaker statement. Designed for executives evaluating what to brief their teams on next.
Live updates related to this session LIVE
Sourced via Parallel AI Monitor — continuous web watch on 21 topical streams. Updated .
- gruve.ai high confidence Scaling infra for agent workloads
FAQs
AgentBudget was identified as an open-source Python SDK that provides real-time cost enforcement for AI agents, allowing developers to set a hard dollar limit on any single AI agent session to prevent runaway expenses.
- insights.reinventing.ai high confidence Open-source agent infrastructure
Open-Source AI Agent Infrastructure Reaches Production Maturity
Galileo released Agent Control, an open-source (Apache 2.0) control plane designed for the centralized governance, real-time policy enforcement, and safety of AI agents. It allows developers to integrate governance hooks using a @control() decorator, decoupling policy management
- ampcome.com high confidence Enterprise AI agent deployments
Enterprise AI Agents 2026: Mid-Year Report on What's Working
NVIDIA GTC 2026 reports that Fortune 500 enterprises have scaled from a few to 50-200 production agentic workflows per company. Key drivers include a 10x drop in inference costs via Blackwell Ultra/Rubin hardware and the adoption of the Model Context Protocol (MCP) and NVIDIA NIM
- rackspace.com high confidence Enterprise AI agent deployments
Agentic AI at Scale: The Rackspace Story
NVIDIA GTC 2026 reports that Fortune 500 enterprises have scaled from a few to 50-200 production agentic workflows per company. Key drivers include a 10x drop in inference costs via Blackwell Ultra/Rubin hardware and the adoption of the Model Context Protocol (MCP) and NVIDIA NIM
- cloud.google.com high confidence Scaling infra for agent workloads
What’s new in compute at Next ‘26 | Google Cloud Blog
AgentBudget was identified as an open-source Python SDK that provides real-time cost enforcement for AI agents, allowing developers to set a hard dollar limit on any single AI agent session to prevent runaway expenses.
External links matched to this session via topic relevance. The KB does not endorse third-party content; verify before citing.