his session explores how Canva leverages Karpenter to scale and optimize diverse workloads on Amazon EKS. Learn how Canva manages AI workloads using On-Demand Capacity Reservations (ODCRs) and EC2 Capacity Blocks for ML, while maximizing resource utilization by intelligently co-locating CPU and GPU workloads on GPU nodes. We will dive into NodePool management strategies for efficient scheduling of AI workloads and examine how Canva uses a range of Amazon EC2 instance types to operate a multi-tenant container orchestration platform for all workloads, optimizing for cost-effectiveness and resource efficiency. Ideal for platform engineers and Kubernetes operators looking to optimize their EKS clusters for both AI and general workloads at scale.
What this session is about
Playbook
Editorial commentary · what to actually do about this on Monday
Independent editorial perspective — not an official AWS or speaker statement. Designed for executives evaluating what to brief their teams on next.
Live updates related to this session LIVE
Sourced via Parallel AI Monitor — continuous web watch on 21 topical streams. Updated .
- cloud.google.com high confidence Scaling infra for agent workloads
What’s new in compute at Next ‘26 | Google Cloud Blog
AgentBudget was identified as an open-source Python SDK that provides real-time cost enforcement for AI agents, allowing developers to set a hard dollar limit on any single AI agent session to prevent runaway expenses.
- virtualizationreview.com high confidence Scaling infra for agent workloads
How to Scale Backend Infrastructure for the Age of Agentic AI
Waxell provides a governance layer for infrastructure-layer budget enforcement that wraps LLM requests and tool calls, synchronously terminating sessions before an API call is placed once a per-session or fleet-wide token/cost ceiling is reached, preventing runaway loop scenarios
- newsroom.ibm.com high confidence Scaling infra for agent workloads
IBM Consulting Expands AI Capabilities to Accelerate Enterprise Transformation
IBM announced an expansion of its AI capabilities through 'IBM Enterprise Advantage' and 'IBM Consulting Advantage,' including the 'Agent2Agent (A2A)' interoperability standard to allow multi-agent orchestration across enterprise ecosystems (e.g., watsonx Orchestrate and SAP's Jo
- gruve.ai high confidence Scaling infra for agent workloads
FAQs
AgentBudget was identified as an open-source Python SDK that provides real-time cost enforcement for AI agents, allowing developers to set a hard dollar limit on any single AI agent session to prevent runaway expenses.
- insights.reinventing.ai high confidence Scaling infra for agent workloads
Multi-Agent Orchestration Patterns Drive Enterprise ROI in 2026
Waxell published a detailed framework on AI Agent Circuit Breakers, proposing automated circuit breakers implemented at the governance plane (outside agent code) to prevent runaway loops, monitor cost velocity, handle consecutive failures, and stop scope violations.
External links matched to this session via topic relevance. The KB does not endorse third-party content; verify before citing.