Overview
RAG combines a retrieval system (usually a vector database) with a generative model so the model can answer questions about private or up-to-date data it was never trained on. Documents are chunked, embedded into vectors, and stored. At query time, the most similar chunks are retrieved and inserted into the model's context. AWS offers Bedrock Knowledge Bases as a managed RAG pipeline, with Amazon OpenSearch Serverless, Aurora pgvector, MemoryDB, and S3 Vectors as vector store options.
Key concepts
- Embeddings and vector similarity (cosine, dot product)
- Chunking strategies — fixed, semantic, hierarchical
- Hybrid search — combining keyword (BM25) and vector search
- Reranking with cross-encoders for precision
- Graph RAG — using knowledge graphs alongside vectors
- Evaluation: faithfulness, answer relevance, context recall
Key AWS services
- Bedrock Knowledge Bases
- Amazon OpenSearch Serverless
- Aurora PostgreSQL (pgvector)
- Amazon MemoryDB
- Amazon S3 Vectors
Learn more — curated resources
Hand-picked official docs, foundational papers, and the best community guides for going deeper on this topic.
Sessions on this topic
51 sessions from the Summit covered this topic. Each is a self-contained mini-lesson.
- AIM201Intermediate
From demo to deployment: solving agentic AI's toughest challenges
Most AI agent projects stall when moving from prototype to production. This session tackles the top challenges builders face when deploying agentic AI at scale. You'll learn how to answer the fundamental question of whether to build custom agents or leverage pre-built agents for DevOps, security, development, and business productivity use cases. Then you'll discover how to address the critical production challenges of reliability, observability, cost management, security, and evaluation. Drawing from real customer deployments and AWS's portfolio of agentic AI capabilities, you'll gain actionable approaches for building agents that don't just demo well but ship and scale.
- ANT301Advanced
A practitioners guide to data for agentic AI
In this session, gain the skills needed to deploy end-to-end agentic AI applications using your most valuable data. This session focuses on data management using processes like Model Context Protocol (MCP) and Retrieval Augmented Generation (RAG), and provides concepts that apply to other methods of customizing agentic AI applications. Discover best practice architectures using AWS database services like Amazon Aurora and OpenSearch Service, along with analytical, data processing and streaming experiences found in SageMaker Unified Studio. Learn data lake, governance, and data quality concepts and how Amazon Bedrock AgentCore and Bedrock Knowledge Bases, and other features tie solution components together.
- DAT304Advanced
AI-Native by Design: How Deputy Rewired Its Operating Model on AWS
Ciaran Hale shares how Deputy moved from fragmented AI experiments to a centralised AI and data foundation on AWS. By standardising infrastructure, introducing governance guardrails, and building reusable components, Deputy enabled teams to scale AI with confidence. This shift also transformed internal adoptionembedding AI into everyday workflowsand unlocked new customer value through Deputy AI, delivering smarter automation and insights in-product. A practical blueprint for building an AI-native organisation.
- MAM306Advanced
Adding Agentic AI to legacy apps with Amazon Bedrock AgentCore
In this code-first session, we demonstrate how to add agentic AI capabilities and augment a legacy application using Amazon Bedrock AgentCore and the Amazon Strands Agents SDK. We will explore how to build AI-powered features for a legacy application without modifying the existing backend code. We will showcase how to leverage existing APIs and Lambda functions as the backbone for your agentic AI experience. You'll learn how to execute code in isolated sandbox environments, ensuring security while accessing internal data sources with Amazon Bedrock AgentCore Code Interpreter.
- ISV302Advanced
Architecting Scalable AI Agents using Amazon Bedrock AgentCore
Discover how to build powerful AI agents using Amazon Bedrock's suite of tools, with a focus on Amazon Bedrock AgentCore. This session explores how Parrot Analytics leveraged the modular components of Amazon Bedrock AgentCore and Amazon Nova foundational models to achieve 10x the processing speed of manual classification across 2M+ entities. We will dive into prompt and context engineering, knowledge bases, and observability for production agentic workloads.
- STP205Intermediate
How Dovetail powers Multi-Tenant Agents with Vector Indexing at Scale
When you're building multi-tenant vector search but can't control what customers throw at you, every indexing decision — isolation, embedding, chunking, partitioning — becomes a bet you're making blind, and this talk gives you the framework to make the right ones.
- DEV207Intermediate
Data Observability Without the Pain - Lessons from a Production System
Modern IoT platforms are inherently data platforms. Events flow through APIs, queues, AWS Lambda Serverless functions, storage systems, and device networks before becoming meaningful data. When something goes wrong, tracing a single event across these distributed components quickly becomes painfuland the question shifts from _what happened_ to _where do I even start looking Ill walk through three practical observability patterns drawn from building and operating a production, event-driven IoT healthcare platform on AWS that processes tens of thousands of device events daily. Using OpenTelemetry, AWS X-Ray and Honeycomb, well explore techniques for gaining visibility into asynchronous event pipelines, correlating activity across services, and tracing events as they move through distributed systems. Youll leave with three concrete patterns you can apply immediately to your own event-driven data systems.
- AIM303Advanced
AWS Security Agent: Proactive AppSec from Design to Deployment
Application security teams face an impossible challenge: scale security expertise across growing application portfolios while maintaining development velocity. Traditional approaches force organizations to choose between speed and security. In this session, discover how AWS Security Agent transforms application security from reactive to proactive through AI-powered automation. Learn how this frontier agent conducts automated security reviews customized to your organizational requirements and delivers on-demand penetration testing tailored to your applications. Join us to see how you can scale security coverage and prevent vulnerabilities early in the development lifecycle while maintaining the speed of modern development.
- ARC305Advanced
Transforming from SaaS to multi-tenant agentic SaaS
Existing SaaS providers must determine how and where agents best fit into their offerings. Getting there requires organizations to transform existing IP and functionality into agent-powered experiences. This breakout will dig into the details of this transformation, examining the patterns, strategies, and techniques that can be used to introduce agents into an existing multi-tenant system. Well focus heavily on identifying the target agents, digging into how/where theyre built and introduced, how theyre integrated, and so on. Well also dig into how multi-tenancy lands in new agents, integrating with MCP servers, using RAG, applying tenant isolation, supporting onboarding, and on on.
- MAM303Advanced
Digital transformation excellence using agentic AI
Discover how customers are leveraging AWS AI-driven solutions to accelerate their cloud transformation journey, moving beyond traditional migration and modernization to achieve digital transformation excellence. This session showcases real-world experiences where organizations have integrated AI-powered accelerators with AWS cloud services to achieve 3x faster migration timelines and drive unprecedented business value. Learn actionable strategies and see demonstrations of how a large transformation project can become a digital enterprise enabler for your organization,
- PRT201-SIntermediate
Postman and the Future of AI-Driven API Development in 2026
Software development has fundamentally changed in 2026, driven by vibe coding, AI agents, and RAG/MCP. APIs are the interface layer for AI systems to perform meaningful work. For this to succeed, your APIs must be discoverable, consistent, and usable by both developers and agents. Postman is now central to designing, managing, and iterating on your APIs to be sustainable in this new era.
- PRT207-SIntermediate
Charting the CX Frontier: A Cohesive, AI-Enabled Engagement Platform
Geopolitical instability, rising CX demands, rapid tech shifts, and escalating cyber threats converge faster than manual processes can handle. Join our expert panel as they leverage AWS and AI to build customer solutions, elevate engagement, and neutralise cyber threats. We'll share real deployments, proven governance, and measurable gains in efficiency, resilience, and customer impact.
- PRT209-SIntermediate
How Auto & General leverage observability foundations for AI
As one of Australia's leading general insurers, Auto & General knew AI would play an important part in their future IT operations. To ensure success, the team embarked on a comprehensive observability maturity journey to build solid foundations, governance, and structure. Learn how A&G worked with New Relic to successfully lay observability foundations for the AI age.
- PRT213-SIntermediate
How NAB is Conquering Multi-Cloud to Secure the Enterprise
For the National Australia Bank (NAB), operating across multiple cloud environments and delivery platforms is a necessity but results in a critical challenge - fragmented visibility and the costly reality of "design once, build thrice" security controls. Learn how NAB moved beyond compliance to empower every engineering team with a unified view of risk, achieving security at speed and scale.
- PRT216-SIntermediate
Postman and the Future of AI-Driven API Development in 2026
Postman and the Future of AI-Driven API Development in 2026 (sponsored by Postman, Inc)Software development has fundamentally changed in 2026, driven by vibe coding, AI agents, and RAG/MCP. APIs are the interface layer for AI systems to perform meaningful work. For this to succeed, your APIs must be discoverable, consistent, and usable by both developers and agents. Postman is now central to designing, managing, and iterating on your APIs to be sustainable in this new era.
- PRT301-SAdvanced
Unite Teams, Tools, and AI to Drive Transformation at Scale
Today's leaders face pressure to prove AI ROI, yet many struggle to realise value on fragmented foundations. Miro on AWS solves this by uniting teams, tools, and collaborative AI workflows in one workspace — amplifying the way teams actually work with enterprise scale and security. In this fireside chat with Culture Amp, hear how turning AI ambition into real outcomes works in practice.
- DEV202Intermediate
AI Native Development: Strategies and Impact across Amazon and AWS
AI Native Development: Strategies and Impact across Amazon and AWSAmazon and AWS have evolved beyond AI-assisted development to embrace AI Native practices, integrating AI as a partner throughout the software development lifecycle. Learn how their teams leverage AWS foundational tools including Kiro, and Amazon Bedrock. Discover effective Prompt Driven Development methodologies and grassroots adoption strategies from early champions. See how Amazon enables teams to provide AI with right context through strategic use of MCP, RAG and custom models trained on Amazon technical knowledge. Understand the culture transformation required across multi-thousand person organizations, where every role must evolve. Gain actionable insights to accelerate your AI Native journey.
- DEV314Advanced
AI Native Development: Strategies and Impact across Amazon and AWS
Amazon and AWS have evolved beyond AI-assisted development to embrace AI Native practices, integrating AI as a partner throughout the software development lifecycle. Learn how their teams leverage AWS foundational tools including Kiro, and Amazon Bedrock. Discover effective Prompt Driven Development methodologies and grassroots adoption strategies from early champions. See how Amazon enables teams to provide AI with right context through strategic use of MCP, RAG and custom models trained on Amazon technical knowledge. Understand the culture transformation required across multi-thousand person organizations, where every role must evolve. Gain actionable insights to accelerate your AI Native journey.
- SEC305Advanced
Advanced AI Security: Architecting Defense-in-Depth for AI Workloads
Dive deep into advanced security architectures for AI workloads, exploring how to protect your workload against sophisticated attack vectors. Through technical examples, we'll implement secure architectures for AI workloads, covering identity, fine-grained access policies, and secure foundation model deployment patterns. Learn how to harden generative and agentic AI applications using AWS security capabilities, implementing least-privilege controls, and building secure architectures at scale.
- TNC202Intermediate
Accelerate Your Cloud Journey with AWS Transform
Embark on a faster, smoother cloud transformation with agentic AI and integrated solutions. This session reveals how AWS Transform accelerates your cloud journey, addressing migration and modernization challenges through intelligent automation. Through real-world examples, discover how to leverage this powerful integration to fast-track your cloud adoption and transformation efforts. With the specialized AI agents of AWS Transform, customers can migrate VMware workloads to Amazon EC2, modernize .NET applications to cross-platform .NET, and modernize IBM z/OS mainframe applications, delivering transformation projects up to 4x faster.
- ISV301Advanced
Rolling to Scale: Roller's Multi-Tenant SaaS platform on AWS
Learn how Roller Software grew from an Australian startup into a global venue management platform serving 3,000 venues across 30 countries and delivering 120 million experiences annually. Using AWS multi-tenant architecture, Roller maintains 99.99% uptime while processing $4 billion in transactions each year through their modern monolith application. This session covers practical strategies for tenant isolation, infrastructure scaling, and enterprise-grade security. Youll discover how to leverage AWSs native multi-tenant capabilities and get a proven roadmap for scaling your SaaS business from startup to enterprise while keeping costs efficient and operations excellent.
- DEV201Intermediate
How Flybuys Built AI Governance to Accelerate Adoption at Scale
Scaling AI successfully isnt just about moving fast — its about building the right foundations first. In this session, learn how Flybuys focused early on AI governance, steering documents, and engineering standards to enable smooth, secure AI adoption at scale. Well explore how upfront investment in guardrails, training, and approval processes allowed teams to deploy AI capabilities faster and with confidence. Youll hear how Flybuys is embedding governance and security expectations into engineering workflows using Kiro, including standardised steering patterns, approval pathways, and controlled rollout of AI capabilities such as Powers. Attendees will gain practical insights into how slowing down early can unlock faster, safer AI delivery across the organisation.
- STP208Intermediate
NextAI's LegalScout: A Data Foundation for Private Legal AI
LegalScout helps Australian SME law firms turn Generative AI into a competitive advantage by securely leveraging their own client data and confidential matters to work smarter, not harder. Built with Australian lawyers on AWS using Amazon Bedrock for inference and Amazon S3Vectors for secure document searches, it automates repetitive work, streamlines workflows, and improves drafting, contract review, and research to boost productivity, reduce costs, and lift accuracy while maintaining strict privacy and compliance.
- SEC401Expert
Advanced AI Security: Architecting Defense-in-Depth for AI Workloads
Dive deep into advanced security architectures for AI workloads, exploring how to protect your workload against sophisticated attack vectors. Through technical examples, we'll implement secure architectures for AI workloads, covering identity, fine-grained access policies, and secure foundation model deployment patterns. Learn how to harden generative and agentic AI applications using AWS security capabilities, implementing least-privilege controls, and building secure architectures at scale.
- DEV311Advanced
Serverless Developer Experience: Day in a life of builder
What does it mean to be a serverless developer in the era of GenAI What disciplines do you need to master to build cloud-native, serverless solutions today In this session, we'll walk through a day in the life of a serverless developer and explore the core principles, architecture patterns, frameworks, and how to leverage GenAI tools to build your next-generation serverless application.
- ISV206Intermediate
Scaling RAG to Millions of Vectors: The Squiz Story
Squiz, a global Digital Experience Platform provider, is transforming how organizations deliver conversational search experiences. By adopting Amazon S3 Vectors, Squiz reimagined its ingestion pipeline — increasing data processing speed by 50% and shifting from bespoke, always-on infrastructure to a scalable serverless model. This allows Squiz to seamlessly scale from 25,000 to millions of vectors per client, while significantly reducing costs. Hear how this shift freed engineering teams to focus on RAG innovation rather than infrastructure management, and how it powers smart video search capabilities across their platform.
- ARC302Advanced
Secure Multi-tenant SaaS with AWS Lambda: A Tenant Isolation Deep Dive
In this session, learn about AWS Lambda's execution environment lifecycle, diving deep into how the service manages isolation at the function level, and understanding the security implications of environment reuse patterns. Learn about traditional patterns for compute isolation in multi-tenant environments, as well as explore Lambda's tenant isolation mode - a new powerful capability that enables tenant-level compute separation without operational overhead. Explore how to implement robust tenant isolation strategies, manage state across executions, and leverage Lambda's security boundaries effectively. Whether building new SaaS applications or enhancing existing ones, leave with practical knowledge to implement secure multi-tenant architectures at scale.
- ARC307Advanced
AI Powered Resilience Lifecycle
Not all disaster recovery strategies can address the complex, dynamic nature of modern cloud infrastructures, leading to gaps in system resilience and compliance adherence. Discover how to enhance resilience and disaster recovery on AWS empowered by AI. This approach bridges infrastructure insights and application-level testing, enabling more effective disaster recovery preparation. You will learn how to leverage Large Language Models (LLMs) with AWS Resilience Hub and AWS Systems Manager to modernize testing, analyze infrastructure, and generate targeted AWS Fault Injection Service experiments and recovery runbooks. Walk away with practical examples of automated test generation with templates and learn to design prompts.
- ARC403Expert
Secure Multi-tenant SaaS with AWS Lambda: A Tenant Isolation Deep Dive
Secure Multi-tenant SaaS with AWS Lambda: A Tenant Isolation Deep DiveIn this session, learn about AWS Lambda's execution environment lifecycle, diving deep into how the service manages isolation at the function level, and understanding the security implications of environment reuse patterns. Learn about traditional patterns for compute isolation in multi-tenant environments, as well as explore Lambda's tenant isolation mode - a new powerful capability that enables tenant-level compute separation without operational overhead. Explore how to implement robust tenant isolation strategies, manage state across executions, and leverage Lambda's security boundaries effectively. Whether building new SaaS applications or enhancing existing ones, leave with practical knowledge to implement secure multi-tenant architectures at scale.
- IDE301Advanced
Diversity In Tech - AI Literacy Skills - Rapid prototyping with Kiro
In this workshop you will be shown how to build functional prototypes using our proven techniques. Learn how to leverage Kiro to go beyond "vibe-coding", and transform ideas into fully functional prototypes while validating technical approaches. Discover effective spec-driven development and prompt engineering techniques combining generative AI capabilities with AWS services for rapid iteration and refinement. If participants would like to follow along please bring your own laptop.
- ISV201Intermediate
MCP on EKS: Xero's AI-Driven Developer Experience
AI coding agents are transforming how developers build and operate modern cloud-native applications. With tools such as Kiro CLI, Kiro IDE, or any MCP-compatible AI coding assistant, developers are embracing AI to move faster and scale smarter. This session explores how MCP servers help developers streamline code generation, deployment, and debugging by embedding infrastructure awareness directly into the AI assistant. Learn how Xero is leveraging MCP to speed up development, simplify operations, and deliver more reliable containerized apps at scale. Xero will also share their success story using Kiro CLI, Prometheus MCP, EKS MCP, and AWS Knowledge Base MCP to identify and resolve Prometheus cost spikesslashing costs by 40%.
- PRT210-SIntermediate
Charting the CX Frontier: A Cohesive, AI-Enabled Engagement Platform
A forward-looking view of how enterprises are transforming customer experience through unified, AI-driven architecture. By moving away from fragmented solutions to an integrated platform spanning AI-powered engagement, automation, and orchestration, organisations can deliver more consistent, scalable, and outcome-focused experiences in an AI-first era.
- SMB202Intermediate
PMY Delivers Realtime Crowd Analytics at the F1 Australian Grand Prix
Major events produce fragmented data across CCTV, sensors, ticketing, and venue systems. PMY Group will show how Optic, built on AWS, brings these sources together to create real-time crowd intelligence. Using the Australian Grand Prix as a case study, this session explores how operators gained live visibility into movement and congestion to support faster operational decisions. It also highlights how the same AWS foundation can support scalable analytics and broader unified data outcomes across venues and events.
- TNC203Intermediate
Structured Approach to AI coding with Spec-Driven Development on Kiro
This session demonstrates how Kiro brings discipline and clarity to AI-assisted software development, ensuring generated code aligns with intended functionality and architecture. Explore Kiro's innovative spec-driven development approach for AI coding. Learn how to leverage structured specifications as a single source of truth, contrasting with unstructured 'vibe coding'. Discover how Kiro uses AI to generate detailed requirements, design, and task documents, guiding AI agents in code creation. Experience a workflow that enhances collaboration, maintainability, and documentation accuracy.
- PRT106-SFoundational
The AI Challenge You Don't Yet Know About - Software Supply Chain
Most teams have "done DevOps" but still face low platform adoption, rising cloud costs, lagging security, and fragile incident response. This session explores the "After" state: treating your platform as a product, with golden paths, built-in security, AI-driven simplicity, and reliability by defaultusing Harness as the model for a governed, cost-aware, AI-native developer experience.
- ISV202Intermediate
Architecting for growth and resilience: Cell based design deep dive
As business demands evolve, architectural patterns must evolve too. SafetyCulture and Buildkite implemented cell-based architectures driven by distinct business objectivesscaling for hypergrowth and enhancing infrastructure resilience. SafetyCulture's expansion plans required proactive architectural evolution to unlock unlimited scaling capacity. Buildkite needed to meet stringent security isolation requirements while achieving scale through repeatable deployment units. This session shares real-world experiences as both companies designed and implemented cell-based architectures for their SaaS platforms. Discover how SafetyCulture identified bottlenecks, redesigned systems for isolation and resilience, and aligned technical capabilities with business growth targets. Learn how Buildkite leveraged cell-based design to achieve both scale and security isolation. Walk away with actionable patterns for building resilient, scalable architectures.
- MAE202Intermediate
Seven's AWS Journey: Streaming Premium Content at the Speed of Innovation
Join Tim Sheridan, Director of Product & Technology at Seven West Media, as he shares how Seven is leveraging cloud and AI to maximise the return on their most valuable asset — premium live content. With marquee events like the AFL Grand Final and The Ashes cricket series, the stakes couldn't be higher: massive concurrent audiences, critical advertising revenue, and zero tolerance for failure. Tim shares how they leaned on AI-powered developer and business tools to accelerate delivery, de-risk high-profile events, and maximise the return on its premium content investments. Discover how Seven's team transformed their approach to innovation — using cloud-native architecture and AI to achieve speed to market, audience experience, and advertising revenue.
- STP213Intermediate
AI-Powered Farming: How Halter's ML Models Transform Dairy Operations
New Zealand Unicorn agritech startup Halter is revolutionizing dairy farming with AI-powered smart collars that predict critical livestock events. Their machine learning models enable heat detection, calving prediction, pasture optimization, and animal behavior classification, processing data from thousands of GPS-enabled collars across remote farms. By leveraging AWS infrastructure, Halter's engineering team built scalable ML pipelines that help farmers make data-driven decisions, reduce labor costs, and improve animal welfare. Learn how Halter developed production ML models for agriculture, overcame challenges of training on livestock data, and their journey toward managed ML services.
- TNC301Advanced
Using Tools and Agents in Generative AI applications
Join us for an engaging session on AI Agents and Tools in AWS, where well explore how to build intelligent, autonomous systems using Amazon Bedrock and open-source frameworks. Learn about function calling, ReAct patterns, and AWSs comprehensive agent platforms. Well dive into practical demonstrations using Strands and CrewAI, and discover how to leverage protocols like MCP and A2A for seamless tool integration and agent collaboration. Perfect for developers looking to create production-ready AI solutions.
- IDE101Foundational
From principles to practice: Scaling AI responsibly
Building AI applications that customers trust requires more than technical excellenceit demands a deliberate approach to managing risk across every stage of the AI lifecycle. As organizations scale their AI initiatives, the challenge of balancing innovation speed with responsible AI practices across dimensions like privacy, security, fairness, safety, and explainability becomes increasingly critical. Join our panelists for a 30-minute discussion where they will explore: Practical approaches to embedding responsible AI principles into AI application development without slowing down innovation, key considerations across privacy, security, fairness, safety, and explainability that organizations should prioritize, lessons learned from building AI applications that earn and maintain customer trust, and strategies for navigating the evolving responsible AI landscape and managing risk at scale. Whether you are a technical leader building AI solutions, a business decision-maker shaping your organization's AI strategy, or a practitioner looking to deepen your understanding of responsible AI, this session will provide actionable insights to help you build AI applications that are not only innovative but also trustworthy.
- ISV102Foundational
From documents to voice - building AI products on AWS
How Affinda leverages Amazon Bedrock (Claude), SageMaker, EKS & CloudFormation to deliver intelligent document processing at enterprise scale, cutting setup time and costs by 90% with 95%+ accuracy. This session will demonstrate how Affinda powers real-world AI product development from Affinda's Intelligfent Document Processing platform to Pathfindr's (acquired by Affinda) custom AI agents. The session will showcase the complete journey of building Honey Insurance's voice agent - Australia's first voice agent in financial services, and how the Affinda-AWS partnership enables rapid AI product development for Enterprises.
- IDE102Foundational
Power of Possibility: Leading Through Innovation and Connection
As AI reshapes every industry, professionals across all roles and backgrounds are navigating an unprecedented pace of changebringing new opportunities but also rising burnout, blurred boundaries, and pressure to continuously adapt. This moment of disruption presents a powerful opportunity to not only deliver innovation but to redesign how we lead, build culture, and sustain meaningful careers in more equitable ways. Join accomplished AWS leaders and peers for this immersive session that combines strategic leadership frameworks, emotional intelligence, and interactive roundtable discussion to accelerate your impact in tech. Together, we will explore practical strategies for claiming visible technical leadership, activating professional networks, setting sustainable boundaries in hybrid work, and championing responsible AI adoption without amplifying existing inequalities. Participants will share lived experiences, tactics that worked and lessons learned, and build meaningful connections through guided speed networking in a collaborative, supportive environment. This session empowers professionals from all backgrounds, with particular focus on amplifying diverse voices and fostering inclusive innovation. Leave with actionable strategies to strengthen your leadership presence, leverage emotional intelligence as a career accelerator, build psychologically safe and inclusive team environments, and navigate the challenges shaping your future in tech.
- INO203Intermediate
Behind the curtain: How Amazons AI innovations are powered by AWS
Discover how Amazon leaders across Zoox, Prime Video, and Amazon Stores are leveraging AI to power their next-generation innovations with AWS and create better experiences for customers. Through three key stories discover how customer needs sparked transformative experiences: fan feedback revolutionising sports broadcasting with AI-powered highlights, shopping patterns evolving Amazons mobile commerce platform, and robotics and automation delivering advances in supply chain optimisation and fulfillment. Learn the best practices Amazon has applied using AWS that can help scale innovation in your organisation.
- IND201Intermediate
Transforming software license efficiency - Human-centered AI on AWS
As Worley's software landscape expands, manual license governance struggles to keep pace with scale and complexity. While manual optimisation has delivered measurable results, a sustainable approach is needed to scale these outcomes. Software Intelligence Advisor (SIA) is Worley's agentic AI solution that enables optimal license decisions and empowers end users. Underpinned by AWS native data platforms, SIA combines deep usage intelligence with a conversational agent that meets users within existing collaboration tools. Through trusted, context-aware conversations, the agent validates usage patterns and encourages better behaviours — delivering scalable, human-centred optimisation and a pragmatic path to learning what agentic AI can deliver.
- ISV104Foundational
hipages Journey Towards an Agentic Engineering Organisation
Discover how hipages, ANZ's leading online construction marketplace, redefined their software development lifecycle by embedding Claude Code on Amazon Bedrock into daily engineering workflows. Join this session to get practical insights into maximizing Claude Code on Bedrock, understanding the decision process behind selecting this solution, and learning how hipages is pioneering an AI-first strategy that's transforming operations across their entire business.
- DEV310Advanced
Zero-Downtime Migration from Sydney to Auckland (ap-southeast-6)
With AWS ap-southeast-6 (Auckland) now open, New Zealand organizations can repatriate workloads from Sydney. This advanced session provides practical migration strategies minimizing downtime and eliminating data loss across every layer of your stack. You'll learn region-to-region migration patterns for: *Storage*: S3 replication, EBS snapshots, EFS cross-region transfers *Databases*: RDS read replicas, DynamoDB global tables, self-managed EC2 database replication *Applications*: Lambda, ECS/EKS workload migration, EC2 AMI copying Walk away with a prioritized migration playbook, realistic RTO/RPO targets, and battle-tested sequencing strategies for large-scale data transfers without extended application outages.
- IND101Foundational
Test, Learn, Iterate: Amazon Connect Success
Discover how Flybuys achieved rapid contact centre transformation through early Amazon Connect adoption using AI-powered capabilities and a disciplined Test, Learn, Iterate approach. Starting with a focused pilot, they deployed AI-driven features like intelligent routing, real-time sentiment analysis, and automated quality assurance. They progressed through Launch, Activate, and Consume phasescapturing baseline metrics, scaling through peer-led training, and continuously refining AI performance based on weekly feedback loops. The results: reduced AHT, improved CSAT, 100% AI-powered QA coverage, and measurable ROI. This demonstrates that early AI adoption delivers calculated, data-driven transformation.
- IND206Intermediate
How scalable data foundations helped TGE unlock the power of AI
In one of Australia's most operationally complex industries, Team Global Express (TGE) turned data into a strategic asset, and AI into a competitive edge. In 2025, TGE invested in data modernisation, establishing an AWS native data platform which now serves as the operational heartbeat of its logistics network. On this foundation, TGE is delivering compounding business value through rapid deployment of AI solutions across multiple domains. Join this session to learn how TGE secured board-level backing, built a lean AI team, and is scaling pragmatic, cost-effective AI — including the lessons learned along the way and whats next on their roadmap.
- ISV213Intermediate
From GRC Platform to AI-Native Risk Intelligence on AWS:Protecht Story
Protecht, a global leader in enterprise risk management software, partnered with AWS and Caylent to build Cognita AI, an embedded AI assistant purpose-built for governance, risk, and compliance (GRC). Backed by a $280M PSG investment, Protecht built Cognita on a production-grade Amazon EKS foundation, integrating Amazon Bedrock and Anthropic's Claude models with a RAG architecture grounded in Protecht's proprietary GRC content. The result is a contextual, explainable, and auditable AI assistant that guides risk professionals through complex workflows, earning high accolades at the Gartner Enterprise Risk, Audit & Compliance Conference and setting a new benchmark for investor-grade, regulator-trusted AI in months.
- ISV207Intermediate
How Canva Scales and Optimizes AI Workloads with Karpenter
his session explores how Canva leverages Karpenter to scale and optimize diverse workloads on Amazon EKS. Learn how Canva manages AI workloads using On-Demand Capacity Reservations (ODCRs) and EC2 Capacity Blocks for ML, while maximizing resource utilization by intelligently co-locating CPU and GPU workloads on GPU nodes. We will dive into NodePool management strategies for efficient scheduling of AI workloads and examine how Canva uses a range of Amazon EC2 instance types to operate a multi-tenant container orchestration platform for all workloads, optimizing for cost-effectiveness and resource efficiency. Ideal for platform engineers and Kubernetes operators looking to optimize their EKS clusters for both AI and general workloads at scale.
- FSI203Intermediate
How HBF Transformed Claims Processing From Two Weeks to Two Minutes
In this session discover how HBF revolutionized claims processing using AWS. By leveraging Amazon Bedrock and Amazon Textract, they cut claim costs from $2 to just 10 cents and reduced the processing time from two weeks to two minutes. With accuracy in the high 90s and 70,000 claims processed monthly, their end-to-end AI-powered architecture for claims processing sets a new benchmark for speed, cost, and customer satisfaction.
Live updates related to this topic LIVE
Sourced via Parallel AI Monitor — continuous web watch on 21 topical streams. Updated .
- ibm.com high confidence Agent memory & RAG architectures
IBM announcements at Think 2026 to advance the agentic era
IBM introduced 'Real-time context on watsonx.data', which provides AI agents with data that is continuously accessible as it changes. Using a Real-Time Context Engine in partnership with Confluent, the system combines streaming data with semantic enrichment and governance, allowi
- vectorize.io high confidence Agent memory & RAG architectures
Best AI Agent Memory Systems in 2026: 8 Frameworks Compared
IBM introduced 'Real-time context on watsonx.data', which provides AI agents with data that is continuously accessible as it changes. Using a Real-Time Context Engine in partnership with Confluent, the system combines streaming data with semantic enrichment and governance, allowi
- zylos.ai high confidence Agent memory & RAG architectures
Live Agent Upgrades and Cross-Runtime Session Portability (2026)
MarsDevs published the 'Agentic RAG: The 2026 Production Guide', detailing a shift from linear RAG pipelines to a state-machine control loop. This 'Agentic RAG' approach uses a planner agent to decompose queries and iteratively retrieve and evaluate information. It identifies fiv
- marsdevs.com high confidence Agent memory & RAG architectures
Agentic RAG: The 2026 Production Guide | MarsDevs
MarsDevs published the 'Agentic RAG: The 2026 Production Guide', detailing a shift from linear RAG pipelines to a state-machine control loop. This 'Agentic RAG' approach uses a planner agent to decompose queries and iteratively retrieve and evaluate information. It identifies fiv
- mem0.ai high confidence Agent memory & RAG architectures
Mem0 - The Memory Layer for your AI Apps
Mem0 introduced 'Memory Decay,' a technical approach to long-term memory management that mimics human forgetting. The system implements a ranking score for memories that is reinforced upon each single retrieval and gently decayed over time if the memory remains untouched. This pr
External links matched to this topic via topic relevance. The KB does not endorse third-party content; verify before citing.
Non-obvious insights
From the PlaybookOne sharp, contrarian insight per session — the things teams don't think of unprompted.