TL;DR I owned design and research for a unified generative AI assistant across the Amazon Ads console — a 12-month, cross-functional effort with 1 PM, 4 SDEs, and 2 data scientists. Starting from a support chatbot, I expanded scope to build a cross-page assistant capable of invoking specialized agents for image generation, reporting, keyword generation, and campaign creation. I defined the data architecture with engineering, established four design principles, navigated a VP-level org conflict, and wrote design system guidelines for pattern adoption across teams. The assistant launched with 10 skills and three interaction types — reactive, ambient, and inline — resolving 10% of support cases, eliminating 50k+ tickets, cutting average handle time from 25 minutes to 1.5 minutes, and saving nearly $600k in cost per contact.
After starting work on a generative AI assistant for the support team, I noticed other teams working on standalone agents with their own chat interfaces. This led to an opportunity to expand the scope and build a unified assistant across the Ad Console — one that could handle not only support but also invoke other agents for image generation, reporting, keyword generation, and more.
Provide advertisers with a cohesive assistant experience across the Ad Console. We defined five success targets going in:
The research surfaced four key findings that shaped the direction:
A cross-page unified assistant with the ability to ingress inline and from nudges. This experience would provide not only support but also access to recommendations, image generation, keyword generation, and more — living in the existing help panel with three distinct interaction types for advertisers.
Working with product, engineering, and data science teams to define the assistant architecture. The system is built around a Central Ads Assistant API with a multi-agent workflow handling intent extraction, skill/agent matching, response moderation, and suggested prompts — backed by a specialized agents registry, conversation history management, advertiser profile, and a responsible AI guardrails layer.
We aligned on four principles to guide every design decision:
Early 2024 — We started with a redesigned contact us experience, moving toward a conversational intent detection path. This groundwork trained our intent detection accuracy to almost 95%.
Mid 2024 — The next exploration revolved around a conflicting team working on the same solution. This escalated to a VP-level decision where we aligned on our team moving forward with our planned design and the conflicting team owning agent onboarding.
End of 2024 — Nearing the final design, we expanded the functionality by introducing two additional interaction types: proactive nudges and inline experiences contextual to each page. During this time there was also a design system refresh.
We launched three interaction types with 10 different skills and counting, focusing on quality of responses before launching each skill completely. At launch we supported: campaign moderation issues, image generation, report generation, billing issues, campaign creation, campaign optimization, advertiser suspensions, invalid clicks, promotional credits, and all education questions.
I wrote design system guidelines for the conversational AI assistant pattern in our Storm design system, providing a framework for pattern adherence and adoption across teams.
In tandem with the written guidelines I created a framework for both designers and engineers to work from — covering feedback patterns, the conversational layer, input fields, and response sources.
The assistant resolved 10% of support cases, eliminating 50k+ tickets and nearly $600k in cost per contact. Average handle time dropped from 25 minutes to 1.5 minutes. We onboarded 4 partner teams under our framework and introduced more than 10 skills.
With the framework now in place, enabling designers and product owners to seamlessly add their agents, the team is focusing on the next key skills — including AI-generated product recommendations and full campaign creation.
There's also a continued push to collect feedback at scale: are the images generated accurate? Are the goals being suggested providing the right outcomes? Feedback collection will be critical as more skills are onboarded.
We nailed our goals and more — making changes at an app level and introducing a new way for product owners to get involved. The only thing I'd tweak is how fast we moved through updates and new features. Slowing down slightly to pressure-test each iteration before moving to the next would have saved some rework down the line.