Agents spent hours listening to recordings and manually assigning categories. At 30,000 calls per month, meaningful coverage was impossible. Most calls went uncategorized or were assigned the first available label regardless of accuracy.
Error-prone + incompleteThe existing taxonomy had just two broad categories — far too coarse to reveal any actionable signal. The real question — which specific issue is driving the most repeat contacts this month? — was simply unanswerable with the existing process.
No business specificityBecause root causes were never captured, the same underlying issues recurred month after month. Customers called multiple times about the same unresolved problem. Leadership had no mechanism to identify which issues were driving repeat contacts.
Repeat contact rate risingBecause categorization was manual and weeks behind, business leaders could never see what was happening now. They were always reacting to last month's patterns. Data-driven decisions were effectively impossible — the data simply wasn't there.
Always reactive, never proactive"We knew thousands of customers were calling us every month. We had no idea what they actually needed — or which problems were sending the same customers back again and again."
— Pacific Specialty Insurance Operations Team
Amazon Connect handles all inbound calls as before, generating event metadata and audio recordings. Amazon Transcribe converts each call's audio to a timestamped text transcript automatically. No changes were made to any Connect flow, IVR, or agent configuration. The AI pipeline is purely additive — it listens to what already exists.
All call transcripts, contact metadata (Contact ID, agent ID, timestamp, queue name), and event records are stored in Amazon S3 as the system's single source of truth. S3 serves as the durable, scalable foundation for all downstream processing — nothing is processed until it is safely persisted here.
Amazon Athena enables serverless SQL queries directly over the S3 data lake. The pipeline queries transcripts by Contact ID, date range, agent, or queue — filtering only the calls relevant to the current processing batch. No data movement is required. This is how the system fetches individual transcripts for on-demand analysis or pulls batches for overnight processing runs.
calls, categories, sub_categories, and call_drivers. The system auto-creates category and driver records as new classifications emerge — so the taxonomy grows organically from what customers are actually saying, not from what someone pre-supposed they might say. Idempotency is enforced: if a call has already been analyzed, the stored result is returned immediately without re-invoking Bedrock.QuickSight connects directly to the RDS database, generating live dashboards that show call category distributions, root cause trend lines, repeat-contact patterns, and agent-level insights. Business analysts see what is happening now — not what happened last month. Leaders can drill from a top-level trend ("billing inquiries up 22% this week") to specific root causes ("automated payment portal timeout") to recommended actions — all within the same dashboard.
"Hi, I'm calling because I just got my renewal notice and my premium went up by almost $200 and no one told me about this. I have the same policy I've always had. I didn't change anything. The website said the new rate was because of a coverage addition but I never asked for that… I've been with you for 11 years. I'd like to understand why this happened and if there's something I can do about it."
AWS Generative AI Competency Partner and Amazon Connect Service Delivery Partner. P3Fusion builds AI solutions on top of existing contact centre infrastructure — adding intelligence without disrupting operations.
Want post-call intelligence on Connect without changing agent workflows?





