Claude is excellent at summaries. It's also wildly variable in cost depending on how you pipe context in.
Run the free audit →The biggest summarization cost leak: chunked pipelines that re-read the same context in each chunk. Audit reveals this fast.
Three patterns dominate the waste on this workload: (1) context that gets re-sent on every request; (2) retry loops on parse failures or validation errors; (3) agent/reviewer chains where intermediate steps re-read content they already processed. AIUsage's audit identifies which of the three is eating your bill.
Across six audited Claude workloads, the measured savings were 76–84% on the same prompts, blind A/B tested.
Audit my own summarization workload →