Follow-Up Summaries and CRM Hygiene
After this you can put a verification gate between any AI-generated field and the CRM it writes to, so auto-drafted follow-ups and enrichment land as proposals a human clears before they become shared truth.
The tempting setup is a clean loop: the call ends, AI summarizes it, extracts the next step and the budget number and the renewal date, and writes all of it straight into the CRM. Nobody retypes anything. The summary is usually good. The problem is the word "usually," because the CRM is not your scratchpad. It is the shared record every other seller, every forecast, every routing rule, and every renewal play reads from later, and they read it as fact. A wrong field you typed yourself is one person's mistake. A wrong field an AI wrote with full confidence is a mistake the whole institution now trusts.
This is why CRM hygiene stops being a tidiness chore the moment AI can write to it at volume. The same automation that de-dupes contacts and enriches stale records can just as easily flood the system with plausible-but-wrong values, and the downstream cost is asymmetric. Consider an enrichment step running at 85% field-level accuracy. That sounds like a strong tool until you sit with the other number: roughly one in seven rows is wrong, and nothing in the record tells you which ones. The 15% does not arrive flagged. It arrives looking exactly like the 85%, gets auto-actioned into a sequence or a forecast, and the next seller who opens that account inherits a confident wrong title, a decayed email, or a renewal date that was never said on the call. Bad auto-logged data does not announce itself. It compounds quietly, because every reader after the write assumes the write was checked.
The fix is not to keep AI away from the CRM. Drafting follow-ups and proposing field updates is exactly the mechanical work it should carry. The fix is to move the human review to before the write instead of after the damage, and to gate the write on confidence rather than trusting every extraction equally. A field the model is sure about — the meeting happened, the attendee's name as spoken — can write through. A field it inferred or half-heard — the deal value, the decision date, the competitor named in passing — drops below the threshold and routes to a human who confirms or corrects it in seconds. The same logic governs enrichment, which is why operators run it as a waterfall: query provider A, take the hit; on a miss fall through to B, then C, first valid result wins, because no single data source has good coverage and B2B contact data decays continuously as people change jobs. The waterfall is cheaper too — stacked providers run roughly $0.08 to $0.22 per enriched row against about $0.52 for a single premium source, a 6.5x penalty for pretending one provider is enough. But cost is the minor point. The real reason to stack and to re-fire enrichment on job-change and new-hire signals is that a CRM is a decaying asset, and the gate is what keeps the decay from being silently overwritten with confident garbage.
Where it breaks
The gate only earns its keep where the cost of a wrong write is real and downstream readers actually exist. On a solo book where you are the only person who ever reads the record, the verification step is overhead — you already know what was said, and a heavy review queue just slows you down. Push the discipline up only as far as the blast radius justifies it. The opposite failure is more common and worse: a confidence threshold set so loose it gates nothing. If the model self-reports high confidence on everything, including the figures it half-heard, the gate becomes theater and the wrong rows write through exactly as before. Model-reported confidence is a weak signal here; it is calibrated to sound sure, not to be right. So the threshold has to be tuned against fields you can check against a second source, not trusted on the model's own say-so. And the gate guards the write, not the extraction. It catches a low-confidence value before it poisons the record, but a confidently-wrong high-confidence value — the renewal date the model heard clearly and got wrong anyway — sails through, which is why high-stakes fields like contract value or close date stay on human review regardless of what confidence the model reports.
Use this as the system prompt for a post-call agent that drafts the follow-up and proposes CRM updates without writing anything itself. The last block is the gate: it forces every field to carry a confidence label and routes the uncertain ones to you before they touch shared data.
You are a post-call assistant. From the transcript and notes below, produce two things.
1) A follow-up email draft to the prospect: recap what was agreed, restate the next step and owner, and surface any open question. Plain, specific, no filler.
2) A proposed CRM update as a table, one row per field, columns:
field | proposed value | confidence (high / medium / low) | source quote from the transcript
Rules for the CRM table:
- Quote the exact transcript line that supports each value. No quote = do not propose the field.
- Mark confidence LOW for anything inferred, paraphrased, or heard once in passing
(deal value, close/renewal date, competitor names, budget, authority).
- Mark confidence HIGH only for facts stated plainly and unambiguously
(meeting occurred, attendee names as spoken, the agreed next step).
- Do not invent values to fill a field. Leave it out and say what's missing.
Write NOTHING to any system. Output the draft and the table only.
GATE before anything writes to the CRM:
- HIGH-confidence fields may write through.
- MEDIUM or LOW fields, and ALL of {deal value, close date, renewal date, authority},
hold for human review and confirmation first — regardless of stated confidence.Notice what the source-quote column forces: the model has to point at the transcript line that justifies each value, and a value with no line behind it is exactly the one you do not want auto-logged. The confidence labels are directional, not the gate on their own. The gate that actually holds is the source-quote requirement plus the always-review rule on deal value, close date, renewal date, and authority, because the model's self-reported confidence is calibrated to sound sure, not to be right. The high/low split just turns "the model is usually right" into "here is precisely which fields I would let through and which I would not."
Worked example
IllustrativeIllustrative. A constructed post-call output, not a real account.
Discovery call with a mid-market ops lead. The rep's notes are thin; the transcript runs long. The agent returns a follow-up draft and a proposed CRM table.
Follow-up draft
Hi Dana — thanks for the time today. To recap: you're evaluating a replacement for your current routing setup, and the blocker is that leads sit unassigned over weekends. Agreed next step is a technical walkthrough with your RevOps lead the week of the 16th — I'll send three slots tomorrow. Open question on your side: whether the eval needs sign-off from finance before a pilot.
Proposed CRM update
field proposed value confidence source quote Next step Technical walkthrough, week of 16th high "let's get my RevOps lead on a walkthrough that week" Pain point Weekend lead-assignment gap high "leads just sit there from Friday to Monday" Competitor (current routing tool, unnamed) low "the thing we have now just isn't keeping up" Deal value $40K ARR low "I'd guess we're in the forty-ish range, don't hold me to it" Close date End of next quarter low inferred from "before our renewal" — no date stated
Two of those fields are the trap. "Deal value: $40K" reads like a clean number a forecast would happily roll up, but the quote behind it is a hedge the prospect explicitly disowned. "Close date" has no source line at all — the model inferred it. Under the gate, both hold for review. The high-confidence rows, the next step and the pain point, write through and the follow-up goes out. What gets stopped is exactly the data that would have looked authoritative to the next seller and been wrong.
The point the example makes concrete is that "summarize my calls and update the CRM" and "verify before the write" are not in tension. The summary is fine. The discipline is deciding, field by field, which extractions have earned the right to become shared truth and which have to be confirmed by the one person who was actually in the room before anyone downstream inherits them.