Reply rate — Glossary

01

Plain English.

TLDR

Reply rate is the percentage of emails that get a human response back. That's the honest definition. The dishonest version — the one most public benchmarks quote — counts every email that triggered anything in your inbox: out-of-office replies, "please remove me," literal bounce notifications. The two numbers can differ by 3–5×.

If 100 cold emails go out and 12 things land in your inbox in response, your sequencer probably shows "12% reply rate." But if 6 of those were out-of-office auto-responders, 3 were "unsubscribe me" requests, and 1 was a hard bounce that got mis-classified, your actual reply rate is 2%. The 12% is what looks good on a screenshot. The 2% is what predicts pipeline.

This matters because reply rate is the most-quoted metric in outbound, the basis for compensation conversations, and the number most teams optimize against. Optimizing against the wrong number for a quarter is how outbound desks end up with a healthy dashboard and no pipeline.

02

What counts vs what tools count.

The gap between the honest reply rate and the dashboard reply rate isn't a software bug — it's a UX choice the sequencer vendors made because higher numbers look better in product screenshots. Side-by-side:

What lands in your inbox after a cold-email send honest vs sequencer-counted

Inbox event

Honest reply?

Counted as reply?

Genuine human reply ("interesting, let's talk")

✓ yes

Genuine human "no thanks" (still a real human signal)

✓ yes

Out-of-office auto-responder ("I'm away until Aug 12")

— no

✓ yes

"Please unsubscribe" / "remove me" (opt-out request)

~ depends

✓ yes

"Wrong person — try Sarah" (referral; counts as soft win)

✓ yes

Hard bounce (NDR) mis-classified (seq UI bug)

— no

~ sometimes

Mailbox-full bounce (not a reply)

— no

~ sometimes

Spam-filter quarantine notice (adversarial signal)

— no

~ rarely

The honest reply-rate number for a sequence is roughly: (genuine human replies + "wrong person" referrals) ÷ delivered emails. Most sequencer dashboards show: (any inbox event including OOOs, NDRs, and unsubs) ÷ sent emails. The two are not the same number. They can diverge by 3–5×, especially during summer (when OOOs spike).

03

Five ways the number gets gamed.

The most common patterns in the wild — observed across the outbound corpora we have audit access to. Some are tooling defaults; some are deliberate.

Auto-responder inflation. The largest single contributor. Every OOO email counts as a "reply" in most sequencer UIs. During Q3 (summer holidays) and December, OOO volume can be 2–4× higher than baseline — inflating reported reply rates by ~ 4–8 percentage points across the same campaign.
Cherry-picked sequence selection. The "we get 18% reply rates" claim usually means "our best-performing sequence in our best month had 18% replies." Asking for the trailing-12-month median across all sequences typically returns half that number. Public benchmarks are nearly always cherry-picked.
Step-1-only reporting. Reply rates degrade sharply by step (step 1: 4–8%, step 2: 1–2%, step 3: < 1%). Quoting step-1 reply rate as "campaign reply rate" inflates the number by ~ 2×. Honest reporting uses campaign-level (all-steps) reply rate.
Excluded bounces from the denominator. Reply rate is "replies ÷ sent." Some teams quietly switch the denominator to "replies ÷ delivered" — which makes the rate look better if delivery is low. Both are valid choices, but the change without note is a tell.
Treating "unsubscribe please" as engagement. An "unsub me" reply is a real signal — but it's a negative one. Counting it as engagement is technically correct (the human did engage) but misleading when used to justify campaign performance. Most public reply-rate claims include unsub replies in the numerator.

The clean test

When a vendor or operator quotes a reply rate, ask three questions: (1) Step-1 only or all-steps? (2) Are OOOs and bounces excluded? (3) Trailing-12-month median or best month? If the answer is "the first one only" to all three — divide the quoted number by 3 to get something closer to what it would mean in your campaigns.

04

Honest benchmarks by signal anchor.

What "good" actually looks like, measured honestly (OOOs excluded, all-steps, trailing-12-month median across our and four customer corpora). Variance comes from ICP fit, send timing, and how strong the signal anchor is.

Reply rate · honest definition · by opener type our corpus + 4 customers · n ~ 18,000 sends

Pure cold (no signal anchor)generic ICP-templated opener

1.2%

1.0–2.5%

Lightly personalized coldname + role mentioned

2.4%

2.0–3.5%

Signal-anchored cold(funding / tech / hiring)

5.6%

4.5–8.5%

Strong signal + receipts test passedweeks 5–8 timing, specific quote

8.4%

7.5–13%

Multi-signal stacked2+ signals on same account, named

11.8%

10–16%

The numbers most outbound vendors quote — "our customers see 25%+ reply rates!" — are nearly always cherry-picked best-month, step-1, OOO-included variants. The 10–16% range at the top of the chart above is what's achievable on the honest definition with disciplined signal-anchored outbound. Above ~ 18% honestly-measured, you're probably looking at warm outbound or a misclassified metric.

05

How to measure it honestly.

The discipline is simple — costing nothing more than the willingness to look at a smaller number on your dashboard.

Filter OOOs out of the reply count. Every major sequencer supports auto-reply detection. Turn it on. The number drops; the number becomes useful.
Report campaign-level, not step-level. "Reply rate" should mean "of the people who entered this sequence, what % responded at any step." Step-1 is interesting but secondary.
Use replies ÷ sent, not ÷ delivered. Delivered is endogenous — your deliverability problems become your reply rate problems. Sent is the honest denominator.
Exclude unsubs from the numerator. They're engagement, but they're not the engagement you're optimizing for. Track them separately.
Report trailing-12-month median, not best month. Best month is marketing copy; median is what predicts next quarter.

Teams that adopt the honest definition typically see their reported reply rates drop by 40–60% in week 1. Don't flinch. The actual performance hasn't changed — only the measurement honesty has.

06

How Mama measures it.

Mama doesn't send the email — your sequencer does — so we don't own the reply-rate reporting directly. But when we publish reply-rate data on this site (in playbooks, in customer stories), here's the discipline we use:

OOOs excluded from the numerator. Always.
Campaign-level reporting. Step-1 numbers are called out separately when relevant.
Trailing-12-month median where possible, or a named window for shorter datasets.
Sample size disclosed. A 14% reply rate across 40 sends is anecdote; across 2,000 sends it's evidence.
Source of data disclosed. Our outbound, customer outbound (anonymized with permission), or industry corpora — never mixed without note.

The numbers in our playbooks are honestly-measured, which is why they look smaller than the marketing-copy versions you'll see elsewhere. The smaller honest number is the one your finance team should trust.

07

Related terms.

If you've read this far

Most reply-rate numbers you've seen are wrong. Yours doesn't have to be.

The discipline is twenty minutes of sequencer configuration. The payoff is a metric you can trust against pipeline. If you're using Mama-anchored openers, the honest numbers are in our playbooks — no inflation, no cherry-pick.

Try a 14-day trial → Read the playbooks

Last revised May 22, 2026. The benchmark data is updated quarterly from our and four customer corpora (anonymized, with permission). If you'd contribute a campaign or push back on a number, write to [email protected].

Reply rate.

Plain English.

What counts vs what tools count.

Five ways the number gets gamed.

Honest benchmarks by signal anchor.

How to measure it honestly.

How Mama measures it.

Related terms.

Most reply-rate numbers you've seen are wrong. Yours doesn't have to be.