Back to Blog

Your Retention Curve Looks Fine. Your Cohort Analysis Disagrees.

Product Akif Kartalci 17 min read
cohort retention analysisSaaS retentiongross revenue retentionNRR masking churncohort divergenceretention benchmarksSaaS churn
Your Retention Curve Looks Fine. Your Cohort Analysis Disagrees.

Most SaaS founders I know have done a version of this: they pull the retention curve, see it dropping in the first 90 days, notice it seems to be flattening out, and conclude things are manageable. Growth is still happening. Nobody’s panicking. The narrative around the numbers holds.

Then they run a proper cohort retention analysis.

The aggregate curve that looked reasonable is actually averaging together three or four completely different customer cohorts sitting on top of each other. The customers you acquired two years ago are staying at 92% annually. The customers you acquired in the last two quarters are churning at 4.2% per month. The blended rate looks fine. The underlying story isn’t.

This is the most common retention misdiagnosis I see in B2B SaaS, and it costs companies real money. By the time a deteriorating cohort trend surfaces in your top-line retention number, you’re already 6 to 12 months behind on the fix. Cohort-level trends are visible in cohort data 60 to 90 days before they show up in aggregate metrics. If you’re only watching aggregate, you’ve already lost the early warning window.

This post is about what your curve is hiding and how to find it. Not churn prevention tactics, which we’ve covered separately in the SaaS churn prevention framework. The diagnostic that has to come first.

Why aggregate retention curves lie

The mechanics of aggregate churn calculation create a structural measurement problem. It’s not noise in the data; it’s how the math works.

When you calculate an aggregate churn rate, you measure customers who were active at the start of a period and are still active at the end. That calculation seems obvious. The problem is what the denominator excludes: every customer who churned before your measurement window opened.

Dave Kellogg called this out years ago in his post on survivor bias in churn calculations, and it’s still one of the clearest explanations of why aggregate churn rates are systematically understated. Your denominator is a cohort of survivors. The customers you’re measuring against are, by definition, the ones who didn’t leave. Early churners are invisible. The selection pressure in that denominator makes your current retention rate look better than it is, and more importantly, it means you’re measuring the retention quality of your stickiest customers rather than your customer base as a whole.

The second problem is cohort mixing. An aggregate retention curve averages potentially dozens of monthly cohorts, each with a distinct retention profile. Customers who shaped your product through early feedback might retain at 90%+. Customers you acquired through a new paid channel eight months ago might be at 71%. The aggregate blends them into something that looks plausible but represents nobody’s actual experience.

The third problem is what I think of as the leaky bucket illusion. If you’re growing at 12% month over month and churning at 8%, net MRR is still rising. The curve looks fine. But you’re replacing 8% of your customer base every 30 days. That acquisition burden quietly crushes unit economics: CAC efficiency falls, payback periods lengthen, LTV forecasts get built on a foundation that’s eroding. The compound effect isn’t visible until you run the cohort numbers or notice that your CAC keeps rising while the business seems to be growing.

The four retention curve shapes

Reforge’s retention framework is the clearest taxonomy I’ve seen for reading curve shapes. Knowing which one you’re looking at tells you what question to ask next.

Shape 1: Declining to zero. The curve drops continuously with no floor. Month 3 is worse than month 1, month 9 is worse than month 6. This is a product-market fit problem. Customers are trying the product and deciding it doesn’t solve their problem well enough to stay. Onboarding tactics and CS playbooks won’t fix this curve. The product strategy has to change.

Shape 2: Declining to a flat floor. The curve drops steeply in the first 60 to 90 days, then stabilizes. This is the target state. The initial drop represents customers who were never going to activate: wrong ICP, poor onboarding match, misaligned expectations. Once they’re gone, the remaining segment has real, durable use cases. The higher the floor, the stronger the PMF signal.

Shape 3: Smiling curve. Drops initially, then rises. Rare, and almost always a network effects signal. Slack and Airbnb show this pattern. As customers use the product and bring in others, retention improves rather than decays. Most B2B SaaS products won’t see this without genuine network dynamics built in.

Shape 4: Gradual long-tail decline. This is the one that fools founders. The curve doesn’t crash. It doesn’t flatten. It slides 1 to 2 percentage points per month, every month, for 18 months. At any given point the slide looks manageable. Compounded over a year and a half, you’ve lost 25 to 30% of your customer base in what read like controlled churn.

Founders who see Shape 4 usually describe it as “healthy churn with normal attrition.” It isn’t. A curve that never develops a floor means the product isn’t generating durable habit. Customers stay until something better appears, or until a budget cycle, or until a champion leaves. There’s no retention floor because there’s no sticky behavior anchoring continued use.

Here’s a summary of the four shapes and what each one signals:

Curve ShapePatternWhat It MeansThe Fix
Declining to zeroContinuous drop, no floorNo PMF: customers aren’t getting lasting valueProduct strategy, not onboarding
Declining to flatSteep drop then stabilizesPMF found: misfit customers churn, core ICP staysRaise the floor by improving ICP fit and activation
Smiling curveDrop then risesNetwork effects: product value grows with adoptionLean into the network mechanic that’s driving re-engagement
Gradual long-tailSlow, steady monthly declineNo habit formation: no durable workflow anchors the productHabit formation audit: what do month-18 customers do that month-4 churners didn’t?

The diagnostic question for Shape 4: what specific workflow does a customer need to be running at month 18 to still be paying? If you can’t answer that with precision, you’ve identified where to look.

Five layers of hidden retention risk

Aggregate curves hide specific problems. Each of the following can exist in your business right now while the top-line retention number looks stable.

Layer 1: Cohort divergence

The most important retention analysis you can run is a cohort overlay: take your customers grouped by acquisition month and plot their retention curves on the same graph.

If cohorts from 18 months ago and cohorts from 3 months ago track similarly, your retention is stable. If recent cohorts show a steeper drop, a lower floor, or continued decline where earlier cohorts flattened out, you have cohort divergence.

OpenView’s research found that companies with improving cohort retention grow at 2x the rate of companies with declining cohort performance. The key word is “improving,” which means tracking and catching divergence early, not looking at aggregate numbers and assuming trends.

Here’s a pattern I see repeatedly: a Series B SaaS discovers through cohort analysis that customers acquired 18+ months ago churn at under 1% monthly while customers from the last two quarters churn at 4%+ monthly. The blended rate sits around 2.3%, which passes the “healthy B2B SaaS” test. In reality the business has two completely different products, from the customer’s perspective. One of them generates durable value. One doesn’t. The company has been funding the broken version for 18 months without knowing it, because the aggregate was letting it hide.

The early warning gap is what makes this so costly. Cohort divergence becomes visible in cohort data 60 to 90 days before it appears in aggregate metrics. Running cohort overlays monthly means you catch the deterioration at month 1 rather than month 4.

Layer 2: NRR masking gross churn

Net Revenue Retention is the metric most founders present to boards. It’s also the one most often misread.

NRR includes expansion: upgrades, seat additions, upsells. When expansion is strong, NRR can look excellent even with a serious gross churn problem. The board sees 118% NRR and treats retention as a strength. What they haven’t seen is GRR: Gross Revenue Retention, which counts only retained revenue without expansion, capped at 100%.

Consider a company reporting 120% NRR with 75% GRR. They’re losing 25% of their base revenue annually. The expansion engine is covering it completely, and the metric looks healthy. But the company is in a fragile position. If expansion slows for a quarter (a sales team reorg, a product delay, budget freezes across the customer base), both metrics deteriorate simultaneously, because the expansion that was covering gross churn is no longer there to cushion it.

Bessemer’s three-tier NRR framework is the standard board language: 100% is good, 110% is better, 120%+ is best. These benchmarks are useful but incomplete without companion GRR numbers. SaaS Capital puts median GRR for B2B SaaS in the 88 to 92% range. When you’re well below that while reporting healthy NRR, you have an expansion-dependency problem.

The diagnostic question: if your expansion engine stopped today, what would NRR be? If the answer is below 90%, your “strong retention” is one organizational failure away from looking like an acute churn problem.

A 10-point NRR improvement translates to a 20 to 30% valuation uplift, per m3ter’s 2026 analysis. That math works in both directions.

Layer 3: Survivorship bias in the denominator

This is a technical measurement issue, but it compounds over time in ways that matter for how you read trend data.

When you report that your churn rate this month was 3.2%, the denominator of that calculation is the count of active customers at the start of the month. That number excludes every customer who churned earlier in the year before reaching this measurement window. The denominator is biased toward your stickiest customers by construction.

The practical effect: as your customer base matures, the denominator quality increases. Early customers who are still active have survived multiple renewal cycles. Newer cohorts that drive actual churn risk are a smaller share of the denominator. Your aggregate retention rate can improve over time even when nothing structural has changed about your product’s ability to retain new customers.

The test: compare your aggregate churn rate against your first-90-day churn rate, calculated separately. The gap between those two numbers tells you how much of your apparent retention strength is a survivorship artifact rather than actual product stickiness.

Layer 4: The Quick Ratio signal

Mamoon Hamid’s Quick Ratio is one of the clearest single signals for the quality of growth hiding behind NRR:

Quick Ratio = (New MRR + Expansion MRR) / (Churned MRR + Contraction MRR)

The VC benchmark: a Quick Ratio of 4 is worth funding. Below 1 means you’re contracting. Between 1 and 4, you’re growing but burning fuel inefficiently.

What this reveals that NRR doesn’t is the quality of growth. A company with 115% NRR could have a Quick Ratio of 1.9 (massive expansion on top of massive gross churn) or a Quick Ratio of 8 (modest expansion on top of near-zero churn). Those are completely different businesses. The number that looks the same on the board deck represents entirely different underlying dynamics.

Track Quick Ratio alongside NRR and GRR. A falling Quick Ratio with a stable or rising NRR is a specific red flag: churn is accelerating and expansion is compensating. That’s the expansion-dependency dynamic getting worse, not better.

Layer 5: Behavioral cohort signals

Revenue churn lags usage churn by months. The customer who doesn’t renew in November decided in August. The decision in August was shaped by usage behavior that shifted in May. By the time the revenue impact shows up, the window for behavioral intervention is usually gone.

Behavioral cohort analysis segments customers by what they did, not when they signed up. The patterns that predict churn are almost always invisible in revenue metrics:

  • Customers who reduce their core action frequency by 40%+ in any 30-day window churn at roughly 3x the rate of the customer base overall
  • Enterprise customers who don’t schedule a QBR in the first 90 days show 8x higher churn in months 4 through 12
  • Customers who never adopt a second feature beyond the core use case churn at around twice the rate of multi-feature customers

These signals live in your product analytics, not your billing system. Revenue-only retention metrics won’t surface them.

There’s a direct connection here to activation: the customers who churn at month 4 and month 6 often have a behavioral warning that was visible in week 2. That’s the argument in the 72-hour activation window post: activation data and renewal risk connect through the same behavioral thread. Cohort retention analysis is what lets you trace that thread across the full customer lifecycle.

Running your SaaS cohort analysis: step by step

Here’s the exact diagnostic process we walk clients through when we’re running a first cohort audit.

Step 1: Choose your retention measure. Revenue retention (MRR-based) and customer retention (account-based) tell different stories. Revenue retention weights large accounts; customer retention weights frequency and breadth. Run both. They’ll diverge in ways that surface segment-level issues: if customer retention is falling faster than revenue retention, your big accounts are masking widespread churn in the long tail.

Step 2: Group by acquisition month. Pull all customers from the last 24 months. Group them into monthly cohorts by first payment date (or free-to-paid conversion if you’re tracking activation separately). You’ll have 24 cohorts.

Step 3: Build the retention table. For each cohort, calculate retention at months 1, 3, 6, and 12. Month 1 measures onboarding success. Month 3 measures habit formation. Month 6 measures product stickiness. Month 12 measures renewal dynamics.

Here’s a simplified version of what the table looks like when cohort divergence is present:

CohortMonth 1Month 3Month 6Month 12
Jan 202494%87%82%79%
Jun 202491%83%76%71%
Jan 202588%77%69%
Mar 202583%70%
May 202578%

Read this table diagonally, not horizontally. The diagonal trend is the cohort divergence signal: month 1 retention dropping from 94% to 78% across 18 months of consecutive cohorts. Each new customer group retains worse at the same time horizon than the group before it. That pattern is invisible in aggregate churn because the stable early cohorts drag the average up.

Step 4: Separate GRR from NRR by cohort. For each cohort, calculate both GRR and NRR at the 6 and 12-month marks. The gap between them is expansion contribution per cohort. If the gap is widening over time (more expansion needed to maintain NRR), the expansion-dependency dynamic is growing.

Step 5: Map cohort performance to causes. Cohort divergence almost always has a specific root cause. The most common ones: a pricing tier change that attracted smaller, less sticky customers; a new acquisition channel with worse ICP fit; a product change that reduced core feature frequency; GTM motion drift as the sales team moved upmarket or downmarket without adjusting onboarding. Once you know which cohorts are deteriorating, the question becomes what changed in product, market, or GTM at the time those cohorts were acquired.

This is where the connection to the 5 metrics that actually predict SaaS growth becomes concrete: the cohort diagnostic links directly to CAC, LTV, and payback period per cohort segment. The full unit economics impact of divergence becomes visible once you’re looking at cohorts rather than averages.

SaaS retention benchmarks by stage

What does “fine” actually mean? The data from ChartMogul, SaaS Capital, and Bessemer gives you something to anchor against.

ARR StageMedian GRRTop-Quartile GRRMedian NRRTop-Quartile NRR
Under $1M ARR78-84%88%+95-102%108%+
$1M-$10M ARR83-88%90%+98-108%112%+
$10M-$50M ARR87-91%93%+105-115%120%+
$50M-$150M ARR90-93%95%+110-120%125%+

A few things worth noting about these numbers.

GRR benchmarks are lower at early stages because smaller companies carry more exposure to ICP mismatches and onboarding failures. That’s not an excuse for early-stage churn; it’s context for prioritizing. The fix is harder to implement at scale.

The gap between “median” and “top-quartile” is real and worth understanding. Top-quartile at $10M to $50M ARR retains 93%+ of gross revenue annually. That’s what a well-calibrated retention system looks like in practice, not 110% NRR with unknown GRR underneath it.

ChartMogul’s 2023 data shows that even top-quartile companies didn’t universally reach 100%+ NRR thresholds in 2024. NRR benchmarks have compressed since 2021 to 2022. “Good” NRR in 2022 was 120%+; in 2024, achieving 107% puts you in the top quartile for your stage. The goalposts shifted.

The benchmark question for your business isn’t whether you’re above the median. It’s whether you understand why you’re where you are, and what the cohort data says about which direction you’re moving.

The 30-day retention diagnostic

If you’re not currently running cohort analysis, here’s how to build the picture in one month.

Week 1: Build the cohort table. Pull 24 months of cohort data from your billing system (Stripe, Chargebee, or your data warehouse). Build the retention table using months 1, 3, 6, and 12 by cohort. Look for the diagonal trend. Any consistent drop across successive cohorts at the same time horizon is your first signal.

Week 2: Separate GRR and NRR by cohort. Calculate GRR and NRR separately for each cohort at 6 and 12 months. Map the gap (expansion contribution) over time. A widening gap means expansion is covering an increasingly large gross churn problem.

Week 3: Add behavioral signals. Identify your three strongest behavioral predictors of churn from product analytics: the specific actions that retained customers take and churned customers don’t. Calculate the behavioral cohort retention rate for each. The customers who are behaviorally at risk but haven’t churned yet are your predictive intervention window.

Week 4: Root cause and prioritization. Map cohort performance to causation. For any cohort showing divergence, trace it back: what acquisition source, ICP segment, or product change was active when those customers signed up? Build 3 to 5 hypotheses about why recent cohorts are deteriorating. Each hypothesis should point to a testable intervention: an ICP refinement, an onboarding change, a product fix, a pricing adjustment.

This four-week process gives you the retention picture your aggregate curve has been obscuring. Once you have it, the tactical work of fixing churn has actual ground to stand on.

What to do with what you find

Cohort analysis is diagnostic. The output is a map of where your retention health stands and where it’s heading. What you do with it depends on what you find.

If you find cohort divergence: the first intervention is almost always an ICP and onboarding audit. Are recent customers a worse product fit than earlier customers? Did onboarding degrade as the team scaled? These two causes account for the majority of cohort divergence patterns I’ve seen.

If you find NRR masking gross churn: the intervention is a GRR improvement program before expansion-dependency becomes structural. That usually means tightening ICP criteria to reduce misfit acquisition, fixing onboarding so more customers reach the first value moment, and building product habits that don’t require expansion to compensate for base churn.

If you find the gradual long-tail decline (Shape 4): the root cause is almost always weak habit formation. Something is stopping customers from building a durable workflow around the product. The intervention is a habit formation audit: what are the customers who are still paying at month 18 doing that month-4 churners weren’t doing?

If you find survivorship bias inflating your metrics: stop reporting aggregate churn as a health signal. Report first-90-day churn and post-90-day churn as separate numbers, and track both over time. The two metrics represent fundamentally different problems.

There’s a compounding math point worth sitting with before closing. At 2% monthly churn, customer LTV is roughly $4,000 at $80 ARPU. At 4% monthly churn, LTV drops to $2,000. At 6%, it’s around $1,333. A 4-percentage-point swing in monthly churn is a 3x LTV difference. That’s not a rounding error; it’s the difference between a business that compounds and one that grinds.

The customers who leave at month 4 while your aggregate curve looks fine represent that math in action. The aggregate made the problem invisible. The cohort data makes it specific, timed, and fixable.

If you want to run this diagnostic on your own retention system, the Momentum Nexus growth audit includes a full cohort breakdown, GRR and NRR disaggregation, and a root-cause map of where your churn risk actually lives. Book a free session and we’ll tell you what your curve is hiding.

Ready to Scale Your Startup?

Let's discuss how we can help you implement these strategies and achieve your growth goals.

Schedule a Call