Cognition AI built Devin, the autonomous AI software engineer, growing from $1M to $73M ARR in just 9 months (September 2024 to June 2025). After acquiring Windsurf's commercial entity in July 2025 (~$82M ARR, 350+ enterprise customers), combined ARR reached an estimated ~$155M. The company raised at $10.2B in September 2025 and is reportedly in talks at ~$25B as of April 2026. The revenue model is usage-based: teams pay per Agent Compute Unit (ACU), where 1 ACU equals roughly 15 minutes of Devin work at $2–2.25 each (~$8–9/hour vs ~$150/hour for a human engineer). Tier entry points are low ($20–500/month), but actual customer spend scales with consumption intensity — active engineering teams can spend $5K–50K+/month on ACU usage. This consumption-based model is structurally closer to AWS/Snowflake than to flat-rate SaaS. Enterprise infrastructure includes VP Enterprise (David Morse, ex-CRO Hebbia), Microsoft Azure marketplace co-selling, four SI partnerships (Infosys, Cognizant, Synechron, WWT), and FedRAMP-authorized government deployment. Named enterprise customers include Goldman Sachs, Citi, Dell, Cisco, Ramp, and Palantir. The company was founded by three IOI gold medalists (Scott Wu, Steven Hao, Walden Yan) and has developed proprietary coding models (SWE-1.5/1.6) claimed to be 13x faster than Claude Sonnet. Total funding exceeds $696M with reported net burn since founding of less than $20M.

ARR

~$155M (combined Devin + Windsurf, Jul 2025)

2025 estimated

Valuation

$10.2B (confirmed); ~$25B in talks (rumored)

Series C (Sep 2025); new round in talks (Apr 2026)

Time to $100M ARR

Not yet confirmed for Devin-only; combined ~$155M at ~11 months from Devin $1M

NRR

Unknown; >5x expansions confirmed by multiple customer case studies (Bilt 3x growth, Itau 1000-dev waitlist, Goldman hundreds→thousands)

partial

GTM Architecture

Wedge	Autonomous AI agent that plans, writes, debugs, and deploys code end-to-end
ICP	Engineering teams at mid-to-large enterprises (primary); individual developers via low entry point (secondary)
Buyer	CTO/VP Engineering (enterprise); engineering team leads (team adoption)
Pilot	Free tier and low-cost entry for initial evaluation; enterprise pilot with real codebase and FDE support; VPC deployment option
Cycle	Initial adoption: minutes to days; Enterprise formalization: est. 1–3 months
Motion	Usage-based adoption: low entry point ($20–500/mo) → teams adopt and ACU consumption grows → enterprise formalization (VPC, SSO, FDE) for large accounts. Enterprise infrastructure: VP Enterprise hire (Nov 2024) → Windsurf acquisition (Jul 2025, 350+ enterprise customers) → Microsoft co-selling → 4 SI partnerships → government (FedRAMP)

Prestige anchor: Goldman Sachs, Citi, Palantir, NASA-JPL as named customers

Domain expert note: All three founders are IOI gold medalists; hired Gennady Korotkevich (8x IOI gold); David Morse (VP Enterprise) ex-CRO Hebbia

Commercial Structure

Pricing	Freemium + usage-based (Agent Compute Units) + enterprise custom · ACU (1 ACU ≈ 15 min Devin work); tiers from $0 to custom enterprise
ACV Range	Usage-based (ACU metered): light teams ~$60K–95K/yr, active teams ~$240K–420K/yr, enterprise $600K–$2.4M+/yr (one banking customer at $1.5M/yr, expanded >10x)
ACV Anchor	Software engineer labor cost ($150K–$300K+/yr); partial task automation ROI
Gross Margin	Unknown (self-reported capital efficiency: net burn <$20M on $696M+ raised) (est)
Payback	Unknown

Competitive Moats

Primary Moat

Proprietary SWE-1.5/1.6 models (950 tok/s, 13x faster than Claude Sonnet); dual-product architecture (Devin agent + Windsurf IDE)

Secondary Moat

FedRAMP High authorization; 350+ enterprise customers (acquired via Windsurf); IOI gold medalist team credibility

Trust Shortcut

IOI gold medalist founders = 'best programmers building programmer replacement'; Windsurf enterprise base = instant credibility

Data Moat

Proprietary code execution and debugging data from Devin sessions (extent unknown)

Exogenous Catalyst

Google acqui-hire of Windsurf CEO created acquisition opportunity at fraction of market value

Pattern Properties

Wedge Clarity	✓
Prestige-First Beachhead	~
Domain-Expert GTM	✓
Proof Before Scale	✓
Labor-Budget Pricing	✓
Expansion Flywheel (NRR >120%)	✓
SOC2/Compliance	✓
Data Non-Training Commitment	~
Citation Traceability	?
Human-in-the-Loop Design	~
Founder-Led Sales Phase	✓
Domain-Expert AEs/CS	✓
Warm-Intro GTM	✓
Paid Pilot	✓
ICP Qualification Discipline	✓
Hyper-Personalized Demo	✓

✓ confirmed · ~ partial · — absent · ✗ explicitly absent

Growth Rates

Year 1: ~7300% (Devin-only: $1M → $73M in 9 months, annualized)

Year 2: Unknown (2026 figures not disclosed)

Year 3: N/A

Full Analysis Memo

Cognition AI (Devin) — Playbook Analysis

Classification: Strategy / Internal Version: V4 — April 2026 (Phase 6 structural rebuild: ACU-economics-centered analysis, corrected GTM classification, usage-based revenue model) Source archive: source-harvest-phase/cognition/ (20 sources, 3 people profiles) Evidence discipline: [E] sourced | [I] inference | [UV] unverified estimate | [OQ] open question

1. Executive Summary

Cognition AI is the company behind Devin, the autonomous AI software engineer. Founded in August 2023 by three IOI gold medalists, Cognition grew Devin's ARR from $1M to $73M in nine months (September 2024 to June 2025) [E: cognition-blog-funding-growth.md] — one of the fastest revenue ramps in the AI startup cohort. After acquiring Windsurf's commercial entity (July 2025, ~$82M ARR, 350+ enterprise customers) [E: cognition-blog-windsurf-acquisition.md], combined ARR reached ~$155M [UV: Sacra estimate]. The company raised at $10.2B (September 2025) [E: cognition-blog-funding-growth.md] and is reportedly in talks at ~$25B (April 2026) [UV: Bloomberg, anonymous sourcing].

GTM classification: Usage-based developer-tool with team adoption and enterprise expansion. Cognition's revenue model is structurally different from both classic PLG (flat seat pricing, individual adoption) and classic enterprise sales (high ACV, long sales cycles). The core economics are metered consumption via ACUs (Agent Compute Units) — teams adopt Devin at a low entry point ($20–$500/mo), but actual spend scales with usage intensity. Heavy engineering teams can spend $5K–$50K+/month on ACU consumption [I: ACU rate × observed usage patterns]. This means the "$500/month team plan" framing dramatically understates real customer economics.

What the founder says about the customer pattern [E: scott-wu-20vc-transcript.md, VERBATIM]: "I think the pattern is it's really like real engineering teams that bring on Devin and they tag Devin all the time in Slack, they tag Devin in Linear... it is not the majority of our usage [as individual self-serve]. I think the majority is really just kind of like real teams using it."

What the founder says about pricing [E: scott-wu-20vc-transcript.md, VERBATIM]: "It's all usage based. It's essentially by the hour. We try to make things so that they're about 10X cheaper than basically the value of your time."

What the founder says about the revenue model design [E: scott-wu-lennys-podcast-notes.md, VERBATIM]: "We have since the beginning have been laser-focused on agentic coding... It goes all the way to even the revenue model with ACUs and having the usage-based setup."

The adoption path: team signs up → engineers start running Devins → ACU consumption grows as usage deepens → enterprise formalization (VPC, SSO, dedicated support) for large accounts. This creates natural revenue expansion without explicit upsell conversations — usage-based billing scales automatically as teams run more Devins on more tasks.

Cognition's enterprise infrastructure includes VP Enterprise (David Morse, ex-CRO Hebbia, ex-VP Sales Scale AI) [E: The Org], Microsoft marketplace co-selling [E: microsoft-customer-story.md], five SI partnerships (Infosys, Cognizant, Synechron, WWT, Endava) [E: si-partnerships-cognizant-synechron.md; enterprise-gtm-operations.md], FedRAMP authorization and government deployment [E: cognition-blog-government.md], active hiring across the full sales stack [E: cognition-careers-sales-hiring.md], and a GTM team described as "formidable" with "strong sales execution capability" [E: swyx-cognition-analysis.md].

The Windsurf acquisition was explicitly a GTM play: Scott Wu stated Cognition was "a little bit behind on growing all the other functions" beyond engineering, and Windsurf brought "go-to-market, marketing, finance, operations" [E: scott-wu-cheeky-pint.md].

Benchmark relevance: Cognition is most instructive as a boundary case in usage-based developer-tool adoption → enterprise layering, acquisition-as-GTM-shortcut, and the VP Sales hire pattern (David Morse from Hebbia, a company already in this benchmark). Its usage-based + enterprise motion is less directly comparable than purely sales-led benchmarks, but specific mechanisms (deployment engineering, pilots, expansion dynamics, SI channel, co-selling) are analytically relevant.

2. Core Motion

The Wedge

Product wedge: An autonomous AI agent that can independently plan, code, debug, test, and deploy software — operating as a "virtual coworker" rather than a code completion assistant [E: scott-wu-cheeky-pint.md, cognition-blog-funding-growth.md].

Differentiation wedge: While competitors (Cursor, GitHub Copilot, Claude Code) operate as assistive tools within the developer's workflow, Devin operates autonomously — you assign it work and it executes independently. The framing is "The Agent" (fully autonomous) vs. "The IDE" (assistive completion) [I: product positioning across multiple sources].

Pain wedge: Software engineering labor is the single largest cost center in technology companies. With median software engineer compensation at $150K-$300K+ (US), even partial automation of software tasks represents massive budget displacement [I: industry context].

The Beachhead Buyer

Usage-based adoption path: Engineering teams adopt Devin at a low entry point ($20–$500/month base), then ACU consumption scales with usage intensity. Wu describes the typical pattern as "real engineering teams that bring on Devin and tag Devin all the time in Slack, tag Devin in Linear" [E: scott-wu-20vc-transcript.md]. The $20/month open entry appeared with Devin 2.0 (April 2025) [E: venturebeat-devin-2.md]; before that, access was gated/early-access at $500/month [I]. But the base tier price is not the revenue driver — metered ACU consumption is.

Enterprise path: CTO/VP Engineering at mid-to-large enterprises. Enterprise tier includes VPC deployment, SAML/SSO, Devin API, dedicated account team [E: devin-pricing-page.md]. Named enterprise customers include Goldman Sachs, Citi, Dell, Cisco, Ramp, Palantir [E: cognition-blog-funding-growth.md].

Government path: Federal agency engineering leadership. FedRAMP High authorization, forward-deployed team, AWS GovCloud [E: cognition-blog-government.md]. Customers: Army, Navy, Treasury, NASA-JPL [E].

The Category Frame

Cognition frames Devin as a new category: the "AI software engineer" — not a copilot, not an IDE plugin, but an autonomous agent peer. This is category creation in the sense that no prior product occupied this exact positioning [I: product messaging analysis]. However, the category is contested — OpenAI Codex, Claude Code, and others are converging on similar autonomous capabilities [I: competitive context].

3. Growth System Decomposition

Acquisition Layer

Channel	Evidence	Role
Usage-based team adoption	Low entry point ($20–$500/mo base) with ACU metered consumption; "real teams" are the majority of usage (Wu, 20VC) [E: scott-wu-20vc-transcript.md; devin-pricing-page.md]	Primary growth mechanism; revenue scales with ACU consumption, not seat count
Founder-led sales	Scott Wu personally involved in early deals; IOI gold medalist credibility [E: multiple podcasts]	Important for marquee enterprise logos
Windsurf acquisition	350+ enterprise customers acquired; <5% overlap with Devin [E: cognition-blog-windsurf-acquisition.md]	Inorganic enterprise base expansion
Microsoft co-selling	Azure marketplace; joint selling with Microsoft Sales + Black Belt teams [E: microsoft-customer-story.md]	Enterprise pipeline amplification
SI channel (Infosys, Cognizant, Synechron, WWT, Endava)	Global enterprise deployment and enablement partnerships [E: cognition-blog-infosys-partnership.md; enterprise-gtm-operations.md]	Deployment capacity + distribution; partner DEs turn pilots into repeatable SI offerings
PR / media coverage	Bloomberg, TechCrunch, SF Standard, major podcast appearances [E: multiple]	Awareness and credibility
Community / word-of-mouth	swyx (AI community figure) joined team; developer community buzz [E: swyx-cognition-analysis.md]	Developer credibility signal

Named enterprise customers [E: cognition-blog-funding-growth.md, cognition-blog-government.md, microsoft-customer-story.md, devin-customer-case-studies.md, goldman-sachs-citi-enterprise.md]:

With detailed case studies (customer-confirmed, L3): Nubank, Itau Unibanco, Ramp, Gumroad, Eight Sleep, FE fundinfo, Bilt, Linktree, Crossmint With press-confirmed deployment details: Goldman Sachs (CIO quote, hundreds of Devin instances), Citi (40,000 developers), Visma (50% cost reduction) Named without detailed metrics: Dell, Cisco, Palantir, Mercado Libre, Santander, NASA-JPL, U.S. Army, U.S. Navy, Treasury, Anduril, OpenSea, Lumos, Cloudflare, Litera, Curai Health

Conversion Layer

Tier structure (entry points, not ACVs): Free → Pro ($20/mo) → Max ($200/mo) → Teams ($500/seat/mo, 250 ACUs included) → Enterprise (custom). These are entry points; actual customer spend scales with ACU consumption beyond included allocations [E: devin-pricing-page.md, devin-deployment-model.md].

Enterprise conversion: Demo → pilot with real codebase → enterprise contract. Cognition's customer-facing Deployed Engineer (DE) owns technical execution across demo, pilot, deployment, activation, training, and expansion; VP Enterprise (David Morse) and the growing sales team handle commercial pipeline [E: cognition-careers-sales-hiring.md, devin-deployment-model.md; enterprise-gtm-operations.md].

Trust mechanisms: 1. Named marquee logos (Goldman Sachs, Citi, Palantir) with CIO-level quotes [E: goldman-sachs-citi-enterprise.md] 2. FedRAMP High + SOC 2 Type 2 + data-never-used-for-training policy [E: devin-deployment-model.md] 3. VPC deployment option with AWS PrivateLink or IPSec tunnel [E] 4. Microsoft co-selling + Infosys, Cognizant, Synechron, WWT, and Endava SI partnerships [E: si-partnerships-cognizant-synechron.md; enterprise-gtm-operations.md] 5. IOI gold medalist founding team = technical credibility anchor [E] 6. 9 published customer case studies with named individuals and specific metrics [E: devin-customer-case-studies.md]

Expansion Layer (V2 upgrade — customer-confirmed evidence)

Aggregate evidence: - >5x contract expansions in successful implementations [E: swyx-cognition-analysis.md] - One banking customer on $1.5M/yr expanded >10x with multiyear commitment in 8 months [E: swyx-cognition-analysis.md] - Enterprise ARR grew 30% in 7 weeks post-Windsurf acquisition [E: cognition-blog-funding-growth.md]

Customer-confirmed expansion patterns [E: devin-customer-case-studies.md]:

Customer	Expansion evidence
Bilt	Dec 2024 pilot → Feb 2025 org-wide → 106 engineers embedded, >50% using weekly, 3x usage growth
Itau Unibanco	Waitlist of ~1,000 developers within 5 months; 75% of teams using Devin
Goldman Sachs	Starting with hundreds of instances, potentially expanding to thousands (of 12,000 developer team)
Citi	Rolling out across 40,000 developers
Gumroad	Nov 2024 test → Dec 2024 company-wide → #1 contributor in 5/7 repos within weeks

Expansion mechanic — ACU consumption growth [I confirmed by case studies + pricing model]:

The "Five Devins" workflow (Wu: "Most folks on the team are definitely working with up to five Devins at once") means each engineer's ACU consumption can increase 5x as they adopt deeper usage. Combined with team growth (more engineers onboarded), the revenue expansion is multiplicative:

Per-engineer expansion: 1 Devin → 5 Devins = 5x ACU consumption
Team expansion: 10 engineers → 50 → 100+ = linear growth on top
Task scope expansion: simple bugs → migrations → architecture = more complex (longer) Devin sessions = more ACUs per task

This is the mechanism behind the >5x contract expansions and the >10x banking customer growth. It is automatic — the billing scales with usage without requiring a new sales conversation. This is structurally analogous to AWS usage expansion: the customer never signs a bigger contract, they simply consume more resources.

Goldman Sachs "hundreds of instances potentially expanding to thousands" (of 12,000 developers) demonstrates the expansion ceiling: if even 10% of Goldman's 12,000 developers run 3 Devins at moderate usage, the ACU consumption is enormous [I].

Retention Layer

No NRR or retention data published [OQ — unchanged]
Founder-articulated stickiness model [E: scott-wu-lennys-podcast-notes.md, Phase 3 caption extraction]: Wu explicitly rejects "moats" in favor of "stickiness" — "I think it's often less about moats and more about stickiness... Devin will really learn and build its representation of your code base and of your stack and of your process over time." He compares Devin to "an engineer who's been at the company for five years, you wrote half the code yourself" — the switching cost is accumulated institutional knowledge.
Devin as onboarding / knowledge layer [E: scott-wu-lennys-podcast-notes.md]: Teams use Devin to onboard new engineers — a "jagged intelligence... almost like a staff engineer at understanding the code base." This creates a retention mechanism: Devin becomes the institutional knowledge repository, not just a code generator.
Multiplayer collaboration [E: scott-wu-lennys-podcast-notes.md]: Devin sessions in Slack involve multiple engineers contributing context. This embeds Devin deeper into team workflows than individual-use tools.
Customer-reported stickiness: 85% merge rate at Gumroad, 10,000+ hours saved monthly at Ramp, 117 PRs/week at Bilt — deeply embedded in engineering workflows [E: devin-customer-case-studies.md]
Competitive intensity remains the retention risk — Cursor ($2B+), Claude Code ($2B+) are larger [I]

4. Unit Economics and Commercial Logic

What We Know (Sourced)

Metric	Value	Source	Confidence
Devin ARR (Jun 2025)	$73M	cognition-blog-funding-growth.md	Confirmed
Devin ARR (Jul 2025)	~$80M	scott-wu-20vc-transcript.md (oblique confirmation: "we've done roughly that" vs Lovable's $80M)	Confirmed (indirect)
Windsurf ARR (Jul 2025)	~$82M	cognition-blog-windsurf-acquisition.md	Confirmed
Combined ARR (Jul 2025)	~$155-162M	Sacra estimate + 20VC indirect confirmation	Estimated
Total funding	~$696M+	Multiple	Confirmed
Series C valuation	$10.2B	cognition-blog-funding-growth.md	Confirmed
Target valuation (Apr 2026)	~$25B	Bloomberg	Rumored
Net burn since founding	<$20M	cognition-blog-funding-growth.md	Confirmed (self-reported)
Employees	~200-286	TechCrunch, Contrary Research	Estimated
Usage growth (H1 2025)	5-10x in 6 months	scott-wu-20vc-transcript.md	Confirmed (founder, on-record)
PR merge rate (aggregate)	67% (up from 34%)	devin-performance-review-2025.md	Confirmed

Commercial Model — Usage-Based ACU Economics

The tiers are entry points; the real economics are usage-based consumption.

Cognition uses Agent Compute Units (ACUs) as the core billing unit. 1 ACU ≈ 15 minutes of active Devin work [E: devin-pricing-page.md]. The tier structure provides base ACU allocations, but revenue scales with consumption:

Tier	Base price	ACU rate	ACU allocation	Target
Free	$0	—	Limited	Trial
Pro	$20/mo	$2.25/ACU	Pay-as-you-go	Individual developer
Max	$200/mo	Lower	Larger quota	Power user
Teams	$500/mo/seat	$2.00/ACU	250 ACUs/seat/mo	Engineering teams
Enterprise	Custom	Custom	Custom	Large organizations

Why this matters for revenue analysis:

The 250 ACUs included in the Teams plan = ~62.5 hours of Devin work per seat per month. But the "Five Devins" workflow described by Wu [E: scott-wu-lennys-podcast-notes.md] means a single engineer may run 5 Devins for several hours per day. At that usage level, a team of 10 engineers could consume 10,000–16,000+ ACUs/month — far exceeding the 2,500 included ACUs (250 × 10 seats). The overage at $2.00/ACU makes the actual monthly spend $20,000–$30,000+ in ACU consumption alone, on top of the $5,000 base [I: arithmetic from ACU rate × usage pattern].

Revenue model comparison [I]:

Scenario	Base cost	Est. ACU consumption	Total monthly	Annual
Light team (5 engineers, moderate use)	$2,500	~$2,000–5,000	~$5K–8K	~$60K–$95K
Active team (10 engineers, heavy use)	$5,000	~$15K–30K	~$20K–$35K	~$240K–$420K
Enterprise (50+ engineers, intensive)	Custom	Custom	~$50K–$200K+	~$600K–$2.4M+
Banking customer (swyx data point)	—	—	~$125K	$1.5M/yr (expanded >10x)

This is structurally analogous to cloud computing pricing: the listed per-unit rate is low, but consumption scales with actual usage. The $500/month base is not the product's ACV — it's the entry point for a metered service. Heavy engineering teams can spend $5K–$50K+/month depending on usage intensity [I: ACU model arithmetic + Wu's description of usage patterns].

Wu explicitly confirms this design intent [E: scott-wu-lennys-podcast-notes.md, VERBATIM]: "We have since the beginning have been laser-focused on agentic coding, and that is the one thing that we've really believed in. It goes all the way to even the revenue model with ACUs and having the usage-based setup."

Price vs. Alternative — Labor-Budget Framing

Founder-confirmed labor-cost anchoring [E: scott-wu-20vc-transcript.md, VERBATIM]:

Wu: "It's all usage based. It's essentially by the hour. We try to make things so that they're about 10X cheaper than basically the value of your time. Because if you think about say a software engineer being 300 grand a year... 150K then would be that 0.5X that you are adding."

At $2.00–$2.25 per ACU (15 min), Devin costs ~$8–9 per hour of active work. A US software engineer at $300K total comp costs ~$150/hour. Devin is ~17x cheaper per hour of work output — slightly better than Wu's "10x cheaper" framing [I: arithmetic].

This is explicit Law 5 (Labor-Budget Pricing) — the same mechanism used by Harvey (vs. associate cost at $250–400K/yr), Sierra (vs. human CX agent cost at $13/interaction), Abridge (vs. physician documentation time at 15% of payroll), and Decagon (vs. $3.7T US support labor). Cognition anchors pricing against $150K–$300K engineer salaries, not against competing software tools.

vs. competitive tools — pricing model differs fundamentally [I]: - Cursor: ~$20/mo (Pro), ~$40/mo (Business) — flat seat-based, unlimited use. Revenue does not scale with consumption intensity. - GitHub Copilot: $10–39/mo — flat seat-based. Same model as Cursor. - Claude Code: included in Anthropic subscriptions — usage-capped or credit-based. - Cognition's pricing model is structurally different: not a flat subscription, but a metered consumption service with a low entry point. This means Cognition captures more value from heavy users (analogous to AWS vs. fixed-price hosting). The $20/mo entry point competes with Cursor/Copilot on price, but the revenue-per-customer ceiling is dramatically higher for teams that use Devin intensively.

Enterprise ACV data points [E/UV]: - One banking customer at $1.5M/yr (expanded >10x to est. $15M+/yr) [E: swyx-cognition-analysis.md] - Enterprise tier is custom pricing; likely $100K–$2M+ based on org size and usage intensity [UV] - Government contracts: unknown but likely high-ACV, multi-year [UV] - Goldman Sachs: "hundreds of Devin instances, potentially expanding to thousands" — at enterprise ACU rates, this could be a multi-million-dollar account [I: scale × rate]

Revenue Trajectory [I/UV]

Period	Devin ARR	Combined ARR	Source
Sep 2024	$1M	N/A	Confirmed
Jun 2025	$73M	N/A	Confirmed
Jul 2025	$73M+	~$155M	Estimated (Sacra)
Sep 2025	Higher	Higher (+30% enterprise in 7 weeks)	Partial confirmation
Apr 2026	Unknown	Unknown	Company declined to share

Growth rate: $1M → $73M in 9 months = ~73x in 9 months, or roughly ~830% annualized growth rate [I: arithmetic]. This is among the fastest in the benchmark cohort, comparable to Intercom/Fin ($1M→$100M in ~2.5 quarters).

Current ARR [OQ]: Company declined to share 2026 figures. If combined ARR maintained even 2-3x YoY from the $155M base, it could be $300-500M+ by April 2026. However, competitive intensity from Cursor ($2B+) and Claude Code ($2B+) suggests Cognition may be significantly smaller than its top competitors [I: SF Standard competitive context].

5. Sales Cycle and Enterprise GTM Operations

GTM Organization [E: enterprise-gtm-operations.md, TheOrg]

Cognition has built a segmented GTM org reporting to CEO Scott Wu:

Segment	Leader	Scope
Enterprise	David Morse (VP Enterprise)	Large enterprise accounts
Mid-Market	Austin Mead (VP Mid-Market)	Mid-market sales + BD
Strategics	Paxton Duff	Largest/most complex accounts
Federal/Gov	Federal AE (hiring)	DoD, IC, civilian agencies
International	Brooke Peterson (VP MEAP)	Middle East & Asia Pacific
Partnerships	Gardner Johnson (VP)	GSI/SI channel (ex-Codeium/Windsurf)
President	Russell Kaplan	Operational leadership

Notable hiring pattern: Multiple GTM hires from Palantir (Arjun Mishra, Jaime Mizrachi — both ex-Palantir FDEs) and Scale AI (Morse, Peterson, Kaplan). The Palantir "deployed engineer" model is an explicit influence.

Enterprise Sales Process [E: enterprise-gtm-operations.md, job descriptions]

Stage	Who	What happens	Timeline
1. Prospecting	GTM Manager / Account Director	Target account identified. Microsoft co-selling generates warm pipeline via Azure marketplace. SI referrals from Infosys/Cognizant/Synechron.	Ongoing
2. Technical qualification	Deployed Engineer (DE)	DE leads demos against prospect's actual codebase. Identifies highest-ROI use cases (migrations, security remediation, testing, tech debt). VERBATIM from JD: "Deeply understand customer technical pain points to qualify the best use cases for deploying Devin."	1-2 weeks
3. Security/procurement review	Enterprise sales + customer IT	SOC 2 Type II, ISO 27001:2022, CCPA compliance. Trust Center (trust.cognition.ai) provides pen test report, network diagram, DPA. For VPC: scoping AWS PrivateLink or IPSec tunnel.	2-6 weeks (parallel with pilot)
4. Pilot execution	DE + small customer team (VP Eng-sponsored, 3-5 engineers)	Hands-on pilot on real codebase, contained scope. Best starting tasks: "tasks with clear, upfront requirements and verifiable outcomes that would take a junior engineer 4-8 hrs." DE sets up Knowledge base, DeepWiki, Slack integration.	2-6 weeks
5. Enterprise formalization	VP Enterprise / Account Director	VPC deployment, SSO (Okta/Azure AD/SAML), dedicated account team, assigned DE. ACU-based contract negotiation.	2-4 weeks post pilot
6. Onboarding and scale-up	DE + AI Enablement Engineer	Live workshops, pair programming sessions, shared playbook creation. Knowledge base seeded with org-specific standards.	Ongoing
7. Expansion	Automatic (ACU consumption) + DE monitoring	Usage grows organically. >5x contract expansions "not at renewal, but proactively." Per-engineer (1→5 Devins) + team (10→100+) + task scope expansion.	Months 2-12+

The Deployed Engineer (DE) Role — Operational Core [E: enterprise-gtm-operations.md]

Cognition calls this role "Deployed Engineer" (not "FDE"), though the function mirrors Palantir's model. Three variants:

Customer DE — owns the full technical lifecycle. VERBATIM from JD: "You own all technical aspects of the customer lifecycle — from deployment through activation, making sure customers are successful using Devin." Pre-sale: runs demos and pilots. Post-sale: designs training programs to "help customers roll out Devin to thousands of engineers." Locations: SF, Austin, NYC (on-site with customers).

Partner DE — enables SIs to scale. VERBATIM: "Own technical success of GSI partners across onboarding, enablement, and scaled delivery." Builds "repeatable solutions, accelerators, and reference architectures." Handles technical escalations for partner-led deployments. 25-50% travel including frequent India trips (SI delivery centers).

AI Enablement Engineer — workshop/training specialist. Runs "interactive programs for enterprise engineering teams — live workshops, pair programming sessions." Creates shared playbooks and enablement materials. Goal: scale into "a repeatable and productized offering."

Observed Pilot-to-Production Patterns [E: devin-customer-case-studies.md, enterprise-gtm-operations.md]

Customer	Pilot team	What triggered expansion	Time to org-wide
Bilt	VP Eng + tech mgr + few engineers	Organic pull: 3x usage growth	~2 months (106 engineers)
Itau	Single squad, full workflow integration	Squad delivered 2x planned releases; 1,000-dev waitlist formed	~9 months
Gumroad	Handful of engineers	Rapid success on bug fixes	~1 month
Nubank	Migration-focused team	4x speed improvement, 20x cost savings	Weeks (collapsed 18-month project)
Eight Sleep	Data team via Slack	"Couple of hours" to first query	Company-wide data team
Goldman Sachs	Undisclosed	CIO mandate: "hybrid workforce"	Hundreds → thousands of instances

Common pattern [I confirmed by 6 case studies]: Small VP-Eng-sponsored team → contained project → organic demand creates internal pull → expansion is demand-driven, not top-down mandated. Itau CTO VERBATIM: "As people get familiar with Devin, they naturally expand how they collaborate with it." Itau abandoned A/B testing because it "didn't capture how teams organically discovered new applications."

Step-by-Step Technical Onboarding [E: docs.devin.ai, enterprise-gtm-operations.md]

Deployment: Enterprise SaaS (minutes — multi-tenant) or Customer Dedicated SaaS / VPC (longer — single-tenant, AWS PrivateLink, code never leaves customer cloud boundary)
SSO: Okta (OIDC), Azure AD, SAML 2.0
Repository access: GitHub, GitLab, Azure DevOps, Artifactory, CodeArtifact
Slack integration: @Devin, !ask (codebase queries), !deep (research), !fast (accelerated), !dana (data analyst)
Knowledge base: Enterprise Knowledge (cross-org) and Organization Knowledge. Trigger descriptions, macro system. Devin auto-suggests based on session interactions.
DeepWiki: Auto-generates codebase documentation with system diagrams (up to 5M LOC, 500GB repos)
Workshops: AI Enablement Engineer delivers live workshops, pair programming. Creates org-specific playbooks.

SI Channel Mechanics [E: enterprise-gtm-operations.md, si-partnerships-cognizant-synechron.md]

Five confirmed partners: Infosys, Cognizant, Synechron, WWT, Endava.

Infosys — three deployment models: 1. Internal productivity (Devin in Infosys's own teams) 2. Services delivery (human + Devin "hybrid delivery pods") 3. Managed service (Infosys deploys/manages Devin in customer environments)

Cognizant — embedded DE model: Cognition VERBATIM: will "embed our own team of forward-deployed AI engineers for project selection, engineer enablement, and ROI measurement."

Synechron — certification model: "Cognition-trained and certified engineers" for compliant deployment in financial services.

Enterprise Automation [E: enterprise-gtm-operations.md, docs.devin.ai]

Beyond interactive use, enterprise Devin supports event-driven automation: - CI failure triggers Devin API for automated fix attempts - ServiceNow incidents trigger autonomous root-cause investigation - Jira/Linear webhooks trigger Devin on new issue creation - MultiDevin (Enterprise-only): 1 manager + up to 10 worker Devins for parallelized execution

Founder-Led Sales Phase [E: multiple podcasts]

Wu is highly visible (Invest Like the Best, Lenny's, 20VC, Cheeky Pint) — textbook founder-as-credibility-weapon. The IOI gold medalist narrative ("best programmers building a programmer replacement") creates trust with engineering buyers. However, the GTM org is now professionalized: VP Enterprise (Morse), VP Mid-Market (Mead), VP Partnerships (Johnson), VP International (Peterson), and a team of DEs handling technical execution. Wu remains the strategic/product voice, not the primary sales operator.

6. Why the Company Won

Six Compounding Advantages

1. Extraordinary founder credibility [E] Three IOI gold medalists (including Gennady Korotkevich, the most decorated competitive programmer in history, hired as engineer). Scott Wu: 3x IOI gold, Mathcounts champion, Forbes 30 Under 30. This creates a unique "best programmers in the world building a programmer replacement" narrative that no competitor can replicate.

2. First-mover in autonomous agents [E/I] Devin launched as the first major autonomous AI software engineer (March 2024 demo). While the category is now contested, the first-mover advantage created massive PR, investor interest, and developer curiosity. The $21M → $175M Series A extension (6 weeks apart) was a direct result of the launch buzz.

3. Proprietary model development [E] SWE-1.5 and SWE-1.6 models: "950 tokens per second, about 13x faster than Claude Sonnet" [E: cognition-blog-funding-growth.md]. Proprietary models reduce dependency on external providers, improve margins, and create differentiation.

4. Strategic acquisition execution [E] Windsurf acquisition in 72 hours: gained $82M ARR, 350+ enterprise customers, GTM/operations team, and FedRAMP authorization. Cost estimated at ~$250M while Google paid $2.4B for just the CEO + research leads [I: SaaStr analysis]. This was exceptional M&A execution.

5. Dual-product architecture [I] Devin (asynchronous agent) + Windsurf (synchronous IDE) = coverage of both developer modalities. <5% customer overlap pre-acquisition means the products serve different segments, not cannibalize each other [E].

6. Extreme capital efficiency [E, self-reported] "Net burn since founding < $20M" with $696M+ raised [E: cognition-blog-funding-growth.md]. Even if this refers to net burn before large raises, it signals operational discipline unusual in AI startups.

What's NOT an advantage

Market position vs. Cursor/Claude Code: Cognition is significantly smaller (~$155M combined vs. $2B+ each for Cursor and Claude Code) [I: SF Standard]. The autonomous agent category may not win vs. assistive tools.
Enterprise maturity: Sales org is still being built. Active hiring for SDR, AE, Director roles suggests the team is early [E: cognition-careers-sales-hiring.md].
Retention data: No published NRR or churn figures. In a market with fast-evolving competition, retention is not guaranteed [OQ].

7. Benchmark Relevance and Transferability Boundaries

What the case demonstrates

1. VP Sales hire from adjacent AI company in the cohort David Morse came from Hebbia (CRO) and Scale AI (VP Sales). Hiring someone who built enterprise motion at a similar-stage AI company is a repeatable pattern across the cohort (for example, senior GTM operators moving between Scale AI, Hebbia, Cognition, Harvey, Sierra, and adjacent AI infrastructure companies) [I: cross-company pattern].

2. SI channel as enterprise distribution Infosys, Cognizant, Synechron, and WWT partnerships show how a developer-tool company can extend enterprise reach without building all services capacity in-house [E: si-partnerships-cognizant-synechron.md]. This is a general enterprise-AI distribution pattern, not a company-specific recommendation.

3. Usage-based pricing with enterprise floor The ACU model scales with adoption while enterprise contracts provide procurement structure and predictability [E: devin-pricing-page.md]. Cognition shows how usage-based economics can coexist with enterprise packaging.

4. Named customer list as social proof Cognition's official communications name Goldman Sachs, Citi, Dell, Cisco, Ramp, Palantir, Nubank, Itau, and others. That creates a trust cascade for enterprise buyers evaluating a new category [E: devin-customer-case-studies.md; goldman-sachs-citi-enterprise.md].

What is not broadly transferable

1. Developer-led adoption as the primary growth engine Cognition's developer-tool context makes individual/bottom-up adoption much more plausible than in high-implementation enterprise software. But even here, the early ramp was not necessarily open self-serve from day one: $20/month self-serve appears to start with Devin 2.0 in April 2025. Do not generalize its developer-led mechanics to categories where value requires heavy setup, data integration, or domain configuration.

2. Acquisition as GTM shortcut The Windsurf acquisition was a capital-intensive, timing-dependent move enabled by unusual market circumstances and funding capacity. It is an important benchmark event, but not a generally replicable GTM playbook.

3. Developer-tool adoption dynamics Developers can adopt tools individually, prove value locally, and pull them into teams. That adoption loop differs from buyer-led enterprise categories where procurement, implementation, and change management dominate.

What requires careful interpretation

1. Hybrid motion classification Cognition is best understood as PLG/developer-led adoption followed by enterprise layering, not as a clean sales-led benchmark. The enterprise motion is real, but the early revenue ramp should not be treated as proof of a pure enterprise sales motion.

2. Expansion mechanics Customer case studies show usage expansion and >5x/>10x account growth, but the company does not publish formal NRR, churn, or enterprise/self-serve revenue mix. Expansion is strongly evidenced directionally, not fully quantified.

8. McKinsey-Style Factor Analysis

Factor	Rating	Evidence basis
Market timing	★★★★★	LLM capabilities crossed autonomous coding threshold exactly when Cognition was ready [E]
Founder credibility	★★★★★	IOI gold medalists building a coding tool = maximum credibility [E]
Product-market fit	★★★★☆	$1M→$73M in 9 months is strong; but competitive convergence is a risk [E/I]
GTM execution	★★★☆☆	Developer-led growth strong; exact split between gated early access, self-serve, teams, and enterprise remains unpublished [I]
Capital efficiency	★★★★★	<$20M net burn on $696M+ raised [E, self-reported]
Competitive moat	★★★☆☆	Proprietary models + Windsurf + FedRAMP; but Cursor/Claude Code are larger and converging [I]
Pricing power	★★★☆☆	$500→$20 price cut suggests limited pricing power at bottom; enterprise pricing unknown [E/I]
Retention/expansion	★★★★☆	>5x expansions reported; but no published NRR [E: partial, I: partial]
Category defensibility	★★☆☆☆	"Autonomous AI engineer" category is contested by all major AI companies [I]
Ecosystem lock-in	★★★☆☆	Knowledge onboarding + workflow integrations create some friction; but low absolute switching cost [I]

9. Risks and Fragilities

1. Competitive convergence [HIGH RISK] Cursor ($29B+, $2B ARR), Claude Code ($2B+), GitHub Copilot (4.7M subs), OpenAI Codex — all converging on autonomous capabilities. Cognition's differentiation may erode as larger players add agent features to their existing platforms [I: SF Standard competitive context].

2. Revenue scale gap [MEDIUM RISK] At ~$155M combined ARR (Jul 2025), Cognition is ~13x smaller than Cursor. The "autonomous agent" positioning may be a niche within the broader "AI-assisted development" market rather than the dominant paradigm [I].

3. Post-acquisition integration [MEDIUM RISK] Windsurf acquisition brought complexity: layoffs reported, team integration challenges, two product lines to maintain. Post-M&A execution risk is real [I: TechCrunch reports of ~200 employees vs. ~450+ combined pre-layoffs].

4. ACU margin compression [MEDIUM RISK] The $20/mo entry point competes with Cursor/Copilot on sticker price, but the real revenue is ACU consumption. If competitors match the autonomous-agent capability and undercut on per-unit compute pricing, ACU margins compress. The proprietary SWE-1.5/1.6 models (13x faster than Claude Sonnet) may provide cost-structure advantage, but this depends on maintaining a performance lead [I].

5. Enterprise org immaturity [MEDIUM RISK] Sales team still being built (active SDR/AE/Director/CS hiring). Enterprise motion is < 18 months old (VP Enterprise hire Nov 2024). Not yet proven at scale [I: cognition-careers-sales-hiring.md].

6. Key person risk [LOW-MEDIUM RISK] Scott Wu is the primary public voice and strategic driver. Gennady Korotkevich adds unique credibility. Loss of either would be significant [I].

10. Final Benchmark Assessment

Classification verdict

Cognition qualifies for the benchmark cohort as a usage-based developer tool with team adoption and enterprise expansion. It is not classic PLG (the revenue model is metered consumption, not flat subscriptions), and it is not classic enterprise sales (the entry point is low and the initial adoption is team-driven, not procurement-driven). The core economic mechanism is ACU consumption scaling — teams adopt at a low entry point, usage deepens as engineers run more Devins on more tasks, and revenue expands automatically through metered billing without requiring new sales conversations.

Wu's description of the growth pattern — "real engineering teams... tag Devin all the time in Slack, tag Devin in Linear... and then they kind of use it and grow it and share it that way" — is a consumption-growth story, not a PLG story and not a traditional enterprise sales story. The correct label is Usage-based team adoption + Enterprise — closer to the cloud-computing model (AWS, Snowflake) than to either PLG SaaS (Slack, Figma) or traditional enterprise sales (Salesforce, ServiceNow).

Primary benchmark value

VP Sales hire pattern: David Morse (ex-CRO Hebbia → VP Enterprise Cognition) demonstrates cross-pollination of enterprise sales talent across the AI cohort. This reinforces the operator/GTM-talent pattern, even though Morse has no public operator interview.
Acquisition as GTM accelerant: The Windsurf acquisition is a unique data point: Cognition bought an enterprise customer base and GTM capability rather than building it all organically. This is analytically important but should be treated as an exceptional path, not a default playbook.
Expansion mechanics: >5x contract expansions and >10x banking customer growth suggest that usage-based pricing plus demonstrable value can create natural expansion without a traditional upsell-heavy motion.
Speed benchmark with caveat: $1M→$73M in nine months is useful as a growth-velocity reference. The mechanism was team-level adoption at $500/month, not open $20/month self-serve PLG. Wu explicitly states "real teams" are the majority of usage. The $20/month tier only existed for the last 2 of 9 months. This ramp is best understood as team/company adoption of a developer tool, not classic PLG.

Boundary conditions

Do not benchmark pure enterprise sales conversion rates against Cognition without separating gated early access, self-serve, teams, enterprise, and acquired Windsurf revenue.
Do not assume developer-tool adoption dynamics generalize to non-developer enterprise categories.
Do not overweight the revenue numbers without acknowledging the revenue scale gap vs. top competitors and the lack of published NRR / revenue mix.

Final inclusion status

Tier-2 benchmark. Cognition belongs in the corpus because its growth velocity, enterprise layering, customer evidence, SI channel, and acquisition-as-GTM event are analytically important. It is less directly comparable than pure sales-led companies, but it is a valuable boundary case for team-level developer-tool adoption scaling into enterprise sales.

V2 Update (April 25, 2026) — Phase 2 Corrective Pass

New sources added (6): devin-customer-case-studies.md (9 named customer case studies with metrics), goldman-sachs-citi-enterprise.md (CIO-level quotes), si-partnerships-cognizant-synechron.md (4 SI partners total), scott-wu-lennys-podcast-notes.md (Five Devins workflow, 25%→50% PR target), devin-deployment-model.md (FDE model, VPC, time-to-value), devin-performance-review-2025.md (aggregate metrics: 67% merge rate, 4x faster YoY).

Key upgrades: - Customer evidence: upgraded from "partial" to "strong" — 9 named case studies with L3 customer-confirmed metrics (Nubank 8-12x, Itau 20-30%, Ramp 10K+ hrs/mo, Gumroad 85% merge rate) - Deployment/implementation: upgraded from "partial" to "strong" — FDE model, VPC options, hours-to-weeks time-to-value documented - Outcome/ROI: upgraded from "limited" to "strong" — customer-confirmed metrics across multiple companies, aggregate 67% PR merge rate - SI partnerships: expanded from 1 (Infosys) to 4 (Infosys, Cognizant, Synechron, WWT) - Pricing: corrected Teams tier from $80 to $500/seat/mo - David Morse: exhaustive search confirmed structural absence of public content (not a research gap; he doesn't do public content)

Remaining gaps: Current 2026 ARR (company declined), enterprise vs. self-serve revenue mix, gated early-access vs. open self-serve split, NRR/retention, sales team org chart, David Morse voice (confirmed absent).

Evidence basis: 22 primary sources, 3 people profiles (updated). Key confirmed metrics: $1M→$73M ARR (9 months), $10.2B valuation, $82M Windsurf ARR, 350+ enterprise customers, <$20M net burn, 9 customer case studies with named contacts, 4 SI partnerships, FDE deployment model, 67% PR merge rate across hundreds of thousands of PRs.

V3 Update (April 26, 2026) — GTM Classification Correction

Trigger: Owner challenged whether founders actually said self-serve/PLG drove the growth. Full evidence audit conducted (see EVIDENCE-AUDIT-gtm-classification.md).

Finding: The "PLG" label was sourced from a single Sacra analyst characterization, not from any founder or company statement. Scott Wu explicitly stated in the 20VC interview that "real teams" are the majority of usage, not individual self-serve sign-ups. The $1M→$73M ramp happened at $500/month pricing (7 of 9 months), which is a team/company purchase, not classic PLG.

Changes made: - GTM classification changed from "Hybrid (PLG + Enterprise)" to "Team-adoption-first with enterprise layering" - Wu's verbatim about teams being the majority promoted to executive summary - Sacra's "primarily self-serve" characterization explicitly demoted from framing assumption to third-party inference with contradiction noted - Final benchmark assessment updated to reflect corrected classification - Evidence audit document created as permanent record

V4 Update (April 26, 2026) — Structural Rebuild: ACU Economics

Trigger: Owner input that the analysis over-anchored on the "$500/month team plan" price point, missing that the real business economics are usage-based ACU consumption where heavy teams spend $5K–$50K+/month.

Key structural change: The analysis now treats Cognition's revenue model as metered consumption (analogous to cloud computing) rather than flat-rate subscription tiers. The tier prices ($20, $200, $500) are entry points, not ACVs. Real customer spend scales with ACU consumption: teams running multiple Devins at intensity can generate $240K–$2.4M+/year in ACU revenue.

Changes made: - Unit Economics section rewritten around ACU economics model with computed spend scenarios - GTM classification changed from "Team-adoption-first" to "Usage-based team adoption + Enterprise" — positioning Cognition closer to AWS/Snowflake consumption model than to PLG SaaS or traditional enterprise sales - Expansion mechanics rewritten to explain the multiplicative ACU growth mechanism (per-engineer × team size × task scope) - Pricing comparison rewritten to show structural difference from flat-rate competitors (Cursor, Copilot) - Risk #4 updated to reflect ACU margin dynamics rather than sticker-price competition - Labor-budget framing preserved and strengthened with per-hour cost comparison ($8–9/hr Devin vs $150/hr engineer)

V5 Update (April 26, 2026) — Enterprise GTM Operations Deep Dive

Trigger: Owner flagged that sales, GTM, pilots, FDEs, and enterprise adoption mechanics were the most underdeveloped area.

New source: enterprise-gtm-operations.md — compiled from official docs (docs.devin.ai), job descriptions (Ashby, Glassdoor), org chart (TheOrg), SI partner announcements, and trust center documentation.

Key additions: - Full GTM org chart (6 segments: Enterprise, Mid-Market, Strategics, Federal, International, Partnerships) - Three Deployed Engineer variants (Customer DE, Partner DE, AI Enablement Engineer) with JD verbatims - Step-by-step enterprise sales process from prospecting through expansion - Technical onboarding sequence (7 steps from deployment to workshops) - Pilot-to-production patterns across 6 named customers with timelines - SI channel operational mechanics (Infosys 3 models, Cognizant embedded DEs, Synechron certification) - Enterprise automation triggers (CI failure, ServiceNow, Jira/Linear, MultiDevin) - Fifth SI partner confirmed: Endava (Feb 2026) - Security/compliance infrastructure details (SOC 2 Type II, ISO 27001, trust center)