If you’re running a commercial organization at a top-15 pharma right now, you’ve probably commissioned ten to twenty AI pilots over the last two years. Maybe a handful are in production. Almost none are moving the line on revenue, share, or HCP engagement quality. And the diagnosis you’re getting from your AI team, your agency partners, and most of the consulting firms in your inbox is some version of “we need more pilots, better data, more change management,” which is the same diagnosis you got two years ago.
The public numbers are now ugly enough that we can stop pretending this is a pilot-quality problem. IDC found that 88% of enterprise AI proofs of concept don’t reach production. MIT NANDA’s 2025 study - 150 leadership interviews, a 350-person survey, 300 public deployments - found that 95% of generative AI pilots deliver no measurable impact on P&L. Gartner forecasts that more than 30% of generative AI projects will be abandoned after proof of concept by end of 2025.
The MIT NANDA study names the cause directly:
It’s not the technology. It’s the integration into workflows, the org alignment.
Aditya is right and yet the response from most commercial leaders is still “let’s commission another pilot.” That’s a coping mechanism, not a strategy. It lets the organization be visibly busy with AI without confronting the harder problem: Most existing pilots actually work as intended, but the commercial model they’re being grafted onto was designed for a pre-AI world. That model can only absorb so much AI before the political capital required to absorb more exceeds the productivity it produces. I’m calling that limit the Retrofit Ceiling. And in most established pharma commercial orgs, you’ve either hit it already or you’ll hit it inside the next twelve months.
The Q4 budget cycle is your window. Miss the opportunity to break the ceiling and you spend another year locked inside the operating model that ate the last fifteen pilots.
What “AI-native” means in this lane, and why the existing conversation misses it
There’s an active conversation about AI-native operating models right now. Most of it doesn’t apply to established pharma. Benchling has been writing about rebuilding biotech for the AI era. ITONICS has published on AI-native pharma startups making the same move. Some recent consulting commentary has framed it as “redesigners versus tinkerers” in the regulatory submissions process. The peer-reviewed Exploring an AI-First Healthcare System makes the same structural argument.
What’s missing from all of it is the specific shape of the problem inside an established pharma commercial organization. Commercial is a fundamentally different beast from biotech startup R&D and regulatory submissions. Two distinctions matter most:
- Established incumbents carry legacy commercial machinery that biotech startups don’t. Sales-force structures, incentive plans, account-team coverage models, brand-team workflows, MLR processes, OKR-KPI ladders. All of it was designed pre-AI, and all of it is necessary for the way day-to-day operations drives revenue today. You can’t rip it out, and you can’t pretend it isn’t there.
- The commercial side faces a regulator that R&D doesn’t experience the same way. Personalized content velocity, dynamic creative variants, AI-generated promotional copy. Every one of those is, formally, a piece of medical communication subject to MLR review. The same AI that’s supposed to give you a 10x lift on engagement is the same AI that breaks the throughput math on your compliance choreography.
The symptom most leaders see (fifteen pilots, four in production, none moving the line) has a different cause in established-pharma commercial than it does in startup R&D or regulatory submissions. The lane matters, and most of the AI-native commentary is written for the wrong lane.
The Retrofit Ceiling, defined
Three things make the Retrofit Ceiling distinct:
- It’s lane-specific. Established pharma, commercial, legacy operating model - not biotech, not R&D, and not generic enterprise AI. The lane defines the claim.
- It is a structural limit, not a stall. Pilot purgatory describes what you see. The Retrofit Ceiling describes why. You can’t pilot your way through it.
- It centers on a political-capital tradeoff. Every additional pilot extracts more capital from the same constrained operators: the CDO, the MLR head, the brand director. The ceiling is the point at which the next pilot would cost more capital than its productivity gain.
What the ceiling actually looks like from inside
The data isn’t saying the models don’t work. They work. The platforms work. The integrations with legacy systems are expensive and annoying, but they work too. Bain’s Q3 2025 read puts AI deal conversion at 47% versus 25% for traditional SaaS. Vendors aren’t the problem either. If you have a sharp data team and familiarity with Cursor / Codex / Claude Code, you can ship a pilot that performs as advertised inside a quarter.
The ceiling shows up when those pilots try to become the operating model.
Every established pharma’s commercial machinery is built around assumptions that were correct at the time. Account teams cover X reps because the human-reach-to-HCP-density math made sense. Brand strategy ran in twelve-month cycles because the data refreshed twelve months after the campaign. Compliance review took six weeks because the bottleneck was human attention on a finite reviewer bench. Customer engagement was macro-segmented because actually personalizing across millions of decisions, let alone hundreds, was operationally impossible.
Each of those constraints was real. Each was a structural wall in some part of the commercial model. And each is exactly the kind of constraint AI dissolves. If you redesign the room around the wall coming down.
The retrofit move
Keep the room intact, move the wall with a chatbot
- Operating model assumed fixed at kickoff
- Pilot performs in demo
- Pilot dies in the field
- Bolt-on eventually gets unplugged
The rebuild move
Redesign the room around the wall coming down
- Operating model designed for continuous decisions
- Workflows assume AI as foundation
- Compliance is a design condition, not a gate
- Data foundation rebuilt before the next solution lands
The retrofit move is the opposite. The retrofit move keeps the room intact and tries to move the wall a foot to the left with a chatbot. The wall doesn’t move; the chatbot eventually gets unplugged, and somewhere in a steering committee deck, the failure gets attributed to “change management.”
That’s what running into the ceiling feels like from the inside. Every pilot performs in a demo. Every pilot dies in the field. The failure mode is always something nobody flagged at kickoff, because the kickoff assumed the surrounding model was fixed or good-as-is. The IDC and MIT numbers aren’t measuring AI failures. They’re measuring the rate at which legacy commercial models reject the AI graft.
Where are you on the Retrofit Ceiling: a three-signal diagnostic
The signals below are the same ones experienced commercial leaders surface when they describe what’s actually happening inside their org. I’ve surfaced them here as a tool you can apply yourself, before you commission a full rebuild assessment.
- 01
Pilot pile-up without production
Your org has at least eight active commercial-AI pilots in the last twenty-four months, and fewer than one in five have reached sustained production. The rate is the signal, not the count. Counts can mislead because the headline number sounds productive in a board update.
- 02
Data-layer veto without data-layer funding
Your data team has flagged at least two pilots in the last quarter as “the data layer can’t support this,” and the rebuild work hasn’t been commissioned, scoped, or named in next year’s budget. The veto is the early warning. The unfunded veto is the ceiling.
- 03
Sponsor rotation faster than pilot maturation
Your AI pilots routinely outlast their executive sponsors. The CCO who commissioned the pilot is gone before it scales. The new CCO inherits commitments without authorship, and the pilot becomes a political liability rather than an operating asset.
How to read the score:
- One of three: You’re approaching the ceiling. You’re approaching the ceiling. The next budget cycle is your window. Frame the data-layer rebuild before someone else’s veto becomes your liability.
- Two of three: You’re at the ceiling. More pilots will not get you through. The next move is operating-model, not technology, and it needs an executive sponsor with at least 24 months of expected runway.
- Three of three: You’ve been hitting the ceiling for six-plus months. The political capital required to redesign the model is depleting. Every quarter that passes makes the rebuild harder to commission, not easier.
I’d estimate with maybe 70% confidence (calibrated against the MIT and IDC samples plus the orgs I’ve actually spent time with) that the majority of top-15 pharma commercial organizations are currently at “two of three” or worse. If that’s right, the next eighteen months get interesting.
Why the ceiling holds: three drivers
Three things make the ceiling particularly hard to see as you approach it in established pharma commercial orgs.
- 01
Data architecture inertia
Your customer data lives in systems built for the cadence and shape of pre-AI commercial decisions. The CDP refreshes nightly - but many underlying data sets refresh monthly. The CRM was built for pipeline math, not for a model that needs continuous signal on what an HCP is doing week over week. The data warehouse was structured for a quarterly SLT and board review, not for a system that needs to ask “what does this physician care about today” twenty thousand times a day.
The evidence is hiding in plain sight if you look at the partnership patterns. IQVIA spent 2025 stitching together identity resolution, OCE CRM, and audience activation through the Throtle acquisition and an expanded Salesforce relationship. Salesforce Life Sciences Cloud only reached GA in October 2025. In other words, the biggest commercial-data vendors in pharma are themselves in the middle of a rebuild, because the data layer they sold for the last decade can’t carry continuous AI workloads. Retrofit AI into that architecture and every model spends the first six months of its life rebuilding the systems underneath it should have been able to assume. By the time it actually works, the team has lost the political room to use it.
- 02
Compliance choreography
Most regulated commercial orgs treat compliance as a gate at the end of a process. Brand team writes the campaign. MLR reviews it. Legal signs off. Campaign goes live. Industry benchmarks put baseline MLR review at roughly 30–40 days per asset, with pre-review steps adding another 5 to 150 days depending on the org. That sequencing was designed for slow content with stable rules.
AI changes the volume of decisions by two or three orders of magnitude. Every personalized email, every dynamic creative variant, every triggered next-best-action is, formally, medical communication that has to clear the same review path. The choreography breaks before the technology does, and the people who feel it first are the people most empowered to slow the AI work down.
And the regulator just changed the risk evaluation. In September 2025, the FDA issued more than 200 enforcement letters on pharmaceutical promotional content in a matter of weeks (the highest annual total in nearly 25 years) and named explicitly that it had used AI to surveil and triage them. The same technology your content team is adopting is being pointed back at you, faster, with longer reach. MLR reviewers, the privacy office, and the field compliance lead aren’t wrong to be cautious. They’re protecting something real. Unfortunately, the model that protection was designed for is already in hospice.
- 03
Operating-model debt
Your org chart, your incentive structures, your accountability lines, your QBR rhythm, your launch-readiness checklists. All of it was designed for a pre-AI tempo. The brand director gets measured on Q4 share. The data science lead gets measured on model accuracy. The MLR head gets measured on review throughput. Nobody gets measured on whether the pilots scale, because “scale the pilots” isn’t anyone’s job. Accountability for the rebuild lives in the seams and baton handoff between three executives and their teams. When the baton drops into the seam, no one believes it’s their job to pick it up.
MIT NANDA makes the same point in different language. It identifies organizational misalignment and governance gaps as primary failure causes, and notes that internal-build pilots succeed roughly one-third as often as pilots structured around external partnerships with clear ownership. The pattern isn’t that internal teams are worse at AI. The pattern is that internal pilots inherit the operating-model debt. External partnerships force a clean ownership line as a condition of the engagement.
When all three of these factors are in place (and in most established pharma commercial orgs they are), pilots become a coping mechanism. They give the org a way to be visibly busy with AI without confronting the rebuild that the busy-ness defers. The pilots aren’t dishonest. They’re just doing a different job than the one their sponsors think they’re doing.
Who hits the ceiling first
The first people to feel the ceiling are usually the ones with the least power to do anything about it. The cascade looks something like this:
- Month 0 to 3Data scienceModels work in lab, die in the field. Data science starts asking for cleaner data, better feature stores, longer integration timelines. From three layers up, that reads as 'the AI team is being precious.'
- Month 3 to 6MLR and complianceVolume hits before headcount catches up. They start asking for more bodies, faster review tooling, clearer rules of the road. From three layers up, that reads as 'compliance is the bottleneck again.'
- Month 6 to 9Field teamsNext-best-action models trained on yesterday's territory geometry tell them to call the wrong physicians. Reps revert to spreadsheets. From three layers up, that reads as 'reps not adopting the new tools.'
- Month 9 to 15Executive layerA flat AI-driven revenue line in a year the deck promised a hockey stick. By the time it surfaces here, it's been visible to the people closer to the work for nine to fifteen months. The org just didn't have the language to name it.
This is where I’d push the political-reality conversation that most consulting work won’t touch. The Retrofit Ceiling makes a small number of senior operators newly accountable for things they weren’t accountable for before, and unless their sponsors have explicitly named that shift, they will (correctly) interpret the AI agenda as a setup. The Chief Data Officer is suddenly responsible for pipeline velocity. The MLR head is suddenly responsible for commercial throughput. The brand director is suddenly responsible for the customer-engagement system rather than the campaign. None of them signed up for those jobs. Without air cover to redraw the accountability lines, the ceiling holds and the AI investment gets quietly reabsorbed into BAU.
If you’re the CCO or CEO reading this and that paragraph sounded like your last steering committee, that’s the problem, not your team.
A predictive bet, and what to watch
The bet, calibrated confidence
By Q4 2027, at least three of the top-15 pharma will have publicly committed to an AI-native rebuild of one named commercial function. Medical-information operations, MLR-and-content, and HCP-engagement orchestration are the most likely candidates. Not 'AI-augmented.' Not 'AI-enabled.' A rebuild with the operating model designed around the AI, a named executive accountable, and a public timeline.
My confidence: ~70%. Still, I’d take that bet on Polymarket at any odds. The reasoning: the underlying numbers are getting harder to ignore. Gartner has already forecast 30% of generative AI projects will be abandoned post-PoC by end of 2025. If even half of that abandonment lands inside life-sciences commercial orgs, the political cost of running the same playbook again in 2026–2027 will exceed the political cost of a public rebuild. At that point one or two CEOs will move first, because the embarrassment of moving second will be greater than the embarrassment of moving early.
How to track whether the bet lands: watch the language in pharma earnings calls and JPM Healthcare keynotes. Listen for the shift from “AI pilots in our commercial organization” to “AI-native [function name].” The first language is retrofit. The second is rebuild. The first leader to use the second language publicly, at scale, will get cited in committee minutes for years. The leaders moving now are the ones positioning to be cited.
If you’re waiting to see who moves first before you do, you’ve already chosen retrofit. The AI-native posture isn’t something you can adopt the day after the bet lands. The data-foundation work has to start now. That’s why the next twelve months and in particular, the next two quarters, are so critical.
The decision
The Retrofit Ceiling has a calendar to it. Budget submitted in October, pilots launched in Q1, results expected in Q2, revisions in Q3, sponsor rotation by January. Miss the October window to do something different and you spend another year inside the operating model that ate the last fifteen pilots.
The decision that breaks the cycle isn’t a bigger pilot. It’s a smaller, more specific commitment, made in writing, before the next budget cycle:
- Name the part of the commercial operating model you’re going to rebuild first. Not “transform commercial.” One named function: medical information, MLR-and-content, HCP orchestration.
- Name the executive accountable. One name. Not a steering committee. With at least 24 months of expected runway in the seat.
- Name the date by which the rebuild has to be standing on its own. Not “successful,” just standing. Walking under its own power. A 12-month commitment is the right shape; anything shorter is a pilot in disguise, anything longer is a steering committee.
The orgs that name those three things in October deliver the rebuild in 9-12 months. The ones still debating them in January re-litigate the question with one less year of runway and one more rotated sponsor. The cost of waiting isn’t theoretical. It’s measured in quarters of compounding operating-model debt while the organizations that moved early start to pull ahead.
What each role does on Monday morning to make that commitment real (Heads of Commercial, Digital, Value and Access, and Medical Affairs) is the conversation I want to have next. I’ll go deep on each one in the next piece.
For now, the question I’ll leave you with:
Where does your org sit on the Retrofit Ceiling - and who in your leadership team is currently carrying it without the air cover to name it?
If you’ve read this far and you’re nodding at signals one, two, or three, book a working session with me. The next twelve months matter more than the calendar implies.
— Sundar
Sources
- IDC, in partnership with Lenovo, Time to Make the AI Pivot (2024). Cited via CIO.com. cio.com
- MIT NANDA initiative, The GenAI Divide: State of AI in Business 2025, lead author Aditya Challapally (August 2025). fortune.com
- Bain & Company, Executive Survey: AI Moves from Pilots to Production (Q3 2025 wave, N=197 executives). bain.com
- IQVIA / Salesforce partnership expansion; Throtle acquisition (Nov 2025); Salesforce Life Sciences Cloud GA (Oct 2025). Public press and industry reporting.
- Veeva and pharma-industry MLR cycle-time benchmarks (2024–2025). veeva.com
- FDA Office of Prescription Drug Promotion enforcement wave (September 2025). fda.gov
- Benchling, Rebuild Biotech for the AI Era. benchling.com
- ITONICS, 7 Lessons from AI-Native Pharma Startups. itonics-innovation.com
- Gartner, “30% of Generative AI Projects Will Be Abandoned After Proof of Concept by End of 2025” (July 2024). gartner.com
- HIT Consultant, The Pilot Purgatory. hitconsultant.net