The Sunk Cost of Intelligence: Lessons from AI’s Failures and False Starts

“The single biggest mistake in AI is not the model—it’s the refusal to walk away when the strategy is broken.” – Thomas H. Davenport, author of Competing on Analytics

When Machines Learn and Leaders Don’t: Why AI Projects Fail and How to Avoid the Same Mistakes

In the early days of machine learning, the promise felt limitless. In 1956, John McCarthy convened the Dartmouth Conference, coining the term “artificial intelligence” and setting off decades of experimentation, optimism, and hype cycles. Each generation of AI—from expert systems in the 1980s to deep learning in the 2010s—carried with it a familiar story: outsized expectations, massive investment, and an equally massive graveyard of failed projects.

Today, large language models (LLMs) are in the spotlight. Enterprises rush to prove they are “AI-powered,” pouring millions into pilots and proofs of concept. Yet, history repeats itself: Gartner estimates that up to 85% of AI projects never make it to production. Why? The reasons are rarely technical alone.

The Weight of History: Why AI Has a Track Record of Failure

The failures of AI, ML, and now LLM projects often stem less from the models themselves and more from the organizations that deploy them. Let’s look back:

Expert Systems (1980s–90s): Companies poured money into rule-based “AI” that could automate decision-making. The systems worked in labs but collapsed in the real world under complexity. Maintenance costs skyrocketed as business rules evolved.
Big Data & Predictive Analytics (2000s): Firms invested heavily in Hadoop clusters and predictive modeling. Many never moved beyond dashboards and one-off insights. The issue wasn’t the math—it was data quality, governance, and integration.
Deep Learning (2010s): From computer vision to recommendation engines, deep learning promised transformation. But outside of tech giants, most enterprises lacked the scale of data, compute, and engineering discipline to make it stick.

Across these eras, the causes rhyme: lack of clear business alignment, poor data hygiene, underestimating ongoing operational costs, and executives seduced by demos rather than outcomes.

Enter the Sunk Cost Fallacy

If there’s a psychological bias that haunts AI projects, it’s the sunk cost fallacy—the tendency to continue investing in a losing proposition because of what’s already been spent.

In practice, it looks like this:

A bank launches a multi-year fraud detection initiative. After three years, accuracy remains poor, but the team argues, “We’ve already invested $20M; pulling out now would waste it.”
A healthcare system builds a custom LLM for patient interactions. It consistently hallucinates unsafe responses. Instead of pivoting, leadership doubles down on more fine-tuning, unwilling to admit the premise was flawed.

In both cases, money, talent, and time become chained to failure, with no room left for strategic course correction.

Why AI Projects Fail: A Deep Analysis

Looking across decades of AI initiatives, five root causes of failure emerge:

Hype Over Substance
Organizations chase shiny demos instead of solving business-critical problems. Leaders want an “AI strategy” before defining a business strategy.
Poor Data Foundations
Models are only as good as the data feeding them. Inconsistent pipelines, lack of lineage, missing governance, and “dark data” lead to garbage-in, garbage-out outcomes.
Lack of Integration Planning
Pilots show promise, but scaling requires integration into workflows, compliance frameworks, and infrastructure. This is where most proofs of concept die.
Organizational Resistance
Change management is underestimated. Middle managers fear automation, employees mistrust opaque models, and regulators raise red flags.
Overlooking Total Cost of Ownership (TCO)
Training a model is a one-time expense; maintaining, monitoring, retraining, and securing it are permanent costs. Leaders often treat AI as a project, not a product.

Lessons from Success Stories

Contrast this with examples of AI done right:

Netflix: Personalized recommendations aren’t magic—they’re the result of relentless A/B testing, user-centered design, and constant retraining. The system evolves with both data and behavior.
DeepMind’s AlphaFold: Rather than chasing vague promises, AlphaFold was laser-focused on a specific, high-value scientific challenge: protein folding. This clarity of purpose, paired with cross-disciplinary collaboration, made the project world-changing.
Duolingo: The language-learning app integrates ML seamlessly into user experience, making lessons adaptive and engaging. Importantly, ML is not the product itself—it’s an enhancer of the product.

Each of these examples reflects a focus on business or mission value, operational discipline, and alignment between leadership, engineering, and end-users.

Lessons from the Graveyard

On the flip side, poorly executed projects offer cautionary tales:

IBM Watson for Oncology: Once heralded as the future of cancer treatment, the system fell apart under scrutiny. Poor data curation and overpromises undermined credibility. Watson was ultimately sold off at a fraction of its cost.
UK Post Office Horizon IT System: Though not branded as AI, the system’s flawed data-driven decisions led to wrongful convictions of postmasters. It underscores the stakes of blind trust in software without transparency or accountability.
Countless Chatbot Projects: Retailers and banks rolled out “AI assistants” that did little more than frustrate users with canned responses. Instead of enhancing customer experience, they eroded trust.

Improving Your Odds of Success

The antidote to failure isn’t just better models; it’s better leadership, governance, and humility. Practical takeaways include:

Start with Value, Not Tech: Anchor projects in measurable business outcomes. Ask: what pain point are we solving, and how will success be measured?
Invest in Data Before Models: Data quality, lineage, and governance should come before model selection.
Treat AI as a Product, Not a Project: Plan for lifecycle costs—monitoring, retraining, compliance, and explainability.
Adopt Kill Criteria: Define in advance what success and failure look like. If results aren’t met, pivot or stop, regardless of sunk costs.
Focus on Change Management: Communication, training, and user adoption matter as much as accuracy metrics.

Wrapping up…

AI and LLMs are not silver bullets; they are amplifiers of organizational strengths and weaknesses. History shows that failures happen not because the technology doesn’t work, but because leaders underestimate the cultural, structural, and economic realities of deploying it.As Andrew Ng, one of today’s most influential AI voices, once said: “AI is the new electricity.” But like electricity, it can power a city—or burn it down—depending on how wisely it’s harnessed.