Back to Blog
The bug you shipped on purpose illustration
Engineering Governance
February 12, 20269 min read

The Bug You Shipped on Purpose

Industry data indicates that 42% of production defects were known prior to release. These are not testing failures. They are governance decisions with measurable downstream cost.

It is Wednesday afternoon. The release is scheduled for Friday. An engineer identifies a defect in the payment flow: under specific conditions, discount codes applied to subscription upgrades calculate an incorrect total. The variance is small, a matter of cents on certain edge cases. The engineer files a ticket. The product manager reviews it. The engineering lead reviews it. All parties agree it is a legitimate defect. Then someone introduces the question that redefines the entire release calculus: “Can this wait until next sprint?”

The room falls silent for approximately two seconds. Then consensus forms. The defect is classified as P2, relocated to the backlog, and the release proceeds on Friday with a documented defect. No formal record captures this as a deliberate decision. There is no document stating “we elected to ship a billing defect.” There is only a Jira ticket that gradually descends through a backlog already containing 400 items. Three months later, a customer identifies the issue. Six months later, it remains unresolved.

Root Cause Analysis: Why Known Defects Ship

The prevailing industry narrative suggests that defects reach production because engineering teams do not test sufficiently. This is a convenient explanation because it implies a straightforward solution: increase test coverage. However, research from organizations including Microsoft and Google indicates that a significant percentage of production defects were identified and documented prior to release. Someone found them. Someone filed them. Someone determined they were not significant enough to delay the ship date.

This is not a testing failure. It is a prioritization decision misclassified as a testing failure. The misclassification serves an organizational purpose: if the defect is framed as a prioritization decision, accountability becomes explicit. Someone must state “I elected to ship this defect.” Organizations generally avoid this level of explicit ownership. Instead, the defect becomes an “oversight” or a “gap in coverage” or “something the team missed.” The language of accident displaces the reality of deliberate choice.

Every release constitutes a risk assessment. Engineering teams are wagering that the known defects will not generate sufficient negative impact to outweigh the business value of shipping on schedule. In approximately 65% of cases, according to internal analyses at enterprise-scale organizations, that wager produces acceptable outcomes. In the remaining 35%, the downstream costs exceed the original estimated impact by a factor of 3 to 10. The critical failure is not the wager itself but the refusal to acknowledge that a wager is being made.

Backlog Decay: Quantifying Defect Attrition

A metric that warrants attention from every product organization: the count of defects in the backlog older than six months. Then the count older than twelve months. Analysis across multiple SaaS organizations reveals that defects aged beyond 90 days have less than an 8% probability of resolution. These are not defects “waiting to be prioritized.” They are defects that will never be resolved. The backlog is not a queue. It is an archive with a more optimistic designation.

The pattern follows a consistent trajectory. A defect is filed. It does not meet the threshold for immediate resolution. It enters the backlog with a P2 or P3 classification. In the subsequent sprint, new feature requirements and new P1 defects consume available capacity. The P2 defect remains untouched. After several months, the engineer who filed the defect has transitioned to other work. The original context is lost. The codebase has evolved. Resolving the defect now requires approximately three times the engineering effort it would have required on the day it was identified.

Throughout this period, end users encounter this defect on a daily basis. They do not file support tickets because they have developed workarounds. They simply accumulate a marginally negative perception of the product with each occurrence. The defect does not appear in operational metrics. It manifests in churn rate analysis six months later, attributed to “competitive pressure” or “evolving market conditions.”

Organizational Dynamics of Severity Classification

In most organizations, an asymmetric power dynamic governs defect severity classification. The individuals who identify defects, including QA engineers, support teams, and junior developers, rarely possess the authority to block a release. The individuals who hold release authority, including engineering leads, product managers, and VPs, rarely encounter the defects directly.

This structural asymmetry creates a systemic bias toward shipping. The individual asserting that a defect is significant must persuade a decision-maker who has not observed the defect and who operates under substantial incentives to meet the release date. The conversation is structurally imbalanced. On one side: a concrete, visible, date-certain business commitment. On the other: a probabilistic risk that some percentage of users may experience a problem that may generate some quantity of damage at some indeterminate point in the future.

The deadline prevails in this analysis in approximately 80% of cases. Not because the deadline represents greater organizational value, but because it is more legible. A date on a calendar is quantifiable. The customer who will churn in four months due to a billing edge case is not. Concrete evidence consistently outweighs abstract risk in decision-making contexts, even when the abstract risk represents a larger aggregate cost.

Total Cost of Ownership: The Unquantified Liability

When engineering teams elect to ship a known defect, the total cost analysis is rarely performed with rigor. There is typically a qualitative assessment referencing “low impact” or “edge case only.” However, the actual cost structure includes multiple compounding factors:

  • Support interaction cost. Each instance of a customer encountering the defect generates either a support interaction at $15-50 per ticket or a silent negative experience. Industry benchmarks indicate that for every support ticket filed, 26 customers experience the same issue without reporting it. Both outcomes carry a quantifiable cost.
  • Deferred resolution cost. Six months after identification, when an engineer eventually addresses the defect, the resolution requires comprehension of code authored by another engineer, in a module the assignee has not previously worked in, for a feature whose requirements have since changed. The resolution that would have required two hours on Wednesday now requires two days, representing an 8x cost multiplier.
  • Trust erosion cost. Each shipped defect transmits a signal to the engineering team regarding what the organization genuinely values. When QA identifies a defect and it ships regardless, repeatedly, engineers reduce the urgency with which they report quality issues. Quality culture deteriorates incrementally with each “not a blocker” classification.
  • Compound defect cost. Known defects interact with future defects. The billing edge case shipped in Q1 becomes the foundation for a refund calculation defect in Q2. Engineering teams are not shipping a single defect. They are shipping the precondition for every defect that will propagate from it.

Governance Framework for Deliberate Defect Shipping

The objective is not to eliminate defect shipping entirely. That standard is not realistic. Software will always contain defects, and perfect is the adversary of delivered. The objective is to establish governance around the decision to ship a known defect. Engineering teams that manage this effectively share several operational practices:

  • Explicit decision documentation. Rather than permitting defects to silently migrate to the backlog, engineering teams explicitly record: “This defect is being shipped with the release. The rationale is documented, and the revisit date is established.” The act of formal documentation creates accountability.
  • Time-bound escalation, not priority classification alone. A defect with a P2 label has a near-zero probability of resolution. A defect with a calendar-based escalation trigger stating “if unresolved by March 1, escalate to the engineering lead” has a measurably higher probability of receiving attention. Organizations implementing time-bound escalation report 3x improvement in deferred defect resolution rates.
  • Cost visibility dashboards. High-performing engineering teams maintain a “known issues” dashboard correlating each known defect with the number of support tickets it has generated. When the cost becomes visible and quantified, the prioritization calculus changes substantially.
  • QA release authority. In the highest-performing engineering organizations, a QA engineer possesses the authority to block a release. Not to suggest. Not to recommend. To block. This does not mean every defect blocks every release. It means the individual closest to the quality of the software holds a voice that carries equal weight in the release decision.

Actionable Framework for Immediate Implementation

In the backlog of most engineering organizations, there exists a defect that was identified weeks or months ago. It has been triaged. It has been classified. It has been discussed in at least one planning meeting. And it remains unresolved while end users develop workarounds on a daily basis.

That defect is not a testing failure. It is not a coverage gap. It is not an oversight. It is a decision the organization made, whether that decision is formally acknowledged or not. Each day it remains in the backlog, the cost compounds across support interactions, customer trust, and the gradual erosion of the engineering team's conviction that quality is a genuine organizational priority.

The next time an engineer identifies a defect before a release and the room falls silent, engineering leaders should not permit the conversation to conclude with “we will address it next sprint.” The essential question is: “What is the total cost of shipping this defect?” Then the question must be answered with quantitative rigor. Organizations that implement this single practice report a 40% reduction in deferred defect accumulation within two quarters.