# Does BP-OSD Actually Decode Near-Optimally? New Benchmark Says No
Belief propagation with ordered statistics decoding (BP-OSD) — the workhorse decoder deployed across much of the quantum error correction (QEC) research community — deviates **markedly** from optimal decoding performance on cellular automaton codes, even at physical error rates already achievable with today's superconducting and trapped-ion hardware. That is the central finding from new research introducing density matrix propagation (DMP) as a benchmarking tool for practical decoders, published June 27, 2026.
The significance is immediate: BP-OSD is widely treated as a near-optimal heuristic for low-density parity-check (LDPC) codes and their relatives. If it systematically underperforms against the true optimal decoder at error rates in the range of 10⁻³ to 10⁻², then resource estimates for [fault-tolerant quantum computing](https://quantumintel.tech/glossary/fault-tolerant-quantum-computing) built on BP-OSD benchmarks are potentially optimistic. Teams at [IBM Quantum](https://quantumintel.tech/companies/ibm), [Google Quantum AI](https://quantumintel.tech/companies/google-quantum-ai), [Quantinuum](https://quantumintel.tech/companies/quantinuum), and the broader [logical qubit](https://quantumintel.tech/glossary/logical-qubit) engineering community must now evaluate whether their decoder choices are leaving meaningful performance on the table.
---
## What Is Density Matrix Propagation and Why Does It Matter?
Density matrix propagation is a numerically exact method for computing the optimal posterior probability distribution over error configurations, given a syndrome measurement record. Unlike BP-OSD, which approximates marginal probabilities through iterative message passing and falls back on ordered statistics postprocessing when that fails, DMP tracks the full density matrix of the error-syndrome system. This makes it computationally expensive — scaling exponentially with system size — but that cost is precisely the point: DMP provides a provably optimal decoding baseline against which approximate decoders can be rigorously calibrated.
Think of DMP as the "oracle decoder" that no production system could run in real time, but which establishes a performance ceiling. Prior work on surface codes and other topological codes has used maximum-likelihood (ML) decoding as this ceiling, typically via tensor network contraction methods. DMP extends this benchmark framework to codes where efficient ML contraction is not straightforward, including cellular automaton codes.
The researchers' core methodology: run both DMP and BP-OSD on identical syndrome data generated from realistic noise models applied to cellular automaton codes, then compare logical error rates across a range of physical error rates. The gap between the two curves is the decoder suboptimality — the fraction of logical errors that could in principle be avoided with a better decoder.
---
## The Cellular Automaton Code Context
Cellular automaton codes are a class of quantum error correcting codes whose decoding dynamics map onto classical cellular automata rules — local, parallel update rules applied uniformly across a lattice. They have attracted serious research interest because their inherent locality makes them candidates for hardware-efficient decoding: in principle, a cellular automaton decoder can run directly on-chip alongside the quantum processor, with latency scaling favorably compared to software-based decoders running on classical co-processors.
For [fault-tolerant quantum computing](https://quantumintel.tech/glossary/fault-tolerant-quantum-computing) timelines, latency matters enormously. If a decoder cannot keep pace with the syndrome extraction cycle of a superconducting processor — typically in the 1–10 µs range — the quantum processor must stall, eating into [coherence time](https://quantumintel.tech/glossary/coherence-time) and inflating overhead. Cellular automaton decoders promise sub-microsecond decision latency. The open question has always been: at what cost to decoding accuracy?
This new work sharpens that question considerably. It does not merely show that BP-OSD is imperfect — imperfection was always assumed. It quantifies the gap precisely, and that gap is large enough to matter for practical system design.
---
## How Large Is the Performance Gap?
The paper does not report a single headline number — the gap is a function of physical error rate and code distance — but the finding described as "marked deviation" at error rates "achievable with current hardware" is pointed. At physical error rates around 10⁻² (a regime where, for example, superconducting transmon two-qubit gate fidelities of 99% are now routinely demonstrated by IBM, Google, and others), the difference between BP-OSD and optimal decoding translates directly into a higher logical error rate per code cycle.
In practical terms: if a team is engineering a [logical qubit](https://quantumintel.tech/glossary/logical-qubit) and targeting a logical error rate of 10⁻⁶ using a cellular automaton code with BP-OSD, they may need a larger code distance — more physical qubits — than optimal decoding would require. For a fault-tolerant system requiring thousands of logical qubits, that overhead compounds dramatically. The qubit overhead multiplier could shift resource estimates by factors that make otherwise-viable near-term fault-tolerant demonstrations infeasible.
This connects directly to where the industry is heading. Companies like [Riverlane](https://quantumintel.tech/companies/riverlane), which specializes in decoder hardware and software stacks, and [Quantum Machines](https://quantumintel.tech/companies/quantum-machines), whose control systems increasingly integrate real-time decoding, are building products around specific decoder performance assumptions. A rigorous demonstration that those assumptions are overly optimistic for certain code families is commercially relevant, not just academically interesting.
---
## What BP-OSD Was Designed For — and Where It Strains
BP-OSD was originally developed in the classical coding theory community for decoding LDPC codes over the binary erasure and binary symmetric channels. Its adaptation to quantum error correction was a natural extension, and it performs well — often near-optimally — on codes with sparse, locally tree-like Tanner graphs, where belief propagation's independence assumptions are approximately valid.
Cellular automaton codes violate those assumptions. Their syndrome graphs tend to have short cycles and high local connectivity precisely because the cellular automaton structure imposes geometric regularity. Belief propagation on graphs with many short cycles is known to produce correlated, unreliable messages; the ordered statistics postprocessing step in BP-OSD partially compensates but cannot fully recover optimality. DMP now gives us a precise handle on exactly how much is being left behind.
This is not an indictment of BP-OSD as a general tool. For surface codes, where substantial evidence already suggested near-optimal performance, BP-OSD remains a strong practical choice. The new finding is code-class-specific: researchers and engineers choosing cellular automaton codes as the basis for their fault-tolerant architecture need a better decoder, or they need to account for the suboptimality gap explicitly in their resource models.
---
## Industry Implications and Next Steps
**For hardware teams:** Any fault-tolerant roadmap incorporating cellular automaton codes should immediately audit its decoder assumptions. The [error threshold](https://quantumintel.tech/glossary/error-threshold) and logical error rate projections used in resource estimation need to be recomputed against DMP baselines, not BP-OSD performance curves.
**For decoder developers:** DMP as a benchmarking tool opens a new design loop. The goal is no longer to demonstrate that a decoder "works" on a code family, but to quantify and close the gap to DMP performance. Neural network decoders, tensor network decoders, and reinforcement learning approaches should be evaluated on this metric for cellular automaton codes specifically.
**For the broader [fault-tolerant quantum computing](https://quantumintel.tech/glossary/fault-tolerant-quantum-computing) ecosystem:** This paper establishes a methodological standard. Expect DMP benchmarking to become a required comparison in future QEC publications that propose new decoders or code constructions. Journals and conference reviewers will likely begin requesting it.
**The competitive angle:** Teams racing to demonstrate [below threshold](https://quantumintel.tech/glossary/below-threshold) operation on novel code families — and to publish the lowest logical error rates — now have a new axis of competition: proximity to DMP-optimal performance. A decoder that closes 90% of the gap to DMP at hardware-relevant error rates is a publishable, commercially valuable result.
---
## Key Takeaways
- **BP-OSD deviates markedly from optimal decoding** on cellular automaton codes at physical error rates achievable with current superconducting and trapped-ion hardware (≈10⁻² regime).
- **Density matrix propagation (DMP)** provides a provably optimal, numerically exact decoding baseline — the first rigorous benchmark for cellular automaton code decoders.
- **Resource estimates** for fault-tolerant systems using cellular automaton codes with BP-OSD may systematically underestimate physical qubit requirements.
- **The finding is code-class-specific:** surface code BP-OSD performance is not directly challenged by this result, but any architecture built on cellular automaton codes is.
- **A new benchmarking standard** is emerging: DMP comparison should now accompany decoder proposals for non-standard code families, just as ML decoding comparisons accompany surface code work.
- **Commercial decoder developers** — including Riverlane and Quantum Machines ecosystem partners — need to evaluate product roadmaps against this gap.
---
## Frequently Asked Questions
**What is density matrix propagation in quantum error correction?**
Density matrix propagation (DMP) is a numerically exact decoding method that computes optimal posterior probabilities over error configurations by tracking the full density matrix of the error-syndrome system. It is computationally too expensive for real-time use but serves as a provably optimal benchmark — the ceiling against which practical decoders like BP-OSD can be rigorously compared.
**Why does BP-OSD underperform on cellular automaton codes specifically?**
BP-OSD relies on belief propagation, which assumes approximately tree-like graph structure. Cellular automaton codes have syndrome graphs with many short cycles and high local geometric regularity, which cause belief propagation's independence assumptions to break down. The ordered statistics decoding postprocessing step partially compensates but cannot fully recover optimal performance.
**Does this finding affect surface code roadmaps at IBM, Google, or Quantinuum?**
Not directly. The result is specific to cellular automaton codes. Surface code decoder performance — where BP-OSD and related methods have been shown to approach near-optimal behavior — is not challenged by this paper. However, any team considering cellular automaton codes as an alternative fault-tolerant architecture should treat their decoder assumptions as unvalidated without DMP comparison.
**What is the practical consequence for logical qubit resource estimates?**
If BP-OSD incurs a meaningful logical error rate penalty relative to optimal decoding, achieving a target logical error rate (e.g., 10⁻⁶) requires a larger code distance and therefore more physical qubits than optimal-decoder projections suggest. At scale, this compounds across thousands of logical qubits and can shift feasibility timelines significantly.
**What decoders might close the gap to DMP for cellular automaton codes?**
The paper establishes the gap but does not prescribe a solution. Likely candidates include tensor network decoders (which can approximate ML decoding more faithfully than BP), neural network decoders trained specifically on cellular automaton syndrome patterns, and reinforcement learning approaches. Each trades computational cost against decoding accuracy differently, and DMP now provides the metric to optimize against.
RESEARCH
BP-OSD Fails Optimality Test for Cellular Automaton Codes
Published: June 27, 2026 at 14:39 EDTLast updated: June 28, 2026 at 05:03 EDTBy Jonas Vogel, Senior EditorLast reviewed by Jonas Vogel on June 28, 20269 min read
Density matrix propagation reveals BP-OSD deviates markedly from optimal decoding for cellular automaton codes at hardware-achievable error rates.
qecdecodingbelief-propagationosdcellular-automatonfault-tolerantdensity-matrix