Loading
Loading
Cell Line Misidentification Persists Despite Decades of Warnings
Nearly 600 cell lines in the ICLAC registry (v13, 2024) are confirmed misidentified or contaminated, yet researchers continue publishing with them. Cell lines purporting to represent liver, stomach, esophageal, or nasopharyngeal tissue are frequently HeLa (cervical cancer) in disguise. A PubMed search for the contaminated "L-O2" hepatocyte line found 263 papers in 2022 and 666 by March 2025 — the contamination-based publication rate is growing, not shrinking. One in five cell lines is estimated to be misidentified, and an estimated $28 billion annually in irreproducible U.S. research is linked to faulty cell stocks.
Cell lines are the foundational model system for drug discovery, toxicology, and cancer biology. When a "liver cell" experiment is actually a cervical cancer cell experiment, every downstream conclusion — drug sensitivity, gene expression, toxicological response — is invalid for the intended context. Thousands of papers and their citations propagate conclusions built on the wrong biological substrate. Drug candidates that advance based on misidentified cell line data waste clinical trial resources and delay treatments that might work.
STR (short tandem repeat) profiling has been available as a definitive authentication method since the 2000s. Many journals now require authentication statements. But compliance is superficial: researchers state "cells were authenticated" without specifying when, how, or by whom. Authentication at purchase doesn't protect against cross-contamination during passaging in shared laboratory incubators. The deeper structural barrier is incentive misalignment: authentication costs money ($50–150 per line, 1–2 week turnaround), produces no publishable result, and a positive finding (your cells are wrong) invalidates months of work. The incentive structure actively discourages checking.
Mandatory STR profiling results (not just statements) deposited with manuscript submission, analogous to GenBank accession numbers for DNA sequences. Integration of continuous authentication into cell culture workflows — e.g., qPCR-based species-check panels run alongside experiments at each passage. Cultural reform where detecting contamination is treated as a quality-control success rather than a career threat. Institutional core facilities that perform routine authentication as a shared service, removing the cost burden from individual researchers.
A team could audit cell lines available in their institutional cell bank or core facility using STR profiling, compare results against the ICLAC misidentified cell line database and CLASTR reference database, and publish the results as an institutional cell line integrity report. Molecular biology and bioinformatics skills would be most relevant. The Cellosaurus database provides comprehensive reference data.
The growth rate of publications using known-contaminated cell lines (263 → 666 in 3 years for L-O2 alone) demonstrates that awareness campaigns and journal policies have failed. The problem is structurally analogous to `health-research-antibody-validation-crisis` — both involve foundational reagents with known quality problems where the incentive structure discourages quality checking. Together, they represent a "research reagent integrity" cluster that could emerge at the 400-brief analysis.
Research Integrity and Peer Review, "Misidentified cell lines: failures of peer review," 2025; British Journal of Cancer, "HeLa cells cross-contaminated nasopharyngeal carcinoma cell lines," 2024; PLOS ONE, "The ghosts of HeLa," 2017; MDPI, "Genetic Insights into the Economic Toll of Cell Line Misidentification," 2024.