Loading
Loading
Pulse Oximeters Systematically Overestimate Oxygen Levels in Darker-Skinned Patients, Concealing Life-Threatening Hypoxemia
Pulse oximeters — ubiquitous devices used in virtually every clinical setting to measure blood oxygen saturation (SpO2) — produce systematically inaccurate readings in patients with darker skin pigmentation. Melanin absorbs the light wavelengths used by these devices, causing them to overestimate oxygen levels. This overestimation conceals hypoxemia, delays treatment, and contributes to worse clinical outcomes. Despite decades of awareness of this failure mode, no validated hardware or algorithmic correction exists that eliminates the bias across the full range of human skin tones.
Pulse oximeters are used on virtually every hospitalized patient and in millions of home-use contexts. During the COVID-19 pandemic, inaccurate readings in darker-skinned patients delayed supplemental oxygen and escalation of care — an estimated 11.7% of occult hypoxemia cases were missed in Black patients compared to 3.6% in White patients. The installed base of affected devices numbers in the tens of millions globally, and a Johns Hopkins study found Black patients are 31.9% more likely to have readings that overestimate oxygen by 4+ percentage points.
All manufacturers use the same basic dual-wavelength (red 660 nm / infrared 940 nm) technology, and calibration curves were historically derived from predominantly light-skinned study populations. The fundamental physics of pulse oximetry — measuring light absorption through tissue — is confounded by melanin absorption, which is not accounted for in the Beer-Lambert law models used. The FDA issued draft guidance in January 2025 requiring manufacturers to test across diverse skin tones using the Monk Skin Tone scale, but this only affects new device submissions — the installed base of millions of devices remains uncorrected. No manufacturer has yet demonstrated a multi-wavelength or algorithmic approach that eliminates bias across all skin tones to within clinically acceptable accuracy (plus or minus 2% SpO2). An FDA-funded multi-site study found bias values of -0.35% to 3.2% for White subjects versus 0.6% to 5.1% for Black subjects, with far worse precision (standard deviation 2.7-9.1%) for Black subjects. Moving to multi-wavelength systems would require entirely new hardware, not just software updates, and manufacturers lack economic incentive to redesign a commodity product with thin margins.
A multi-wavelength optical approach (beyond the current two-wavelength design) that can distinguish melanin absorption from hemoglobin absorption would address the root cause. Alternatively, an algorithmic correction validated across the full Monk Skin Tone scale using large, diverse clinical datasets could improve accuracy without hardware changes. A rapid, low-cost skin-tone sensing module that enables adaptive calibration could bridge existing and next-generation devices.
A student team could build a benchtop optical test system using additional wavelengths (e.g., 3-5 wavelengths) and tissue-mimicking phantoms with varying melanin concentrations to characterize the relationship between melanin levels and SpO2 measurement error. Another entry point would be developing an algorithmic correction layer that takes existing dual-wavelength signals plus a skin-tone input and outputs a bias-corrected SpO2 estimate. Teams with biomedical engineering, optics, or signal processing backgrounds would be well-suited. A scoped semester project could focus on phantom validation rather than clinical testing.
The FDA Advisory Panel (February 2024) recommended increasing diversity requirements in clinical studies to a minimum of 24 subjects spanning the full Monk Skin Tone scale. Key references include the FDA Executive Summary on Performance Evaluation of Pulse Oximeters (2024), Johns Hopkins Bloomberg School of Public Health study on racial bias (2024), FDA Draft Guidance on Pulse Oximeter Accuracy (January 2025), and Health Affairs analysis on cross-racial pulse oximeter imperatives (2025). This brief connects to broader health equity themes — the same design-for-the-majority pattern appears in other diagnostic devices. The problem is classified as "worsening" because home pulse oximetry use is expanding while the installed base remains uncorrected.
FDA Executive Summary: Performance Evaluation of Pulse Oximeters, FDA CDRH Anesthesiology and Respiratory Therapy Devices Panel, https://www.fda.gov/media/175828/download, accessed 2026-02-19