Alzheimer’s disease diagnostic tests gain FDA approval based on data from controlled clinical studies, yet once released to the market, they often receive minimal oversight to confirm they perform as expected in real-world settings. Post-market monitoring is essential because the conditions under which a test is studied in a laboratory differ significantly from the diverse patient populations, clinical workflows, and technical variations encountered in actual practice. Blood biomarker tests like plasma phosphorylated tau and amyloid-beta ratios, which have entered the market rapidly over the past few years, exemplify this gap: they showed promise in research cohorts but require ongoing surveillance to detect whether they accurately identify Alzheimer’s pathology across different racial and ethnic groups, age ranges, and comorbid conditions.
The FDA’s regulatory framework for in vitro diagnostic tests has historically allowed manufacturers to implement their own post-market surveillance systems with minimal federal coordination. This creates a patchwork of monitoring efforts that may miss trends in diagnostic failure, especially when a test’s limitations emerge only after thousands of patients have been screened. For Alzheimer’s testing specifically, the stakes are high because a false-positive or false-negative result can lead to unnecessary treatment initiation, unnecessary psychological burden, or missed opportunities for early intervention.
Table of Contents
- How FDA Approval Standards Fall Short of Real-World Diagnostic Needs
- The Problem of Incomplete Surveillance Infrastructure
- Real-World Diagnostic Accuracy Beyond Clinical Trial Populations
- The Regulatory Burden of Post-Market Monitoring Without Clear Accountability
- The Emerging Complexity of Multiplex and Composite Tests
- Gaps in Monitoring Long-Term Outcomes
- The Role of Regulatory Modernization and Laboratory Oversight
How FDA Approval Standards Fall Short of Real-World Diagnostic Needs
The fda‘s de novo pathway, through which many novel Alzheimer’s biomarker tests have been approved, requires manufacturers to demonstrate analytical validity (the test measures what it claims to measure) and clinical validity (the test correlates with disease presence). However, analytical validity does not guarantee clinical utility—the ability to improve patient outcomes in routine clinical practice. A plasma phosphorylated tau test might accurately identify amyloid and tau pathology in a research setting with standardized sample handling, temperature control, and personnel training, yet produce false results in a community hospital laboratory where samples sit at room temperature for hours or where technicians lack specific training for biomarker testing. The FDA’s approval standards do not mandate that manufacturers document how their test performs when these real-world conditions deviate from the controlled study environment.
Additionally, FDA approval often relies on data from relatively homogeneous populations—frequently older, white, educated cohorts with access to research medical centers. Alzheimer’s disease affects people of all racial and ethnic backgrounds, yet most validation studies for new biomarker tests have enrolled predominantly white participants. Post-market monitoring is the only mechanism through which to discover whether a test approved based on research in a homogeneous cohort maintains its accuracy when applied to Black Americans, Hispanic Americans, Asian Americans, or other populations, where both the pathological progression of Alzheimer’s and the prevalence of comorbid conditions like hypertension and diabetes differ. Without systematic post-market tracking of test performance across diverse populations, approved tests risk perpetuating health disparities in diagnosis and early treatment access.
The Problem of Incomplete Surveillance Infrastructure
Most Alzheimer’s diagnostic tests currently in use lack a unified post-market surveillance system that would allow public health authorities to detect emerging safety or performance signals in real time. Manufacturers are required to report adverse events (such as a test result that leads to harm) and to establish a complaint handling system, but they are not required to proactively audit how frequently their test produces discordant results when used alongside other diagnostic modalities or to systematically track clinical outcomes of patients screened with their test. This is a critical limitation: if a new Alzheimer’s blood test is approved and deployed in clinical practice, and a subset of patients receive false-negative results that delay diagnosis and treatment, there is no centralized registry or database that flags this pattern across multiple clinical sites until the problem becomes large enough that individual clinicians recognize it or publish case reports.
The FDA does maintain MedWatch, a voluntary reporting system for adverse events, but studies show that adverse event reporting is significantly underutilized for diagnostic test failures compared to adverse drug events. This underreporting occurs because diagnostic test failures are often attributed to operator error, patient factors, or specimen quality rather than to a deficiency in the test itself. A false-negative Alzheimer’s biomarker test might be rationalized by a clinician as resulting from the patient’s atypical disease presentation rather than triggering a formal adverse event report to the FDA. Until post-market surveillance systems specifically designed for diagnostic tests are mandated and funded, the medical community will continue to operate with incomplete visibility into real-world test performance.
Real-World Diagnostic Accuracy Beyond Clinical Trial Populations
In clinical trials, Alzheimer’s biomarker tests are evaluated against neuropathological confirmation (brain autopsy) or against other biomarkers obtained under highly controlled conditions. However, in clinical practice, a test result is often interpreted in isolation or alongside only a clinical interview and cognitive testing, without confirmation through multiple biomarker modalities. A patient might receive a positive blood phosphorylated tau result and be diagnosed with Alzheimer’s disease without concurrent cerebrospinal fluid analysis, positron emission tomography (PET) imaging, or magnetic resonance imaging (MRI)—yet it is precisely in these scenarios where post-market data would reveal whether the blood test’s accuracy holds. Furthermore, patients in real-world practice often have mild cognitive complaints rather than the clear-cut mild cognitive impairment or dementia that defined trial participants, and it is unknown whether tests validated in symptomatic populations maintain their specificity in asymptomatic or minimally symptomatic individuals.
Amyloid-beta PET imaging, approved for clinical use over a decade ago, provides a cautionary example. The test’s FDA approval was based on its ability to detect cerebral amyloid pathology, and it gained rapid clinical adoption for diagnostic purposes. Yet post-market experience revealed that amyloid positivity does not invariably correlate with cognitive impairment—cognitively normal individuals can have significant amyloid burden without symptoms. This discrepancy was not anticipated in pre-approval studies, and post-market surveillance systems were inadequate to prevent widespread off-label use of amyloid PET to make treatment decisions in cognitively asymptomatic people, a use for which evidence of clinical utility remains limited.
The Regulatory Burden of Post-Market Monitoring Without Clear Accountability
Unlike pharmaceutical products, which face explicit regulatory mandates for post-marketing surveillance plans and risk minimization strategies, diagnostic test manufacturers are not uniformly required to submit a detailed post-market surveillance plan as a condition of FDA approval. Some manufacturers voluntarily implement registry studies or outcome tracking, but these efforts are often limited in scope and lack standardization. The result is a fragmented landscape where one manufacturer may track clinical outcomes of patients tested with their product while another tracks only technical performance metrics, making it impossible to compare post-market safety and effectiveness across competing tests or to identify industry-wide trends in diagnostic failure.
A manufacturer might be economically incentivized to minimize the visibility of post-market problems, as acknowledging that a widely adopted test produces inaccurate results in certain populations or clinical scenarios could trigger negative publicity, label changes, clinical guideline updates, or regulatory action. Without a clear mechanism to compel comprehensive, independent post-market surveillance, there is limited external accountability. In contrast, pharmaceutical manufacturers face the threat of FDA warning letters, label changes, and even market withdrawal if post-market data reveal that a drug does not perform as approved; diagnostic test manufacturers face few comparable incentives to prioritize post-market data collection and analysis.
The Emerging Complexity of Multiplex and Composite Tests
Newer Alzheimer’s biomarker tests often include multiple analytes—a combination of different phosphorylated tau variants, amyloid ratios, and other markers—scored through proprietary algorithms to generate a single risk or diagnostic score. These composite tests introduce an additional layer of complexity for post-market monitoring because the clinical performance of the overall test score depends not only on the analytical accuracy of each individual analyte but also on the validity of the algorithm that combines them. An algorithm trained on one population might perform poorly when applied to patients with different demographic characteristics, genetic backgrounds, or concomitant diseases.
Post-market surveillance systems are not routinely designed to detect algorithm drift—gradual degradation in test performance as the population tested shifts away from the population used to develop and validate the algorithm. Furthermore, proprietary algorithms are often protected as trade secrets, creating a barrier to independent verification of how the test result is derived. Clinicians and laboratories using the test may not have sufficient transparency to understand how different patient factors influence the result, yet post-market monitoring has not been mandated to evaluate whether the test performs equally across demographic and clinical subgroups. This lack of transparency and systematic post-market evaluation poses a risk of undetected bias in diagnostic assignment—where the test systematically produces false results in specific populations, leading to differential diagnosis rates across racial, ethnic, or socioeconomic groups.
Gaps in Monitoring Long-Term Outcomes
Post-market surveillance for diagnostic tests typically focuses on short-term technical performance and near-term clinical decisions (whether the test led to a diagnosis, whether the diagnosis was changed based on the result), but it often fails to track longer-term patient outcomes. For Alzheimer’s testing, this is a significant oversight because the ultimate clinical value of early diagnostic testing depends on whether identifying the disease early translates to better long-term outcomes.
A test might be analytically and clinically valid—accurately identifying early Alzheimer’s pathology—yet still lack demonstrated clinical utility if patients diagnosed through testing do not experience improved cognitive trajectories, quality of life, or disease management compared to patients not tested. Systematic post-market tracking of cognitive decline, functional status, medication use, and healthcare outcomes in cohorts tested with approved Alzheimer’s biomarkers would provide evidence of whether diagnostic testing is justified. Without such tracking, the market is filled with tests that detect pathology but have not been proven to improve outcomes, and patients may receive diagnoses that increase anxiety and lead to costly follow-up testing and treatments with unproven benefits.
The Role of Regulatory Modernization and Laboratory Oversight
Laboratory regulations under the Clinical Laboratory Improvement Amendments (CLIA) certify that laboratories meet minimum quality standards, yet CLIA does not explicitly mandate post-market surveillance of diagnostic test accuracy, nor does it require laboratories to systematically verify that FDA-approved tests perform as claimed in their specific laboratory environment. A laboratory can be CLIA-certified and validly report results from an approved Alzheimer’s biomarker test without ever independently verifying that the test’s analytical characteristics match the manufacturer’s published specifications in their own hands, on their own equipment, or in their specific patient population.
Post-market monitoring that includes requirement for periodic verification of test performance across multiple independent laboratories would strengthen confidence that approved tests maintain accuracy in practice. Recent FDA guidance documents have begun to address these gaps by recommending that manufacturers of advanced diagnostic tests implement post-market surveillance, but recommendations lack the force of mandatory requirement. Without legislative or regulatory change to make comprehensive post-market monitoring a condition of diagnostic test approval and ongoing use, Alzheimer’s tests will continue to enter clinical practice with incomplete understanding of their real-world performance, leaving patients and clinicians to navigate uncertainty about diagnostic accuracy, equity in diagnosis across populations, and the true clinical value of early identification.
- —





