Why Alzheimer’s Trial Results Can Be Hard to Interpret

Alzheimer's trials measure different things with different standards, making it hard to know which results actually mean benefit for patients.

Alzheimer’s clinical trials are difficult to interpret because they measure wildly different things—some track brain changes on a scan, others measure thinking ability, still others track daily functioning—and pharmaceutical companies may emphasize whichever metric makes their drug look best. A trial showing that a medication slows cognitive decline by 35 percent sounds promising until you learn that the absolute difference was a few points on a 100-point scale and may or may not translate to meaningful change in how someone lives day-to-day.

The gap between what a trial claims and what it actually proves is where most confusion happens. Multiple layers of complexity compound this gap. Trial participants are healthier and more monitored than typical Alzheimer’s patients, measurements can be subjective or biased, and FDA approval standards have shifted multiple times in recent years, meaning older approvals used different goalposts than newer ones.

Table of Contents

What Do These Trials Actually Measure?

Alzheimer’s trials can measure changes in amyloid or tau deposits (the brain proteins thought to drive the disease), declines in cognitive test scores, changes in activities of daily living like dressing or managing finances, or decline in overall functioning. A drug could reduce amyloid plaques significantly on a PET scan but show no slowing of cognitive decline. Another could slightly preserve memory test scores while having no effect on how independently a person can live.

The aducanumab trials of 2020–2021 showed strong amyloid reduction but failed to clearly prove cognitive benefit—yet the FDA approved it anyway under its accelerated approval pathway, relying on the assumption that reducing amyloid would help cognition. That assumption turned out to be uncertain, and the drug was later withdrawn from the market. The different measures create a credibility problem: a trial reporting a primary outcome (the main measure it was designed to test) sometimes shows no benefit, but secondary analyses or exploratory findings look promising. Readers and patients then hear the promising news, not the actual primary result.

Clinical Significance Versus Statistical Significance

A trial result can be “statistically significant,” meaning unlikely to have happened by random chance, while being clinically meaningless in actual patient life. If a medication slows cognitive decline by 0.5 points per year on the Mini-Cog, that’s detectable in a trial of 1,800 people over 18 months, but a person might notice no difference in memory or daily function.

The CLARITY trial for lecanemab showed a 27 percent slowing of decline, which sounded substantial but translated to about 25 percent improvement on a 100-point scale—roughly three weeks of slowing in a three-year disease course. Conversely, a result that looks modest statistically might be clinically important if it affects something people truly care about. Remaining independent in self-care tasks for six more months might be more valuable to a patient than a five-point change on a cognitive test, even if the cognitive change is “statistically significant.” The problem is that trials measure what researchers can quantify, not always what patients prioritize.

Cognitive Decline Comparison: Trial Results vs Absolute ChangePlacebo group (18mo)4 points on 100-point scaleLecanemab (18mo)2.7 points on 100-point scaleAducanumab trial0.5 points on 100-point scaleTypical mild cognitive impairment5.2 points on 100-point scale3-year disease average12 points on 100-point scaleSource: Alzheimer’s trial composite data (illustrative; actual values vary by trial design and population)

Biomarker Endpoints Versus Functional Outcomes

Modern Alzheimer’s trials often use biomarkers—PET scans of amyloid and tau, blood tests for phosphorylated tau, MRI measures of brain atrophy—as primary endpoints because they can be measured objectively and change can be detected quickly. But biomarker changes do not guarantee functional improvement, a risk that the field learned painfully after aducanumab. A drug might reduce amyloid by 50 percent on a scan and still leave a patient’s memory and daily functioning essentially unchanged.

Trials using functional endpoints (actual memory tests, ability to perform activities like paying bills or cooking) are harder to conduct because they require longer follow-up, larger sample sizes, and careful training of raters to reduce bias. Some newer trials attempt to capture both biomarker and cognitive outcomes, but interpretation gets murky when they diverge. The lecanemab trials showed modest slowing of cognitive decline alongside significant amyloid reduction, but investigators continue to debate whether the cognitive slowing justifies the risk of amyloid-related imaging abnormalities (ARIA), a potentially serious side effect that can include brain microhemorrhages.

Trial Populations Don’t Reflect Typical Patients

Alzheimer’s trial participants are usually diagnosed early (mild cognitive impairment or mild dementia), highly educated, cognitively intact enough to consent, financially stable enough to attend frequent visits, and free from most other serious illnesses. A typical participant is 15 years younger cognitively than the average Alzheimer’s patient in the community, takes fewer medications, and has fewer comorbidities like heart disease or diabetes. Results from these healthy, engaged, monitored volunteers do not reliably predict how a drug will work in an 82-year-old with three chronic conditions, poor adherence, and no access to specialized memory centers.

This selection bias is rarely discussed in headlines. A trial showing a medication works in mild cognitive impairment or mild dementia says little about whether it will help someone in moderate or advanced dementia, where most people with Alzheimer’s live and where most die. The lecanemab trials enrolled predominantly mild stages; its real-world safety and benefit in more advanced disease remains unknown.

Trial Duration and the Problem of Long-Term Uncertainty

Most Alzheimer’s trials run 12 to 24 months, but Alzheimer’s disease progresses over 8 to 20 years. A drug that slows decline slightly over 18 months might lose efficacy over time, or participants might experience side effects that offset benefits in years 3 and 4.

Aducanumab was studied for 18 months in the trial that led to its approval, but after approval, safety signals emerged—some patients developed amyloid-related imaging abnormalities including brain microhemorrhages and microinfarcts that were not seen or reported in the trial. Long-term follow-up of trial participants is rare and expensive, so trials can only show what happens during the study window. Real-world use stretches months into years, and patients on drug registries and in clinical practice often experience outcomes not captured in the controlled trial.

The Role of Trial Design and Hidden Bias

Open-label trials (where everyone, patients and doctors, know who is getting the treatment) are vulnerable to bias: expected benefits can feel real even if the drug is inert, and raters may score cognitive tests more favorably for treated participants. Many early Alzheimer’s trials used this design because they were small or conducted before good outcome measures existed.

Even blinded trials (where neither patient nor rater knows who is treated) can fail if the side effect profile is distinctive—a drug that causes a bothersome rash in the treatment group, for example, unblind the trial because participants and raters figure out who got the active drug. Amyloid-related imaging abnormalities in lecanemab trials created a partial unblinding problem: some treated patients showed visible brain changes on MRI that raters could recognize, potentially biasing how they scored cognitive outcomes. Data analysis choices also matter—whether a trial counts only completers (people who stayed until the end) or includes everyone enrolled (intention-to-treat analysis) can swing results in different directions.

How Regulatory Pathways Shape What Gets Approved

The FDA’s accelerated approval pathway allows drugs to reach patients based on biomarker endpoints (like amyloid reduction) without proving cognitive benefit first, with the promise of later trials to confirm real benefit. Aducanumab was approved this way, and lecanemab was also accelerated, approved initially with a plan for further data. The assumption underlying accelerated approval is that reducing a hypothesized disease cause will help patients, but in Alzheimer’s, that assumption has been wrong more than once.

Standard approval requires proof of clinical benefit on a pre-specified outcome. The difference in standards means that a drug approved five years ago under one pathway cannot be directly compared to one approved today, and older approvals carry more uncertainty about real-world functional impact than newer ones. Regulatory standards continue to shift, influenced by patient advocacy, industry pressure, and evolving scientific understanding—which means the same evidence might result in approval one year and rejection the next, depending on the FDA’s interpretation of the evidence and its confidence in the biomarker-to-benefit link.

Frequently Asked Questions

If a drug reduces amyloid on brain scans, does that mean it will help my memory?

Not necessarily. Amyloid reduction is correlated with cognitive benefit in some trials but not others. Aducanumab reduced amyloid substantially but didn’t clearly slow cognitive decline, and was later withdrawn from the market. Always look for evidence of functional or cognitive improvement, not just biomarker changes.

Why do trial participants do better than real patients on the same drug?

Trial participants are healthier, younger, more adherent, and attend regular visits where cognitive changes are monitored carefully. Typical patients are older, take more medications, have other illnesses, and may miss appointments, so the conditions that made the drug work in a trial don’t exist in real life.

What does “35 percent slowing of decline” actually mean?

It means the treated group declined 35 percent less than the placebo group during the trial. If the placebo group declined 4 points over 18 months and the treated group declined 2.6 points, the slowing is 35 percent—but in absolute terms, the difference is only 1.4 points on a much larger scale, which may not feel noticeable to the patient.

Should I take a drug that’s FDA-approved if I’m worried it might not work?

FDA approval means the evidence met a regulatory threshold for benefit and safety, but doesn’t guarantee it will work for you. Your doctor should discuss what the trials actually showed, what benefits are realistic for your stage of disease, and what side effects are possible—because individual response varies widely.

Why don’t trials just ask patients if they feel better?

Because subjective feeling is prone to bias and expectation, and because “feeling better” is hard to quantify and compare across people. Trials use standardized tests and biomarkers instead, but these don’t always match what patients actually notice or care about in their daily lives.


You Might Also Like