Trained on Us: How AI Codifies Human Prejudice at Scale

The Myth of Machine Neutrality

Here's the thing about artificial intelligence that nobody wants to hear: it isn't neutral. Not even close.

We've built this comfortable narrative around AI as a dispassionate observer—a tool that impartially categorizes data, delivers faster diagnoses, and makes predictions free from the messy baggage of human prejudice. It's a seductive story. If a machine made the decision, surely it must be objective, right? Wrong.

Research is increasingly clear that AI reflects the same psychological biases, stereotypes, and stigmas that shaped the humans who built it. And I mean the same biases. Not subtle echoes. Full-on, unexamined prejudice, now running at the speed of silicon.

The problem starts with a fundamental misunderstanding of what machine learning actually does. AI systems don't learn "facts" the way we think of them. They absorb how recorded events have been interpreted by humans. When a model trains on decades of medical records, hiring data, or news articles, it isn't ingesting raw reality—it's absorbing centuries of human judgment calls, many of which were shaped by sexism, racism, and class bias. The system doesn't know the difference between a genuine pattern and a historical injustice.

There's nothing truly creative about AI. It does what it's told, filtered through the programmer's lens and the data's legacy. And when that data is built on inequality, the output will be too.

The Myth of Machine Neutrality

When Missing Data Means Missed Diagnoses

Let's get specific, because the healthcare examples are where this stops being theoretical and starts being lethal.

Women's health issues have been chronically under-researched for generations. Women from marginalized groups are even more absent from medical datasets. This isn't a minor gap—it's a structural void that AI systems now inherit and amplify.

Consider cardiovascular disease prediction models. These tools are trained on predominantly male datasets, yet heart attack symptoms in women manifest differently than in men. The result? Women get misdiagnosed. Not occasionally. Systematically.

Then there's skin cancer detection. CNN-based lesion classifiers were trained on datasets where Black patients made up only 5–10% of the data. The consequence? Diagnostic accuracy for Black patients drops by roughly 50%, even though melanoma mortality is actually higher in this population. Let that sink in. The system that's supposed to save lives works worst on the people who need it most.

Pulse oximeters—those little clips you put on your finger at the doctor's office—systematically overestimate oxygen saturation in nonwhite patients. Black patients are three times more likely to suffer undetected occult hypoxemia. Three times. That's not a bug. That's what happens when you build life-critical technology on incomplete data.

And perhaps the most chilling example: an AI algorithm used health costs as a proxy for health needs. It concluded that Black patients were healthier than equally sick white patients—because less money was being spent on them. Less spending, not better health. The algorithm then assigned lower priority for life-saving care to the people who needed it most. It didn't "decide" Black patients were healthier. It mirrored a history of systemic disinvestment and then institutionalized it at scale.

When Missing Data Means Missed Diagnoses

Scale Turns Bias Into Systemic Harm

Human heuristics produce systematic biases. We all rely on mental shortcuts—cognitive patterns that help us process information quickly but also generate gender stereotypes and racial assumptions embedded deep in our culture and language.

AI absorbs these patterns from human-generated data. That part isn't new. What's genuinely alarming is what happens when you add scale to the equation.

A single biased human decision affects one patient, one hire, one loan application. An AI system running that same bias across millions of interactions? That's institutional harm, automated and invisible. The patterns get amplified far beyond individual human encounters.

Wang et al. (2025) demonstrated this through rigorous testing—using SC-WEAT, FMAT, and GPT-4o probing—and found that AI is strongly associated with socially dominant groups: men, young people, the wealthy, white individuals, and high-prestige occupations. The research identified what they call an "AI divide," where disadvantaged groups are systematically less likely to benefit from AI advancements.

This isn't just about healthcare. Early adopters of cutting-edge AI are predominantly young, male, and urban-based. The technology shapes public perception in ways that reinforce existing power structures across education, employment, investment, and medical access. The bias doesn't stay contained. It spreads.

Design Choices That Reinforce the Cycle

Here's something most people never think about: the design of AI technologies actively perpetuates gender roles.

Digital assistants—Siri, Alexa, Cortana—are almost universally given female names and voices. They're positioned as helpful, supportive, deferential. Meanwhile, roles associated with authority or expertise are coded masculine. Users expect female-coded AI to be warm and emotionally intelligent, while male-coded AI is seen as more competent and authoritative. Even when designers attempt to create gender-neutral systems, users infer gender from subtle cues—tone of voice, the types of tasks assigned. Our gender schemas are that powerful.

AI-generated images in healthcare settings reproduce narrow stereotypes, shaping expectations about who is taken seriously as a caregiver or patient. These images guide attention, influence memory, and shape understanding—often beyond what any accompanying text can do. When AI consistently presents biased views of healthcare roles, it reinforces stereotypes that affect how care is delivered and who gets believed.

And then there's stereotype threat—the psychological phenomenon where awareness of belonging to a negatively stereotyped group induces anxiety and reduced confidence. This impairs communication, leads to delayed care, and produces worse outcomes. When AI systems signal or reinforce stereotypes, they intensify these effects. A patient who feels an automated system is predisposed to dismiss their symptoms? They're less likely to seek help, follow advice, or engage with professionals at all. The cycle closes on itself.

Why This Matters Beyond the Exam Room

It's easy to treat this as a healthcare problem. It isn't. The mechanisms at work here operate across every domain where AI touches human life.

The Wang et al. research makes this explicit: public perception of AI carries stereotypical bias toward dominant groups, affecting adoption patterns and access to opportunity. The "AI divide" isn't a future risk—it's already emerging across education, employment, investment decisions, and medical access.

But here's what keeps me up at night: improving AI in healthcare requires more than technical fixes and ethical guidelines. It demands an understanding of the complex psychological processes that shape both human thinking and its machine reflections. We need to stop treating bias as a data quality issue and start treating it as what it actually is—a reflection of who we are.

AI is not an impartial observer of reality. It's a mirror of human cognition—our categories, our simplifications, our prejudices. And because it operates at scale, it amplifies these patterns far beyond what any individual human could achieve. The question isn't whether AI will reflect our biases. It already does. The question is whether we'll keep building systems that turn those reflections into policy.

Trained on Us: How AI Codifies Human Prejudice at Scale

The Myth of Machine Neutrality

When Missing Data Means Missed Diagnoses

Scale Turns Bias Into Systemic Harm

Design Choices That Reinforce the Cycle

Why This Matters Beyond the Exam Room

Related blogs

Aligning the Fable: Inside the Safety Debate Behind Claude Fable 5

Hybrid Sovereignty: Reclaiming Cognitive Independence

Policy on the AI Exponential: Dario Amodei's Blueprint for a Post-Mythos World