AI in Bio Explained

Beyond "Human in the Loop": Practical Considerations for Ethical Oversight of Healthcare AI

"Human in the loop" has become shorthand for human oversight over AI models. But in healthcare — where AI operates at a scale that can overwhelm human attention, and reasons in ways we can't always see — the harder question is which kind of oversight belongs where.

Justin Chen
Author
Chief Technology Officer, BioLiterate
Iphigénie Fossati-Kotz
Contributor
Iphigénie Fossati-Kotz
Non-Executive Director, BioLiterate

In discussions about AI ethics, "human in the loop" is a common refrain that implies human oversight of AI-driven processes and systems. The phrase sounds straightforward: by ensuring humans can observe and have authority over an AI process, we can avoid the worst inaccuracies of AI while still benefitting from its efficiency gains.

However, AI presents unique strengths and challenges that distinguish it from previous technological improvements. AI can be deployed at a scale that can very easily overwhelm human oversight, and due to its "black box" nature, has opaque reasoning that makes its behavior difficult for humans to troubleshoot. Because of these challenges and others, there is real nuance in the phrase "human in the loop" that is worth exploring further.

The framework

The European Commission's Ethics Guidelines for Trustworthy AI (2019) offers a useful structure for this discussion. It distinguishes three modalities of human oversight: Human-In-The-Loop, Human-On-The-Loop, and Human-In-Command. These are sometimes misunderstood as a hierarchy of human involvement; in reality, they reflect distinct oversight functions operating at different levels of the AI lifecycle. Applied to healthcare, the appropriateness of each model is contingent on the nature and severity of risk.

HITLHuman-In-The-Loop


HITL is required in patient-specific contexts — diagnostic support, treatment recommendations, or triage decisions — where clinical judgment, contextual interpretation, and professional accountability are indispensable. Its effectiveness depends immensely on the quality of human engagement.

Ethical considerations cannot be satisfied merely by adding a person somewhere near the output. A clinician who is overloaded, under-informed, or overly deferential to the system may add little real protection, even where formal approval is required. The adequacy of oversight therefore depends on whether that human has the competence, authority, context, and time needed to exercise genuine oversight — otherwise HITL risks degenerating into rubber-stamping, actually undermining the control function it is meant to provide.

HOTLHuman-On-The-Loop


By contrast, HOTL is better suited to large-scale or system-level applications — population health monitoring, workflow optimization, or anomaly detection. Its principal advantage is scalability and continuous system evaluation.

Yet it introduces distinct risks, such as automation bias or diminished vigilance, as human operators transition from active decision-makers to passive supervisors. Effective HOTL therefore depends on robust system design: transparency, auditability, calibrated alert mechanisms, and clear intervention thresholds.

HICHuman-In-Command


HIC addresses a different dimension of ethical control, where responsibility remains anchored at the organisational level — with the governance authority determining whether, where, and under what constraints a system should be used at all. It is particularly important in healthcare, where significant harms may arise from flawed deployment decisions (such as inadequate validation across populations or inappropriate clinical integration) rather than from wrong outputs.

HIC works in tandem with the previous paradigms to ensure processes are continuously evaluated for efficacy and safety. Ultimately, it preserves the capacity to constrain, suspend, or terminate the use of an unsafe system.

The real question is not where the human sits in relation to the system in the abstract, but whether human judgment is meaningfully exercised at the points in the lifecycle where it matters most.

Creating AI-enhanced systems with an appropriate "human in the loop" is destined to fail unless the appropriate model of oversight is applied. Each model carries its own failure mode — and each, applied at the wrong layer, can give the illusion of control while providing none.

See also the World Health Organization's Ethics and Governance of Artificial Intelligence for Health (2021).

Figure 01

Oversight happens at multiple layers

HITL Human-In-The-Loop Precision + Validation
HOTL Human-On-The-Loop Monitoring + Control
HIC Human-In-Command Governance + Strategy
Role
Direct decision oversight
Supervised autonomy with intervention capability
Authority over system deployment and use
Focus
Accuracy of data, models, and outputs
Detection of anomalies, drift, and system failures
Alignment with clinical, ethical, and organisational goals
Requires
Domain expertise and contextual judgment
Robust monitoring systems and clear intervention thresholds
Technical literacy, authority, and governance processes
Risk if
missing
Incorrect outputs leading to flawed decisions
Undetected failures at scale
Misuse, poor deployment, and loss of trust

Human oversight operates at multiple layers — each bringing a different form of judgment. Together, they ensure AI systems are not only effective, but safe and accountable in practice.

Source: BioLiterate · adapted from the EC Ethics Guidelines for Trustworthy AI (2019)
About this series

AI in Bio Explained

This is the first article in AI in Bio Explained, a BioLiterate series that unpacks the concepts, frameworks, and vocabulary shaping AI in the life sciences — translating them into plain, practical terms for everyone to understand. Each piece stands on its own, taking one idea at a time and grounding it in how AI actually shows up in biomedical practice.

Justin Chen
Author

Justin Chen

Chief Technology Officer, BioLiterate

An engineering leader with over 16 years in software, Justin most recently served as Senior Director of Engineering at Hewlett Packard Enterprise, where he led the AI Solutions Engineering team building massively parallel GPU compute platforms for some of HPE's largest and most complex customers.

About
BioLiterate
Founded 2025 · Independent

BioLiterate is an independent research and education nonprofit organization founded in 2025 with a focused mission: to equip biomedical professionals with the knowledge and tools to engage with AI critically, confidently, and responsibly.

We produce curated educational content, a quarterly newsletter, and live and online community events — developed in partnership with organizations across the biopharma, clinical, and research ecosystems. Our work is built on a simple conviction: that responsible AI adoption in biomedicine depends not on hype or fear, but on grounded, peer-relevant evidence.