Skip to Content
Artificial intelligence

AI is learning when it should and shouldn’t defer to a human

Is a doctor or an AI system better at diagnosing a particular patient? This machine learning tool makes the call.
August 5, 2020
A doctor examines a magnetic resonance image of a human brain.
Mark Schiefelbein/AP

The context: Studies show that when people and AI systems work together, they can outperform either one acting alone. Medical diagnostic systems are often checked over by human doctors, and content moderation systems filter what they can before requiring human assistance. But algorithms are rarely designed to optimize for this AI-to-human handover. If they were, the AI system would only defer to its human counterpart if the person could actually make a better decision.

The research: Researchers at MIT’s Computer Science and AI Laboratory (CSAIL) have now developed an AI system to do this kind of optimization based on strengths and weaknesses of the human collaborator. It uses two separate machine-learning models; one makes the actual decision, whether that’s diagnosing a patient or removing a social media post, and one predicts whether the AI or human is the better decision maker.

The latter model, which the researchers call “the rejector,” iteratively improves its predictions based on each decision maker’s track record over time. It can also take into account factors beyond performance, including a person’s time constraints or a doctor’s access to sensitive patient information not available to the AI system.

The results: The researchers tested the hybrid human-AI approach in a variety of scenarios, including for image recognition tasks and for hate speech detection. The AI system was able to adapt to the expert’s behavior and defer to them when appropriate, allowing the two decision makers to quickly reach a combined level of accuracy higher than a previous hybrid human-AI approach.

Case study: While these experiments are still relatively simple, the researchers believe such an approach could eventually be applied to complex decisions in health care and elsewhere. Consider an AI system that helps doctors prescribe the right antibiotic. While broad spectrum antibiotics are highly effective, their overuse can lead to antibiotic resistance. Specific antibiotics, on the other hand, avoid that problem but should only be used if they have a high chance of working. Given this trade-off, the AI system could learn to adapt to various doctors with different biases in their prescriptions, and correct for tendencies to over or under-prescribe broad-spectrum antibiotics.

Deep Dive

Artificial intelligence

Large language models can do jaw-dropping things. But nobody knows exactly why.

And that's a problem. Figuring it out is one of the biggest scientific puzzles of our time and a crucial step towards controlling more powerful future models.

OpenAI teases an amazing new generative video model called Sora

The firm is sharing Sora with a small group of safety testers but the rest of us will have to wait to learn more.

Google’s Gemini is now in everything. Here’s how you can try it out.

Gmail, Docs, and more will now come with Gemini baked in. But Europeans will have to wait before they can download the app.

Google DeepMind’s new generative model makes Super Mario–like games from scratch

Genie learns how to control games by watching hours and hours of video. It could help train next-gen robots too.

Stay connected

Illustration by Rose Wong

Get the latest updates from
MIT Technology Review

Discover special offers, top stories, upcoming events, and more.

Thank you for submitting your email!

Explore more newsletters

It looks like something went wrong.

We’re having trouble saving your preferences. Try refreshing this page and updating them one more time. If you continue to get this message, reach out to us at customer-service@technologyreview.com with a list of newsletters you’d like to receive.