images.unsplash.com

AI Personal Tutors Actually Raised High School Students' Grades

AI 교육

Generative AI without guardrails can harm learning: Evidence from high school mathematics

Against Brain Damage — Ethan Mollick

GPT-4 Tutoring in Nigeria Boosts English Scores

"AI tutors will revolutionize education" — tired of hearing that, right? But this time it's different. They ran randomized controlled trials (RCTs) in real schools, with real students. Not just once, but multiple times. The results? One experiment showed 2 years of learning gains in just 6 weeks, while another found that simply handing students ChatGPT actually dropped their scores by 17%.

3-Second Summary

3 AI tutor RCTs → Well-designed = +127% improvement → No guardrails = -17% backfire → Key: prompt design + teacher oversight

What is this about?

Between 2024 and 2025, a series of randomized controlled trials (RCTs) validating the learning effects of GPT-4-based AI tutors were published. RCTs are the "gold standard" used in medicine to test new drugs — students are randomly split into groups, one uses the AI tutor and the other doesn't, then results are compared.

Here are the three key experiments summarized.

3 Key RCT Experiments

Nigeria Experiment (World Bank, 2025): 9 public high schools, 6 weeks of after-school GPT-4 tutoring. Achieved 2 years of learning gains at $48 per student. Effect size of 0.31 standard deviations, ranking in the top 20% of educational interventions.
Turkey Experiment (Penn/Wharton, 2025): ~1,000 high school students, GPT-4 in math class. 'GPT Tutor' (with guardrails) improved scores by +127%, 'GPT Base' (no guardrails) by +48%. But when tested later without AI, the Base group showed a -17% backfire effect.
Harvard Experiment (Kestin et al., 2025): In a university physics class, the AI tutor produced higher learning outcomes than active learning instruction. Student engagement and motivation were also higher.

Wharton professor Ethan Mollick synthesized these results and concluded: "Whether AI helps or hurts learning depends not on the AI itself, but on how it's used."

+127%

GPT Tutor Score Improvement (Turkey)

-17%

Unguarded GPT Backfire Effect

$48

Per-Student Cost (Nigeria)

What's actually changing?

Until now, the "AI tutors good/bad" debate was a battle of opinions. Now there's data. And what the data says is quite nuanced.

	ChatGPT as-is	Designed AI Tutor
Learning approach	Gives answers directly (shortcut)	Guides with hints and questions
Scores during practice	+48% (AI solves for them)	+127% (students solve themselves)
Test without AI	-17% (dependency backfire)	Almost no backfire effect
Student perception	"I feel like I learned a lot" (illusion)	Actually learned
Cost efficiency	Unmeasurable (no real learning)	$48/student for 2 years of gains

The scariest finding came from the Turkey experiment. Students who used ChatGPT without guardrails felt they had "learned a lot," but actually scored 17% lower than students who didn't use AI at all. The autopilot analogy fits perfectly — relying on autopilot erodes your manual flying skills, same principle.

The Nigeria experiment showed the opposite result. What made the difference?

Why the Nigeria experiment succeeded

Teacher oversight: Teachers guided directly but didn't give answers. AI didn't replace teachers — teachers used AI as a tool.
Curriculum alignment: Prompts were designed to match Nigeria's national curriculum. They didn't just throw random topics at it.
Learning science principles applied: Retrieval practice, elaborative interrogation, contextual examples — proven pedagogical methods were baked into the prompts.
Pair learning: Students interacted with AI in pairs of two. Together with a friend, not alone.

Stanford's Tutor CoPilot experiment reached the same conclusion. When AI was used to assist human tutors rather than directly teaching students, it worked. Specifically, students of less experienced tutors saw a 9 percentage point increase in math pass rates — at a cost of just $20 per student per year.

The essentials: How to get started

Whether you're a student, parent, or educator — here are the practical principles these studies reveal.

Use a "don't give the answer" prompt
Instead of telling ChatGPT "solve this problem," start with "I'm learning this concept. Don't give me the answer — guide me with hints and questions. If I'm wrong, explain why." In the Turkey experiment, this difference separated +127% from -17%.
Maintain teacher/parent supervision
This was the key success factor in the Nigeria experiment. Don't hand students off to AI — use AI as a tool while a human manages the overall process.
Review without AI after studying
After studying with AI, always set aside time to work through problems alone. This is the clearest lesson from the Turkey experiment — performing well with AI help is expected; real learning means performing well without it.
Use the Wharton prompt library
Professor Mollick's team has published education prompts under Creative Commons. If creating your own is difficult, start here.
Consistency is key
In the Nigeria experiment, each additional day of attendance produced 0.031 standard deviations of additional effect. It's not about trying it once — consistent use matters.

Things to keep in mind

The control group in the Nigeria experiment received no intervention at all. This means it wasn't a direct comparison of AI tutor vs. human tutor. Also, the effect was larger for students with higher digital literacy, raising concerns that AI tutors could actually widen the digital divide.

🔗

Want to dig deeper?

Turkey High School RCT Full Paper (PNAS)

The mechanism by which unguarded GPT harms learning. Full text.

From Chalkboards to Chatbots (World Bank)

Full Nigeria 6-week experiment. 2 years of gains for $48.

Harvard AI Tutor RCT (Nature)

AI tutor vs. active learning comparison. AI wins on both learning and engagement.

Ethan Mollick — Against "Brain Damage"

The conditions under which AI helps or hurts thinking. Education experiment synthesis.

Stanford Tutor CoPilot Study

AI-assisted human tutor model. 9pp improvement at $20/year.

The 74 — AI Tutoring Report

Explainer article on the Stanford Tutor CoPilot experiment.

FAQ

Can't I just use ChatGPT as an AI tutor?

In the Turkey high school RCT, students who used ChatGPT without guardrails scored 17% lower when tested without AI. Because giving answers directly interferes with students' own learning. You must set up prompts that say "don't give the answer, guide me with hints."

Is the 2 years of learning gains from the Nigeria experiment real?

In the World Bank-led RCT, students who received 6 weeks of after-school AI tutoring showed 0.31 standard deviations of improvement. Extrapolated by attendance rates, sustaining this for a year could reach 2.23 standard deviations. However, the control group received no intervention at all, making it difficult to isolate the AI effect alone.

Can AI tutors replace human tutors?

Current research says assistance is more effective than replacement. In Stanford's Tutor CoPilot experiment, using AI to assist human tutors raised math pass rates by 9 percentage points at $20 per student per year. Researchers emphasize that motivation and emotional support remain firmly in the human domain.

Written by 러쉬

비즈니스와 AI가 만나는 접점을 추적합니다.

Did you find this reference helpful?

Get curated references delivered to your inbox weekly

Share this reference

이런 가이드도 추천해요

비슷한 주제의 AI 활용 가이드를 더 살펴보세요

granola.ai

Granola AI

The Awkward Meeting-Bot Era Is Over — Granola AI Complete Guide

Granola AI takes meeting notes without the awkward bot join. A complete guide to setup, features, and how it compares to other AI note-takers.

Book비지니스 서적

That's Not How You Make Money! A Millionaire in His 30s Reveals the Wealth Acceleration Formula

[Business Book] The hidden secret of wealth in capitalism — Rob Moore's 'Leverage'

Tell AI What You Want — Excalidraw MCP Server Makes Diagrams in Seconds

raw.githubusercontent.com

AI 도구Excalidraw MCP

Tell AI What You Want — Excalidraw MCP Server Makes Diagrams in Seconds

Tell AI what you want and get diagrams instantly. Here's how to create diagrams right inside your chat window using the Excalidraw MCP server.