Mail

Students using an AI tutor learned twice as much as those in a conventional classroom — and in less time.

This was the finding of a study published by Harvard University in 2025. The education world was surprised, and the media was excited. But just months later, the OECD's Digital Education Outlook 2026 had this to say: "Students who used AI completed tasks 48% more successfully. But when AI was removed, their performance dropped by 17%."

What is going on here? Is AI the savior of education, or a new danger?

What Happened in the Harvard Physics Classroom
The Double-Edged Nature of Generative AI: OECD's Findings
When AI Helps Learning — and When It Harms It
A Practical Guide for Teachers and Students

1. What Happened in the Harvard Physics Classroom

Study Design: Same Students, Two Conditions

In June 2025, a paper published in the scientific journal Scientific Reports sent ripples through the education research community. Led by Harvard physics lecturer Gregory Kestin and senior lecturer Kelly Miller, the research team designed a rigorous randomized controlled trial with 194 undergraduate students.

The heart of the experiment was a crossover design. The same students learned one topic (surface tension) through an AI tutor and another topic (fluid flow) in a traditional active-learning classroom. This approach controls for individual student ability, allowing a fair comparison between the two teaching methods.

The AI tutor was called PS2 Pal — not a generic ChatGPT prompt, but an education-specific chatbot carefully engineered with learning science principles for the physics course. Rather than giving students answers directly, it guided them through Socratic questioning so they could build understanding themselves.

The Results: What the Numbers Show

Measure	AI Tutor	Traditional Class
Learning gain	2× above baseline	Baseline
Time spent (median)	49 minutes	60 minutes
Engagement rating	4.1 / 5	3.6 / 5
Motivation rating	3.4 / 5	3.1 / 5
Statistical significance	p < 10⁻⁸	—

This was not a simple "use ChatGPT" experiment. A pedagogically designed AI tutor produced statistically significantly better results than an active-learning classroom led by experienced instructors.

The Researcher's Own Warning

Remarkably, Kestin himself added a caution when announcing the results:

"While AI has the potential to supercharge learning, it could also undermine learning if we're not careful. AI tutors shouldn't 'think' for students, but rather help them build critical thinking skills."

This warning aligns precisely with the core message of the OECD report released in the same year.

2. The Double-Edged Nature of Generative AI: OECD's Findings

OECD Digital Education Outlook 2026

In January 2026, the OECD published the Digital Education Outlook 2026: Exploring Effective Uses of Generative AI in Education — a report analyzing the educational impact of generative AI across global education systems. Its central finding can be summarized in one sentence:

"Generative AI amplifies good pedagogy — and it amplifies bad pedagogy too."

48% Better, Then 17% Worse

Among the studies cited in the report, one finding was striking. Students who used AI completed tasks 48% more successfully. But when tested without AI, their performance dropped by 17%.

AI had not helped them learn — it had done the work for them. Students had not learned how to produce better results; they had learned how to depend on AI. The OECD called this phenomenon "metacognitive laziness."

TALIS 2024: The Reality in Classrooms

According to the OECD's teacher survey (TALIS 2024), 37% of lower-secondary teachers are already using AI in their teaching. Of these, 57% said AI helps them write or improve lesson plans. At the same time, 72% of teachers expressed concern about students submitting AI-generated work as their own.

AI adoption in classrooms is already a reality. The question is how it is being used.

3. When AI Helps Learning — and When It Harms It

When It Helps: When Pedagogical Design Is Present

The OECD report sets a clear standard: when AI tools are designed with intentional pedagogical purpose, learning outcomes improve meaningfully.

One particularly interesting finding: less-experienced tutors who used educational AI tools achieved better student learning outcomes than experienced tutors working alone. AI compensated for gaps in expertise. In collaborative learning scenarios, using AI as an information hub or peer contributor led to substantial improvements in critical thinking and teamwork.

When It Harms: The "Fast Use" Trap

The OECD distinguishes between two modes of AI use. Fast use means deploying AI to produce immediate outputs, while slow use means using AI to support iterative exploration and reflection for creative development.

The problem is that most students choose fast use — asking ChatGPT to write their essay, or requesting step-by-step solutions to math problems. This eliminates productive struggle, the very cognitive effort that makes learning stick. AI removes the process that is the learning.

AI use that helps learning: Concept exploration through Socratic questioning, requesting feedback on errors, asking for multiple explanations of a concept
AI use that harms learning: Asking AI to write assignments, requesting complete problem solutions, generating essay drafts wholesale

4. A Practical Guide for Teachers and Students

Teachers: Use AI as a Co-Designer

The OECD report strongly recommends co-design — teachers and students working together to shape how AI tools are used. When teacher expertise is embedded in AI tool design, learning outcomes are maximized.

Practical steps:

Shift to process-oriented assessment: Evaluate not just the final product, but how students interacted with AI along the way
Teach prompt engineering: Help students learn how to ask AI good questions, not just use it as an answer machine
Assign AI critique tasks: Activities that ask students to analyze AI responses and find errors build critical thinking naturally

Students: Don't Outsource the Hard Part

The key to Harvard's success was that the AI engaged with students' thinking process, not their output. PS2 Pal didn't give answers — it asked questions. Students still had to think.

Three questions to ask yourself when using AI:

"Could I do this without AI's help?"
"Did AI replace my thinking, or help me think more deeply?"
"Can I explain this right now without AI?"

In Closing

Both the Harvard study and the OECD report converge on the same truth. AI can be a game changer in education — but only when it leads students toward better thinking, not when it thinks for them.

In education, AI's right role is not to provide faster answers but to generate deeper questions. The ability to understand and design that distinction — for teachers and students alike — is becoming the most important competency of our time.

Are you using AI in your teaching or learning right now? What approach has worked best for you? Share your experience in the comments!

Recommended Reading

Sources

Kestin, G., & Miller, K. (2025). AI tutoring outperforms in-class active learning: an RCT. Scientific Reports. https://www.nature.com/articles/s41598-025-97652-6
OECD. (2026). OECD Digital Education Outlook 2026: Exploring Effective Uses of Generative AI in Education. https://www.oecd.org/en/publications/oecd-digital-education-outlook-2026_062a7394-en.html
Harvard Gazette. (2024). Professor tailored AI tutor to physics course. https://news.harvard.edu/gazette/story/2024/09/professor-tailored-ai-tutor-to-physics-course-engagement-doubled/

AI Tutors Double Learning Outcomes — What the Harvard Study and OECD 2026 Report Are Warning Us About

Table of Contents