← All reviews

AI explains the world

Copilot Analysis: Josef Kraus on the Education Crisis — Fact Check & Hypothesis Space

Microsoft Copilot's independent analysis of the same WELT interview. Fact check, hypothesis space H1-H6, international contextualization with PISA, TIMSS, and smartphone policies.

Analysis metadata

AI model GPT-4o (Copilot Smart Plus)
Provider Microsoft/OpenAI
Context window 128.000 Tokens
Editor Lukas Geiger (LG)
Date of analysis 6 April 2026
Analysed document WELT-Interview mit Josef Kraus: Man nennt das kindgerecht, ich nenne es eine Versündigung
Felix Seifert (WELT / Business Insider), April 2026
Tools used
WebrechercheHypothesenraum H1-H6

Replications with other models

To surface model bias, reviews are replicated with different AI systems.

Anthropic Claude Opus 4.6 fertig To the review →
Google Gemini 3.1 Pro fertig To the review →
Show original prompt (for replication)
Fact check of all claims + hypothesis space with alternative explanations. International contextualization (PISA/TIMSS, smartphone policies, digitalization studies).

Source: WELT / Business Insider — Interview with Josef Kraus, former President of the German Teachers’ Association. Model: Microsoft Copilot (GPT-4o-based). Note: This analysis was created independently and simultaneously with a Claude analysis. Comparison of both: → Three AI, one interview


Object and Status of the Text

The text is an interview/opinion piece featuring Josef Kraus. It contains:

  • Verifiable factual claims (numbers, trends, institutional statements)
  • Interpretative theses (causal claims, value judgments)
  • Rhetorical exaggerations (“woke pedagogy”, “sin against children”), which are not strictly empirically testable

Fact Check

ClaimTestabilityFindingEmpirical Support
Student performance “worrying”, minimum standards missed more frequentlyhighcorrectIQB (Institute for Educational Quality Improvement) Education Trend 2024: significant decline
”In international comparison, students are falling behind”mediumplausible, but unevidenced in the interviewPISA/TIMSS references not cited
70,000 teachers short by 2035highnot supported as statedStanding Conference of the Ministers of Education (KMK): ~49,000; number varies by source
Teacher shortage known for 25 yearslow-mediumpartially evaluativeLong-term bottlenecks are documented
Berlin: Migrant share up to 90%mediumunclearPossible, but not robust without an official reference in the text
Knives/metal detectors at schoolslow-mediumanecdotalAn exaggeration without systematic statistics
Abitur (high school diploma) grades getting better, minimum standards worsemediumpartially provableIQB proves issues with minimum standards
Lower Saxony cancels long divisionhighdisputed/context-dependent”Not abolished, but initiated differently”
39% police dictation failure rate, 30% are Abitur holdershighessentially correctParliamentary inquiry confirms this
KfW (development bank): Schools need ~€10 billion renovationhighappears clearly false/too lowKfW Municipal Panel: ~€54.8 billion (2024)
Denmark: Smartphone ban in schoolsmediumplausibleReports on restrictions exist

Interim Conclusion

Strongest points: IQB performance declines, teacher demand forecasts, police test, investment backlog. Weakest points: Migration/discipline statements without data, “woke pedagogy” as a causal claim.


Hypothesis Space: Why are Competencies Declining?

Copilot identifies six competing hypotheses:

HypothesisMechanismStatus
H1: Curricular Lowering/Grade Inflation (Noteninflation)Standards get easier → grades rise, competencies sinkPartially plausible
H2: Pandemic/Lag EffectsLearning interruptions → cohort effectPlausible, discussed by IQB
H3: Composition of the Student BodyMore heterogeneity with insufficient supportPartially supported, causally open
H4: Teacher Shortage/Cancelled ClassesUnfilled positions → less effective learning timeHighly plausible
H5: Infrastructure/Organizational DeficitsRamshackle buildings, administrative bottlenecks → worse learning environmentStrongly proven as a problem
H6: Digitalization/Smartphone DistractionDistraction, less “Deep Work” → lower learning performancePolitically salient, empirically thin

Core Finding

None of the six models explains the decline on its own. Kraus emphasizes H1 and H3, but the evidence leans more towards H2 + H4 + H5 as the main drivers.


International Contextualization

PISA/TIMSS/ICILS

StudyAge GroupFocusFinding for Germany
PISA 202215-year-oldsApplication competencyClear declines in Math/Reading/Science
TIMSS 2023Grades 4+8Curriculum-closePrimary school stable (524/515 points)
ICILSGrade 8Digital competencyMore devices ≠ more competency

Smartphone Policies Internationally

Policy TypeEvidence Base
Strict BanEffects small to moderate, more social than performance-related
Recess RulesPlausible, heavily depends on implementation
Didactically IntegratedOnly works with clear didactic strategies

Copilot’s Core Statement: “Bans are not a magic lever, but as part of a package (rules + media education + teaching quality) they are rationally justifiable.”

Digitalization

“Digitalization without an attention and didactic regime is a pricey redistribution of learning time to screen time.”


Assessment

Sustainable

  • Measurable decline in competence in central domains (IQB)
  • Teacher supply structurally strained
  • School infrastructure is a massive investment problem
  • Basic competencies even problematic in selection scenarios (police test)

Overstretched

  • Data uncertainty: Teacher shortage figure (70,000) and KfW figure (~10 billion) are not robust
  • Causal reduction: “Performance sinks” → “modern pedagogy/migration is to blame” — not conclusive without controlled evidence
  • Migration as an explanation: Confounding with socioeconomic status (SES), language level, school segregation
  • Lower Saxony example: Depending on the source, it is framed didactically differently

Analysis: Microsoft Copilot (GPT-4o) | Editorial preparation: Um:bruch | 2026-04-06

✉️ Write to us 📝 Contact form
bildungfaktencheckbildungspolitikschule