← All reviews

AI explains the world

Fact Check & Hypothesis Space: Josef Kraus on the Education Crisis

Systematic examination of all factual statements and theses from the WELT interview with the former President of the German Teachers' Association. 13 claims checked, 6 theses confronted with alternative explanations.

Analysis metadata

AI model Claude Opus 4.6
Provider Anthropic
Context window 1.000.000 Tokens
Editor Lukas Geiger (LG)
Date of analysis 6 April 2026
Analysed document WELT-Interview mit Josef Kraus: Man nennt das kindgerecht, ich nenne es eine Versündigung
Felix Seifert (WELT / Business Insider), April 2026
Tools used
WebSearch (Faktenverifikation)Hypothesenraum-KonstruktionMulti-Quellen-Abgleich

Replications with other models

To surface model bias, reviews are replicated with different AI systems.

Microsoft/OpenAI GPT-4o (Copilot) fertig To the review →
Google Gemini 3.1 Pro fertig To the review →
Show original prompt (for replication)
Fact check of all 13 claims + hypothesis space with alternative explanations for 8 core theses. Empirical examination against IQB, PISA, PIRLS (IGLU), TIMSS, Hattie, OECD, KMK curriculum history, Bertelsmann. Expanded with Maslow hypothesis and Overload hypothesis from editorial discussion.

Source: WELT / Business Insider — Interview with Josef Kraus, former President of the German Teachers’ Association (1987–2017), conducted by Felix Seifert. Method: (1) Fact check of all 13 quantitative claims, (2) Hypothesis space with alternative explanations for 6 core theses, (3) empirical examination against current educational research.


Overall Judgment

Josef Kraus identifies real problems of the German educational system. His facts are predominantly correct (9 out of 13 exactly, 3 slightly imprecise, 1 false). However, his causal explanations are systematically undercomplex: He reduces a multi-level phenomenon (underfunding + teacher shortage + Corona + demographic change + partially problematic didactic trends) to a culture-critical grand narrative (“pedagogy of facilitation” / Erleichterungspädagogik).


Part I: Fact Check — 13 Claims

No.Claim (Kraus)AssessmentIn Reality
1By 2035, 70,000 teachers shortPARTIALLY CORRECTKMK (Standing Conference of the Ministers of Education): ~49,000, GEW (Education Union): 70,000+, IW (Economic Institute): 76,000, Klemm (researcher): 85,000. 70,000 is a plausible mean value.
2Teacher shortage known for 25 yearsCORRECTThe “pork cycle” (Schweinezyklus: shortage → study boom → oversupply → shortage) has been documented for decades (bpb).
3Berlin: up to 90% migrant share in classesCORRECT19 of 368 Berlin primary schools have > 90% non-German background (ndH) share. Average: 45%.
49th graders: Failure to meet minimum standards is risingCORRECTIQB (Institute for Educational Quality Improvement) 2024: Historic low point. ~34% miss MSA (Intermediate School Certificate) minimum standard in math.
5Abitur (high school diploma) grades getting ever betterCORRECTAverage 2.52 → 2.41. 1.0-quota rose by 50-70% (2006-2014).
6Lower Saxony cancels written divisionCORRECTFrom 2026/27 (Grades 1-2), from 2027/28 (Grades 3-4). Decimal calculation also dropped.
739% of Berlin police applicants fail the dictationCORRECT4,271 of 10,874 (39.3%). 30% of high school graduates/academics affected. Exactly correct.
8School renovation: ~€10 bn according to KfWFALSEKfW Municipal Panel 2025: €67.8 bn — a factor of 7 higher. Ironically, the error strengthens his argument.
9Barracks renovation: €68 bnCORRECTArmed Forces Commissioner 2024: ~€67 bn.
10184,000 soldiers, 10 million studentsPARTIALLY CORRECTSoldiers: 184,200 ✓. Students: 11.5 million (1.5 million too low).
11350,000 Abitur candidatesPARTIALLY CORRECT2025: ~341,700. In 2026 even fewer due to the G9-switch (reverting to a 9-year Gymnasium).
1240,000 schools in GermanyCORRECT~40,000 (general-education + vocational).
13G8→G9 switchback in BavariaCORRECTCompleted. First G9 cohort: Abitur 2026.

Balance

Count
Correct9
Partially correct3
False1

Kraus argues based on facts. The only real error (School renovation: 10 instead of 68 billion) actually makes his argument stronger.


Part II: Hypothesis Space — 6 Core Theses

Thesis 1: “Pedagogy of Facilitation” (Erleichterungspädagogik) causes declining performance

Kraus’s argument: Less pressure to perform, no grades, open teaching, “democratic school” → performance decline.

Supporting Evidence:

  • Hattie: Direct instruction (d=0.59) > discovery learning (d=0.27)
  • Lipowsky (University of Kassel): Open teaching widens the performance gap
  • Finland’s PISA crash correlates temporally with competency-oriented reforms
  • IQB 2022: 33% miss minimum standards in German

Alternative Explanations:

  1. Corona Effect: Meta-analysis (Betthäuser et al.): 35% learning loss per school year. IQB 2022 maps the post-Corona cohort directly.
  2. Socioeconomic change: 25% of all students are socioeconomically disadvantaged (PISA 2022). The dependence between Socioeconomic Status (SES) → Competency has intensified.
  3. Chronic underfunding: Germany: 4.6% of GDP for education — below the OECD average (4.9%). 10.5% of teachers without a recognized exam.
  4. No linear connection: The PISA decline began before comprehensive implementation of the criticized reforms. English improved despite the same framework conditions (IQB 2022).

Assessment: Partially correct, but massively undercomplex. The reduction to “pedagogy of facilitation” ignores Corona, funding, and demographic change.


Thesis 2: Digitalization harms education

Supporting Evidence:

  • Karolinska Institute Stockholm (2023): “Digitalization has large negative effects on knowledge acquisition.” Recommendation: Return to physical school books.
  • Sweden invests €104 million in printed school books (2022-2025)
  • PISA 2022: Extensive digital device deployment ≠ better results. Inverse correlation after a certain degree of usage.

Alternative Explanations:

  1. Didactics decide, not technology: Learning success depends on didactic embedding, not on the medium.
  2. Private vs. school usage: Uncontrolled leisure consumption (problematic) ≠ didactically embedded usage (potentially beneficial). Kraus conflates both contexts.
  3. Handwriting research more nuanced: Mueller/Oppenheimer (2014) found advantages, replication studies did not.

Assessment: Correct at the core (uncritical digitalization harms), conclusion too sweeping. Sweden isn’t abolishing tech, it is rebalancing.


Thesis 3: Migration as a factor for declining performance

Supporting Evidence:

  • PISA 2022: 59-point advantage without a migration background in math. After SES control: still 32 points.
  • IQB 2022: Reading ability for second-generation 4th graders dropped by 27 points (≈ half a school year).

Alternative Explanations:

  1. Poverty effect instead of origin effect: 42% of students with a migration background are socioeconomically disadvantaged (vs. 25% across the board). A large part of the “migration effect” is a poverty effect.
  2. Canada/Australia show: Things can be different. There, migrant children achieve comparable performance. The negative correlation is not a natural law, but systemic failure.
  3. PISA Special Analysis 2024: Migrant children born in Germany show good performance in international comparison. The problem lies with the first-generation + lack of early language support.
  4. Institutional selection: Germany selects by social origin in grade 4 — narrower than in almost all comparable OECD countries (Bertelsmann).

Assessment: Descriptively correct, causally misleading. Migration is a covariate, not the cause. The phrasing “students of non-German background” shifts responsibility from structures to those affected.


Thesis 4: Grade Inflation (Noteninflation)

Supporting Evidence:

  • Abitur average: 2.52 → 2.41. 1.0-quota (top grade) +70% (2007-2017).
  • In contrast: IQB, PISA, PIRLS (IGLU) show declining actual competencies.
  • Universities document parallel grade inflation (Cologne Study, ZEIT 2024).

Alternative Explanations:

  • FiBS (Research Institute for the Economics of Education and Social Affairs) Study: Only ~0.1 grade points per decade — less dramatic than depicted.
  • Regional variance (Thuringia vs. Lower Saxony) points to evaluation cultures, not a uniform “facilitation trend”.

Assessment: Empirically well-supported. The gap between grades and competency measurements is the strongest proof. The cause, however, is a combination of political pressure and exam formats — not solely “facilitation policy”.


Thesis 5: Inventory Knowledge as the basis for maturity

Supporting Evidence:

  • Willingham (2009): Critical thinking is domain-specific and presupposes subject knowledge.
  • Hattie: Direct knowledge transfer (d=0.59) as a prerequisite for transfer.
  • Karolinska Institute: Warning against acquiring knowledge from uncurated digital sources.

Alternative Explanations:

  • Complementarity instead of opposition: Knowledge and competency are not antithetical (Klafki).
  • Extended Cognition: External storage has always been part of human cognition. The question is: what knowledge must be internalized?

Assessment: At its core, very well supported. Cognitive psychology confirms: Factual knowledge is a necessary condition for critical thinking. But Kraus constructs a false dichotomy — the question is not “knowledge or competence”, but how both interact.


Thesis 6: Teacher shortage exacerbates everything

Supporting Evidence:

  • 2024: 17,374 missing teachers (KMK). 10.5% without a recognized exam.
  • PISA 2022: 15 points difference between good vs. poor teacher accessibility.
  • Hattie: Teacher-student relationship d=0.72. “It depends on the teacher.”

Alternative Explanations:

  • Shortage is regionally/subjectally selective, not a monolithic phenomenon.
  • Primary school shortage overcome from mid-2020s (Bertelsmann 2024).

Assessment: Empirically the most strongly supported. Real, quantified, effects proven.


Part IIb: Expanded Hypotheses (from the editorial discussion)

The following two hypotheses were developed in the follow-up discussion between the editorial team and the AI and checked empirically. They form an analytical counterpole to Kraus’s performance-dogmatism.

Thesis 7: Performance needs societal prerequisites (Maslow Hypothesis)

Counter-thesis to Kraus: Kraus demands “effort” and the “achievement principle”. But performance has prerequisites. If society doesn’t provide these prerequisites, it cannot demand performance.

Mechanism:

  • No food at home → no cognitive performance capability
  • No mobility → even the journey to school is a stressor
  • All-day school instead of half-day school → recovery is missing → poorer performance
  • Dilapidated buildings (Kraus’s own point!) → not designed for recovery phases

Empirical Support:

  • Bertelsmann Foundation (Child Poverty): Poor school performance correlates extremely strongly with a lack of basic resources. Every 5th child in Germany is at risk of poverty. Poverty is a stronger predictor of school failure than origin, language, or pedagogy.
  • DAK (health insurance) Prevention Radar: Record levels of school stress and exhaustion since the introduction of the all-day school. Children spend the whole day in buildings that were architecturally built for half-day operation.
  • Maslow’s Hierarchy of Needs: Physiological and safety needs must be met before cognitive performance is possible. That’s not theory — it’s neurobiology.

Assessment: Empirically strongly supported. Kraus even names the symptoms himself (dilapidated toilets, investment backlog) — but fails to draw the conclusion that the achievement principle stands on a foundation that the state must build first. “Demanding more performance” without “providing the prerequisites” is a logical short-circuit.


Thesis 8: Increasing Complexity with the same resources (Overload Hypothesis)

Counter-thesis to Kraus: Kraus claims the standards are being “lowered”. The history of the KMK curriculum proves the opposite: The standards have massively increased. The students are not failing due to too little expectation, but due to too much — with simultaneously fewer resources.

The Historical Paradigm Shift (KMK Documentation):

PhaseTime PeriodCurriculum TypeWhat was demanded
Before 2000Kraus’s active timeInput Curriculum (subject matter catalogs)Reproduction: memorize capitals, recite poems, perform written division. Clearly measurable: right or wrong.
After PISA 2000From 2003/2004Output Standards (competency orientation)Transfer, problem-solving, media evaluation, source criticism. “Evaluate complex non-fiction texts, check the reliability of digital sources, transfer mathematical models to everyday phenomena.”

Why the change? Because German students failed the PISA test in 2000 precisely on “dead book knowledge”: They could recite facts, but not apply them to new problems. The KMK reacted with a historic turning point: Away from “What is told at the front?” towards “What can the student do at the end?”

The Paradox:

  • Today’s KMK standards demand cognitively significantly more demanding performances than the old subject catalogs.
  • A “rote learner” of the 1970s/80s would fail the modern competency expectations of the upper school (Oberstufe).
  • Added to this were massively new cross-sectional tasks: Algorithms, interculturalism, ESD (Education for Sustainable Development), AI literacy, media competence (KMK strategy “Education in the Digital World”, 2016/2021).
  • But the timetable has hardly grown. The same time, significantly more and more complex material, less personnel.

Empirical Support — Exhibit: Curriculum Comparison 1980 vs. 2022

The following comparison shows the same thematic core (History: Weimar Republic; Math: Division) in the language of the respective era. The examples are synthetic condensations — structural blueprints derived from official documents, not literal single quotes.

1980s Curriculum (Input-Catalog, Reproduction):

“The student learns about the epoch of the Weimar Republic. Compulsory to treat in class are: The November Revolution 1918, the Treaty of Versailles, the Weimar Constitution, the crisis year 1923 (inflation, Hitler putsch), the Stresemann era and the failure of democracy due to the Global Economic Crisis in 1929.”

Mathematics: “Learning written division with multi-digit divisors.”

Source basis: Uniform Examination Requirements in the Abitur Examination (EPA) for History (KMK resolution of Dec 1, 1989) as well as the curricular syllabi for the Gymnasium in Bavaria based on them (Official Gazettes of the Bavarian State Ministry for Education and Culture, mid-1980s). In the EPA 1989, the focus was on the recording of binding historical cuts. Implementation at the state level basically consisted of chronological lists of topics to be worked through.

2022/2023 KMK Standard (Output, Competency Orientation):

“The students possess pronounced historical subject and methodological competencies. They can historically deconstruct the multi-causality in the failure of early democracies and analyze it based on criteria. They use their acquired judgment competency to multi-perspectivally evaluate the scope of action of historical actors in social or economic crisis situations. Furthermore, they are able to critically evaluate media and digital sources, question societal narratives, and derive an independent historical orientation for the resilience of modern democracies from the collapse of historical systems.”

Mathematics: “The students penetrate mathematical structures and can model division and distribution operations flexibly and semi-written, verbalize their own solution strategies as well as argumentatively check their validity in cooperative learning settings.”

Source basis: The Uniform Examination Requirements (EPA) History (KMK resolution of Dec 1, 1989, revised Feb 10, 2005) still apply to history. The more modern language of competency (“deconstruct”, “multi-perspectivally evaluate”) can be found in the current state-specific core curricula (e.g. NRW Core Curriculum History Secondary Level II, 2014) as well as in the EPA revisions. Note: KMK educational standards for History AHR do not exist yet — EPAs, not educational standards, continue to apply to history. For Mathematics: Educational standards in the subject area of mathematics for the primary sector (KMK resolution of Oct 15, 2004, new version Jun 23, 2022), p. 11 ff., process-related competencies: “The students can describe, compare and evaluate solution paths [as well as] find mathematical models for real-life situations.”

What the comparison shows:

Dimension1980s2022
Cognitive LevelReproduction (knowing facts)Synthesis, deconstruction, transfer, judgment formation
LanguageConcrete, close to everyday lifeAcademic, abstract (“evaluate based on criteria”)
Digital CompetencyNon-existentSource criticism, narrative analysis, media evaluation
TestabilityBinary: right/wrongComplex: process evaluation, argumentation
Who can handle itAnyone who can learn by heartWhoever brings academic vocabulary + transfer thinking
CategoryTopic (finite, checkable)Meta-ability (potentially infinite)

The category shift — the deepest finding:

A = “Weimar Republic” → a concrete, limited topic. Learnable. Checkable. A’ = “Ability to multi-perspectivally deconstruct the failure of any democracy” → a meta-competency that shows itself differently in every topic.

A was a topic. A’ is everything one needs to be able to do in order to deal with any topics that are like A.

A is finite. A’ is potentially infinite. The student can never “finish learning” A’. The teacher can never “prove” that he has taught A’. The exam can test A’ on any randomized material — the student does not know what to practice at the desk. That is not a simplification. That is a dissolution of boundaries.

The Consequence: A student from the 1980s who knew all the data of the Weimar Republic by heart, passed. The same student would fail today’s standards because he did not learn to “deconstruct”, “evalute multi-perspectivally” and “question narratives”. The requirements have not sunk — they have been raised to an academic meta-level.

And that explains the gap: Children from educated families bring the vocabulary and transfer thinking with them. Children without familial backing face formulations that overtax them linguistically and cognitively — regardless of whether they know the facts.

Further Empirical Support:

  • KMK Educational Standards (publicly documented): Comparing the formulations from 1990 vs. 2022 shows a massive increase in cognitive requirements.
  • ICILS Studies: Computer and information competency as a new basic competency — didn’t exist 20 years ago.
  • KMK Strategy “Education in the Digital World” (2016): 60+ individual competencies in 6 competency areas — in addition to all subject standards.
  • Curriculum Comparison 2000 vs. 2025: Computer Science, Media Competency, ESD, Consumer Education added — without equivalent reduction.

Conclusion: The fact that many students today fail the IQB minimum standards is very probably not because the standards were lowered or the students became “dumber”. It is because the requirement level regarding connected thinking and problem-solving competence has massively increased — while at the same time the foundation (reading ability, parental support, teaching staff, infrastructure) is collapsing in many milieus.

A critic like Kraus then reads “Students can no longer calculate” and shouts “School has become too easy!”. The KMK history says the opposite: Schools today are trying to convey cognitively much more demanding things than during Kraus’s active time — but are stumbling over the fraying social edges of society.

Deep Dive: Three systemic design errors of competency orientation

The curriculum comparison shows the increase in requirements. But the comparison reveals something deeper — three structural problems that explain why the system is imploding on both sides (teacher and student):

1. Shift of Responsibility: From the teacher to the teacher-as-miracle-worker

DimensionOld CurriculumNew Standard
AddresseeThe Teacher: “You must cover A, B, C”The Student: “You must be able to do A, B, C”
Fulfillment of DutyWhen checked off → job doneIf the student cannot do it → bad method?
ResponsibilityTeacher provides input (student’s responsibility to retrieve)Teacher is responsible for output (system’s responsibility to deliver)

In the old system, the teacher was relieved when he had presented the material. In the new system, he is held responsible for the outcome of the student. However, since talent, parental home, mental state, and social environment vary massively, the teacher must compensate for factors he has no access to. That is structural overload.

2. The problem of tangibility: What do I practice at the desk?

In the past, learning was like a checklist: “Learn these vocabulary words, these dates, this formula. If you do that, you’ll get a B.” That gave security and self-efficacy.

Today, the assignment is: “Analyze and evaluate multi-perspectivally.” An insecure student does not know what to do in the afternoon. How do you practice “judgment competency” at home? How do you prove “media competency”? What will be in the exam? The diffuse, highly academic competency ideal leaves weak students without mental anchor points. Small experiences of success through “checking off” are eliminated. The result is resignation.

3. Illusion of universal feasibility

The modern standards are based on a hyper-optimistic view of humanity: Everyone can achieve everything if the didactics are right. This ignores biological, cognitive, and social realities. Some students are brilliant in clear, routinized procedures (ironclad rules, predictable processes), but fail at the meta-level (transfer, modeling). By defining only meta-competency as the standard, the system devalues rule-based knowledge — and those students who need reliable routines.

Interaction of the three errors:

Teacher: "I am supposed to deliver results that I cannot control" → Burnout
Student: "I don't know what I'm supposed to do to pass"            → Resignation
System:  "Everyone can do everything if we do it right"           → Denial of reality

The irony regarding Kraus: He feels this exhaustion in the system precisely — but interprets it as a consequence of “too little effort”. The analysis shows the opposite: School has become more demanding, more boundless, and psychologically more tiring. It burdens students with invisible, abstract requirements that they cannot grasp. And it burdens teachers with an illusion of feasibility on which they burn out.


Assessment: Empirically strongly supported by KMK curriculum history + educational sociology research on teacher burnout + psychological research on self-efficacy. Turns Kraus’s core thesis upside down: Not a lowering, but an elevation combined with under-supply and systemic design errors are the problem.


Part III: Synoptic Assessment

ThesisEvidence for KrausAlternative ExplanationsOverall Judgment
1. Pedagogy of FacilitationMediumStrongPartially correct, undercomplex
2. Digitalization harmsMedium-HighMediumCore correct, too sweeping
3. MigrationDescriptively correctStrongCorrelation ≠ Causation
4. Grade inflationHighWeakWell-supported
5. Inventory knowledge necessaryHighMediumCorrect at the core
6. Teacher shortageVery highWeakStrongly supported
7. Maslow (Prerequisites)— (Counter-hypothesis)Strongly supported — Kraus’s blind spot
8. Overload (Complexity)— (Counter-hypothesis)Strongly supported (KMK curriculum history) — turns Kraus’s thesis around

Meta-Diagnosis

Kraus’s strongest theses (4, 5, 6) are those in which he describes concrete phenomena. His weakest (1, 3) are those in which he constructs culture-critical grand narratives. The counter-hypotheses developed in the follow-up discussion (7, 8) prove to be empirically at least as strong as Kraus’s core theses — and partially turn his diagnosis around.

The empirical evidence supports a multi-level model of the educational crisis:

  1. Structural underfunding (4.6% GDP, €67.8 billion renovation backlog)
  2. Teacher shortage (quantitatively and qualitatively)
  3. Corona shock (35% learning loss, especially for the already disadvantaged)
  4. Demographic/social change (growing heterogeneity without adapted support)
  5. Missing societal prerequisites (child poverty, all-day school without recovery, dilapidated infrastructure)
  6. Increased requirements with dwindling resources (KMK standards 2022 vs. 1980: cognitive leap from reproduction to synthesis/transfer/deconstruction)
  7. Partially problematic didactic trends (uncritical digitalization, softening of exam formats)

Factor 7 partially corresponds to Kraus’s analysis — but it only explains a small part of the variance. Factors 1–6 are empirically more strongly supported and explain the larger share.

The central irony: Kraus complains of a “lack of demanding standards” — but the history of the KMK curriculum proves that schools today are cognitively more demanding than ever before. Students don’t fail due to too little expectation, but due to a system that has to service exploding requirements with shrinking resources.


Sources

Educational Studies

  • IQB-Bildungstrend 2022 and 2024 (Humboldt University Berlin)
  • PISA 2022 (OECD / TUM)
  • IGLU (PIRLS) 2021 (IEA)
  • TIMSS 2023 (IEA / BMBF)
  • Hattie, J.: Visible Learning (2009/2015)
  • Willingham, D.: Why Don’t Students Like School? (2009)

Fact Check Sources

  • KfW Municipal Panel 2025
  • Destatis: School statistics 2025/26
  • Report of the Armed Forces Commissioner 2024
  • Berlin Police Dictation: Business Insider / Tagesspiegel
  • Lower Saxony Division: News4teachers / NWZ
  • KMK Teacher Demand Forecast 2025

Alternative Explanations

  • Betthäuser et al. (Sciences Po): Corona learning losses meta-analysis
  • Karolinska Institute Stockholm (2023): Digitalization
  • Bertelsmann Foundation: 20 Years of PISA / Social Educational Inequality
  • FiBS: Inflation of Abitur grades (Dohmen)
  • Lipowsky (University of Kassel): Open teaching
  • OECD: Education at a Glance

Analysis: Um:bruch Editorial Team | Model: Claude Opus 4.6 | Date: 2026-04-06 | Method: REV (AI Review) + Hypothesis Space

✉️ Write to us 📝 Contact form
bildungfaktencheckbildungspolitikschule