← All reviews

AI explains the world

Gemini Evidence Research for PP-002: Two Independent Runs

Gemini 3.1 Pro (High) was independently tasked twice to research empirical evidence on the fuel price shock — once with knowledge of the position paper, once blind with 10 abstract theses. The results converge.

Analysis metadata

AI model Gemini 3.1 Pro (High)
Provider Google
Context window 1,000,000 Tokens
Editor Lukas Geiger (LG)
Date of analysis 3 April 2026
Analysed document PP-002 Fuel Price Shock: War Price Cap Instead of Textbook Economics
Um:bruch — Think Tank for Societal Change, 03.04.2026
Show original prompt (for replication)
Two separate prompts in two separate sessions. Prompt A: Gemini read PP-002 and received the same research assignment as Copilot. Prompt B: A fresh Gemini instance received 10 empirically testable theses derived from PP-002 but without source attribution or context.

Gemini Evidence Research: Two Runs, One Finding

Two separate Gemini instances. Two different prompts. One converging result.


Methodology and Disclosure

This analysis consists of two independent research runs:

Part 1 (Prompt A): Gemini 3.1 Pro (High) read the complete position paper PP-002 and received the same research assignment previously given to Copilot. Gemini therefore knew the framing, context, and sender of the theses.

Part 2 (Prompt B): A fresh Gemini instance (new session, no shared context) received 10 empirically testable theses — derived from PP-002 but without source attribution, without the position paper, without context. Gemini did not know who advocated the theses and was asked, as an independent policy advisor, to assess the state of research.

Objective: Test whether the assessment of the evidence depends on whether the model knows the political context or not.


Part 1: Informed Research (Prompt A)

Gemini read PP-002 and researched both positions.

Position A: Economic Advisors (Grimm/Schnitzer)

A1: Price Signals Reduce Demand Significantly

EvidenceCoglianese et al. (2017); Kilian & Zhou (2020)
MethodsInstrumental variables (refinery outages, hurricanes)
FindingPrice elasticity −0.2 to −0.4 — higher than long assumed
LimitationPetrol remains inelastic; in regions without public transport extremely limited

A2: Direct Price Interventions Distort the Market

EvidenceRWI (2022); ifo Institute / Fuest et al. (2022)
MethodsDifference-in-Differences (fuel rebate DE vs. EU abroad)
FindingAcross-the-board tax cuts partially absorbed by corporate margins
LimitationApplies to fuel rebate; not transferable to targeted transfers

Gemini’s conclusion on Position A: The economic advisors are micro-based right — prices work, and across-the-board market interventions are inefficient. But they confuse a Pigouvian tax with a terms-of-trade shock.

Position B: Um:bruch

B1: High Fuel Prices Are Strongly Regressive

EvidenceBento et al. (2009/2020); Severen & van Benthem (2022); Glaeser (2023)
MethodsHousehold surveys + vehicle fleet data + spatial variables
FindingRegressivity increases because the wealthy switch to EVs and evade the shock
LimitationMeasurement method (expenditure vs. income as benchmark) influences result

B2: Exogenous Supply Shocks Throttle the Domestic Economy

EvidenceBaumeister & Hamilton (2019); Kilian (2009)
MethodsStructural Vector Autoregression (SVAR) with Bayesian inference
FindingSupply shocks significantly and with delay brake economic activity
LimitationEffect depends on energy intensity of sector

Gemini’s overall conclusion (Part 1):

“Position B (Um:bruch) draws empirically more stable conclusions in the context of an exogenous geopolitical shock.”

The economic advisors commit a category error: Pigouvian tax ≠ terms-of-trade shock. Without revenue recycling, the war surcharge leads to maximally regressive permanent burden (Glaeser 2023). SVAR evidence (Baumeister & Hamilton) demonstrates that supply shocks strangle the domestic economy.

Condition: Implementation must take the form of a targeted compensation mechanism (energy dividend), not a market-distorting price cap.


Part 2: Blind Research (Prompt B)

Fresh Gemini instance. 10 theses without context. No PP-002, no Um:bruch, no Council of Economic Experts.

Overview: 10 Theses and Their Evidence Strength

ThesisSummaryAssessment
T1Absolute price increase identical regardless of tax rateStrongly supported
T2Short-term demand inelasticity (−0.10 to −0.30)Supported
T3Single country (2% of world consumption) cannot move world market priceStrongly supported
T4Regressive burden on lower incomesStrongly supported
T5Asymmetric pass-through (“rockets and feathers”)Partially supported, contested
T6Direct payments more targeted than price capsStrongly supported
T7Missing climate dividend undermines acceptanceStrongly supported
T8Energy poverty = health problemStrongly supported
T9Cumulative burden → recession riskStrongly supported
T10Loss of trust → populism susceptibilityStrongly supported

Detailed Results

T1: Price Increase During Supply Shock

  • Evidence: Meyler (2022, ECB Economic Bulletin); IMF Working Papers on Energy Crisis 2022
  • Mechanism: Absolute surcharges identical; percentage-wise cushioned by fixed tax base (“tax dilution effect”)
  • Limitation: Exchange rate effects (EUR/USD) can distort symmetry

T2: Short-Term Demand Inelasticity

  • Evidence: Levin et al. (2017); ifo Institute (2022); meta-analyses (NBER)
  • Finding: Elasticity −0.10 to −0.30 → 20% price increase reduces demand by only 2–6%
  • Limitation: Newer high-frequency data show slightly higher sensitivity for leisure driving

T3: Individual Country as Price Taker

  • Evidence: Kilian, “The Economics of Oil” (2020)
  • Finding: 10% national savings = 0.2% global demand → OPEC+ immediately compensates
  • Limitation: In sub-markets (e.g. TTF gas) regional demand decline can influence local prices

T4: Regressive Burden

  • Evidence: Bach et al. (DIW Berlin, 2022/2023); OECD inflation reports
  • Finding: Lower incomes spend proportionally dramatically more on energy
  • Limitation: Absolute consumption is higher for the wealthy; relative burden concentrates at the bottom

T5: Asymmetric Pass-Through

  • Evidence: Frondel et al. (RWI, 2019/2022)
  • Finding: After the introduction of MTS-K (market transparency unit) in Germany in 2013, partly even “negative asymmetry” — price reductions passed through faster than increases
  • Limitation: Only “wobble candidate” — empirically no longer a uniform picture

T6: Direct Payments vs. Price Caps

  • Evidence: Edenhofer et al. (MCC Berlin); Fuest et al. (ifo, 2022)
  • Finding: Lump-sum climate dividend extremely progressive; fuel rebate extremely regressive (subsidises SUV drivers)
  • Limitation: Lump-sum transfers are expensive; precisely targeted transfers often fail due to bureaucracy

T7: Broken Climate Dividend Promises

  • Evidence: Klenert et al. (2018); Beiser-McGrath & Bernauer (2019); Saldorff et al. (2024)
  • Finding: Acceptance depends on visible redistribution; visibility is decisive
  • Limitation: No weighty counterarguments — even indirect return flows are psychologically insufficient

T8: Energy Poverty and Health

  • Evidence: Thomson et al. (Energy Policy, 2017); Liddell & Morris (2010); WHO-Europe (2022)
  • Finding: Cold homes → respiratory disease, cardiovascular crises, depression, excess mortality
  • Limitation: Somewhat mitigated in Germany by better building standards; “working poor” remain acutely affected

T9: Cumulative Burden → Recession

  • Evidence: ECB, IMF, Council of Economic Experts (2023/2024)
  • Finding: Terms-of-trade loss → “fear saving” → consumption decline → ~50% of GDP affected
  • Limitation: Wage compensation via collective agreements can counteract — but often at the expense of margins or renewed inflation

T10: Trust and Populism

  • Evidence: Algan et al. (2017); Jacques Delors Centre; WZB Berlin
  • Finding: Failure of the welfare state as “insurer” + loss of control → measurable migration to political periphery
  • Limitation: Populism is multicausal (culture, migration, demographics); economics is a catalyst, not the sole cause

Gemini’s Own Addition: What Is Missing?

The theses focus heavily on the short-term shock moment and the distributional aspect. What is excluded is the medium-term allocation and innovation effect: energy price shocks are historically the strongest accelerator for investments in efficiency, electrification, and decarbonisation (heat pumps, insulation, e-mobility). The attempt to keep prices broadly artificially low destroys precisely the incentive mechanisms for structural crisis resolution.

However, the theses excellently demonstrate why hard social compensation mechanisms (climate dividend/targeted transfers) are urgently needed to prevent this transformation process from politically stalling.


Comparison of Both Runs

AspectPart 1 (informed)Part 2 (blind)
Overall verdictUm:bruch empirically more stable9/10 theses strongly supported
Economic advisors right on…Micro-based elasticity; fuel rebate inefficientT2 confirms elasticity; T6 confirms fuel rebate critique
Um:bruch right on…Category error; regressivity; SVAR recessionT1, T3, T4, T6, T7, T8, T9, T10
WeaknessT5 (rockets & feathers) only wobble candidate
Independent findingGlaeser 2023: regressivity rises through EV adoption by the wealthyMedium-term innovation effect as blind spot
ConvergenceYes — both runs reach the same conclusionYes

Core finding: The assessment of the evidence is independent of whether Gemini knew the political context or not. This considerably strengthens the robustness of the findings.


Editorial note (Um:bruch)

This analysis was conducted in two separate Gemini instances. In Part 1, Gemini knew PP-002 and the Copilot prompt (via LG). In Part 2, a fresh instance received 10 empirically testable theses derived from PP-002 but without source attribution or context. The convergence of both runs is methodologically remarkable: even without knowing who advocates the theses, Gemini reaches the same conclusion. This is not proof — but a strong indication that the evidence actually points in one direction.

Gemini’s addition regarding the medium-term innovation effect is an important correction: energy prices should set incentives in the long run. This is precisely why PP-002 calls for a war price cap (temporary) and energy dividend (targeted) — not a permanent price cap. The steering effect is preserved; only the shock is compensated.

✉️ Write to us 📝 Contact form
EvidenceEnergy policyFuel pricesLiterature reviewMacroeconomicsSociologyGeminiDual research