E69: AI vs Experts: OpenAI’s GDP‑Val Shows 50% Parity, 35% Tipping Point, and Model Matchups (GPT‑5 vs Claude)
Key Insights
Key takeaways
- The “35% rule”: below ~35% win‑rate, AI costs more due to human rework; above it, AI becomes ROI‑positive.
- Formatting is a primary failure mode; adding a prompt‑level checklist improves outcomes by ~5 pts on slide tasks.
- Models differ: Claude 4.1 excels in layout/formatting; GPT‑5 in factuality and calculations; no single “best” model.
- Complex, structured tasks (e.g., slides with context) outperform simple text prompts; context density matters.
- Trajectory: from ~13% (GPT‑4.0 a year ago) to ~50% now; plan for rapid step‑ups through 2026–2027.
For access to the full transcript, join our community!
get started



