Emmanuel Prouvèze
AboutProjectsWritingContact
All posts

Tagged: Evals

1 post tagged with "Evals"

June 14, 2026·10 min read

I re-ran the AI Stroop paper on the 2026 frontier. The deficit is real, but the goalposts moved.

A viral paper says transformers fail a 90-year-old attention test as the list gets longer. I reproduced it on the 2026 models. The deficit is real. The cliff just moved out.

AIBuildingEvals

Topics

AI (13)Building (10)GenUI (3)Leadership (3)Architecture (2)Career (2)Design (2)Agents (1)Claude (1)Enterprise (1)Evals (1)Forecasting (1)Health (1)Japan (1)LinkedIn (1)Mindset (1)Technical (1)

Recent Posts

  • I re-ran the AI Stroop paper on the 2026 frontier. The deficit is real, but the goalposts moved.

    June 14, 2026
  • 25 Billion Tokens Later

    June 10, 2026
  • Claude as Health Coach: What I Learned Building HealthPulse

    March 30, 2026
  • How I Gave My AI a Memory

    March 16, 2026
  • AI Won't Do Your Taxes — But It'll Make Them Bearable

    March 13, 2026

© 2026 Emmanuel Prouvèze. All rights reserved.