I re-ran the AI Stroop paper on the 2026 frontier. The deficit is real, but the goalposts moved.
A viral paper says transformers fail a 90-year-old attention test as the list gets longer. I reproduced it on the 2026 models. The deficit is real. The cliff just moved out.