>>88005402They say it can't be compared because it takes longer and etc. but it isn't true because you can run the benchmarks still and get a score, pic related is what OpenAI claims. Anthropic got caught off guard as did everyone else with Chain of Thought improving things by leaps and bounds in benchmarks. Of course, it doesn't matter for RP.