OpenAI Scores 0%? DeepSeek V3.2 Speciale Beats GPT-5.2 in New “Impossible” Physics Test
While the internet is laughing at a shocking “0%” score for OpenAI’s flagship model on the new CritPt benchmark, a bigger story is hiding in the data: DeepSeek V3.2 Speciale has officially outperformed GPT-5.1 and Claude Opus 4.5 to become the second-smartest model in the world for hard science. The “Impossible” Test: What is CritPt? … Read more