Are we in a GPT-4-style leap that evals can't see?
martinald Sunday, November 30, 2025
Summary
The article discusses the possibility of a GPT-4 style leap in AI capabilities that current evaluation metrics may not be able to detect. It explores the potential for AI systems to exhibit emergent behaviors that surpass the abilities of their training data and models.
1
0
Summary
martinalderson.com