Aha! Here is the paper on the research behind this story (ht @leftpaddotpy):
https://arxiv.org/abs/2211.00241
The researchers’ abstract puts it quite crisply: “Our results demonstrate that even superhuman AI systems may harbor surprising failure modes.”