A look at the more challenging AI evaluations emerging in response to the rapid progress of models, including FrontierMath, Humanity’s Last Exam, and RE sleon 1 year ago As AI models rapidly advance, evaluations are racing to keep up. Read More » Related posts: Motorola Moto G8 video leak reveals triple rear cameras Microsoft’s latest Windows 10 20H2 previews bring a big pile of fixes Samsung Galaxy Unpacked recap: All the Galaxy S22 and Galaxy Tab S8 news PlayStation VR2 launches February 22, 2023; 11 new titles announced