A look at the more challenging AI evaluations emerging in response to the rapid progress of models, including FrontierMath, Humanity’s Last Exam, and RE sleon 1 year ago As AI models rapidly advance, evaluations are racing to keep up. Read More » Related posts: A look at the more challenging AI evaluations emerging in response to the rapid progress of models, including FrontierMath, Humanity’s Last Exam, and RE Today’s best deals: AirPods 2 and AirPods Pro sale, $8 wireless charger, Ring Doorbell deals, $3 Instant Pot cookbook, more Apple AirPods Pro are back down to $199, their lowest price yet Sources: Amazon recently pushed back RTO dates for some staff in Austin, Dallas, and Phoenix by as much as four months, citing a lack of office space (Bloomberg)