Thursday, January 23, 2025

Even some of the best AI can’t beat this new benchmark

on 3:52 PM in features, Tech News with No comments

The nonprofit Center for AI Safety (CAIS) and Scale AI, a company that provides a number of data labeling and AI development services, have released a challenging new benchmark for frontier AI systems. The benchmark, called Humanity’s Last Exam, includes thousands of crowdsourced questions touching on subjects like mathematics, humanities, and the natural sciences. To make […]

Posted from: this blog via Microsoft Power Automate.

Thursday, January 23, 2025

Even some of the best AI can’t beat this new benchmark

0 comments:

Post a Comment