Everyone’s been asking when AI will take our jobs, but almost nobody has systematically measured whether models actually can. OpenAI’s new GDPval benchmark, introduced last Thursday, is the first ...
Some results have been hidden because they may be inaccessible to you
Show inaccessible results