GDPVal: Measuring the performance of our models on real-world tasks
L: https://openai.com/index/gdpval/
C: https://news.ycombinator.com/item?id=45375392
posted on 2025.09.25 at 12:55:48 (c=2, p=5)
If you have a fediverse account, you can quote this note from your own instance. Search https://mstdn.social/users/hkrn/statuses/115266193734518353 on your instance and quote it. (Note that quoting is not supported in Mastodon.)
