Heykuki News
Top
New
Best
Ask
Show
Jobs
Toggle theme
Login
Top
New
Best
Ask
Show
Jobs
Study identifies weaknesses in how AI systems are evaluated | Heykuki News
Study identifies weaknesses in how AI systems are evaluated
oii.ox.ac.uk
416 points
pseudolus
7 months ago
Paper:
https://openreview.net/pdf?id=mdA5lVvNcU
Related:
https://www.theregister.com/2025/11/07/measuring_ai_models_h...
192 comments
Loading...