128 reads

Evaluation: AI Benchmarks Beyond ARC-AGI, MMMU, MLE-bench, and the FrontierMath Test

by
January 15th, 2025
featured image - Evaluation: AI Benchmarks Beyond ARC-AGI, MMMU, MLE-bench, and the FrontierMath Test