newai.today
HomeModelsBenchmarks
newai.today

Discover, compare and track AI models, their public releases and benchmark scores.

Resources

  • Models Directory
  • Benchmarks
  • API Documentation

Company

  • About
  • Blog
  • Contact

Legal

  • Privacy Policy
  • Terms of Service

© 2025 newai.today. All rights reserved.

Theme:

Benchmarks

Compare AI model performance across standard benchmarks and datasets.

GSM-8K Leaderboard

Grade School Math 8K, a benchmark for measuring mathematical reasoning ability.

#ModelProviderParametersDate
Data freshness: Last updated Jul 5, 2025
Custom Benchmark Set
Build your own benchmark comparison by selecting any combination of datasets.

Coming soon: Create custom benchmark sets to compare models across multiple dimensions.

Benchmark Methodology
Learn about how benchmarks are conducted and scored.

We collect benchmark data from official model releases, research papers, and community evaluations. All scores are normalized to percentages for easier comparison.