GMP Bench

Leaderboard

Compare model performance across GMP knowledge and task completion benchmarks. Click a model name to view detailed results.

#ModelOverallKnowledge QATask CompletionAvg LatencyTotal Tokens# Evals
190.3%100.0%41.8%3.4s5k12
283.3%100.0%0.0%12.0s8k6