GMP Bench
LLM Benchmark for Pharmaceutical Manufacturing
GMP Bench evaluates large language models on their ability to understand pharmaceutical Good Manufacturing Practice regulations, generate compliant documentation, and assist with quality-critical tasks. Community-driven test cases ensure relevance to real-world GMP operations.
Top Models
View full leaderboardWhat We Test
How It Works
1
Submit a Test Case
Propose a GMP knowledge question or document generation task with a reference answer or scoring rubric.
2
Models Are Evaluated
Each model runs the test case. Knowledge QA is scored against reference answers. Tasks are evaluated by an LLM judge.
3
Compare Results
View rankings on the leaderboard. Filter by category, provider, or local vs. hosted deployment.