Heykuki News
Top
New
Best
Ask
Show
Jobs
Toggle theme
Login
Top
New
Best
Ask
Show
Jobs
Show HN: LLM Benchmarking Suite
github.com/dhyaneesh
2 points
Dhyaneesh
a year ago
A comprehensive benchmarking suite for evaluating Gemma and other language models on various benchmarks including MMLU (Massive Multitask Language Understanding) and GSM8K (Grade School Math 8K).
No comment yet
Show HN: LLM Benchmarking Suite | Heykuki News