LLM Watch
Subscribe
Sign in
📝 We Need New Benchmarks
Pascal Biese
Jan 5, 2024
3
For a future of holistic and robust LLM evaluation
Read →
Comments
This site requires JavaScript to run correctly. Please
turn on JavaScript
or unblock scripts
📝 We Need New Benchmarks
For a future of holistic and robust LLM evaluation