Hacker Newsnew | past | comments | ask | show | jobs | submit | fromlogin
LiveBench: A Challenging, Contamination-Free LLM Benchmark (livebench.ai)
3 points by foolswisdom 9 months ago | past
Gemini 2.5 Pro tops LiveBench, +6 pts overall over Claude 3.7 Sonnet Thinking (livebench.ai)
1 point by ankeshanand 10 months ago | past
Google's latest Gemini-exp-1206 seems to be great, near the top of livebench (livebench.ai)
4 points by KaoruAoiShiho on Dec 6, 2024 | past
LiveBench: A Challenging, Contamination-Free LLM Benchmark (livebench.ai)
1 point by belter on June 13, 2024 | past
LiveBench: A Challenging, Contamination-Free LLM Benchmark (livebench.ai)
6 points by georgehill on June 12, 2024 | past

Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: