Hacker Newsnew | past | comments | ask | show | jobs | submit | weichiang's submissionslogin
1.Judging LLM-as-a-Judge with MT-Bench and Chatbot Arena (twitter.com/lmsysorg)
20 points by weichiang on June 16, 2023 | past
2.Google PaLM 2 ranked 6th on the LLM benchmark in the wild (twitter.com/lmsysorg)
1 point by weichiang on May 25, 2023 | past
3.Chatbot Arena: a crowd-sourced LLM leaderboard (twitter.com/lmsysorg)
1 point by weichiang on May 12, 2023 | past | 1 comment
4.State-of-the-Art Chatbot, Vicuna-7B, now runs on MacBook with GPU acceleration (twitter.com/lmsysorg)
126 points by weichiang on April 6, 2023 | past | 84 comments
5.State-of-the-art open-source chatbot, Vicuna-13B, just released model weights (twitter.com/lmsysorg)
271 points by weichiang on April 3, 2023 | past | 139 comments
6.Who's GPT-4's favorite? Battles between state-of-the-art chatbots. (lmsys.org)
6 points by weichiang on March 30, 2023 | past | 4 comments

Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: