|
|
| 1. | | Judging LLM-as-a-Judge with MT-Bench and Chatbot Arena (twitter.com/lmsysorg) | | 20 points by weichiang on June 16, 2023 | past | |
| 2. | | Google PaLM 2 ranked 6th on the LLM benchmark in the wild (twitter.com/lmsysorg) | | 1 point by weichiang on May 25, 2023 | past | |
| 3. | | Chatbot Arena: a crowd-sourced LLM leaderboard (twitter.com/lmsysorg) | | 1 point by weichiang on May 12, 2023 | past | 1 comment | |
| 4. | | State-of-the-Art Chatbot, Vicuna-7B, now runs on MacBook with GPU acceleration (twitter.com/lmsysorg) | | 126 points by weichiang on April 6, 2023 | past | 84 comments | |
| 5. | | State-of-the-art open-source chatbot, Vicuna-13B, just released model weights (twitter.com/lmsysorg) | | 271 points by weichiang on April 3, 2023 | past | 139 comments | |
| 6. | | Who's GPT-4's favorite? Battles between state-of-the-art chatbots. (lmsys.org) | | 6 points by weichiang on March 30, 2023 | past | 4 comments | |
|

Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact
|