Hacker Newsnew | past | comments | ask | show | jobs | submitlogin

MoE inference wouldn't be terrible. That being said, there's not a good MoE model in the 70-160B range as far as I'm aware.


Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: