https://twitter.com/EMostaque/status/1564655464406650881
With all the model optimization, distillation papers already out, a few hundred Mb doesn't look impossible with similar quality outputs.
https://twitter.com/EMostaque/status/1564655464406650881
With all the model optimization, distillation papers already out, a few hundred Mb doesn't look impossible with similar quality outputs.