These things are not actually useful. They hyper optimzed it for coding usecase but it still sucks balls at it.
cost isn't the limiting factor in this though. 'even larger' models arn't 'more capable' . where did you get that from?
These things are not actually useful. They hyper optimzed it for coding usecase but it still sucks balls at it.