> Imagine halving the resource costs of AI and what that could mean for the planet and the industry
Google has done this:
"In eighteen months, we reduced costs by more than 90% for these queries through hardware, engineering, and technical breakthroughs, while doubling the size of our custom Gemini model." https://blog.google/inside-google/message-ceo/alphabet-earni...
Even Llama 3.1 can give you perfect JSON formatted responses for free these days. Also you really ought to be using yaml instead, you save 30% on tokens.
Tried the Gemini Advanced trial last week. For some reason their so called 1M context model is limited to 10 files at a time, so you can't upload a codebase for it to reference and even with the extra data the end result is somehow worse than both Sonnet or 4o without much given context at all. It's definitely not on the level as a coding assistant at least.
Google has done this: "In eighteen months, we reduced costs by more than 90% for these queries through hardware, engineering, and technical breakthroughs, while doubling the size of our custom Gemini model." https://blog.google/inside-google/message-ceo/alphabet-earni...