I don’t think it will ever make sense; you can buy *so* much cloud based usage f...

websiteapi · 2025-12-14T15:05:14 1765724714

my issue is once you have it in your workflow I'd be pretty latency sensitive. imagine those record-it-all apps working well. eventually you'd become pretty reliant on it. I don't want to necessarily be at the whims of the cloud

stingraycharles · 2025-12-14T16:38:40 1765730320

Aren’t those “record it all” applications implemented as a RAG and injected into the context based on embedding similarity?

Obviously you’re not going to always inject everything into the context window.

lordswork · 2025-12-14T15:02:28 1765724548

As long as you're willing to wait up to an hour for your GPU to get scheduled when you do want to use it.

stingraycharles · 2025-12-14T15:31:26 1765726286

I don’t understand what you’re saying. What’s preventing you from using eg OpenRouter to run a query against Kimi-K2 from whatever provider?

bgwalter · 2025-12-14T16:38:23 1765730303

Because you have Cloudflare (MITM 1), Openrouter (MITM 2) and finally the "AI" provider who can all read, store, analyze and resell your queries.

EDIT: Thanks for downvoting what is literally one of the most important reasons for people to use local models. Denying and censoring reality does not prevent the bubble from bursting.

irthomasthomas · 2025-12-15T00:22:47 1765758167

you can use chutes.ai TEE (Trusted Execution Environment) and Kimi K2 is running at about 100t/s rn

hu3 · 2025-12-14T15:49:15 1765727355

and you'll get a faster model this way

givinguflac · 2025-12-14T14:41:45 1765723305

I think you’re missing the whole point, which is not using cloud compute.

stingraycharles · 2025-12-14T15:33:05 1765726385

Because of privacy reasons? Yeah I’m not going to spend a small fortune for that to be able to use these types of models.

givinguflac · 2025-12-14T20:01:44 1765742504

There are plenty of examples and reasons to do so besides privacy- because one can, because it’s cool, for research, for fine tuning, etc. I never mentioned privacy. Your use case is not everyone’s.

wyre · 2025-12-14T22:52:20 1765752740

All of those things you can still do renting AI server compute though? I think privacy and cool-factor are the only real reasons why it would be rational for someone to spend checks the apple store $19,000 on computer hardware...

givinguflac · 2025-12-15T14:50:28 1765810228

Why do you look at this as a consumer? Have you never heard of businesses spending money on hardware???

wyre · 2025-12-16T18:01:46 1765908106

And what reasons would a business have to spend the money on hardware instead of cloud services? Privacy

givinguflac · 2025-12-19T19:46:07 1766173567

Seriously?? You’ve never seen a company want to control its entire stack and hardware for ANY reason but privacy? Cloud is great, but it doesn’t fit every use case.