Hacker Newsnew | past | comments | ask | show | jobs | submit | dcreater's commentslogin

Based on qwen2.5-coder? seems like a "why not/resume embellish/show VC" type release I guess

"Please don't post shallow dismissals, especially of other people's work. A good critical comment teaches us something."

https://news.ycombinator.com/newsguidelines.html


You can see that Qwen3 does worse than Qwen2.5 on our benchmark. Reason is it's never been pretrained for FIM / autocomplete.

This post is totally not astroturfed

Yes a book is exactly what makes sense in this very immature and fast changing space.

It's especially effective when you litter it around ai company offices in SF


Its ok to just stay on v7 no? Thats what im doing.

Its already here - its called GEO and their are silicon valley startups already pumping out crap to feed next gen models so that you ensure you're product is baked into the weights

The next gen of models are going to need very strict sanitising of input articles as I think the sheer volume of GPT SEO spam is going to be, or already is, quite staggering. Model collapse might not be what happens but certainly a dilution of quality in training data.

This looks like a workflow problem more than a model problem. When inputs aren’t controlled, scale amplifies noise faster than understanding. Tools improve, but the decision boundaries stay the bottleneck.

Agree. And to make the CLI usage more effective/efficient, if you can publish a skill that would be excellent

That's why we're asking for the CLI; so we can write the skills.

you say "local-first" but have placed voyage API for embeddings as the default (had to go to the website and dig to find that you can infact use local embedding models). Please fix

Thank you, yes the docs are overdue for a refresh. It's in the works

Presumably it could update its own docs

Exactly. There's an autodoc feature coming up in the next version

It would be convenient if it could load local SLMs itself, otherwise I'll have to manually start the LLM server before I can use it, and it's not something I leave running all the time.

There already are open source extensions. Visor is one I remember off the top of my head. https://marketplace.visualstudio.com/items?itemName=sidhants...

AI and Claude Code are incredible tools. But use cases like "Organize my desktop" are horrible misapplications that are insecure, inefficient and a privacy nightmare. Its the smart refrigerator of this generation of tech.

I worry that the average consumer is none the wiser but I hope a company that calls itself Anthropic is anthropic. Being transparent about what the tool is doing, what permissions it has, educating on the dangers etc. are the least you can do.

With the example of clearing up your mac desktop: a) macOS already autofolds things into smart stacks b) writing a simple script that emulates an app like Hazel is a far better approach for AI to take


Rust + Native App I take it


Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: