More

baalimago · 2026-01-15T07:32:41 1768462361

I thought that context building via tooling was shown to be more effective than rag in practically every way?

Question being: WHY would I be doing RAG locally?

petesergeant · 2026-01-15T08:10:24 1768464624

For code, maybe? For documents, no, text embeddings are magical alien technology.

baalimago · 2026-01-14T05:20:15 1768368015

Here I'm being ridiculous but I was a bit disappointed that it was a canvas rendering and not a mono-font text block

mildmelon · 2026-01-15T20:25:03 1768508703

Author here. Not ridiculous at all, the name is a bit of a misnomer. I had tried doing true ASCII but moving data back to the CPU to render it all was too slow. So I opted to recreate them as glyphs that are drawn using signed distance functions, which gets pretty close to looking like real ASCII while still being incredibly performant due to it never leaving the GPU.

baalimago · 2026-01-12T11:37:51 1768217871

When it's done, it'll be one hell of a project!

Suggestion: continue in the current LLM-generated track and ask Claude (or whatever) to create an example + unit tests validating the idiom. Then tell Claude to remove half the example, leaving only a stub + failing unit tests. Add a go.mod at root + instructions on how to run all tests. The go initiate is "certified" once he/she has forked the repository and made the tests pass.

baalimago · 2026-01-12T11:10:11 1768216211

We used to lay the bricks, now we design the pyramids.

baalimago · 2026-01-07T06:15:12 1767766512

There are many open source alternatives to claude code. Crush[0] is one, Clai[1] another, opencode[3] a third. These are all vendor agnostic, and use API credits from different providers.

[0]: https://github.com/charmbracelet/crush [1]: https://github.com/baalimago/clai [2]: https://github.com/anomalyco/opencode

baalimago · 2026-01-07T05:07:50 1767762470

I'm a bit jealous. I would like to experiment with having a similar setup, but 10x Opus 4.5 running practically non stop must amount to a very high inference bill. Is it really worth the output?

From experimentation, I need to coach the models quite closely in order to get enough value. Letting it loose only works when I've given very specific instructions. But I'm using Codex and Clai, perhaps Claude code is better.

Snakes3727 · 2026-01-07T06:08:19 1767766099

I have a coworker who is basically doing this right now he leads our team and is second place overall. Regularly runs opus in parallel he alone is burning through 1k worth of credits a day.

He is also one of our worst performers.

h33t-l4x0r · 2026-01-07T07:26:44 1767770804

Wait, what is he second place at?

wiseowise · 2026-01-07T09:19:31 1767777571

Credit usage.

Maxion · 2026-01-07T06:09:14 1767766154

I've tried running a number of claude's in paralell on a CRUD full stack JS app. Yes, it got features made faster, yes it definitely did not leave me enough time to acutally look at what they did, yes it definitely produced sub-par code.

At the moment with one claude + manually fixing crap it produces I am faster at solving "easier" features (Think add API endpoint, re-build API client, implement frontend logic for API endpoint + UI) faster than if I write it myself.

Things that are more logic dense, it tends to produce so many errors that it's faster to solve myself.

NitpickLawyer · 2026-01-07T07:38:20 1767771500

> manually fixing crap it produces

> it tends to produce so many errors

I get some of the skepticism in this thread, but I don't get takes like this. How are you using cc that the output you look at is "full of errors"? By the time I look at the output of a session the agent has already ran linting, formatting, testing and so on. The things I look at are adherence to the conventions, files touched, libraries used, and so on. And the "error rate" on those has been steadily coming down. Especially if you also use a review loop (w/ codex since it has been the best at review lately).

You have to set these things up for success. You need loops with clear feedback. You need a project that has lots of clear things to adhere to. You need tight integrations. But once you have these things, if you're looking at "errors", you're doing something wrong IMO.

PunchTornado · 2026-01-07T09:46:42 1767779202

I don't think he meant like syntax errors, but thinking errors. I get these a lot with CC. Especially for example with CSS. So much useless code it produces, it blows my mind. Once I deleted 50 lines of code and manually added 4 which was enough to fix the error.

UncleOxidant · 2026-01-07T05:14:40 1767762880

Yeah, doesn't this guy work for Anthropic? He'd get to use 10x Opus 4.5 for free.

baalimago · 2026-01-05T07:52:02 1767599522

Who is this for? Apart from the contributors ofc, who wish to feel good about eternalizing their 'novel' idea

usefulposter · 2026-01-05T08:00:10 1767600010

It's a mix of signaling, busywork and productivity porn for the ingroup.

A few years ago we had GitHub resource-spam about smart contracts and Web3 and AWESOME NFT ERC721 HACK ON SOLANA NEXT BIG THING LIST.

Now we have repos for the "Self-Rewriting Meta-Prompt Loop" and "Gas Town":

https://steve-yegge.medium.com/welcome-to-gas-town-4f25ee16d...

If you haven't got a Rig for your project with a Mayor whose Witness oversees the Polecats who are supervised by a Deacon who manages Dogs (special shoutout to Boot!) who work with a two-level Beads structure and GUPP and MEOW principles... you're not gonna make it.

dandelionv1bes · 2026-01-05T08:54:40 1767603280

I thought Gas Town was a satire until I saw the GitHub. Maybe it’s a very involved satire?

It is right? “ Do not use Gas Town.”

nkko · 2026-01-05T15:25:12 1767626712

Hi, author here. Honestly, I just used this as a bookmarking place for myself. Which you could infer if you go through some patterns. I’ve created a flow with CC where I would just dump a new source like a podcast, post, or whatever to have it for reference.

dandelionv1bes · 2026-01-05T22:13:56 1767651236

Thank you for putting it together. I looked at a couple of the references and they look like they point to your blog. Do you have a view at all of popular patterns in terms of citations? Might be useful

nkko · 2026-01-07T10:52:28 1767783148

That’s a good idea, will have to think a bit on how to implement it.

keybored · 2026-01-05T11:08:16 1767611296

> Who is this for?

Star-farming anno 2026.

nkko · 2026-01-05T15:28:15 1767626895

See my comment above. The repository is from May when I was intensely exploring everything agentic. I used it as a public bookmarking tool and also in the hope of receiving contributions. Thanks to this HN share, I received four PRs.

keybored · 2026-01-05T17:50:20 1767635420

Anno 2025. Makes a difference I guess.

baalimago · 2025-12-18T06:42:06 1766040126

They should do a study on this.

baalimago · 2025-12-17T07:21:10 1765956070

Well, then the formal verification will be vibe-coded as well, killing the point.

More likely is the rise of test driven development, or spec driven development.

monkeydust · 2025-12-17T07:26:18 1765956378

What are the paradigms people are using to use AI in helping generate better specs and then converting those specs to code and test cases? The Kiro IDE from Amazon I felt was a step in the direction of applying AI across the entire SDLC

barishnamazov · 2025-12-17T07:24:58 1765956298

Then should we apply formal verification to the vibe coded formal verification software?

baalimago · 2025-12-15T13:15:29 1765804529

https://github.com/baalimago/kinoview

An agentic media player, intended as home media server for.. uhh.. seasonal vacation videos with subtitles. I've experimented a lot with different "levels" of AI automation, starting from simple workflows, to more advanced ones, and now soon to fully agentic.

Pretty good practice project! All written in Go with minimal dependencies and an embedded vanillja-js frontend built into the binary (it's so small it's negligable)