More

lherron · 2025-12-09T20:04:10 1765310650

Freaking awesome. You should extend clicking on a link, similar to how this article describes infinite content:

https://worksonmymachine.ai/p/solving-amazons-infinite-shelf...

lherron · 2025-11-18T21:10:43 1763500243

Gemini 3 = Skynet ?

lherron · 2025-10-23T11:51:09 1761220269

Building an interactive shell inside their CLI seems like a very odd technical solution. I can’t think of any use case where the same context gathering couldn’t be gleaned by examining the file/system state after the session ended, but maybe I’m missing something.

On the other hand, now that I’ve read this, I can see how having some hooks between the code agent CLIs and ghostty/etc could be extremely powerful.

CaptainOfCoit · 2025-10-23T11:58:13 1761220693

LLMs in general struggles with numbers, it's easy to tell with the medium sized models that struggle with line replacement commands where it has to count, it usually takes a couple of tries to get right.

I always imagined they'd have an easier time if they could start a vim instance and send search/movement/insert commands instead, not having to keep track of numbers and do calculations, but instead visually inspect the right thing happening.

I haven't tried this new feature yet, but that was the first thing that came to mind when seeing it, it might be easier for LLMs to do edits this way.

lherron · 2025-10-23T12:22:10 1761222130

Gotta be better than codex literally writing a python script to edit a file multiple times in a single prompt response.

CaptainOfCoit · 2025-10-23T12:43:54 1761223434

Personally haven't had that happen to me, been using Codex (and lots of other agents) for months now. Anecdote, but still. I wrote up a summary of how I see the current difference between the agents right now: https://news.ycombinator.com/item?id=45680796

lherron · 2025-10-20T20:39:37 1760992777

Still a toss-up for me which one I use. For deep work Codex (codex-high) is the clear winner, but when you need to knock out something small Claude Code (sonnet) is a workhorse.

Also CC tool usage is so much better! Many, many times I’ve seen Codex writing a python script to edit a file which seems to bypass the diff view so you don’t really know what’s going on.

lherron · 2025-10-08T03:11:27 1759893087

I would add to the list of the vibe engineer’s tasks:

Knowing when the agent has failed and it’s time to roll back. After four or five turns of Claude confidently telling you the feature is done, but things are drifting further off course, it’s time to reset and try again.

lherron · 2025-09-12T01:25:56 1757640356

Need someone to do this with the old THX test sound.

cluckindan · 2025-09-12T07:16:24 1757661384

Even better if it’s dynamic. Just start with a number of tones at random frequencies, and bring them towards unison as the screen opens.

lherron · 2025-09-11T12:41:33 1757594493

This “article” is clickbait. Controversial title with no substance, asking “why are companies investing heavily in a technology that works for some (limited but valuable) use cases, when they could invest in pure R&D for something that might be better someday”.

lherron · 2025-09-10T19:59:44 1757534384

Progress, but the real unlock will be local MCP/desktop client support. I don't have much interest in exposing all my local MCPs over the internet.

lherron · 2025-08-12T16:50:43 1755017443

Wow, I thought they would feel some pricing pressure from GPT5 API costs, but they are doubling down on their API being more expensive than everyone else.

sebzim4500 · 2025-08-12T17:19:48 1755019188

I think it's the right approach, the cost of running these things as coding assistants is negligable compared to the benefit of even a slight model improvement.

AtNightWeCode · 2025-08-12T18:23:30 1755023010

GPT5 API uses more tokens for answers of the same quality as previous versions. Fell into that trap myself. I use both Claude and OpenAI right now. Will probably drop OpenAI since they are obviously not to be trusted considering the way they do changes.

lherron · 2025-08-08T17:21:41 1754673701

Did I miss the total cost for each run in the article? Can't seem to find it.

If Sonnet is more expensive AND more chatty/requires more attempts for the same result, seems like that would favor GPT5 for daily driver.