I've been underwhelmed with dedicated tools like Windsurf and Cursor in the sens...

elliotec · 2025-06-09T02:20:45 1749435645

Have you tried Claude code? I’m surprised it’s not in this analysis but in my personal experience, the competition doesn’t even touch it. I’ve tried them all in earnest. My toolkit has been (neo)vim and tmux for at least a decade now so I understand the apprehension for less terminal-inclined folks that prefer other stuff but it’s my jam and just crushes it.

cap11235 · 2025-06-09T02:50:45 1749437445

Right, after the Sonnet 4 release it was the first time I could tell an agent something and just let it run comfortably. As for the tool itself, I think a large part of its ability comes from how it writes recursive todo-lists for itself, which are shown to the user, so you can intervene early on the occasions it goes full Monkey's Paw.

tough · 2025-06-09T09:15:06 1749460506

yeah i've been manually doing first a TASKS.md so i can modify it while the agent starts working on it.

jillesvangurp · 2025-06-09T10:32:54 1749465174

OpenAI nailed the UX/DX with codex. This completely obsoletes cursor and similar IDEs. I don't need AI in my tools. I just need somebody to work on my code in parallel to me. I'm happy to interact via pull requests and branches.

I found out that I have access to codex on Thursday with my plus subscription. I've created and merged about a dozen PRs with it on my OSS projects since then. It's not flawless but it's pretty good. I've done some tedious work that I had been deferring, got it to complete a few FIXMEs that I hadn't gotten around to fixing, made it write some API documentation, got it to update a README, etc. It's pretty easy to review the PRs.

What I like is that it creates and works on its own branch. I can actually check that branch out, fix a few things myself, push it and then get it to do PRs against that branch. I had to fix a few small compilation issues. In one case, the fix was just removing a single import that it somehow got wrong after that everything built and the tests passed. Overall it's pretty impressive. Very usable.

I wonder how it performs on larger code bases. I expect some issues there. I'm going to give that a try next.

osigurdson · 2025-06-09T14:02:05 1749477725

I think there are basically three kinds of uses for AI: 1) "Out of loop" - e.g. Codex - it does things while you work on something else. Today it can handle basic things on its own like an appliance. 2) "In the loop" - e.g. Windsurf / Cursor. Here, you know what you are doing but are trying to use AI to essentially type at super human speeds. 3) "Coach mode" - you need to learn something in order to progress. You are using ChatGPT (usually), but possibly other tools as a way to help you get the right context faster.

Of these "In the loop", seems to be the one that doesn't work that well (yet). The main problem is latency in my opinion.

jillesvangurp · 2025-06-09T15:13:32 1749482012

In the loop is not really a problem I have. I use intellij. So, I'm usually not really limited by my ability to type fast. I don't actually type a lot of code mostly.

A better auto complete than comes with the IDE already is actually hard and most of the AI code completion approaches I've seen conflict with the built in auto complete and don't actually don't do better. I've tried a few things and usually end up disabling the auto complete features they offer because they are quite pointless for me. What happens is that I get a lot of suggestions for code I definitely don't want drowning out the completions I do want and messing up my editing flow. Aside from having to constantly read through code that is definitely a combination of not what I'm looking for and probably wrong. And it is actually extra work that I don't need in my life. A bit of an anti feature as far as I'm concerned.

But, I actually have been using chat gpt quite a bit. It works for me because it connects to the IDE (instead of interfering with it) and it allows me to easily prompt it to ask questions about my code. This is much more useful to me than an AI second guessing me on every keystroke.

Codex adds to this by being more like a team mate that I can delegate simple things to. It would be nice if it could notify me when it is done or when it needs my input. But otherwise it's nice.

I'm pretty sure the codex and chat gpt desktop UIs might merge soon. There's no good reason to have two modalities here other than that they are probably created by two different teams. Conway's law might be an issue here. But I like what OpenAI has done with their desktop client though and they seem to be on top of that.

softwaredoug · 2025-06-09T10:59:55 1749466795

I love using codex to just explore code instead of searching. It’s a great tool to learn or research what’s happening in the code with great code breadcrumbs to find what you need to know

wahnfrieden · 2025-06-09T08:22:53 1749457373

On Mac I don’t like how chatgpt makes it difficult to have a few queries generating in parallel for my Xcode

deadbabe · 2025-06-09T01:56:35 1749434195

You can just use Cursor as a chat assistant if you want.

threeseed · 2025-06-09T02:57:32 1749437852

But then you're paying far more than just using Claude web which can be used for tasks other than coding.

deadbabe · 2025-06-09T03:30:32 1749439832

Your company can be paying for it

koakuma-chan · 2025-06-09T03:36:13 1749440173

How do I convince my company to pay for it?

osigurdson · 2025-06-09T13:00:05 1749474005

It has to be an anti-pattern when all spending regardless of size effectively has to be approved by the CEO.

roygbiv2 · 2025-06-09T06:12:10 1749449530

Tell them how much money they will save.

n4r9 · 2025-06-09T09:00:25 1749459625

... how much money will they save?

tough · 2025-06-09T09:15:36 1749460536

depends on how many employees they can fire thanks to the productivity gains!

koakuma-chan · 2025-06-09T10:07:49 1749463669

> convince employer to pay for Claude Code

> get fired due to productivity gains