More

mohsen1 · 2026-01-12T14:30:58 1768228258

I was paying for Max but after trying GLM 4.7 I am a convert. Hardly hit the limit but even if I do it is cheaper to get two accounts from Z.ai than one Max from Anthropic

mohsen1 · 2026-01-11T22:07:14 1768169234

Mafia Arena -- Benchmarking LLMs for EQ

https://mafia-arena.com

The only problem I have is that it's so effing expensive to run those games that I can't have a good number of games to claim to be any sort of legit benchmark. BUT so far the games that I paid out of pocket and ran are looking good and I think there is merit to this.

also had lots of fun building on top of Cloud Flare and solving some distributed systems problems while building this.

if you can help me run more games (for science!!) let me know!

mohsen1 · 2026-01-03T23:02:29 1767481349

People are still asking questions, it's no longer on the public internet. Google, Anthropic, OpenAI etc get to see and use them.

the8472 · 2026-01-03T23:17:16 1767482236

This is concerning on two fronts. The questions are no longer open (SO is CC-BY-SA) and if Q&A content dies then this herds even more people towards LLM use. It's basically draining the commons.

jondwillis · 2026-01-03T23:39:05 1767483545

Yup. This, to me, provides another explanation for why the social contract is being used as toilet paper by the owner class. They literally see the writing on the wall.

mohsen1 · 2026-01-01T16:54:23 1767286463

This is super nice! Thank you for working on this!

Recently really enjoying CloudFlare Workflows (used it in https://mafia-arena.com) and would be nice to build Workflows on top of this too.

max_lt · 2026-01-01T18:36:05 1767292565

Thanks! Workflows is definitely interesting – it's basically durable execution with steps and retries. It's on the radar, probably after the CLI and GitHub integration.

mohsen1 · 2026-01-01T14:35:52 1767278152

i think this is a little unfair, its comparing a model that is optimised for pass@2 and self improving its output compared to the other models, just test time scaling in a way

mohsen1 · 2025-12-29T20:56:43 1767041803

I really really want this to be true. I want to be relevant. I don’t know what to do if all those predictions are true and there is no need (or very little need) for programmers anymore.

But something tells me “this time is different” is different this time for real.

Coding AIs design software better than me, review code better than me, find hard-to-find bugs better than me, plan long-running projects better than me, make decisions based on research, literature, and also the state of our projects better than me. I’m basically just the conductor of all those processes.

Oh, and don't ask about coding. If you use AI for tasks above, as a result you'll get very well defined coding task definitions which an AI would ace.

I’m still hired, but I feel like I’m doing the work of an entire org that used to need twenty engineers.

From where I’m standing, it’s scary.

dataviz1000 · 2025-12-29T22:03:12 1767045792

I was a chef in Michelin-starred restaurants for 11 years. One of my favorite positions was washing dishes. The goal was always to keep the machine running on its 5-minute cycle. It was about getting the dishes into racks, rinsing them, and having them ready and waiting for the previous cycle to end—so you could push them into the machine immediately—then getting them dried and put away after the cycle, making sure the quality was there and no spot was missed. If the machine stopped, the goal was to get another batch into it, putting everything else on hold. Keeping the machine running was the only way to prevent dishes from piling up, which would end with the towers falling over and breaking plates. This work requires moving lightning fast with dexterity.

AI coding agents are analogous to the machine. My job is to get the prompts written, and to do quality control and housekeeping after it runs a cycle. Nonetheless, like all automation, humans are still needed... for now.

conductr · 2025-12-30T05:00:56 1767070856

If it requires an expert engineer/dishwasher to keep the flow running perfectly, the human is the bottleneck in the process. This sounds a lot more like the past before AI to me. What AI does is just give you enough dishes that they don’t need to be washed at all during dinner service. Just let them pile up dirty or throw them away and get new dishes tomorrow it’s so immaterial to replace that washing them doesn’t always make sense. But if for some reason you do want to reuse them, then, it washes and dries them for you too. You just look over things at the end and make sure they pass your quality standards. If they left some muck on a plate or lipstick on a cup, just tell it not to let that happen again and it won’t. So even your QC work gets easier over time. The labor needed to deal with dirty dishes is drastically reduced in any case.

leptons · 2025-12-30T09:28:12 1767086892

> humans are still needed... for now

"AI" doesn't have a clue what to do on its own. Humans will always be in the loop, because they have goals, while the AI is designed to placate and not create.

The amount of "AI" garbage I have to sift through to find one single gem is about the same or more work than if I had just coded it myself. Add to that the frustration of dealing with a compulsive liar, and it's just a fucking awful experience for anyone that actually can code.

potamic · 2025-12-30T09:51:22 1767088282

Humans are still needed, but they just got down-skilled.

chii · 2025-12-30T10:20:45 1767090045

> got down-skilled.

who's to say that it's a down?

Orchestrating and doing higher level strategic planning, such that the sub-tasks can be AI produced, is a skill that might be higher than programming.

bigstrat2003 · 2025-12-30T08:23:01 1767082981

> Coding AIs design software better than me, review code better than me, find hard-to-find bugs better than me, plan long-running projects better than me, make decisions based on research, literature, and also the state of our projects better than me.

That is just not true, assuming you have a modicum of competence (which I assume you do). AIs suck at all these tasks; they are not even as good as an inexperienced human.

embedding-shape · 2025-12-30T10:28:00 1767090480

For all we know, you both could comparing using a Nokia 3310 and a workstation PC based on the hardware, but you both just say "this computer is better than that computer".

There are a ton of models out there, ran in a ton of different ways, that can be used in different ways with different harnesses, and people use different workflows. There is just so many variables involved, that I don't think it's neither fair nor accurate for anyone to claim "This is obviously better" or "This is obviously impossible".

I've been in situations where I hit my head against some hard to find bug for days, then I put "AI" (but what? No one knows) to it and it solves it in 20 minutes. I've also asked "AI" to do trivial work that it still somehow fucked up, even if I could probably have asked a non-programmer friend to do it and they'd be able to.

The variance is great, and the fact that system/developer/user prompts matter a lot for what the responses you get, makes it even harder to fairly compare things like this without having the actual chat logs in front of you.

mrwrong · 2025-12-30T13:42:10 1767102130

> The variance is great

this strikes me as a very important thing to reflect on. when the automobile was invented, was the apparent benefit so incredibly variable?

embedding-shape · 2025-12-30T15:57:41 1767110261

> was the apparent benefit so incredibly variable?

Yes, lots of people were very vocally against horseless-carriages, as they were called at the time. Safety and public nuisance concerns were widespread, the cars were very noisy, fast, smoky and unreliable. Old newspapers are filled with opinions about this, from people being afraid of horseless-carriages spooking other's horses and so on. The UK restricted the adoption of cars at one point, and some Canton in Switzerland even banned cars for a couple of decades.

Horseless-carriages was commonly ridiculed for being just for "reckless rich hobbyists" and similar.

I think the major difference is that cars produced immediate, visible externalities, so it was easy for opposition to focus on public safety in public spaces. In contrast, AI has less physically visible externalities, although they are as important, or maybe even more important, than the ones cars introduced.

mrwrong · 2025-12-31T11:26:01 1767180361

yeah I agree about the negative externalities but I'm curious about the perceived benefits. did anybody argue that cars were actually slower than horse and carriage? (were they at first?)

embedding-shape · 2025-12-31T13:54:09 1767189249

The cars were obviously faster than the typical horse transportation and I don't think anyone tried to argue against that, but laws typically restricted cars so they couldn't go faster than horses, at least in highly populated areas like cities. As others mentioned too, the benefit of not needing roads to go places were highlighted as a drawback of cars too. People argued that while cars might go faster, the result would be that the world would be worse off in total.

mrwrong · 2026-01-02T16:34:52 1767371692

sure but my point is people could agree they were faster at least. that is decidedly not true for LLMs. maybe due to alignable vs non-alignable differences

throw4847285 · 2025-12-30T14:22:50 1767104570

Is this a trick question? Yes it was. A horse could go over any terrain while a car could only really go over very specific terrain designed for it. We had to terraform the world in order to make the automobile so beneficial. And it turned out that this terraforming had many unintended consequences. It's actually a pretty apt comparison to LLMs.

mrwrong · 2025-12-31T11:28:58 1767180538

who would I be trying to trick if it was? you didn't answer the question anyways. I'm not wondering whether cars were seen as strictly better than horses in all situations. I'm wondering if people disagreed so vehemently about whether cars were faster road transportation than horses

gentooflux · 2025-12-30T09:24:50 1767086690

LLMs generate the most likely code given the problem they're presented and everything they've been trained on, they don't actually understand how (or even if) it works. I only ever get away with that when I'm writing a parser.

chii · 2025-12-30T10:18:43 1767089923

> they don't actually understand how

but if it empirically works, does it matter if the "intelligence" doesn't "understand" it?

Does a chess engine "understand" the moves it makes?

goatlover · 2025-12-30T11:10:41 1767093041

It matters if AGI is the goal. If it remains a tool to make workers more productive, then it doesn't need to truly understand, since the humans using the tools understand. I'm of the opinion AI should have stood for Augmented (Human) Intelligence outside of science fiction. I believe that's what early pioneers like Douglas Engalbert thought. Clearly that's what Steve Jobs and Alan Kay thought computing was for.

victorbjorklund · 2025-12-30T11:59:59 1767095999

AGI is such a meaningless concept. We can’t even fully design what human intelligence is (and when a human fails it meaning they lack human intelligence). It’s just philosophy.

theshrike79 · 2025-12-30T16:09:27 1767110967

AGI is about as well defined as "full self-driving" :D

It's an useless philosophical discussion.

gentooflux · 2025-12-30T12:33:41 1767098021

If it empirically works, then sure. If instead every single solution it provides beyond a few trivial lines falls somewhere between "just a little bit off" and "relies entirely on core library functionality that doesn't actually exist" then I'd say it does matter and it's only slightly better than an opaque box that spouts random nonsense (which will soon include ads).

simonw · 2025-12-30T12:59:23 1767099563

Those are 2024-era criticisms of LLMs for code.

Late 2025 models very rarely hallucinate nonexistent core library functionality - and they run inside coding agent harnesses so if they DO they notice that the code doesn't work and fix it.

mrwrong · 2025-12-30T13:43:03 1767102183

get ready to tick those numbers over to 2026!

theshrike79 · 2025-12-30T16:10:50 1767111050

This sounds like you're copy-pasting code from ChatGPT's web interface, which is very 2024.

Agentic LLMs will notice if something is crap and won't compile and will retry, use the tools they have available to figure out what's the correct way, edit and retry again.

jvanderbot · 2025-12-30T10:41:10 1767091270

This is a semantic dead end when discussing results and career choices

lelanthran · 2025-12-30T10:41:42 1767091302

Depends on how he defined "better". If he uses the word "better" to mean "good enough to not fail immediately, and done in 1/10th of the time", then he's correct.

foxygen · 2025-12-29T22:34:47 1767047687

I think I've been using AI wrong. I can't understand testimonies like this. Most times I try to use AI for a task, it is a shitshow, and I have to rewrite everything anyway.

qweiopqweiop · 2025-12-30T09:45:58 1767087958

Have you tried Opus 4.5 (or similar recent models)? With Claude code 2, it's actually harder to mess things up IMO

techblueberry · 2025-12-30T12:24:52 1767097492

I remember when about a year ago people were asking the same thing about gpt-4.5, the answer is always “yes, I’ve tried them all”

meindnoch · 2025-12-30T13:27:02 1767101222

Ok, but have you tried claude-sonnet-GPT-codex-4.5-thinking-fast? That's the game changer. Anyone saying bad things about vibe coding without trying claude-sonnet-GPT-codex-4.5-thinking-fast is like a dinosaur to me, doomed to extinction. Seriously, give claude-sonnet-GPT-codex-4.5-thinking-fast a try, you'll thank me ;)

qweiopqweiop · 2025-12-30T16:01:30 1767110490

Fair. Well personally they didn't work well for me (on a huge, complex codebase) until the latest batch. Now they do.

weakfish · 2025-12-29T22:42:28 1767048148

Same. Seems to be the never ending theme of AI.

threethirtytwo · 2025-12-30T07:09:44 1767078584

Try Claude. And partner with it on building something complex.

dent9 · 2025-12-30T15:43:28 1767109408

Yes you want Kiro which uses Claude models under the hood

doug_durham · 2025-12-29T22:40:32 1767048032

I don’t know about right/wrong. You need to use the tools that make you productive. I personally find that in my work there are dozens of little scripts or helper functions that accelerate my work. However I usually don’t write them because I don’t have the time. AI can generate these little scripts very consistently. That accelerates my work. Perhaps just start simple.

JSDave · 2025-12-30T01:03:09 1767056589

Instead of generating, exporting or copy pasting just seems more reliable to me and also takes very little time.

I think what matters most is just what you're working on. It's great for crud or working with public APIs with lots of examples.

For everything else, AI has been a net loss for me.

63stack · 2025-12-30T23:35:19 1767137719

> there are dozens of little scripts or helper functions that accelerate my work. However I usually don’t write them because I don’t have the time

People who write things like this can't expect to be taken seriously.

Before AI you didn't have time to write things that saved you time? So you just ended up spending (wasting) more time by going the long way? That was a better choice than just doing the thing that would have saved you time?

CuriouslyC · 2025-12-30T05:53:54 1767074034

Do you tell AI the patterns/tools/architecture you want? Telling agents to "build me XYZ, make it gud!" is likely to precede a mess, telling it to build a modular monolith using your library/tool list, your preferred folder structure, other patterns/algorithms you use, etc will end you up with something that might have some minor style issues or not be perfectly canonical, but will be approximately correct within a reasonable margin, or is within 1-2 turns of being so.

You have to let go of the code looking exactly a certain way, but having code _work_ a certain way at a coarse level is doable and fairly easy.

dent9 · 2025-12-30T15:42:23 1767109343

We are way beyond this. Now you use your plain text prompt to generate a requirements spec that the AI will follow when implementing your project

https://kiro.dev/

CuriouslyC · 2025-12-30T15:55:39 1767110139

Kiro is just trying to build a product around exactly what I'm talking about. I'm not a fan, because it's simultaneously too heavyweight and agents don't respect all the details of the specs it creates enough to make the time investment in super-detailed specs worthwhile.

I have a spec driven development tool I've been working on that generates structured specs that can be used to do automatic code generation. This is both faster and more robust.

dent9 · 2025-12-30T16:20:58 1767111658

That sounds cool, please do share your tools when they're ready :)

mkozlows · 2025-12-30T07:16:24 1767078984

Honestly, even this isn't really true anymore. With Opus 4.5 and 5.2 Codex in tools like Cursor, Claude Code, or Codex CLI, "just do the thing" is a viable strategy for a shockingly large category of tasks.

CuriouslyC · 2025-12-30T14:45:02 1767105902

Just do the thing can produce functional code, but even with Opus4.5/Codex5.2, there are still plenty of moments where the way it decides to do something is cringe.

mkozlows · 2025-12-30T15:38:49 1767109129

Agree. But it's increasingly the case, IME, that for a a lot of tasks, you can start with that. If it does it well, great. If it does something stupid, it's easy enough to ask it to completely rework the stupid thing in a better way, and it can do it quickly. That's still a huge shift compared to the olden days (three months ago) where you needed to really break things down into small chunks for it to get to a success state.

leptons · 2025-12-30T09:31:17 1767087077

>You have to let go of the code looking exactly a certain way, but having code _work_ a certain way at a coarse level is doable and fairly easy.

So all that bullshit about "code smells" was nonsense.

CuriouslyC · 2025-12-30T15:38:56 1767109136

A lot of code smells matter more for humans than LLMs (and LLMs have their own unique code smells). For example, nested ternary operators are a great source of bugs in human code, but agents could care less, but humans handle multiple files with the same variable names and lots of duplicated code well, whereas this stuff confuses agents.

leptons · 2025-12-30T17:44:03 1767116643

>but agents could care less,

The phrase is "couldn't care less". If you "could care less" then you actually care about it. If you "couldn't care less" then there's no caring at all.

mrwrong · 2025-12-30T13:44:21 1767102261

have you tried using $NEWEST_MODEL ?

threethirtytwo · 2025-12-30T14:18:05 1767104285

It’s because depending on the person the newest model crossed the line into being useful for them personally. It’s not like a new version crosses the line for everyone. It happens gradually. Each version more and more people come into the fold.

For me Claude code changed the game.

mrwrong · 2025-12-31T11:30:37 1767180637

yes, it is trivially true that each new person who recommends LLMs is a new person coming into the fold

threethirtytwo · 2025-12-31T12:14:09 1767183249

You get new people recommending the latest version all the time to people who are unconvinced because that version is usually what brought them into the fold.

What you’re mocking is somewhat of a signal of actual improvement of the models and that improvement as a result becoming useful to more and more people.

bdangubic · 2025-12-29T22:44:56 1767048296

how much time/effort have you put in to educate yourself about how they work, what they excel at, what they suck at, what is your responsibility when you use them…? this effort is directly proportional to how well they will serve you

belter · 2025-12-29T21:17:02 1767043022

>> From where I’m standing, it’s scary.

You are being fooled by randomness [1]

Not because the models are random, but because you are mistaking a massive combinatorial search over seen patterns for genuine reasoning. Taleb point was about confusing luck for skill. Dont confuse interpolation for understanding.

You can read a Rust book after years of Java, then go build software for an industry that did not exist when you started. Ask any LLM to write a driver for hardware that shipped last month, or model a regulatory framework that just passed... It will confidently hallucinate. You will figure it out. That is the difference between pattern matching and understanding.

[1] https://en.wikipedia.org/wiki/Fooled_by_Randomness

Verdex · 2025-12-29T21:29:45 1767043785

I've worked with a lot of interns, fresh outs from college, overseas lowest bidders, and mediocre engineers who gave up years ago. All over the course of a ~20 year career.

Not once in all that time has anyone PRed and merged my completely unrelated and unfinished branch into main. Except a few weeks ago. By someone who was using the LLM to make PRs.

He didn't understand when I asked him about it and was baffled as to how it happened.

Really annoying, but I got significantly less concerned about the future of human software engineering after that.

joefourier · 2025-12-29T21:27:08 1767043628

Have you used an LLM specifically trained for tool calling, in Claude Code, Cursor or Aider?

They’re capable of looking up documentation, correcting their errors by compiling and running tests, and when coupled with a linter, hallucinations are a non issue.

I don’t really think it’s possible to dismiss a model that’s been trained with reinforcement learning for both reasoning and tool usage as only doing pattern matching. They’re not at all the same beasts as the old style of LLMs based purely on next token prediction of massive scrapes of web data (with some fine tuning on Q&A pairs and RLHF to pick the best answers).

treespace8 · 2025-12-29T21:43:37 1767044617

I'm using Claude code to help me learn Godot game programming.

One interesting thing is that Claude will not tell me if I'm following the wrong path. It will just make the requested change to the best of its ability.

For example a Tower Defence game I'm making I wanted to keep turret position state in an AStarGrid2D. It produced code to do this, but became harder and harder to follow as I went on. It's only after watching more tutorials I figured out I was asking for the wrong thing. (TileMapLayer is a much better choice)

LLMs still suffer from Garbage in Garbage out.

jennyholzer3 · 2025-12-29T22:41:00 1767048060

don't use LLMs for Godot game programming.

edit: Major engine changes have occurred after the models were trained, so you will often be given code that refers to nonexistent constants and functions and which is not aware of useful new features.

memoriuaysj · 2025-12-29T21:51:27 1767045087

before coding I just ask the model "what are the best practices in this industry to solve this problem? what tools/libraries/approaches people use?

after coding I ask it "review the code, do you see any for which there are common libraries implementing it? are there ways to make it more idiomatic?"

you can also ask it "this is an idea on how to solve it that somebody told me, what do you think about it, are there better ways?"

hansmayer · 2025-12-29T22:38:24 1767047904

> before coding I just ask the model "what are the best practices in this industry to solve this problem? what tools/libraries/approaches people use?

Just for the fun of it, and so you lose your "virginity" so to speak, next time when the magic machine gives you the answer about "what it thinks", tell it its wrong in a strict language and scold it for misleading you. Tell it to give you the "real" best practices instead of what it spat out. Then sit back and marvel at the machine saying you were right and that it had mislead you. Producing a completely, somewhat, or slightly different answer (you never know what you get on the slot machine).

manmal · 2025-12-29T22:27:28 1767047248

Both the before and after are better done manually. What you are describing is fine for the heck of it (I‘ve vibe coded a whisper related rust port today without having any actual rust skills), but I’d never use fully vibed software in production. That’s irresponsible in multiple ways.

skydhash · 2025-12-29T22:00:53 1767045653

Do you also light candles and chant?

belter · 2025-12-29T22:06:38 1767045998

Ask a model to

"Write a chess engine where pawns move backward and kings can jump like nights"

It will keep slipping back into real chess rules. It learned chess, it did not understand the concept of "rules"

Or

Ask it to reverse a made up word like

"Reverse the string 'glorbix'"

It will get it wrong on the first try. You would not fail.

Or even better ask it to...

"Use the dxastgraphx library to build a DAG scheduler."

dxastgraphx is a non existing library...

Marvel at the results...tried in both Claude and ChatGPT....

manmal · 2025-12-29T22:30:17 1767047417

I‘ve just tried the dxastgraphx one in pi with Opus 4.5. This was its response:

    I couldn't find a library called dxastgraphx in either pip (Python) or npm (JavaScript) package registries. This library doesn't appear to exist.

    Did you perhaps mean one of these popular DAG/graph libraries?

    Python:
    - networkx - comprehensive graph library with DAG support
    - graphlib - Python standard library (3.9+) with TopologicalSorter
    - dask - parallel computing with DAG task scheduling

    JavaScript/TypeScript:
    - graphlib - graph data structures
    - dagre - DAG layout

    Would you like me to build a DAG scheduler using one of these existing libraries, or would you like me to implement one from scratch? Let me know which language and approach you prefer.

bossyTeacher · 2025-12-31T13:56:40 1767189400

Tried ChatGPT (free version). I answered correctly to the first question.

Answer to second question:

"I can do that, but there’s a catch: dxastgraphx is not a known or standard Python (or JS) library as of now. I don’t have any public API or documentation for it to target directly.

To avoid guessing wrong, here are the two sensible options:"

somebodythere · 2025-12-30T12:50:09 1767099009

Why would I ask the model to reverse the string 'glorbix,' especially in the context of software engineering?

criticalfault · 2025-12-29T22:30:27 1767047427

just tried to reverse the string you provided using Gemini. it worked fine on the first try

eithed · 2025-12-29T22:53:08 1767048788

In gemini:

> The reverse of the string 'gssssyeekxxuoaqaaaiooo9aaa7zjx' is:

xjz7aaa9oooiiaaaqauxxkyeessssg

> Youve made a mistake

``` I see where I tripped up! Let's re-examine that long string carefully.

The original string you gave me was: `gssssyeekxxuoaqaaaiooo9aaa7zjx`

Let's look at the end of that string: `...aaa7zjx`. When we reverse it, it should start with: `xjz7aaa...`

Here is the precise character-by-character reversal:

*xjz7aaa9oooiiaaaqauxxkyeessssg*

Wait—I see it now. In my previous response, I included an extra *'i'* and missed one *'o'* in the middle section. Let's do a strict count to get it perfect:

* *Original:* `g` `ssss` `y` `ee` `k` `xx` `u` `o` `a` `q` `aaa` *`iooo`* `9` `aaa` `7` `z` `j` `x` * *Correct Reverse:* `x` `j` `z` `7` `aaa` `9` *`oooi`* `aaa` `q` `a` `o` `u` `xx` `k` `ee` `y` `ssss` `g`

The correct reversed string is: *xjz7aaa9oooiiaaaqauoxxkyeessssg*

Thank you for catching that. It’s easy for the eyes to glaze over when there are so many repeating vowels!

Since I've stumbled a bit on these, would you like to give me one more string to see if I can get it right on the first try? ```

After more back and fors it consistently fails in this task, even though when strictly dividing the tokens it will get this right. Yet the final answer is always wrong.

knollimar · 2025-12-30T15:49:25 1767109765

Mine said it used python and got: xjz7aaa9oooiaaaqaouxxkeeyssssg

baq · 2025-12-30T08:31:42 1767083502

You’re trying to interrogate a machine as you would a human and presenting this as evidence that machines aren’t humans. Yes, you’re absolutely right! And also completely missing the point.

belter · 2025-12-31T02:00:00 1767146400

The discussion is not about being human. Is about being fit for purpose...

doug_durham · 2025-12-29T22:54:36 1767048876

Why would you expect an LLM or even a human to succeed in these cases? “Write a piece of code for a specification that you can’t possibly know about?” That’s why you have to do context engineering, just like you’d provide a reference to a new document to an engineer writing code.

germandiago · 2025-12-30T09:14:46 1767086086

This is exactly what happened to me: novel or uncommon = hallucinate or invent wrong.

It is ok for getting snippets for example and saying (I did it). Please make this MVVM style. It is not perfect, but saves time.

For very broad or novel reasoning, as of today... forget it.

btbuildem · 2025-12-29T21:44:52 1767044692

They do all those things you've mentioned more efficiently than most of us, but they fall woefully short as soon as novelty is required. Creativity is not in their repertoire. So if you're banging out the same type of thing over and over again, yes, they will make that work light and then scarce. But if you need to create something niche, something one-off, something new, they'll slip off the bleeding edge into the comfortable valley of the familiar at every step.

I choose to look at it as an opportunity to spend more time on the interesting problems, and work at a higher level. We used to worry about pointers and memory allocation. Now we will worry less and less about how the code is written and more about the result it built.

keyle · 2025-12-29T21:57:27 1767045447

Take food for example. We don't eat food made by computers even though they're capable of making it from start to finish.

Sure we eat carrots probably assisted by machines, but we are not eating dishes like protein bars all day every day.

Our food is still better enjoyed when made by a chef.

Software engineering will be the same. No one will want to use software made by a machine all day every day. There are differences in the execution and implementation.

No one will want to read books entirely dreamed up by AI. Subtle parts of the books make us feel something only a human could have put right there right then.

No one will want to see movies entirely made by AI.

The list goes on.

But you might say "software is different". Yes but no, in the abundance of choice, when there will be a ton of choice for a type of software due to the productivity increase, choice will become more prominent and the human driven software will win.

Even today we pick the best terminal emulation software because we notice the difference between exquisitely crafted and bloated cruft.

doug_durham · 2025-12-29T22:51:16 1767048676

You should look at other engineering disciplines. How many highway over passes have unique “chef quality” designs? Very few. Most engineering is commodity replications of existing designs. The exact same thing applies to software engineering. Most of us engineers are replicating designs that came earlier. LLMs are good at generating the rote designs that make up the bulk of software by volume. Who benefit from an artisanal REST interface? The best practices were codified over a decade ago.

bccdee · 2025-12-30T05:23:54 1767072234

> How many highway over passes have unique “chef quality” designs?

Have you ever built a highway overpass? That kind of engineering is complex and interdisciplinary. You need to carry out extensive traffic pattern analysis and soil composition testing to even know where it should go.

We're at a point where we've already automated all the simple stuff. If you want a website, you don't type out html tags. You use Squarespace or Wordpress or whatever. If you need a backend, you use Airtable. We already spend most of our time on the tricky stuff. Sure, it's nice that LLMs can smooth the rough edges of workflows that nobody's bothered to refine yet, but the software commodities of the world have already been commodified.

keyle · 2025-12-29T23:04:19 1767049459

Just like cooking in the middle ages. As the kitchen, hygiene, etc. got better, so did the chefs and so did the food.

This is just a transition.

re-Rest API, you're right. But again, we use roombas to vacuum when the floor layout is friendly to them. Not all rooms can be vacuumed by roombas. Simple Rest api can be emitted one shot from an LLM and there is no room for interpretation. But ask a future LLM to make a new kind of social network and you'll end up with a mash up of the existing ones.

Same thing, you and I won't use a manual screwdriver when we have 100 screws to get in, and we own an electric drill.

That didn't reinvent screws nor the assembly of complex items.

I'm keeping positive in the sense that LLMs will enable us to do more, and to learn faster.

The sad part about vibe coding is you learn very little. And to live is to learn.

You'll notice people vibecoding all day become less and less attached to the product they work on. That's because they've given away the dopamine hits of the many "ha-ha" moments that come from programming. They'll lose interest. They won't learn anymore and die off (career wise).

So, businesses that put LLM first will slowly lose talent over time, and business that put developers first will thrive.

It's just a transition. A fast one that hits us like a wall, and it's confusing, but software for humans will be better made by humans.

I've been programming since the 80s. The level of complexity today is bat shit insane. I welcome the LLM help in managing 3 code bases of 3 languages spread across different architectures (my job) to keep sane!

brulard · 2025-12-31T01:01:55 1767142915

I disagree with the vibecoding take. Its a new skill that absolutely has a place in developers skillset and it may be of great importance for some kinds of projects. You can learn so much by vibecoding little projects that otherwise would never see the light of day.

germandiago · 2025-12-30T08:03:20 1767081800

There is a part of this that is true. But when you get the nuanced parts of every "replicated design" or need the tweaks or what the AI gave you is just wrong, that deteriorates quality.

For many tasks it is ok, for others it is just a NO.

For software maintenance and evolution I think it won't cut it.

The same way a Wordpress website can do a set of useful things. But when you need something specific, you just drop to programming.

You can have your e-commerce web. But you cannot ask it to give you a "pipeline excution as fast as possible for calculating and solving math for engineering task X". That needs SIMD, parallelization, understanding the niche use you need, etc. which probably most people do not do all the time and requires specific knowledge.

apt-apt-apt-apt · 2025-12-29T23:11:48 1767049908

Is your argument that we only want things that are hand-crafted by humans?

There are lots of things like perfectly machined nails, tools, etc. that are much better done by machines. Why couldn't software be one of those?

skydhash · 2025-12-29T21:56:36 1767045396

> So if you're banging out the same type of thing over and over again, yes, they will make that work light and then scarce.

The same thing over and over again should be a SaaS, some internal tool, or a plugin. Computers are good at doing the same thing over and over again and that's what we've been using them for

> But if you need to create something niche, something one-off, something new, they'll slip off the bleeding edge into the comfortable valley of the familiar at every step.

Even if the high level description of a task may be similar to another, there's always something different in the implementation. A sports car and a sedan have roughly the same components, but they're not engineered the same.

> We used to worry about pointers and memory allocation.

Some still do. It's not in every case you will have a system that handle allocations and a garbage collector. And even in those, you will see memory leaks.

> Now we will worry less and less about how the code is written and more about the result it built.

Wasn't that Dreamweaver?

9dev · 2025-12-29T21:52:13 1767045133

I think your image of LLMs is a bit outdated. Claude Code with well-configured agents will get entirely novel stuff done pretty well, and that’s only going to get better over time.

I wouldn’t want to bet my career on that anyway.

germandiago · 2025-12-30T09:12:44 1767085964

I am all ears. What is your setup?

Deep-Blue · 2025-12-30T04:15:51 1767068151

As of today NONE of the known AI codebots can solve correctly ANY of the 50+ programming exercises we use to interview fresh grads or summer interns. NONE! Not even level 1 problems that can be solved in fewer than 20 lines of code with a bit of middle school math.

NitpickLawyer · 2025-12-30T07:55:14 1767081314

After 25+ years in this field, having interviewed ~100 people for both my startup and other companies, I'm having a hard time believing this. You're either in an extremely niche field (such as to make your statement irrelevant to 99.9% of the industry), or it's hyperbole, or straight up bs.

Interviewing is an art, and IME "gotcha" types of questions never work. You want to search for real-world capabilities, and like it or not the questions need to match those expectations. If you're hiring summer interns and the SotA models can't solve those questions, then you're doing something wrong. Sorry, but having used these tools for the past three years this is extremely ahrd to believe.

I of course understand if you can't, but sharing even one of those questions would be nice.

heldrida · 2025-12-30T21:41:16 1767130876

I agree, it’s hard to believe. Hopefully the original comment author can share one of those questions.

bdangubic · 2025-12-30T21:43:09 1767130989

I would live to see just one

cheevly · 2025-12-30T06:20:20 1767075620

I promise you that I can show you how to reliably solve any of them using any of the latest OpenAI models. Email me if you want proof; josh.d.griffith at gmail

utopiah · 2025-12-30T06:38:54 1767076734

I'd watch that show ideally with few base rules though, e.g.

- the problems to solve must NOT be part of the training set

- the person using the tool (e.g. OpenAI, Claude, DevStral, DeepSeek, etc) must NOT be able to solve problems alone

as I believe otherwise the 1st is "just" search and the 2nd is basically offloading the actual problem solving to the user.

ehnto · 2025-12-30T07:54:57 1767081297

> the person using the tool (e.g. OpenAI, Claude, DevStral, DeepSeek, etc) must NOT be able to solve problems alone

I think this is a good point, as I find the operators input is often forgotten when considering the AIs output. If it took me an hour and decades of expertise to get the AI to output the right program, did the AI really do it? Could someone without my expertise get the same result?

If not, then maybe we are wasting our time trying to mash our skills through vector space via a chat interface.

cheevly · 2025-12-30T14:24:39 1767104679

Im talking generalized solutions that solve all of them.

to11mtm · 2025-12-29T22:33:38 1767047618

It's definitely scary in a way.

However I'm still finding a trend even in my org; better non-AI developers tend to be better at using AI to develop.

AI still forgets requirements.

I'm currently running an experiment where I try to get a design and then execute on an enterprise 'SAAS-replacement' application [0].

AI can spit forth a completely convincing looking overall project plan [1] that has gaps if anyone, even the AI itself, tries to execute on the plan; this is where a proper, experienced developer can step in at the right steps to help out.

IDK if that's the right way to venture into the brave new world, but I am at least doing my best to be at a forefront of how my org is using the tech.

[0] - I figured it was a good exercise for testing limits of both my skills prompting and the AI's capability. I do not expect success.

dent9 · 2025-12-30T15:44:38 1767109478

AI does not forget requirements when you use a spec driven AI tool like Kiro

songodongo · 2025-12-30T22:06:56 1767132416

Are you on the Kiro marketing team?

chii · 2025-12-30T10:16:47 1767089807

> I’m basically just the conductor of all those processes.

a car moves faster than you, can last longer than you, and can carry much more than you. But somehow, people don't seem to be scared of cars displacing them(yet)? Perhaps autodriving would in the near future, but there still needs to be someone making decisions on how best to utilize that car - surely, it isn't deciding to go to destination A without someone telling them.

> I feel like I’m doing the work of an entire org that used to need twenty engineers.

and this is great. A combine harvester does the work of what used to be an entire village for a week in a day. More output for less people/resources expended means more wealth produced.

embedding-shape · 2025-12-30T10:24:41 1767090281

> a car moves faster than you, can last longer than you, and can carry much more than you. But somehow, people don't seem to be scared of cars displacing them(yet)?

People whose life were based around using horses for transportation were very scared of cars replacing them though, and correctly so, because horses for transportation is something people do for leisure today, not necessity. I feel like that's a more apt analogy than comparing cars to any human.

> More output for less people/resources expended means more wealth produced.

This is true, but it probably also means that this "more wealth produced" will be more concentrated, because it's easier to convince one person using AI that you should have half of the wealth they produce, rather than convincing 100 people you should have half of what they produce. From where I'm standing, it seems to have the same effects (but not as widespread or impactful, yet) as industrialization, that induced that side-effect as well.

jvanderbot · 2025-12-30T10:38:32 1767091112

Analogies are not going to work. Bug it's just as likely that, in the worst case, we are stage coach drivers who have to use cars when we just really love the quiet slowness of horses.

wiether · 2025-12-30T11:31:33 1767094293

And parent is scared of being made redundant by AI because they need their job to pay for their car, insurance, gas and repairs.

lelanthran · 2025-12-30T10:39:48 1767091188

> a car moves faster than you, can last longer than you, and can carry much more than you. But somehow, people don't seem to be scared of cars displacing them(yet)?

???

Cars replaced horses, not people.

In this scenario you are the horse.

aprilthird2021 · 2025-12-30T10:50:10 1767091810

Well no, you'd be the horse driver who becomes a car driver

lelanthran · 2025-12-30T11:01:55 1767092515

> Well no, you'd be the horse driver who becomes a car driver

Well, that's the crux of the argument. The pro-AI devs are making the claim that devs are the horse-drivers, the anti-AI is making the claim that devs are the horses themselves.

There is no objective way to verify who is right in this case, we just have to see it play out.

aprilthird2021 · 2025-12-30T18:50:37 1767120637

I don't really understand what you are saying... Anyways glad you got what I am saying at least

Desafinado · 2025-12-30T01:07:46 1767056866

That's kind of the point of the article, though.

Sure LLMs can churn out code, and they sort of work for developers who already understand code and design, but what happens when that junior dev with no hard experience builds their years of experience with LLMs?

Over time those who actually understand what the LLMs are doing and how to correct the output are replaced by developers who've never learned the hard lessons of writing code line by line. The ability to reason about code gets lost.

This points to the hard problem that the article highlights. The hard problem of software is actually knowing how to write it, which usually takes years, sometimes up to a decade of real experience.

Any idiot can churn out code that doesn't work. But working, effective software takes a lot of skill that LLMs will be stripping people of. Leaving a market there for people who have actually put the time in and understand software.

jayd16 · 2025-12-29T22:23:23 1767047003

My experience with these tools is far and away no where close to this.

If you're really able to do the work of a 20 man org on your own, start a business.

gingersnap · 2025-12-30T10:00:13 1767088813

This is not how I think about it. Me and the coding assistant is better then me or the coding assistant separately.

For me its not about me or the coding assistant, its me and the coding assistant. But I'm also not a professional coder, i dont identify as a coder. I've been fiddling with programming my whole life, but never had it as title, I've more worked from product side or from stakeholder side, but always got more involved, as I could speak with the dev team.

This also makes it natural for me to work side-by-side with the coding assistant, compared maybe to pure coders, who are used to keeping the coding side to themselves.

zsoltkacsandi · 2025-12-30T09:37:06 1767087426

I have been using the most recent Claude, ChatGPT and Gemini models for coding for a bit more than a year, on a daily basis.

They are pretty good at writing code *after* I thoroughly described what to do, step by step. If you miss a small detail they get loose and the end result is a complete mess that takes hours to clean up. This still requires years of coding experience, planning ahead in head, you won't be able to spare that, or replace developers with LLMs. They are like autocomplete on steroids, that's pretty much it.

dent9 · 2025-12-30T15:47:44 1767109664

Yes what you are describing is exactly what Kiro solves

zsoltkacsandi · 2025-12-31T08:18:50 1767169130

> Through Kiro, we reinvented how developers work with AI agents.

Even according to it’s documentation it is still built for developers, so my point still stands. You need dev experience to use this tool, same as other LLM-based coding tools.

germandiago · 2025-12-30T07:24:09 1767079449

I am sorry to say you are not a good programmer.

I mean, AIs can drop something fast the same way you cannot beat a computer at adding or multiplying.

After that, you find mistakes, false positives, code that does not work fully, and the worse part is the last one: code that does not work fully but also, as a consequence, that you do NOT understand yet.

That is where your time shrinks: now you need to review it.

Also, they do not design systems better. Maybe partial pieces. Give them something complex and they will hallucinate worse solutions than what you already know if you have, let us say, over 10 years of experience programming in a language (or mabye 5).

Now multiply this unreliability problem as the code you "AI-generate" grows.

Now you have a system you do not know if it is reliable and that you do not understand to modify. Congrats...

I use AI moderately for the tasks is good at: generate some scripts, give me this small typical function amd I review it.

Review my code: I will discard part of your mistakes and hallucinations as a person that knows well the language and will find maybe a few valuable things.

Also, when reviewing and found problems in my code I saw that the LLMs really need to hallucinate errors that do not exist to justify their help. This is just something LLMs seem to not be accurate at.

Also, when problems go a bit more atypical or past a level of difficulty, it gets much more unreliable.

All in all: you are going to need humans. I do not know how many, I do not know how much they will improve. I just know that they are not reliable and this "generate-fast-unreliable vs now I do not know the codebase" is a fundamental obstacle that I think it is if not very difficult, impossible to workaround.

khalic · 2025-12-29T21:09:36 1767042576

I feel you, it's scary. But the possibilities we're presented with are incredible. I'm revisiting all these projects that I put aside because they were "too big" or "too much for a machine". It's quite exciting

CraigJPerry · 2025-12-30T09:36:00 1767087360

>> Coding AIs design software better than me

Absolutely flat out not true.

I'm extremely pro-faster-keyboard, i use the faster keyboards in almost every opportunity i can, i've been amazed by debugging skills (in fairness, i've also been very disappointed many times), i've been bowled over by my faster keyboard's ability to whip out HTML UI's in record time, i've been genuinely impressed by my faster keyboard's ability to flag flaws in PRs i'm reviewing.

All this to say, i see lots of value in faster keyboard's but add all the prompts, skills and hooks you like, explain in as much detail as you like about modularisation, and still "agents" cannot design software as well as a human.

Whatever the underlying mechanism of an LLM (to call it a next token predictor is dismissively underselling its capabilities) it does not have a mechanism to decompose a problem into independently solvable pieces. While that remains true, and i've seen zero precursor of a coming change here - the state of the art today is equiv to having the agent employ a todo list - while this remains true, LLMs cannot design better than humans.

There are many simple CRUD line of business apps where they design well enough (well more accurately stated, the problem is small/simple enough) that it doesn't matter about this lack of design skill in LLMs or agents. But don't confuse that for being able to design software in the more general use case.

tatjam · 2025-12-30T10:20:26 1767090026

Exactly, for the thing that has been done in Github 10000x times over, LLMs are pretty awesome and they speed up your job significantly (it's arguable if you would be better off using some abstraction already built if that's the case).

But try to do something novel and... they become nearly useless. Not like anything particularly difficult, just something that's so niche it's never been done before. It will most likely hallucinate some methods and call it a day.

As a personal anecdote, I was doing some LTSpice simulations and tried to get Claude Sonnet to write a plot expression to convert reactance to apparent capacitance in an AC sweep. It hallucinated pretty much the entire thing, and got the equation wrong (assumed the source was unit intensity, while LTSpice models AC circuits with unit voltage. This surely is on the internet, but apparently has never been written alongside the need to convert an impedance to capacitance!).

Herring · 2025-12-29T22:19:16 1767046756

Try have your engineers pick up some product work. Clients do NOT want to talk to bots.

lelanthran · 2025-12-30T10:38:49 1767091129

> Coding AIs design software better than me, review code better than me, find hard-to-find bugs better than me, plan long-running projects better than me, make decisions based on research, literature, and also the state of our projects better than me.

They don't do any of that better than me; they do it poorer and faster, but well enough for most of the time.

dent9 · 2025-12-30T15:46:26 1767109586

Then you are using the wrong AI tools or using them poorly

tom_m · 2025-12-30T02:21:42 1767061302

There will be a need. Don't worry. Most people still haven't figured out how to properly read and interpret instructions. So they build things incorrectly - with or without AI

Seriously. The bar is that low. When people say "AI slop" I just chuckle because it's not "AI" it's everyone. That's the general state of the industry.

So all you have to do is stay engaged, ask questions, and understand the requirements. Know what it is you're building and you'll be fine.

conartist6 · 2025-12-30T12:38:40 1767098320

More than any other effect they have LLMs breed something called "learned helplessness". You just listed a few things it may stay better than you at, and a few things that it is not better than you at and never will be.

Planning long running projects and deciding are things only you can do well!! Humans manage costs. We look out for our future. We worry. We have excitement, and pride. It wants you to think none of these things matter of course, because it doesn't have them. It says plausible things at random, basically. It can't love, it can't care, it won't persist.

WHATEVER you do don't let it make you forget that it's a bag of words and you are someing almost infinitely more capable, not in spite of human "flaws" like caring, but because of them :)

conartist6 · 2025-12-30T12:59:27 1767099567

Plus I think I've almost never see so little competition for what I think are the real prizes! Everyone's off making copies of copies of copies of the same crappy infrastructure we already have. They're busy building small inconsequential side projects so they can say they built something using an LLM.

embedding-shape · 2025-12-30T13:08:15 1767100095

> They're busy building small inconsequential side projects

Unironically, sending a program to build those for me have send me almost endless amount of time. I'm a pretty distracted individual, and pretty anal about my workflow/environment, so lots of times I've spent hours going into rabbit-holes to make something better, when I could have just sucked it up and do it the manual way instead, even if it takes mental energy.

Now, I can still do those things, but not spend hours, just a couple of minutes, and come back after 20-30 minutes to something that lets me avoid that stuff wholesale. Once you start stacking these things, it tends to save a lot of time and more importantly, mental energy.

So the programs by themselves are basically "small inconsequential side projects" because they're not "production worthy and web scale SaaS ready to earn money", but they help me and others who are building those things in a big way.

throw4847285 · 2025-12-30T14:19:52 1767104392

But isn't that exactly the kind of learned helplessness being discussed? As a fellow distracted individual, I have seen instant gratification erode all of my most prized hobbies and skills. Why read a book when I can scroll on my phone? My distress tolerance is lower than ever. LLMs feel like a bridge too far, for me anyway.

embedding-shape · 2025-12-30T14:41:30 1767105690

Nothing has been eroded for me, in fact it had the opposite effect. It's easier to get into new hobbies, easier to develop skills, I value reading on my own more than I did before. At least for me, LLMs act as multipliers of what I can and want to do, it hasn't removed my passion for music production, 3D, animation or programming one bit, if anything it's fueled those passions and let me do stuff within them faster and better.

throw4847285 · 2025-12-30T15:00:56 1767106856

Nothing I could make would be very good. So the only reason I would, say, write, is in order to write, not to have produced an essay. Hobbies are ways to pass time productively. If it took less time, it wouldn't be a better use of time, but a worse one.

embedding-shape · 2025-12-30T15:51:02 1767109862

It's not about being able to do more faster, but be able to faster get help doing what you wanted to do. For example, before LLMs, if I wanted to figure out how to do something with a specific analog synth I basically spent time reading manuals and browsing internet forums, piecing together whatever I could find into something actionable, sometimes slightly wrong, but at least in the right direction.

Nowadays, I fire off the LLM to figure it out for me, then try out what I get back, and I can move on to actually having fun playing on the synth, rather than trying to figure out how to do what I wanted to do.

The end goal for me with my hobbies is more or less the same, have fun. But for me the fun is not digging through manuals, it is to "do" or "use" or "perform" or whatever. I like music production because I like to make music, not because I like digging through manuals for some arcane knowledge.

throw4847285 · 2025-12-30T17:14:39 1767114879

But looking up information via an LLM is an entirely different category of usage. I have no problem with that (well, much less of a problem).

embedding-shape · 2025-12-30T18:29:34 1767119374

The point is "things that used to take me hours, can now be done by a magic computer program in the background, while I do other things". It's applicable for small unix utilities I create to make my development UX better, it's applicable for when I'm doing music production and it's applicable in a wide-range of tasks both professionally and for my hobbies.

It saves me from stuff I find boring yet necessary, so I can focus more on the fun stuff. I guess this was the overall point I was trying to make in this comment-chain.

cootsnuck · 2025-12-30T18:06:05 1767117965

Yea I've been seeing very similar behavior from people. They think of themselves as static, unchanging, uncreative but view LLMs as some kind of unrelenting and inevitable innovative force...

I think it's people's anxieties and fears about the uncertainty about the value of their own cognitive labor demoralizing them and making them doubt their own self-efficacy. Which I think is an understandable reaction in the face of trillion dollar companies frothing at the mouth to replace you with pale imitations.

Best name I could think of calling this narrative / myth is people believing in "effortless AI": https://www.insidevoice.ai/p/effortless-ai

XenophileJKO · 2025-12-30T19:55:51 1767124551

You are still in denial of what an LLM actually is capable of in the near-mid term.

In the current architecture there are mathmatical limitations on what it can do with information. However, tool use and external orchestration allow it to work around many (maybe all) those limitations.

The current models have brittle parts and some bad tendencies.. but they will continue to eat up the executive thought ladder.

I think it is better to understand this and position yourself higher and higher on that chain while learning what are the weak areas in the current generation of models.

Your line of thinking is like hiding in a corner while the water is rising. You are right, it is a safe corner, but probably not for long.

conartist6 · 2025-12-31T16:37:58 1767199078

I don't think the limitations on what it can do are mathematical at all. It has no, faith, no conviction, no sense of self. No philosophy, no ability to learn. How could it undertake a major effort?

conartist6 · 2025-12-30T22:43:23 1767134603

I'm as high on the chain as it is possible to get! I don't use AI at all. Models help people follow, but I'm leading. Bite me.

XenophileJKO · 2025-12-30T23:35:41 1767137741

No reason to be uncivil.

Just so we are clear, you are saying you don't use it at all, but you are providing advice about it? Specifically detailing with certainty that the current state of the art has or doesn't have certain traits or abilities.

conartist6 · 2025-12-31T13:38:54 1767188334

Yes. I'm not providing advice on how to use it, I'm providing advice on whether or not to use it. A million people cried out that I would be obsolete. I would be replaced: left behind. Career suicide one said LOL.

I think I'm the perfect person to be qualified to stand up and say "if they tell you you can't live without it, they are lying to your face." Only someone who has lived without it as I have would be in a position to know

deadbabe · 2025-12-29T22:05:12 1767045912

Where the hell was all this fear when the push for open source everything got fully underway? When entire websites were being spawned and scaffolded with just a couple lines of code? Do we not remember all those impressive tech demos of developers doing massive complex thing with "just one line of code"? How did we not just write software for every kind of software problem that could exist by now?

How has free code, developed by humans, become more available than ever and yet somehow we have had to employ more and more developers? Why didn't we trend toward less developers?

It just doesn't make sense. AI is nothing but a snippet generator, a static analyzer, a linter, a compiler, an LSP, a google search, a copy paste from stackoverflow, all technologies we've had for a long time, all things developers used to have to go without at some point in history.

I don't have the answers.

goodpoint · 2025-12-30T09:07:21 1767085641

> Coding AIs design software better than me, review code better than me, find hard-to-find bugs better than me, plan long-running projects better than me, make decisions based on research, literature, and also the state of our projects better than me

ChatGPT, is that you?

scellus · 2025-12-29T22:07:07 1767046027

Perfect economic substitution in coding doesn't happen for a long time. Meanwhile, AI appears as an amplifier to the human and vice versa. That the work will change is scary, but the change also opens up possibilities, many of them now hard to imagine.

bborud · 2025-12-30T12:01:10 1767096070

Notice who makes these predictions that programmers will become irrelevant.

heliumtera · 2025-12-29T21:59:01 1767045541

Stop freaking out. Seriously. You're afraid of something completely ridiculous.

It is certainly more eloquent than you regarding software architecture (which was a scam all along, but conversation for another time). It will find SOME bugs better than you, that's a given.

Review code better than you? Seriously? What you're using and what you consider code review? Assume I could identify one change broke production and you reviewed the latest commit. I am pinging you and you better answer. Ok, Claude broke production, now what? Can you begin to understand the difference between you and the generative technology? When you hop on the call, you will explain to me with a great deal of details what you know about the system you built, and explain decision making and changes over time. You'll tell about what worked and what didn't. You will tell about the risks, behavior and expectations. About where the code runs, it's dependencies, users, usage patterns, load, CPU usage and memory footprint, you could probably tell what's happening without looking at logs but at metrics. With Claude I get: you're absolutely right! You asked about what it WAS, but I told you about what it WASN'T! MY BAD.

Knowledge requires a soul to experience and this is why you're paid.

mywittyname · 2025-12-29T22:46:16 1767048376

We use code rabbit and it's better than practically any human I've worked with at a number of code review tasks, such as finding vulnerabilities, highlighting configuration issues, bad practices, etc. It's not the greatest at "does this make sense here" type questions, but I'd be the one answering those questions anyway.

Yeah, maybe the people I've worked with suck at code reviews, but that's pretty normal.

Not to say your answer is wrong. I think the gist is accurate. But I think tooling will get better at answering exactly the kind of questions you bring up.

Also, someone has to be responsible. I don't think the industry can continue with this BS "AI broke it." Our jobs might devolve into something more akin to a SDET role and writing the "last mile" of novel code the AI can't produce accurately.

andrekandre · 2025-12-30T19:43:01 1767123781

  > We use code rabbit and it's better than practically any human

code rabbit does find things occasionally, but it also calls things 'critical' that arent and flags issues that dont actually exist and even lies in replies sometimes...

it also is extremely verbose to the point of being slog to go through... and the haikus: they are so cringe and infantilizing...

maybe its our config, but code rabbit has been underwhelming...

anonymars · 2025-12-29T22:51:18 1767048678

> Review code better than you? Seriously?

Yes, seriously (not OP). Sometimes it's dumb as rocks, sometimes it's frighteningly astute.

I'm not sure at which point of the technology sigmoid curve we find ourselves (2007 iPhone or 2017 iPhone?) but you're doing yourself a disservice to be so dismissive

heliumtera · 2025-12-29T23:05:13 1767049513

Copilot reviews are enabled company wide and comments must be resolved manually. I wish I could be so dismissive lol I cannot, literally do not have the ability to be dismissive

robofanatic · 2025-12-30T18:26:28 1767119188

>I really really want this to be true. I want to be relevant

Think of yourself as a chef and LLMs as ready to eat meals or a recipe app. Can ready to eat meals OR recipe apps put a chef out of business?

bob1029 · 2025-12-30T07:41:58 1767080518

The AI is pretty scary if you think most of software engineering is about authoring individual methods and rubber ducking about colors of paint and brands of tools.

Once you learn that it's mostly about interacting with a customer (sometimes this is yourself), you will realize the AI is pretty awful at handling even the most basic tasks.

Following a product vision, selecting an appropriate architecture and eschewing 3rd party slop are examples of critical areas where these models are either fundamentally incapable or adversely aligned. I find I have to probe ChatGPT very hard to get it to offer a direct implementation of something like a SAML service provider. This isn't a particularly difficult thing to do in a language like C# with all of the built in XML libraries, but the LLM will constantly try to push you to use 3rd party and cloud shit throughout. If you don't have strong internal convictions (vision) about what you really want, it's going to take you for a ride.

One other thing to remember is that our economies are incredibly efficient. The statistical mean of all information in sight of the LLMs likely does not represent much of an arbitrage opportunity at scale. Everyone else has access to the same information. This also means that composing these systems in recursive or agentic styles means you aren't gaining anything. You cannot increase the information content of a system by simply creating another instance of the same system and having it argue with itself. There usually exists some simple prompt that makes a multi agent Rube Goldberg contraption look silly.

> I’m basically just the conductor of all those processes.

"Basically" and "just" are doing some heroic weight lifting here. Effectively conducting all of the things an LLM is good at still requires a lot of experience. Making the constraints live together in one happy place is the hard part. This is why some of us call it "engineering".

63stack · 2025-12-29T21:56:57 1767045417

This reads like shilling/advertisement.. Coding AIs are struggling for anything remotely complex, make up crap and present it as research, write tests that are just "return true", and won't ever question a decision you make.

Those twenty engineers must not have produced much.

pfannkuchen · 2025-12-30T04:44:49 1767069889

I think part of what is happening here is that different developers on HN have very different jobs and skill levels. If you are just writing a large volume of code over and over again to do the same sort of things, then LLMs probably could take your job. A lot of people have joined the industry over time, and it seems like the intelligence bar moved lower and lower over time, particularly for people churning out large volumes of boilerplate code. If you are doing relatively novel stuff, at least in the sense that your abstractions are novel and the shape of the abstraction set is different from the standard things that exist in tutorials etc online, then the LLM will probably not work well with your style.

So some people are panicking and they are probably right, and some other people are rolling their eyes and they are probably right too. I think the real risk is that dumping out loads of boilerplate becomes so cheap and reliable that people who can actually fluently design coherent abstractions are no longer as needed. I am skeptical this will happen though, as there doesn’t seem to be a way around the problem of the giant indigestible hairball (I.e as you have more and more boilerplate it becomes harder to remain coherent).

mekoka · 2025-12-30T10:52:12 1767091932

Indeed, discussions on LLMs for coding sound like what you would expect if you asked a room full of people to snatch up a 20 kg dumbbell once and then tell you if it's heavy.

> I think the real risk is that dumping out loads of boilerplate becomes so cheap and reliable that people who can actually fluently design coherent abstractions are no longer as needed.

Cough front-end cough web cough development. Admittedly, original patterns can still be invented, but many (most?) of us don't need that level of creativity in our projects.

stackghost · 2025-12-30T08:04:38 1767081878

Absolutely this, and TFA touches on the point about natural language being insufficiently precise:

AI can write you an entire CRUD app in minutes, and with some back-and-forth you can have an actually-good CRUD app in a few hours.

But AI is not very good (anecdotally, based on my experience) at writing fintech-type code. It's also not very good at writing intricate security stuff like heap overflows. I've never tried, but would certainly never trust it to write cryptography correctly, based on my experience with the latter two topics.

All of the above is "coding", but AI is only good at a subset of it.

bonesss · 2025-12-30T12:15:00 1767096900

Generating CRUD is like solving cancer in mice, we already have a dizzying array of effective solutions… Ruby on Rails, Access 97, model first ORMs with GUI mappers. SharePoint lets anyone do all the things easily.

The issue is and always has been maintenance and evolution. Early missteps cause limitations, customer volume creates momentum, and suddenly real engineering is needed.

I’d be a lot more worried about our jobs if these systems were explaining to people how to solve all their problems with a little Emacs scripting. As is they’re like hyper aggressive tech sales people, happy just to see entanglements, not thinking about the whole business cycle.

skydhash · 2025-12-30T12:58:04 1767099484

Go with Laravel and some admin packages and you generate CRUD pages in minutes. And I think with Django, that is builtin.

But I don’t think I’ve seen pure CRUD on anything other than prototype. Add an Identity and Access Management subsystem and the complexity of requirements will explode. Then you add integration to external services and legacy systems, and that’s where the bulk of the work is. And there’s the scalability issue that is always looming.

Creating CRUD app is barely a level over starting a new project with the IDE wizard.

stackghost · 2025-12-30T17:23:55 1767115435

>Creating CRUD app is barely a level over starting a new project with the IDE wizard.

For you, maybe. But for a non-progrmamer who's starting a business or just needs a website it's the difference between hiring some web dev firm and doing it themselves.

andrekandre · 2025-12-30T18:42:59 1767120179

  > it's the difference between hiring some web dev firm and doing it themselves.

anecdote but i've had a lot of acquaintances who started at both "hiring some web dev firm" and "doing it themselves" with results largely being the same: "help me fix this unmaintainable mess and i will pay you x"...

jmo but i suspect llms will allow for the later to go further before the "help me" phase but i feel like that aint going away completely...

stackghost · 2025-12-30T22:28:33 1767133713

Just like my previous comments, much depends on the specifics.

My wife's sister and her husband run a small retail shop in $large_city. My sister-in-law taught herself how to set up and modify a website with a shopify storefront largely with LLM help. Now they take online orders. I've looked at the code she wrote and it's not pretty but it generally works. There will probably never be a "help me fix this unmaintainable mess and I will pay you" moment in the life of that business.

The crux of my point is this: In 2015 she would have had to hire somebody to do that work.

This segment of the software industry is where the "LLMs will take our jerbs" argument is coming from.

The people who say "AI is junk and it can't do anything right" are simply operating in a different part of the industry.

llmslave2 · 2025-12-30T10:05:28 1767089128

> and with some back-and-forth you can have an actually-good CRUD app in a few hours

Perhaps the debate is on what constitutes "actually-good". Depends where the bar is I suppose.

stackghost · 2025-12-30T22:29:25 1767133765

Beauty is in the eye of the beholder. Litigating our personal opinions about "actually-good" is irrelevant and pointless.

IshKebab · 2025-12-30T07:29:02 1767079742

> different developers on HN have very different jobs and skill levels.

Definitely this. When I use AIs for web development they do an ok job most of the time. Definitely on par with a junior dev.

For anything outside of that they're still pretty bad. Not useless by any stretch, but it's still a fantasy to think you could replace even a good junior dev with AI in most domains.

I am slightly worried for my job... but only because AI will keep improving and there is a chance it will be as good as me one day. Today it's not a threat at all.

ryandrake · 2025-12-30T07:48:54 1767080934

Yea, LLMs produce results on par with what I would expect out of a solid junior developer. They take direction, their models act as the “do the research” part, and they output lots of code: code that has to be carefully scrutinized and refined. They are like very ambitious interns who never get tired and want to please, but often just produce crap that has to be totally redone or refactored heavily in order to go into production.

If you think LLMs are “better programmers than you,” well, I have some disappointing news for you that might take you a while to accept.

monsieurbanana · 2025-12-30T10:00:59 1767088859

> LLMs produce results on par with what I would expect out of a solid junior developer

This is a common take but it hasn't been my experience. LLMs produce results that vary from expert all the way to slightly better than markov chains. The average result might be equal to a junior developer, and the worst case doesn't happen that often, but the fact that it happens from time to time makes it completely unreliable for a lot of tasks.

Junior developers are much more consistent. Sure, you will find the occasional developer that would delete the test file rather than fixing the tests, but either they will learn their lesson after seeing your wth face or you can fire them. Can't do that with llms.

jvanderbot · 2025-12-30T10:45:45 1767091545

I think any further discussion about quality just needs to have the following metadata:

- Language

- Total LOC

- Subject matter expertise required

- Total dependency chain

- Subjective score (audited randomly)

And we can start doing some analysis. Otherwise we're pissing into ten kinds of winds.

My own subjective experience is earth shattering at webapps in html and css (because I'm terrible and slow at it), and annoyingly good but a bit wrong usually in planning and optimization in rust and horribly lost at systems design or debugging a reasonably large rust system.

monsieurbanana · 2025-12-30T11:03:25 1767092605

I agree in that these discussions (this whole hn thread tbh) are seriously lacking in concrete examples to be more than holy wars 3.0.

Besides one point: junior developers can learn from their egregious mistakes, llms can't no matter how strongly worded you are in their system prompt.

In a functional work environment, you will build trust with your coworkers little by little. The pale equivalent in LLMs is improving system prompts and writing more and more ai directives that might or might not be followed.

ryandrake · 2025-12-30T16:16:38 1767111398

This seems to be one of the huge weaknesses of current LLMs: Despite the words "intelligence" and "machine learning" we throw around, they aren't really able to learn and improve their skills without someone changing the model. So, they repeat the same mistakes and invent new mistakes by random chance.

If I was tutoring a junior developer, and he accidentally deleted the whole source tree or something egregious, that would be a milestone learning point in his career, and he would never ever do it again. But if the LLM does it accidentally, it will be apologetic, but after the next context window clear, it has the same chances of doing it again.

embedding-shape · 2025-12-30T13:10:19 1767100219

> Besides one point: junior developers can learn from their egregious mistakes, llms can't no matter how strongly worded you are in their system prompt.

I think if you set off an LLM to do something, and it does a "egregious mistake" in the implementation, and then you adjust the system prompt to explicitly guard against that or go towards a different implementation and you restart from scratch again yet it does the exact same "egregious mistake", then you need to try a different model/tool than the one you've tried that with.

It's common with smaller models, or bigger models that are heavily quanitized that they aren't great at following system/developer prompts, but that really shouldn't happen with the available SOTA models, I haven't had something ignored like that in years by now.

jvanderbot · 2025-12-30T13:29:45 1767101385

And honestly this is precisely why I don't fear unemployment, but I do fear less employment overall. I can learn and get better and use LLMs as a tool. So there's still a "me" there steering. Eventually this might not be the case. But if automating things has taught me anything, it's that removing the person is usually such a long tail cost that it's cheaper to keep someone in the loop.

But is this like steel production or piloting (few highly trained experts are in the loop) or more like warehouse work (lots of automation removed any skills like driving or inventory work etc).

AnimalMuppet · 2025-12-30T14:36:42 1767105402

I can in fact fire an LLM. It's even easier than firing a junior developer.

Or rather, it's more like a contractor. If I don't like the job they did, I don't give them the next job.

anthonypasq · 2025-12-30T22:25:22 1767133522

you say this as if web development isnt 90% of software.

1718627440 · 2025-12-30T10:01:45 1767088905

> If you are just writing a large volume of code over and over again

But why would you do that? Wouldn't you just have your own library of code eventually that you just sell and sell again with little tweaks? Same money for far less work.

embedding-shape · 2025-12-30T10:30:42 1767090642

People, at least novice developers, tend to prefer fast and quick boilerplate that makes them look effective, over spending one hour sitting just thinking and designing, then implementing some simple abstraction. This is true today, and been true for as long as I've been in programming.

Besides, not all programming work can be abstracted into a library and reused across projects, not because it's technically infeasible, but because the client doesn't want to, cannot for legal reasons or the developer process at the client's organization simply doesn't support that workflow. Those are just the reasons from the top of my head, that I've encountered before, and I'm sure there is more reasons.

1718627440 · 2025-12-30T23:50:59 1767138659

But people don't stay novices after years/decades. Of course when you write the boilerplate for the 20x time maybe you still accept that, but when you write it for the 2000x time, I bet you do the lazy thing and just copy it.

> cannot for legal reasons or ...

Sure, you can't copy trade secrets, but that's also not the boilerplate part. Copying e.g. a class hierarchy and renaming all the names and replacing the class contents that represent the domain, won't be a legal problem, because this is not original in the first place.

embedding-shape · 2025-12-31T13:52:32 1767189152

> But people don't stay novices after years/decades

Some absolutely do. I know programmers who entered web development at the same time as me, and now after decades they're still creating typical CRUD applications for whatever their client today is, using the same frameworks and languages. If it works, makes enough money and you're happy, why change?

> Copying e.g. a class hierarchy and renaming all the names and replacing the class contents that represent the domain, won't be a legal problem, because this is not original in the first place.

Some code you produce for others definitively fall under their control, but obviously depends on the contracts and the laws of the country you're in. But I've written code for others that I couldn't just "abstract into a FOSS library and use in this project", even if it wasn't trade secrets or what not, just some utility for reducing boilerplate.

1718627440 · 2025-12-31T22:06:55 1767218815

> "abstract into a FOSS library and use in this project"

That is not what I meant. My idea was more like "copy ten lines from this project, then lines from that project, the class from here, but replace every line before the commit ...".

I shouldn't have used the word library, as I did not mean output from the linker, but rather a colloquial meaning of a loose connection of snippets.

therobots927 · 2025-12-30T14:08:02 1767103682

That’s a very good point I hadn’t heard explained that way before. Makes a lot of sense and explains a lot of the circular debates about AI that happen here daily.

charcircuit · 2025-12-30T08:03:23 1767081803

>at least in the sense that your abstractions are novel and the shape of the abstraction set is different from the standard things that exist

People shouldn't be doing this in the first place. Existing abstractions are sufficient for building any software you want.

yetihehe · 2025-12-30T08:31:04 1767083464

> Existing abstractions are sufficient for building any software you want.

Software that doesn't need new abstractions is also already existing. Everything you would need already exists and can be bought much more cheaply than you could do it yourself. Accounting software exists, unreal engine exists and many games use it, why would you ever write something new?

charcircuit · 2025-12-30T10:15:14 1767089714

>Software that doesn't need new abstractions is also already existing

This isn't true due to the exponential growth of how many ways you can compose existing abstractions. The chance that a specific permutation will have existing software is small.

bryanrasmussen · 2025-12-30T08:18:23 1767082703

I'm supposing that nobody who has a job is producing abstractions that are always novel, but there may be people who find abstractions that are novel for their particular field because it is something most people in that field are not familiar with, or that come up with novel abstractions (infrequently) that improve on existing ones.

bckr · 2025-12-30T08:39:19 1767083959

The new abstraction is “this corporation owns this IP and has engineers who can fix and extend it at will”. You can’t git clone that.

But if there is something off the shelf that you can use for the task at hand? Great! The stakeholders want it to do these other 3000 things before next summer.

mdemare · 2025-12-30T21:05:13 1767128713

Software development is a bit like chess. 1. e4 is an abstraction available to all projects, 3. Nc3 is available to 20% of projects, while 15. Nxg5 is unique to your own project.

Or, abstractions in your project form a dependency tree, and the nodes near the root are universal, e.g. C, Postgres, json, while the leaf nodes are abstractions peculiar to just your own project.

charcircuit · 2025-12-31T01:30:32 1767144632

The possible chess moves is already known ahead of time. Just because an AI can't make up a move like Np5 as a human could do, that doesn't mean anything AI can't play chess. It will be fine with just using the existing moves that have been found so far. The idea that we still need humans to come up with new chess moves is not a requirement for playing chess.

aspenmartin · 2025-12-29T22:22:06 1767046926

No it doesn’t read like shilling and advertisement, it’s tiring hearing people continually dismiss coding agents as if they have not massively improved and are driving real value despite limitations and they are only just getting started. I’ve done things with Claude I never thought possible for myself to do, and I’ve done things where Claude made the whole effort take twice as long and 3x more of my time. It’s not like people are ignoring the limitations, it’s that people can see how powerful the already are and how much more headroom there is even with existing paradigms not to mention the compute scaling happening in 26-27 and the idea pipeline from the massive hoarding of talent.

jayd16 · 2025-12-29T22:24:51 1767047091

When prices go down or product velocity goes up we'll start believing in the new 20x developer. Until then, it doesn't align with most experiences and just reads like fiction.

You'll notice no one ever seems to talk about the products they're making 20x faster or cheaper.

hansmayer · 2025-12-29T22:31:30 1767047490

+1 - I wish at least one of these AI boosters had shown us a real commercialised product they've built.

aspenmartin · 2025-12-29T22:43:20 1767048200

AI boosters? Like people are planted by Sam Altman like the way they hire crowds for political events or something? Hey! Maybe I’m AI! You’re absolutely right!

In seriousness: I’m sure there are projects that are heavily powered by Claude, myself and a lot of other people I know use Claude almost exclusively to write and then leverage it as a tool when reviewing. Almost everyone I hear that has this super negative hostile attitude references some “promise” that has gone unfulfilled but it’s so silly: judge the product they are producing and maybe just maybe consider the rate of progress to _guess_ where things are heading

hansmayer · 2025-12-29T23:07:39 1767049659

I never said "planted", that is your own assumption, albeit a wrong one. I do respect it though, as it is at least a product of a human mind. But you don't have to be "planted" to champion an idea, you are clearly championing it out of some kind of conviction, many seem to do. I was just giving you a bit of reality check.

If you want to show me how to "guess where things are heading" / I am actually one of the early adopters of LLMs and have been engineering software professionally for almost half my life now. Why do you think I was an early adopter? Because I was skeptical or afraid of that tech? No, I was genuinely excited. Yes you can produce mountains of code, even more so if you were already an experienced engineer, like myself for example.

Yes you can even get it to produce somewhat acceptable outputs, with a lot of effort at prompting it and fatigue that comes with it. But at the end of the day, as an experienced engineer, I am not being more productive with it, I will end up being less productive because of all the sharp edges I have to take care of, all the sloppily produced code, unnecessary bloat, hallucinated or injected libraries etc.

Maybe for folks who were not good at maths or had trouble understanding how computers work this looks like a brave new world of opportunities. Surely that app looks good to you, how bad can it be? Just so you and other such vibe-coders understand, here is a parallel.

It is actually fairly simple for a group of aviation enthusiasts to build a flying airplane. We just need to work out some basic mechanics, controls and attach engines. It can be done, I've seen a couple of documentaries too. However, those planes are shit. Why? Because me and my team of enthusiast dont have the depth of knowledge of a team of aviation engineers to inform my decisions.

What is the tolerance for certain types of movements, what kind of materials do I need to pick, what should be my maintenance windows for various parts etc. There are things experts can decide on almost intuitively, yet with great precision, based on their many years of craft and that wonderful thing called human intelligence. So my team of enthusiasts puts together an airplane. Yeah it flies. It can even be steered. It rolls, pitches and yawns. It takes off and lands. But to me it's a black-box, because I don't understand many, many factors, forces, pressures, tensors, effects etc that are affecting an airplane during it's flight and takeoff. I am probably not even aware WHAT I should be aware of. Because I dont have that deep educaiton about mechanical engineering, materials, aerodynamics etc. Neither does my team. So my plane, while impressive to me and my team, will never take off commercially, not unless a team of professionals take it over and remakes it to professional standards. It will probably never even fly in a show. And if me or someone on my team dies flying it, you guessed it - our insurance sure as hell won't cover the costs.

So what you are doing with Claude and other tools, while it may look amazing to you, is not that impressive to the rest of us, because we can see those wheels beginning to fall off even before your first take off. Of course, before I can even tell that, I'd have to actually see your airplane, it's design plans etc. So perhaps first show us some of those "projects heavily powered by Claude" and their great success, especially commercial one (otherwise its a toy project), before you talk about them.

The fact that you are clearly not an expert on the topic of software engineering should guide you here - unless you know what you are talking about, it's better to not say anything at all.

user34283 · 2025-12-30T09:29:31 1767086971

How would you know whether he is an expert on the topic of software engineering or not?

For all I know, he is more competent than you; he figured out how to utilize Claude Code in a productive way, which is a point for him.

I'd have to guess whether you are an expert working on software not well suited for AI, or just average with a stubborn attitude towards AI and potentially not having tried the latest generation of models and agentic harnesses.

llmslave2 · 2025-12-30T10:02:59 1767088979

> How would you know whether he is an expert on the topic of software engineering or not?

Because of their views on the effectiveness of AI agents for generating code.

user34283 · 2025-12-30T10:27:52 1767090472

Considering those views are shared by a number of high profile, skilled engineers, this is obviously no basis for doubting someone's expertise.

mekoka · 2025-12-30T11:41:59 1767094919

I think it's worth framing things back to what we're reacting to. The top poster said:

> I really really want this to be true. I want to be relevant. I don’t know what to do if all those predictions are true and there is no need (or very little need) for programmers anymore.

The rest of the post is basically their human declaration of obsolescence to the programming field. To which someone reacted by saying that this sounds like shilling. And indeed it does for many professional developers, including those that supplement their craft with LLMs. Declaring that you feel inadequate because of LLMs only reveals something about you. Defending this position is a tell that puts anyone sharing that perspective in the same boat: you didn't know what you were doing in the first place. It's like when someone who couldn't solve the "invert a binary tree" problem gets offended because they believed they were tricked into an impossible task. No, you may be a smart person that understands enough of the rudiment of programming to hack some interesting scripts, but that's actually a pretty easy problem and failing to solve it indeed signals that you lack some fundamentals.

> Considering those views are shared by a number of high profile, skilled engineers, this is obviously no basis for doubting someone's expertise.

I've read Antirez, Simon Willison, Bryan Cantrill, and Armin Ronacher on how they work or want to work with AI. From none I've got this attitude that they're no longer needed as part of the process.

hansmayer · 2025-12-30T11:08:54 1767092934

> Considering those views are shared by a number of high profile, skilled engineers, this is obviously no basis for doubting someone's expertise

Again, a lot of fluff, a lot of of "a number ofs", "highly this, highly that". But very little concrete information. What happened to the pocket PhDs promised for this past summer? Where are the single-dude billion dollar companies built with AI tools ? Or even a multiple-dudes billion dollar companies ? What are you talking about?

llmslave2 · 2025-12-30T10:44:27 1767091467

I've yet to see it from someone who isn't directly or indirectly affiliated with an organisation that would benefit from increased AI tool adoption. Not saying it's impossible, but...

Whereas there are what feels like endless examples of high profile, skilled engineers who are calling BS on the whole thing.

victorbjorklund · 2025-12-30T12:06:38 1767096398

You can say the same about people saying the opposite. I haven’t heard from a single person who says AI can’t write code that does not a financially interest directly or indirectly in humans writing code.

llmslave2 · 2025-12-30T12:10:11 1767096611

Nobody says AI "can't write code". It very clearly can.

user34283 · 2025-12-30T21:56:31 1767131791

That seems rather disingenuous to me. I see many posts which clearly come from developers like you and me who are happy with the results they are getting.

Every time people on here comment something about "shilling" or "boosters". It would seem to me that in the rarest of cases someone shares their opinion to profit from it, while you act like that is super common.

aspenmartin · 2025-12-30T20:35:51 1767126951

Right: they disagree with me and so must not know what they’re talking about. Hey guess how I know neither of you are all as good as you think you are: your egos! You know what the brightest people at the top of their respective fields have in common? They tend not to think that new technologies they don’t understand how to use are dumb and they don’t think everyone who disagrees with them is dumb!

aspenmartin · 2025-12-30T20:25:17 1767126317

> you are clearly not an expert on the topic of software engineering should guide you here - unless you know what you are talking about, it's better to not say anything at all.

Yikes, pretty condescending. Also wrong!

IMO you are strawmanning pretty heavily here.

Believe it or not, using Claude to improve your productivity is pretty dissimilar to vibe coding a commercial airplane(?) which I would agree is probably not FAA approved.

I prefer not to toot my own horn, but to address an idea you seem to have that I don’t know math or CS(?) I have a PhD in astrophysics and a decade of industry experience in tech and other domains so I’m fairly certain I know how math and computers work but maybe not!