More

sbarre · 2025-12-05T14:29:40 1764944980

I got a 4k 55" TV for $299 earlier this year. It weighs maybe 10lbs, and is super thin and fits on the wall.

Large 4k TVs being this accessible/affordable for most households has not been an option for "decades"..

Retric · 2025-12-05T14:35:29 1764945329

Screen size makes little difference for an individual they can just sit closer, viewing angels are the problem for a family where 55” doesn’t cut it.

4k also makes little difference here, most people really don’t care as seen by how many people use simple HD vs 4k streaming.

dpark · 2025-12-05T14:41:49 1764945709

> Screen size makes little difference for an individual they can just sit closer

This is silly. Most people don’t want to sit in a chair 3 feet from their TV to make it fill more of their visual area. A large number of people are also not watching movies individually. I watch TV with my family far more than I watch alone.

Retric · 2025-12-05T14:49:29 1764946169

> This is silly.

Tell that to every streaming on their tablets sitting on their stomachs. People even watch movies on their phones but they aren’t holding them 15’ away.

Also you don’t need to sit 3’ from a 37” TV.

dpark · 2025-12-05T15:10:17 1764947417

No one says the experience of watching on their tablet matches the experience of watching a movie in the theater.

But this isn’t the point. TVs are furniture. People generally have a spot where the TV naturally fits in the room regardless of its size. No one buys a TV and then arranges the rest of their furniture to sit close enough to fill their visual space. If the couch is 8 feet from the TV, it’s 8 feet from the TV.

Retric · 2025-12-05T15:12:24 1764947544

People watching their tablet on a couch in from of a 55+” TV with a surround sound speaker system says on some level it’s a better experience. I’ve seen plenty of people do this to say it’s common behavior.

> No one buys a TV and then arranges the rest of their furniture to sit close enough to fill their visual space. If the couch is 8 feet from the TV, it’s 8 feet from the TV.

It’s common on open floor plans / large rooms for a couch to end up in a completely arbitrary distance from a TV rather than next to a wall. Further setting up the TV on the width vs length vs diagonal of a room commonly provides two or more options for viewing distance.

dpark · 2025-12-05T17:07:03 1764954423

> People watching their tablet on a couch in from of a 55+” TV with a surround sound speaker system says on some level it’s a better experience.

It’s a more private/personal experience. Turning on the TV means everyone watches.

> It’s common on open floor plans / large rooms for a couch to end up in a completely arbitrary distance from a TV rather than next to a wall. Further setting up the TV on the width vs length vs diagonal of a room commonly provides two or more options for viewing distance.

You’re essentially arguing that people can arrange their furniture for the best viewing experience. Which is true, but also not what people actually do.

The set of people willing to arrange their furniture for the best movie watching experience in their home are the least likely to buy a small TV.

Retric · 2025-12-05T18:10:43 1764958243

> Turning in the TV means everyone watches.

People still do this while home alone, you’re attacking a straw man.

> least likely to buy a small TV.

People can only buy what actually exists. My point was large TV’s “have been out for decades they really aren’t a replacement” people owning them still went to the moves.

dpark · 2025-12-05T18:22:06 1764958926

> People still do this while home alone, you’re attacking a straw man.

Maybe? You’re making blind assertions with no data. I have no idea how frequently the average person sits in front of their 60” TV by themselves and watches a movie on their tablet. My guess is not very often but again, I have no data on this.

> My point was large TV’s “have been out for decades they really aren’t a replacement” people owning them still went to the moves.

And we come back to the beginning where your assertion is true but also misleading.

Most people have a large tv in their homes today. Most people did not have this two decades ago, despite then being available.

The stats agree. TV sizes have grown significantly.

https://www.statista.com/chart/3780/tv-screen-size/?srsltid=...

averageRoyalty · 2025-12-06T21:19:30 1765055970

> I have no idea how frequently the average person sits in front of their 60” TV by themselves and watches a movie on their tablet.

If you want some anecdota, I do this regularly. If I'm watching something and I may have to move somewhere in the house during, it's just practical.

Retric · 2025-12-05T18:49:26 1764960566

> Maybe? You’re making blind assertions with no data.

I’ve seen or talked to more than five people doing it (IE called them, showed up at their house, etc) and even more people mentioned doing the same when I asked. That’s plenty of examples to say it’s fairly common behavior even if I can’t give you exact percentages.

Convince vs using the TV remove was mentioned, but if it’s not worth using the remote it’s definitely not worth going to the moves.

nevertoolate · 2025-12-05T19:07:18 1764961638

I do. I’ve researched the optimal distance for a smallish tv screen (which fits between the studio monitor stand). I move the tv closer when watching a film, it stands on hacked together wooden box like thing which has some yoga tools and film magazines in it - it has wheels. Crazy stuff. There is a flipchart like drawing of my daughter covering the tv normally which we flip when watching films.

vharish · 2025-12-05T16:00:53 1764950453

Living rooms are not that big to start with. I don't think you actually asked anyone's opinion on this! :D

Small TVs are not comfortable to watch. No one I know is okay with getting a smaller TV and moving their sofa closer. That sounds ridiculous. If there's any comfort to this capatilistic economy, it is the availability of technology at throw away prices. Most people would rather spend on a TV than save the money.

As for the theatre being obsolete, I do agree with you, atleast to some extent. I think everyone is right here. All factors combined is what makes going to the theatre not worth the effort for most of the movies. It's just another nice thing, not what it used to be.

Also, the generational difference too. I think teen and adolescents have a lot of ways to entertain themselves. The craze for movies isn't the same as it used to be. And we grew old(er). With age, I've grown to be very picky with movies.

Retric · 2025-12-05T16:20:01 1764951601

37+” isn’t a small TVs. Resolution was an issue in the 90’s but midsized TV’s have been around for a long time.

Also, I see plenty of people use tablets to watch stuff laying on the couch in front of a big screen TV. So viewing distance is plenty relevant.

sbarre · 2025-12-05T14:26:58 1764944818

You're describing TV and movies since forever.

Ever year there are a few good shows and movies and a lot of mid-to-bad shows and movies.

This is not a Netflix thing, nor is it new.

HDThoreaun · 2025-12-05T19:31:42 1764963102

Just not true with HBO. Most of their content is consistently pretty good

triceratops · 2025-12-06T00:11:11 1764979871

HBO is expensive and most people don't have it. Ergo most people never see or hear about their lower quality content. Only the good stuff that their rich friends rave about.

https://en.wikipedia.org/wiki/List_of_HBO_original_programmi...

I don't recognize half the titles on that page.

They also make less content overall. This makes sense because they are one TV channel and assume you can get your reality TV fix somewhere else.

Netflix wants to be the only thing you watch. So they have to serve all needs.

HDThoreaun · 2025-12-06T00:16:46 1764980206

You not recognizing their shows doesnt mean they are bad. Ive seen most of those and the overwhelming majority are at least solid. I understand netflix's business model, Im just annoyed that theyre buying HBO because they will likely make it worse. Maybe netflix wants more prestige content and will let HBO be HBO but I doubt it.

sbarre · 2025-12-05T14:25:33 1764944733

Endgame: Netflix renames itself to HBO

sbarre · 2025-12-05T14:24:07 1764944647

Yeah until Netflix adds tiered pricing for content and you end up paying more than what Netflix + HBO Max together would have cost because Netflix is the only game in town for that content..

I think like all media consolidation this will send a lot of people back to the seven seas..

autoexec · 2025-12-05T14:48:42 1764946122

The seven seas can't stop netflix from canceling good shows though.

sbarre · 2025-11-30T02:09:51 1764468591

Honest question: given all the companies and people working on anti-cheat systems for the last 20+ years of multiplayer video games, don't you think it would all be server-side if it could be, by now?

smolder · 2025-11-30T05:14:34 1764479674

No, game companies are simply unwilling to pay for the talent and man hours that it takes to police their games for cheaters. Even when they are scanning your memory and filesystem they don't catch people running the latest rented cheat software.

thenthenthen · 2025-11-30T05:23:08 1764480188

Cheating is a social problem, not a technical issue. Just give the community dedicated server possibility (remember how back in the days games used to ship with dedicated server binaries?) and the community can police for free! Wow!

smolder · 2025-11-30T05:31:19 1764480679

Yes, I would also prefer that servers were community run as in the hl2 days.

I would still argue that there are technical issues leading to some amount of cheating. In extraction shooters like Hunt Showdown, Escape From Tarkov and a few others, people can run pcie devices that rip player location and other information from the machines memory in order to inject it into an overlay with a 2nd computer, and they do go to these lengths to cheat, giving them a huge advantage. It wouldn't be possible to rip that info from memory for these "ESP cheats" if the server didn't needlessly transmit position information for players that aren't actually visible. IMO this is a technical failure. There are other steps that could be taken as well, which just aren't because they're hard.

ThatPlayer · 2025-11-30T06:33:54 1764484434

Yes, because players want to spend time moderating other players instead of playing the game. Sounds fun!

Community servers literally invented anti-cheat. All current big name anti-cheats started as anti-cheats for community servers. And admins would choose to use them. Game developers would see that and integrate it. Quake 3 Arena even added Punkbuster in a patch.

Modern community servers like FiveM for GTAV, or Face-It and ESEA for CS2 have more anti-cheats, not less.

smolder · 2025-12-01T06:41:37 1764571297

This is a misunderstanding of what community is, said by someone who doesn't know.

d3Xt3r · 2025-11-30T02:58:53 1764471533

No, because most companies will make decisions based on time/effort/profitability, and because client-side anticheat is stupid simple and cheap, that's what they go with. Why waste their own server resources, when they can waste the user's?

sbarre · 2025-11-30T03:33:39 1764473619

Alright then, sounds like you've got it all figured out.

Covenant0028 · 2025-11-30T07:13:19 1764486799

So it is the company prioritising their bottom line at the expense of their customer's computers. More simply, they move cost from their balance sheet and convert it into risk on the customer's end.

Which is actively customer-abusive behavior and customers should treat it with the contempt it deserves. The fact that customers don't, is what enables such abuse.

sbarre · 2025-11-30T13:22:12 1764508932

This is such a weird take. In an online multiplayer game the cheaters are the risk to the company's bottom line.

If a game is rampant with cheaters, honest paying players stop showing up, and less new players sign up. The relatively small percentage of cheaters cost the company tons of sales and revenue.

It is actively in a company's best interest to do everything they possibly can to prevent cheating, so the idea that intentionally building sub-par anti-cheat is about "prioritising their bottom line" seems totally absurd to me.

Not to mention these abstract "the company" positions completely ignore all the passionate people who actually make video games, and how much most of them care about fair play and providing a good experience to their customers. No one hates cheaters more than game developers.

Covenant0028 · 2025-12-05T02:23:06 1764901386

I'll quote what the person I responded to said:

> because most companies will make decisions based on time/effort/profitability, and because client-side anticheat is stupid simple and cheap, that's what they go with. Why waste their own server resources, when they can waste the user's?

And my comment was a response to that statement. In context of that statement, companies are indeed choosing to prioritise their commercial interests in a way that increases the risk to the computers of their customers.

> Not to mention these abstract "the company" positions completely ignore all the passionate people who actually make video games

Irrelevant. Companies and their employees are two different distinct entities and a statement made about one does not automatically implicate the other. Claiming, for example, that Ubisoft enables a consistent culture of sexual harassment does not mean random employees of that company are automatically labeled as harassers.

Coming to anti-cheat, go ahead and fight them all you want. That's not a problem. Demanding the right to introduce a security backdoor into your customer's machines in order to do that, is the problem.

sbarre · 2025-11-29T13:33:08 1764423188

> pretty much entirely just generalizations of their own experience, but phrased as if they're objective truth

I mean you're describing 90% of blog and forum posts on the Internet here.

This (IMO - so it's not ironic) is the biggest leap most people need to make to become more self-aware and to better parse the world around them: recognizing there is rarely an objective truth in most matters, and the idea that "my truth is not your truth, both can be different yet equally valid" (again in most cases, not all cases).

saghm · 2025-11-29T19:39:32 1764445172

I think my issue is that the blog post comes across to me as in essence an argument that the person communicating shouldn't be dissuaded by potential reactions to what they say, but it fails to account for the difference between good-faith and bad-faith reactions. There's a huge difference between a bad-faith misinterpretation and a good-faith misunderstanding in my opinion, as the latter seems to come just as often from a failure on the part of the communicator to be clear as from any fault on the listener. It's hard for me not to get the impression that the author either can't or doesn't seen the value in differentiating between those cases based on the fact there's such significant room for improvement in clarifying their views in their paragraph about remote work, which is why I called it out.

sbarre · 2025-11-30T02:13:45 1764468825

Totally fair, I agree with your take on the blog post for sure.

sbarre · 2025-11-27T15:15:24 1764256524

A question I don't see addressed in all these articles: what prevents Nvidia from doing the same thing and iterating on their more general-purpose GPU towards a more focused TPU-like chip as well, if that turns out to be what the market really wants.

timmg · 2025-11-27T15:34:22 1764257662

They will, I'm sure.

The big difference is that Google is both the chip designer *and* the AI company. So they get both sets of profits.

Both Google and Nvidia contract TSMC for chips. Then Nvidia sells them at a huge profit. Then OpenAI (for example) buys them at that inflated rate and them puts them into production.

So while Nvidia is "selling shovels", Google is making their own shovels and has their own mines.

pzo · 2025-11-27T19:37:29 1764272249

on top of that Google is also cloud infrastructure provider - contrary to OpenAI that need to have someone like Azure plug those GPUs and host servers.

throwawayffffas · 2025-11-28T00:55:01 1764291301

I am pretty sure OpenAI has data centers of its own.

veunes · 2025-11-28T11:03:01 1764327781

The own shovels for own mines strategy has a hidden downside: isolation. NVIDIA sells shovels to everyone - OpenAI, Meta, xAI, Microsoft - and gets feedback from the entire market. They see where the industry is heading faster than Google, which is stewing in its own juices. While Google optimizes TPUs for current Google tasks (Gemini, Search), NVIDIA optimizes GPUs for all possible future tasks. In an era of rapid change, the market's hive mind usually beats closed vertical integration.

1980phipsi · 2025-11-27T15:35:47 1764257747

Aka vertical integration.

ForHackernews · 2025-11-28T00:02:10 1764288130

Selling shovels may still turn out to be the right move: Nvidia got rich off the cryptocurrency bubble, now they're getting even richer off the AI bubble.

Having your own mines only pays off if you actually do strike gold. So far AI undercuts Google's profitable search ads, and loses money for OpenAI.

sagarm · 2025-11-27T22:28:32 1764282512

> AI ... profits

Citation needed. But the vertical integration is likely valuable right now, especially with NVidia being supply constrained.

m4rtink · 2025-11-27T17:17:22 1764263842

So when the bubble pops the companies making the shovels (TSMC, NVIDIA) might still have the money they got for their products and some of the ex-AI companies might least be able to sell standard compliant GPUs on the wider market.

And Google will end up with lots of useless super specialized custom hardware.

skybrian · 2025-11-27T19:19:26 1764271166

It seems unlikely that large matrix multipliers will become useless. If nothing else, Google uses AI extensively internally. It already did in ways that weren’t user-visible long before the current AI boom. Also, they can still put AI overviews on search pages regardless of what the stock market does. They’re not as bad as they used to be, and I expect they’ll improve.

Even if TPU’s weren’t all that useful, they still own the data centers and can upgrade equipment, or not. They paid for the hardware out of their large pile of cash, so it’s not debt overhang.

Another issue is loss of revenue. Google cloud revenue is currently 15% of their total, so still not that much. The stock market is counting on it continuing to increase, though.

If the stock market crashes, Google’s stock price will go down too, and that could be a very good time to buy, much like it was in 2008. There’s been a spectacular increase since then, the best investment I ever made. (Repeating that is unlikely, though.)

nutjob2 · 2025-11-27T20:45:58 1764276358

How could Google's custom hardware become useless? They've used it for their business for years now and will do so for years into the future. It's not like their hardware is LLM specific. Google cannot lose with their vast infrastructure.

Meanwhile OpenAI et al dumping GPUs while everyone else is doing the same will get pennies on the dollar. It's exactly the opposite to what you describe.

I hope that comes to pass, because I'll be ready to scoop up cheap GPUs and servers.

qcnguy · 2025-11-27T21:27:33 1764278853

Same way cloud hardware always risks becoming useless. The newer hardware is so much better you can't afford to not upgrade, e.g. an algorithmic improvement that can be run on CUDA devices but not on existing TPUs, which changes the economics of AI.

timmg · 2025-11-27T18:39:37 1764268777

> And Google will end up with lots of useless super specialized custom hardware.

If it gets to the point where this hardware is useless (I doubt it), yes Google will have it sitting there. But it will have cost Google less to build that hardware than any of the companies who built on Nvidia.

UncleOxidant · 2025-11-27T19:23:22 1764271402

Right, and the inevitable bubble pop will just slow things down for a few years - it's not like those TPUs will suddenly be useless, Google will still have them deployed, it's just that instead of upgrading to a newer TPU they'll stay with the older ones longer. It seems like Google will experience much less repercussions when the bubble pops compared to Nvidia, OpenAI, Anthropic, Oracle etc. as they're largely staying out of the money circles between those companies.

heisenbit · 2025-11-27T21:43:58 1764279838

And running loads long term profitable may require both lower power use as well as longer chip lifetimes - something associated with lower power use.

immibis · 2025-11-27T18:53:40 1764269620

aka Google will have less of a pile of money than Nvidia will

kolbe · 2025-11-27T19:29:00 1764271740

Alphabet is the most profitable company in the world. For all the criticisms you can throw at Google, lacking a pile of money isn't one of them.

acoustics · 2025-11-27T17:54:19 1764266059

I think people are confusing the bubble popping with AI being over. When the dot-com bubble popped, it's not like internet infrastructure immediately became useless and worthless.

iamtheworstdev · 2025-11-27T18:54:19 1764269659

that's actually not all that true... a lot of fiber that had been laid went dark, or was never lit, and was hoarded by telecoms in an intentional supply constrained market in order to drive up the usage cost of what was lit.

pksebben · 2025-11-27T20:19:12 1764274752

If it was hoarded by anyone, then by definition not useless OR worthless. Also, you are currently on the internet if you're reading this, so the point kinda stands.

ithkuil · 2025-11-27T19:44:31 1764272671

Are you saying that the internet business didn't grow a lot after the bubble popped?

bryanlarsen · 2025-11-27T20:08:08 1764274088

And then they sold it to Google who lit it up.

blinding-streak · 2025-11-28T04:56:03 1764305763

Google uses TPUs for its internal AI work (training Gemini for example), which surely isn't decreasing in demand or usage as their portfolio and product footprint increases. So I have a feeling they'd be able to put those TPUs to good use?

Workaccount2 · 2025-11-27T15:56:22 1764258982

Deepmind gets to work directly with the TPU team to make custom modifications and designs specifically for deepmind projects. They get to make pickaxes that are made exactly for the mine they are working.

Everyone using Nvidia hardware has a lot of overlap in requirements, but they also all have enough architectural differences that they won't be able to match Google.

OpenAI announced they will be designing their own chips, exactly for this reason, but that also becomes another extremely capital intensive investment for them.

This also doesn't get into that Google also already has S-tier dataceters and datacenter construction/management capabilities.

wood_spirit · 2025-11-27T18:55:18 1764269718

Isn’t there a suspicion that OpenAI buying custom chips from another Sam Altman venture is just graft? Wasn’t that one of the things that came up when the board tried to out him?

saagarjha · 2025-11-28T01:06:54 1764292014

The chips are being done in-house.

overfeed · 2025-11-28T04:11:29 1764303089

It was only brought in-house after the $5,000,000,000,000 self-dealing AI chip venture failed to launch.

saagarjha · 2025-11-28T15:36:17 1764344177

Nvidia?

overfeed · 2025-11-29T03:51:57 1764388317

Sam Altman attempted to raise $5Tn for an AI-chip startup

saagarjha · 2025-11-29T12:38:27 1764419907

Link? I only know of Rain and they raised <<$1 billion IIRC

overfeed · 2025-11-29T19:39:02 1764445142

Note that I said "attempted". https://www.wsj.com/tech/ai/sam-altman-seeks-trillions-of-do...

saagarjha · 2025-11-30T14:38:57 1764513537

I think this was to build datacenters

01100011 · 2025-11-28T08:07:14 1764317234

> Deepmind gets to work directly with the TPU team to make custom modifications

You don't think Nvidia has field-service engineers and applications engineers with their big customers? Come on man. There is quite a bit of dialogue between the big players and the chipmaker.

Workaccount2 · 2025-11-28T15:15:47 1764342947

They do, but they need to appease a dozen different teams from a dozen different labs, forcing nvidia to take general approaches and/or dictating approaches and pigeonholing labs into using those methods.

Deepmind can do whatever they want, and get the exact hardware to match it. It's a massive advantage when you can discover a bespoke way of running a filter, and you can get a hardware implementation of it without having to share that with any third parties. If OpenAI takes a new find to Nvidia, everyone else using Nvidia chips gets it too.

01100011 · 2025-11-28T19:27:12 1764358032

This ignores the way it often works: Customer comes to NVDA with a problem and NVDA comes up with a solution. This solution now adds value for every customer.

In your example, if OpenAI makes a massive new find they aren't taking it to NVDA.

Nvidia has the advantage of a broad base of customers that gives it a lot of information on what needs work and it tries to quickly respond to those deficiencies.

Workaccount2 · 2025-11-29T01:16:45 1764379005

>In your example, if OpenAI makes a massive new find they aren't taking it to NVDA.

Right, and therefore they are stuck doing it in software, while google can do it in hardware.

jauntywundrkind · 2025-11-27T19:27:19 1764271639

Nvidia doesn't have the software stack to do a TPU.

They could make a systolic array TPU and software, perhaps. But it would mean abandoning 18 years of CUDA.

The top post right now is talking about TPU's colossal advantage in scaling & throughput. Ironwood is massively bigger & faster than what Nvidia is shooting for, already. And that's a huge advantage. But imo that is a replicateable win. Throw gobs more at networking and scaling and nvidia could do similar with their architecture.

The architectural win of what TPU is more interesting. Google sort of has a working super powerful Connection Machine CM-1. The systolic array is a lot of (semi-)independent machines that communicate with nearby chips. There's incredible work going on to figure out how to map problems onto these arrays.

Where-as on a GPU, main memory is used to transfer intermediary results. It doesn't really matter who picks up work, there's lots of worklets with equal access time to that bit of main memory. The actual situation is a little more nuanced (even in consumer gpu's there's really multiple different main memories, which creates some locality), but there's much less need for data locality in the GPU, and much much much much tighter needs, the whole premise of the TPU is to exploit data locality. Because sending data to a neighbor is cheap, sending storing and retrieving data from memory is slower and much more energy intense.

CUDA takes advantage of, relies strongly on the GPU's reliance in main memory being (somewhat) globally accessible. There's plenty of workloads folks do in CUDA that would never work on TPU, on these much more specialized data-passing systolic arrays. That's why TPUs are so amazing, because they are much more constrained devices, that require so much more careful workload planning, to get the work to flow across the 2D array of the chip.

Google's work on projects like XLA and IREE is a wonderful & glorious general pursuit of how to map these big crazy machine learning pipelines down onto specific hardware. Nvidia could make their own or join forces here. And perhaps they will. But the CUDA moat would have to be left behind.

zzzoom · 2025-11-28T00:22:22 1764289342

> They could make a systolic array TPU and software, perhaps. But it would mean abandoning 18 years of CUDA.

Tensor cores are specialized and have CUDA support.

jauntywundrkind · 2025-11-28T02:04:47 1764295487

Tensor cores can help a lot for matrix maths, sure, definitely. They made a big splash in 2017 & have been essential. https://developer.nvidia.com/blog/programming-tensor-cores-c...

But it's still something grafted onto the existing architecture, of many grids with many blocks with many warps, and lots and lots of coordination and passing intermediary results around. It's only a 4x4x4 unit, afaik. There's still a lot of main memory being used to combine data, a lot of orchestration among the different warps and blocks and grids, to get big matrices crunched.

The systolic array is designed to allow much more fire and forget operations. It's inputs are 128 x 128 and each cell is its own compute node basically, shuffling data through and across (but not transitting a far off memory).

TPU architecture has plenty of limitations. It's not great at everything. But if you can design work to flow from cell to neighboring cell, you can crunch very sizable chunks of data with amazing data locality. The efficiency there is unparalleled.

Nvidia would need a radical change of their architecture to get anything like the massive data locality wins a systolic array can do. It would come with massively more constraints too.

Would love if anyone else has recommended reading. I have this piece earmarked. https://henryhmko.github.io/posts/tpu/tpu.html https://news.ycombinator.com/item?id=44342977

HarHarVeryFunny · 2025-11-27T15:43:58 1764258238

It's not that the TPU is better than an NVidia GPU, it's just that it's cheaper since it doesn't have a fat NVidia markup applied, and is also better vertically integrated since it was designed/specified by Google for Google.

UncleOxidant · 2025-11-27T19:29:25 1764271765

TPUs are also cheaper because GPUs need to be more general purpose whereas TPUs are designed with a focus on LLM workloads meaning there's not wasted silicon. Nothing's there that doesn't need to be there. The potential downside would be if a significantly different architecture arises that would be difficult for TPUs to handle and easier for GPUs (given their more general purpose). But even then Google could probably pivot fairly quickly to a different TPU design.

mr_toad · 2025-11-28T14:11:40 1764339100

The T in TPU stands for tensor, which in this context is just a fancy matrix. These days both are optimised for matrix algebra, i.e. general ML workloads, not just LLMs.

If LLMs become unfashionable they’ll still be good for other ML tasks like image recognition.

numbers_guy · 2025-11-27T15:25:13 1764257113

Nothing in principle. But Huang probably doesn't believe in hyper specializing their chips at this stage because it's unlikely that the compute demands of 2035 are something we can predict today. For a counterpoint, Jim Keller took Tenstorrent in the opposite direction. Their chips are also very efficient, but even more general purpose than NVIDIA chips.

mindv0rtex · 2025-11-27T18:06:39 1764266799

How is Tenstorrent h/w more general purpose than NVIDIA chips? TT hardware is only good for matmuls and some elementwise operations, and plain sucks for anything else. Their software is abysmal.

ezekiel68 · 2025-11-28T06:04:43 1764309883

Of course there's the general purpose RISC V CPU controller component but also, each NPU is designed in troikas that have one core reading data in, one core performing the actual kernel work, and the third core forwarding data out.

llm_nerd · 2025-11-27T15:27:43 1764257263

For users buying H200s for AI workloads, the "ASIC" tensor cores deliver the overwhelming bulk of performance. So they already do this, and have been since Volta in 2017.

To put it into perspective, the tensor cores deliver about 2,000 TFLOPs of FP8, and half that for FP16, and this is all tensor FMA/MAC (comprising the bulk of compute for AI workloads). The CUDA cores -- the rest of the GPU -- deliver more in the 70 TFLOP range.

So if data centres are buying nvidia hardware for AI, they already are buying focused TPU chips that almost incidentally have some other hardware that can do some other stuff.

I mean, GPUs still have a lot of non-tensor general uses in the sciences, finance, etc, and TPUs don't touch that, but yes a lot of nvidia GPUs are being sold as a focused TPU-like chip.

sorenjan · 2025-11-27T15:38:48 1764257928

Is it the Cuda cores that run the vertex/fragment/etc shaders in normal GPUs? Where does the ray tracing units fit in? How much of a modern Nvidia GPU is general purpose vs specialized to graphics pipelines?

qcnguy · 2025-11-27T21:30:41 1764279041

A datacenter GPU has next to nothing left related to graphics. You can't use them to render graphics. It's a pure computational kernel machine.

fooker · 2025-11-27T15:26:12 1764257172

That's exactly what Nvidia is doing with tensor cores.

bjourne · 2025-11-27T15:43:51 1764258231

Except the native width of Tensor Cores are about 8-32 (depending on scalar type), whereas the width of TPUs is up to 256. The difference in scale is massive.

neilmovva · 2025-12-01T01:15:24 1764551724

I think Hopper's native matmul tile is 64x64, and Blackwell is 128x128.

see this blog for a reference on Blackwell:

https://hazyresearch.stanford.edu/blog/2025-03-15-tk-blackwe...

fooker · 2025-11-28T03:35:23 1764300923

If it turns out to be useful, Nvidia can't just tweak a parameter in their verilog and declare victory?

If not, what's fundamentally difficult about doing 32 vs 256 here?

saagarjha · 2025-11-28T01:07:50 1764292070

Nobody cares about width; they care about TFLOPs.

LogicFailsMe · 2025-11-27T15:33:13 1764257593

That's pretty much what they've been doing incrementally with the data center line of GPUs versus GeForce since 2017. Currently, the data center GPUs now have up to 6 times the performance at matrix math of the GeForce chips and much more memory. Nvidia has managed to stay one tape out away from addressing any competitors so far.

The real challenge is getting the TPU to do more general purpose computation. But that doesn't make for as good a story. And the point about Google arbitrarily raising the prices as soon as they think they have the upper hand is good old fashioned capitalism in action.

blibble · 2025-11-27T15:18:35 1764256715

the entire organisation has been built over the last 25 years to produce GPUs

turning a giant lumbering ship around is not easy

sbarre · 2025-11-27T15:21:18 1764256878

For sure, I did not mean to imply they could do it quickly or easily, but I have to assume that internally at Nvidia there's already work happening to figure out "can we make chips that are better for AI and cheaper/easier to make than GPUs?"

coredog64 · 2025-11-27T20:01:19 1764273679

Isn't that a bit like Kodak knowing that digital cameras were a thing but not wanting to jeopardize their film business?

sofixa · 2025-11-27T15:19:15 1764256755

> what prevents Nvidia from doing the same thing and iterating on their more general-purpose GPU towards a more focused TPU-like chip as well, if that turns out to be what the market really wants.

Nothing prevents them per se, but it would risk cannibalising their highly profitable (IIRC 50% margin) higher end cards.

baron816 · 2025-11-27T22:59:27 1764284367

It’s not binary. It’s not existential. What’s at stake for Nvidia is its HUGE profit margins. 5 years from now, Nvidia could be selling 100x as many chips. But its market cap could be a fraction of what it is now if competition is so intense that its making 5% profit margin instead of 90%.

storus · 2025-11-28T02:35:17 1764297317

More like 900% right now.

torginus · 2025-11-28T00:50:16 1764291016

My personal guess would be what drives the cost and size of these chips is the memory bandwidth and the transcievers required to support it. Since transcievers/memory controllers are on the edge of the chip, you get a certain minimum circumference for a given bandwidth, which determines your min surface area.

It might be even 'free' to fill it with more complicated logic (especially one that allows you write clever algorithms that let you save on bandwidth).

sojuz151 · 2025-11-27T15:35:45 1764257745

They lose the competitive advantage. They have nothing more to offer than what Google has in-house.

sbarre · 2025-11-27T00:38:55 1764203935

I definitely do not return to a hotel where the bathroom was sub-par...

And likewise I absolutely return to a hotel where the bathroom was good when going back to a city.

I'm mostly talking about the water pressure for the shower here, but you get the idea.

sbarre · 2025-11-21T21:40:12 1763761212

I'm so mad about this, I need DDR5 for a new mini-PC I bought and prices have literally gone up by 2.5x..

128GB used to be 400$ in June, and now it's over $1,000 for the same 2x64GB set..

I have no idea if/when prices will come back down but it sucks.

ajb · 2025-11-22T02:07:09 1763777229

Dram alternates between feast and famine; it's the nature of a business when the granularity of investment is so huge (you have a fab or you don't, and they cost billions -maybe trillions by now). So, it will swing back. Unfortunately it looks like maybe 3-5 years on average, from some analysis here: https://storagesearch.com/memory-boom-bust-cycles.html

(That's just me eyeballing it, feel free to do the math)

ksec · 2025-11-22T14:49:10 1763822950

I am so glad both top rated and majority of comments on HN finally understands DRAM industry instead of constant DRAM is a cartel that is why things are expensive.

Also worth mentioning DRAM and NAND's profit from Samsung is what keep the Samsung Foundry fighting TSMC. Especially for those who thinks TSMC is somehow a monopoly.

Another things to point out which is not mentioned yet, China is working on both DRAM and NAND. Both LPDDR5 and Stacked NAND are already in production and waiting for yield and scale. Higher Price will finally be perfect timing for them to join the commodity DRAM and NAND race. Good for consumer I suppose, not so good for a lot of other things which I wont go into.

ls612 · 2025-11-22T17:16:41 1763831801

DRAM manufacturers have literally been convicted of price fixing in the past why do you have to white knight for them?

hollerith · 2025-11-22T17:53:41 1763834021

Most of us who've been on Earth for a while know that courts often get it wrong. Even if the particular court decision you mention was correct does not mean that price fixing is the main reason or the underlying reason DRAM prices sometime go up.

lazide · 2025-11-22T18:06:07 1763834767

They blatantly were doing it, admitted to it, and did it again later. What kind of crazy is this?

Is this the ‘but he loves me, he wouldn’t hit me again’ of the tech world?

ls612 · 2025-11-24T21:55:32 1764021332

He isn't even the only one on this thread who is making this argument lol. I'm guessing its a paid information op.

ksec · 2025-11-23T02:35:48 1763865348

And I am 100% sure a lot of other industries in commodities would have been convicted of price fixing if we look into it. And I say this as someone who have witnessed it first hand.

Unfortunately commodity business is not sexy, it doesn't get the press, nor does it get told even in business schools. But a lot of the times these call called price fixing is a natural phenomenon.

I wont even go into what get decided in court doesn't always mean it is right.

I will also add we absolutely want the DRAM and NAND or in fact any industries to make profits, or as much profits as it could. What is far more important is where do they spend not those profits. I didn't look into SK Hynix but both Samsung and Micron spends significant amount of R&D at least try to lower the total production cost of DRAM per GB. We want them to make healthy margin selling DRAM at $1/GB, not losing money and then go bankrupt.

ls612 · 2025-11-23T04:17:17 1763871437

Look man I’m a PhD economist I know the difference between monopolistic competition and collusion. All that price fixing does is transfer monopoly rents from you and me to the DRAM cartel (or whatever industry is doing the price fixing).

kbolino · 2025-11-22T19:46:57 1763840817

Both stories can be true.

The firms can coordinate by agreeing on a strategy they deem necessary for the future of the industry, and that strategy requires significant capital expenditures, and the industry does not get (or does not want) outside investment to fund it, and if any of the firms defects and keeps prices low the others cannot execute on the strategy, so they all agree to raise prices.

Then, after the strategy succeeds, they have gotten addicted to the higher revenues, they do not allow prices to fall as fast as they should, their coordination becomes blatantly illegal, and they have to get smacked down by regulators.

vee-kay · 2025-11-22T21:22:15 1763846535

> The firms can coordinate by agreeing on a strategy they deem necessary for the future of the industry.. Then, after the strategy succeeds, they have gotten addicted to the higher revenues, they do not allow prices to fall as fast as they should, their coordination becomes blatantly illegal..

So said and did the infamous Phoebus cartel, to unnaturally "fix" the prices and quality of light bulbs.

https://spectrum.ieee.org/the-great-lightbulb-conspiracy

https://en.wikipedia.org/wiki/Phoebus_cartel

For more than a century, one strange mystery has puzzled the world: why do old light bulbs last for decades while modern bulbs barely survive a couple of years?

The answer lies in a secret meeting held in Geneva, Switzerland in 1924, where the world’s biggest light bulb companies formed the notorious Phoebus Cartel.

Their mission was simple but shocking: control the global market, set fixed prices, and most importantly… reduce bulb lifespan.

Before this cartel, bulbs could easily run for 2500+ hours. But after the Phoebus Cartel pact and actions, all companies were forced to limit lifespan to just 1000 hours. More failure meant more purchases. More purchases meant more profit. Any company who refused faced heavy financial penalties.

The most unbelievable proof is the world-famous Livermore Fire Station bulb in California, glowing since 1901. More than 120 years old. Still alive. While our new incandescent bulbs die in 1–2 years.

Though the Phoebus cartel was dissolved in the 1930s due to government pressure, its impact still shadows modern manufacturing. Planned obsolescence didn’t just begin here… but Phoebus made it industrial.

https://m.youtube.com/watch?v=0U5uU6nzgO8

artimaeis · 2025-11-22T21:31:07 1763847067

The Phoebus cartel didn't collude just to make the light bulbs have a shorter lifespan. They upped the standard illumination a bulb emitted so that consumers needed fewer of them to see well. With an incandescent you have a kind of sliding scale of brightness:longevity (with curves on each end that quickly go exponential, hence the longest lasting light bulb that's so dim you can barely read by its light). The brighter the bulb, the shorter the lifespan.

https://www.youtube.com/watch?v=zb7Bs98KmnY

kbolino · 2025-11-22T22:59:47 1763852387

Also, incandescent lightbulb lifespan is reduced by repeated power cycling. Not only is the legendary firehouse bulb very dim, it has been turned off and back on again very few times. Leaving all your lights on all the time would be a waste of power for the average household, and more expensive than replacing the bulbs more frequently.

PunchyHamster · 2025-11-23T01:33:32 1763861612

And same quirk was also shared by fluorescent bulbs

I still try to fight that habit of not unnecessarily cycling even tho all my lights are LED.

zozbot234 · 2025-11-22T22:03:25 1763849005

Also lightbulb dimmers were a thing back in the day, so you could always buy more lightbulbs and lower the brightness of each to take advantage of that exponential curve in lifespan.

Y_Y · 2025-11-22T20:31:59 1763843519

> The firms can coordinate by agreeing on a strategy they deem necessary for the future of the industry

As long as it doesn't fall into the "collusion" prohibitions of the relevant competition law.

> “People of the same trade seldom meet … but the conversation ends in a conspiracy against the public, or in some contrivance to raise prices.”

Adam Smith, The Wealth of Nations (1776)

rtaylorgarlock · 2025-11-22T15:19:36 1763824776

re: other things, I bet I agree.

Yokolos · 2025-11-22T06:10:59 1763791859

I wouldn't be so sure. I've seen analyses making the case that this new phase is unlike previous cycles and DRAM makers will be far less willing to invest significantly in new capacity, especially into consumer DRAM over more enterprise DRAM or HBM (and even there there's still a significant risk of the AI bubble popping). The shortage could last a decade. Right now DRAM makers are benefiting to an extreme degree since they can basically demand any price for what they're making now, reducing the incentive even more.

https://www.tomshardware.com/pc-components/storage/perfect-s...

zozbot234 · 2025-11-22T08:09:03 1763798943

The most likely direct response is not new capacity, it's older capacity running at full tilt (given the now higher margins) to produce more mature technology with lower requirements on fabrication (such as DDR3/4, older Flash storage tech, etc.) and soak up demand for these. DDR5/GDDR/HBM/etc. prices will still be quite high, but alternatives will be available.

kees99 · 2025-11-22T12:42:02 1763815322

> produce more mature technology ... DDR3/4

...except current peak in demand is mostly driven by build-out of AI capacity.

Both inference and training workloads are often bottlenecked on RAM speed, and trying to shoehorn older/slower memory tech there would require non-trivial amount of R&D to go into widening memory bus on CPU/GPU/NPUs, which is unlikely to happen - those are in very high demand already.

ifwinterco · 2025-11-22T12:46:32 1763815592

Even if AI stuff does really need DDR5, there must be lots of other applications that would ideally use DDR5 but can make do with DDR3/4 if there's a big difference in price

shevy-java · 2025-11-22T13:56:45 1763819805

I mean, AI is currently hyped, so the most natural and logical assumption is that AI drives these prices up primarily. We need compensation from those AI corporations. They cost us too much.

blackqueeriroh · 2025-11-24T06:12:08 1763964728

It is still an assumption.

justin66 · 2025-11-22T16:28:57 1763828937

> The shortage could last a decade.

Do we really think the current level of AI-driven data center demand will continue indefinitely? The world only needs so many pictures of bears wearing suits.

lukeschlather · 2025-11-22T16:45:09 1763829909

The pop culture perception of AI just being image and text generators is incorrect. AI is many things, they all need tons of RAM. Google is rolling out self-driving taxis in more and more cities for instance.

justin66 · 2025-11-22T16:59:30 1763830770

Congrats on engaging with the facetious part of my comment, but I think the question still stands: do you think the current level of AI-driven data center demand will continue indefinitely?

I feel like the question of how many computers are needed to steer a bunch of self-driving taxis probably has an answer, and I bet it's not anything even remotely close to what would justify a decade's worth of maximum investment in silicon for AI data centers, which is what we were talking about.

lazide · 2025-11-22T18:07:31 1763834851

Data center AI is also completely uninteresting/non-useful for self driving Taxis, or any other self driving vehicle.

lukeschlather · 2025-11-23T08:05:32 1763885132

Do you know comparatively how much GPU time training the models which run Waymo costs compared to Gemini? I'm genuinely curious, my assumption would be that Google has devoted at least as much GPU time in their datacenters to training Waymo models as they have Gemini models. But if it's significantly more efficient on training (or inference?) that's very interesting.

lazide · 2025-11-23T15:55:32 1763913332

My note is specifically for operating them. Training the models, certainly can help.

blackqueeriroh · 2025-11-24T06:12:36 1763964756

A decade is far from indefinitely.

uncletscollie · 2025-11-22T17:03:17 1763830997

AI is needed to restart feudalism?

downrightmike · 2025-11-22T18:56:40 1763837800

No, the 10% best scenario return on AI won't make it. The bubble is trying to replace all human labor, which is why it is a bubble in the first place. No one is being honest that AGI is not possible in this manner of tech. And Scale won't get them there.

snuxoll · 2025-11-22T08:32:56 1763800376

There's not a difference between "consumer" DRAM and "enterprise" DRAM at the silicon level, they're cut from the same wafers at the end of the day.

david-gpu · 2025-11-22T10:05:24 1763805924

Doesn't the same factory produce enterprise (i.e. ECC) and consumer (non-ECC) DRAM?

If there is high demand for the former due to AI, they can increase production to generate higher profits. This cuts the production capacity of consumer DRAM, and lead to higher prices in that segment too. Simple supply & demand at work.

crote · 2025-11-22T13:00:23 1763816423

Conceptually, you can think of it as "RAID for memory".

A consumer DDR5 module has two 32-bit-wide buses, which are both for example implemented using 4 chips which each handle 8 bits operating in parallel - just like RAID 0.

An enterprise DDR5 module has a 40-bit-wide bus implemented using 5 chips. The memory controller uses those 8 additional bits to store the parity calculated over the 32 regular bits - so just like RAID 4 (or RAID 5, I haven't dug into the details too deeply). The whole magic happens inside the controller, the DRAM chip itself isn't even aware of it.

Given the way the industry works (some companies do DRAM chip production, it is sold as a commodity, and others buy a bunch of chips to turn them into RAM modules) the factory producing the chips does not even know if the chips they have just produced will be turned into ECC or non-ECC. The prices rise and fall as one because it is functionally a single market.

david-gpu · 2025-11-22T14:22:44 1763821364

That makes sense, thank you.

matthews3 · 2025-11-22T11:20:20 1763810420

At the silicon level, it is the same.

Each memory DIMM/stick is made up of multiple DRAM chip. ECC DIMMs have an extra chip for storing the error correcting parity data.

The bottleneck is with the chips and not the DIMMs. Chip fabs are expensive and time consuming, while making PCBs and placing components down onto them is much easier to get into.

david-gpu · 2025-11-22T14:23:16 1763821396

Got it now, thanks!

Yokolos · 2025-11-22T10:49:03 1763808543

Yes, but if new capacity is also redirected to be able to be sold as enterprise memory, we won't see better supply for consumer memory. As long as margins are better and demand is higher for enterprise memory, the average consumer is screwed.

bobbob1921 · 2025-11-22T18:49:07 1763837347

Does it matter that AI hardware has such a shorter shelf life/faster upgrade cycle? Meaning we may see the ram chips resold/thrown back into the used market quicker than before?

immibis · 2025-11-22T11:10:16 1763809816

Is there still a difference? I have DDR5 registered ECC in my computer.

Yokolos · 2025-11-22T11:26:25 1763810785

I mean, the only difference we care about is how much of it is actual RAM vs HBM (to be used on GPUs) and how much it costs. We want it to be cheap. So yes, there's a difference if we're competing with enterprise customers for supply.

I don't really understand why every little thing needs to be spelled out. It doesn't matter. We're not getting the RAM at an affordable price anymore.

dangus · 2025-11-22T14:27:38 1763821658

Anytime somebody is making a prediction for the tech industry involving a decade timespan I pull out my Fedora of Doubt and tip my cap to m’lady.

downrightmike · 2025-11-22T18:55:21 1763837721

Maybe we'll get default to ECC in everything with this?

rasz · 2025-11-22T08:26:45 1763800005

A LOT of businesses learned during Covid they can make more money by permanently reducing output and jacking prices. We might be witnessing the end times of economies of scale.

Incipient · 2025-11-22T09:18:30 1763803110

The idea is someone else comes in that's happy to eat their lunch by undercutting them. Unfortunately, we're probably limited to China doing that at this point as a lot of the existing players have literally been fined for price fixing before.

https://en.wikipedia.org/wiki/DRAM_price_fixing_scandal

autoexec · 2025-11-22T19:29:55 1763839795

It seems more likely that someone else comes in and either colludes with the people who are screwing us to get a piece of the action or gets bought out by one of the big companies who started all this. Since the rare times companies get caught they only get weak slaps on the wrist where they only pay a fraction of what they made in profits (basically just the US demanding their cut) I don't have much faith things will improve any time soon.

Even China has no reason to reduce prices much for memory sold to the US when they know we have no choice but to buy at the prices already set by the cartel. I expect that if China does start making memory they'll sell it cheap within China and export it at much higher prices. Maybe we'll get a black market for cheap DRAM smuggled out of China though.

trhway · 2025-11-22T11:37:08 1763811428

I think in part it is a system level response to the widespread just-in-time approach of those businesses' clients. A just-in-time client is very "flexible" on price when supply is squeezed. After that back and forth i think we'll see return to some degree of supply buffering(warehousing) to dampen down the supply levels/price shocks in the pipelines.

CamperBob2 · 2025-11-22T18:02:39 1763834559

I thought that, too, but then the Nexperia shitstorm hit, and it was as if the industry had learned nothing at all from the COVID shortages.

PunchyHamster · 2025-11-23T01:36:07 1763861767

In that case it's far simpler - even IF they wanted to met the demand, building more capacity is hideously expensive and takes years.

So, it would happen even with best intentions and no conspiracies. AI boom already hiked GPU prices, memory was next in line.

fullstop · 2025-11-22T16:44:36 1763829876

Historically, yes. But we haven't had historical demand for AI stuff before. What happens when OpenAI and NVIDIA monopolize the majority of DRAM output?

addaon · 2025-11-22T04:45:11 1763786711

Nothing costs trillions.

chmod775 · 2025-11-22T06:09:17 1763791757

If you had a trillion dollars you might find some things are for sale that otherwise wouldn't be...

nolok · 2025-11-22T10:04:22 1763805862

To be fair, nobody HAS a trillion dollar either. They have stuff that may be worth a trillion dollar when sold.

boguscoder · 2025-11-22T06:27:00 1763792820

Have you seen our debt recently?..

crote · 2025-11-22T13:22:38 1763817758

Is this still the case in 2025, though?

In a traditional pork cycle there's a relatively large number of players and a relatively low investment cost. The DRAM market in the 1970s and 1980s operated quite similarly: you could build a fab for a few million dollars, and it could be done by a fab which also churned out regular logic - it's how Intel got started! There were dozens of DRAM-producing companies in the US alone.

But these days the market looks completely different. The market is roughly equally divided up between SK Hynix, Micron, and Samsung. Building a fab costs billions and can easily a year of 5 - if not a decade - from start to finish. Responding to current market conditions is basically impossible, you have to plan for the market you expect years from now.

Ignoring the current AI bubble, DRAM demand has become relatively stable - and so has the price. Unless there's a good reason to believe the current buying craze will last over a decade, why would the DRAM manufacturers risk significantly changing their plans and potentially creating an oversupply in the future? It's not like the high prices are hurting them...

lazide · 2025-11-22T13:43:03 1763818983

Also, current political turbulence makes planning for the long term extremely risky.

Will the company be evicted from the country in 6 months? A year? Will there be 100% tariffs on competitions imports? Or 0%? Will there be an anti-labor gov’t in effect when the investment might mature, or a pro-labor?

The bigger the investment, the longer the investment timeframe, and the more sane the returns - the harder it is to make the investment happen.

High risk requires a correspondingly high potential return.

That everyone has to pay more for current production is a side effect of the uncertainty, because no one knows what the odds are of even future production actually happening, let along the next fancy wiz-bang technology.

But people do need the current production.

darkwater · 2025-11-22T14:23:54 1763821434

My guess is that they will plummet down when the AI bubble bursts.

jbverschoor · 2025-11-22T05:30:54 1763789454

A waiver is a waiver. The cost is per square mm. It’s pure supply and demand

chmod775 · 2025-11-22T06:12:38 1763791958

No, a wafer is very much not a wafer. DRAM processes are very different from making logic*. You don't just make memory in your fab today and logic tomorrow. But even when you stay in your lane, the industry operates on very long cycles and needs scale to function at any reasonable price at all. You don't just dust off your backyard fab to make the odd bit of memory whenever it is convenient.

Nobody is going to do anything if they can't be sure that they'll be able to run the fab they built for a long time and sell most of what they make. Conversely fabs don't tend to idle a lot. Sometimes they're only built if their capacity is essentially sold already. Given how massive the AI bubble is looking right now, I personally wouldn't expect anyone to make a gamble building a new fab.

* Someone explained this at length on here a while ago, but I can't seem to find their comment. Should've favorited it.

jbverschoor · 2025-11-22T19:42:52 1763840572

Sure, yes the cost of producing a wafer is fixed. Opex didn’t change that much.

Following your reasoning, which is common in manufacturing, the capex needed is already allocated. So, where does the 2x price hike come from if not supply/demand?

The cost to produce did not go up 100%, or even 20%

Actually, DRAM fabs do get scaled down, very similar to the Middle East scaling down oil production.

chmod775 · 2025-11-22T23:18:27 1763853507

> So, where does the 2x price hike come from if not supply/demand?

It absolutely is supply/demand. Well, mostly demand, since supply is essentially fixed over shorter time spans. My point is that "cost per square mm [of wafer]" is too much of a simplification, given that it depends mostly on the specific production line and also ignores a lot of the stuff going on down the line. You can use to look at one fab making one specific product in isolation, but it's completely useless to compare between them or when looking at the entire industry.

It's a bit like saying the cost of cars is per gram of metal used. Sure, you can come up with some number, but what is it really useful for?

zozbot234 · 2025-11-22T19:56:26 1763841386

DRAM/flash fab investment probably did get scaled down due to the formerly low prices, but once you do have a fab it makes sense to have it produce flat out. Then that chunk of potential production gets allocated into DRAM vs. HBM, various sorts of flash storage etc. But there's just no way around the fact that capacity is always going to be bottlenecked somehow, and a lot less likely to expand when margins are expected to be lower.

incrudible · 2025-11-22T10:38:08 1763807888

> Sometimes they're only built if their capacity is essentially sold already.

"Hyperscalers" already have multi-year contracts going. If the demand really was there, they could make it happen. Now it seems more like they're taking capacity from what would've been sold on the spot or quarterly markets. They already made their money.

phoboslab · 2025-11-21T22:18:20 1763763500

I just looked at the invoice for my current PC parts that I bought in April 2016: I paid 177 EUR (~203 USD) for 32GB (DDR4-2800).

It's kinda sad when you grow up in a period of rapid hardware development and now see 10 years going by with RAM $/GB prices staying roughly the same.

Roark66 · 2025-11-22T10:02:34 1763805754

Well, I've experienced both to some degree in the past. The previous long time with very similar hardware performance was when PCs were exorbitantly expensive and commodore 64 was the main "home computer" (at least in my country) over the latter 80s and early 90s.

That period of time had some benefits. Programmers learned to squeeze absolutely everything out of that hardware.

Perhaps writing software for today's hardware is again becoming the norm rather than being horribly inefficient and simply waiting for CPU/GPU power to double in 18 months.

I was lucky. I built my am5 7950x Ryzen pc with 2x48gb ddr5 2 years ago. I just bought 4x48gb kit a month ago with an idea to build another home server with the old 2*48gb kit.

Today my old g.skill 2x48gb kit costs Double what I paid for the 4x48gb.

Furthermore I bought two used rtx3090 (for AI) back then. A week ago I bought a third one for the same price... ,(for vram in my server).

bombcar · 2025-11-22T04:00:26 1763784026

Olds remember the years around '95 when RAM stayed the exact same price per megabyte for what seemed a decade.

robotresearcher · 2025-11-22T18:02:44 1763834564

I paid about GBP 20K for the 192MB RAM in a Sun SPARC 5 workstation in 1995. That’s maybe $27K USD in 1995 dollars. Gulp.

bombcar · 2025-11-22T21:48:22 1763848102

There is or was a website that would let you plug in an Apple computer, and then tell you what you'd be worth if instead you'd bought Apple stock.

I put my G4 PowerBook into it once, and then vowed never to look at it again.

Aurornis · 2025-11-22T16:34:53 1763829293

> It's kinda sad when you grow up in a period of rapid hardware development and now see 10 years going by with RAM $/GB prices staying roughly the same.

But you’re cherry picking prices from a notable period of high prices (right now).

If you had run this comparison a few months ago or if you looked at averages, the same RAM would be much cheaper now.

We’re just consuming a lot of DRAM in general.

Terr_ · 2025-11-21T23:11:09 1763766669

Aside, $203 USD back then would be about $276 USD after inflation. Not a primary effect, but contributory.

Temporary_31337 · 2025-11-22T08:11:02 1763799062

I think that goes to show that official inflation benchmarks are not very practical / useful in terms of buckets of things that people actually buy or desire. If the bucket that measured inflation included computer parts (GPUs?), food and housing - i.e. all that the thing that a geek really needs inflation would be wayy higher...

Aurornis · 2025-11-22T17:12:32 1763831552

> If the bucket that measured inflation included computer parts (GPUs?), food and housing - i.e. all that the thing that a geek really needs inflation would be wayy higher...

A house is $500,000

A GPU is $500

You could put GPUs into the inflation bucket and it wouldn’t change anything. Inflation trackers count cost of living and things you pay monthly, not one time luxury expenses every 4 years that geeks buy for entertainment.

sheepscreek · 2025-11-22T03:32:21 1763782341

Also we’re likely comparing RAMs at different speeds and memory bandwidth.

dboreham · 2025-11-22T17:07:16 1763831236

Also need to account for the dollar decline vs other currencies (which yes is possibly somewhat factored into dollar inflation so you'd have to do the inflation calculation in Euros then convert to dollars accounting for the decline in value).

rkagerer · 2025-11-22T07:52:15 1763797935

I bought a bunch of hard drives in 2021 (16TB Seagate Exos) that are now $50-$100 more expensive. It's depressing.

robotresearcher · 2025-11-22T17:59:02 1763834342

If the sticker price stayed the same since 2016, it got about 35% cheaper due to inflation.

tempest_ · 2025-11-21T22:11:53 1763763113

Ordered some servers 6 months ago ~12k USD per unit.

Same order, same bill of materials, 17.5K USD per unit today.

That is roughly a 5.5k increase for 768GB of DDR5 ECC memory and the 4 2tb nvme ssds.

nyrikki · 2025-11-22T02:23:22 1763778202

I just gave up and built an AM4 system with a 3090 because I had 128G of ddr4 udimms on hand the whole build was for less than just the memory would have cost for an AM5/ddr5 build.

Really wish that I could replace my old skylake-x system but even ddr4 rdimms for an older xeon are crazy now let alone ddr5. Unfortunately I need slots for 3xTitan V's for the 7.450 TFLOPS each of FP64. Even the 5090 only does 1.637 TFLOPS for FP64, so just hopping that old system keeps running.

touisteur · 2025-11-22T08:45:52 1763801152

If you don't need full ieee-754 double precision, ozaki scheme (emulation with tensor cores) might do the trick. It's been added (just a little bit) to cublas recently.

brenainn · 2025-11-22T06:24:42 1763792682

My 64gb DDR5 kit started having stability issues running XMP a few weeks out of warranty. I bought it two years ago. Looked into replacing it and the same kit is now double the price. Bumping the voltage a bit and having better cooling gets it through memtest thankfully. The fun of building your own computer is pretty much gone for me these days.

minkeymaniac · 2025-11-21T23:20:28 1763767228

Doubled in the last 4 months https://www.youtube.com/watch?v=o5Zc-FsUDCM

Upgraded by adding 64GB.. last Friday I sold the 32 GB I took out for what I paid for the 64 GB in July... insane

incompatible · 2025-11-21T23:24:06 1763767446

Time to start scouring used-PC sales to reclaim the RAM and sell it for a profit?

ThrowawayR2 · 2025-11-22T06:17:24 1763792244

Have you not noticed the domain of the submitted article? Others are way, way ahead on that already.

(Including the submitter. In their comment history is "Tip: You can sell used server RAM or desktop modules through BuySellRam to recover value from old hardware." at https://news.ycombinator.com/item?id=45800881 and all of the submissions of this domain are from this user: https://news.ycombinator.com/from?site=buysellram.com )

chii · 2025-11-22T04:31:19 1763785879

but why wouldn't that used-PC simply increase in price due to the components becoming more expensive?

hattmall · 2025-11-22T04:52:54 1763787174

Information asymmetry

Aurornis · 2025-11-22T16:37:16 1763829436

If you can find used PCs being liquidated with DDR4 RAM that is fast enough for a modern build, then you might.

Old RAM that comes out of the PCs being sold at fire sale prices isn’t really in demand though. Even slower DDR4 grades aren’t seeing much demand.

avipars · 2025-11-26T18:49:08 1764182948

You should use OBS to screen record rather than video your computer screen with your phone

MangoToupe · 2025-11-22T05:45:31 1763790331

Such is life. I suggest finding a less volatile hobby, like crocheting.

Actually, the textile market is pretty volatile in the US these days with Joan's out of business. Pick a poison, I guess? There's little room for stability in a privately-owned-world.

Tempest1981 · 2025-11-22T03:54:01 1763783641

Why do we all need 128GB now? I was happy with 32.

Close a few Chrome tabs, and save some DDR5 for the rest of us. :-)

humanfromearth9 · 2025-11-22T08:50:29 1763801429

Last night, while writing a LaTeX article, with Ollama running for other purposes, Firefox with its hundreds of tabs, multiple PDF files open, my laptop's memory usage spiked up to 80GB RAM usage... And I was happy to have 128GB. The spike was probably due to some process stuck in an effing loop, but the process consuming more and more RAM didn't have any impact on the system's responsiveness, and I could calmly quit VSCode and restart it with all the serenity I could have in the middle of the night. Is there even a case where more RAM is not really better, except for its cost?

hylaride · 2025-11-22T18:32:06 1763836326

> Is there even a case where more RAM is not really better, except for its cost?

It depends. It takes more energy, which can be undesirable in battery powered devices like laptops and phones. Higher end memory can also generate more heat, which can be an issue.

But otherwise more RAM is usually better. Many OS's will dynamically use otherwise unused RAM space to cache filesystem reads, making subsequent reads faster and many databases will prefetch into memory if it is available, too.

philsnow · 2025-11-22T19:06:49 1763838409

Firefox is particularly good at having lots of tabs open and not using tons of memory.

    $ ~/dev/mozlz4-tool/target/release/mozlz4-tool \
        "$(find ~/Library/Application\ Support/Firefox/Profiles/ -name recovery.jsonlz4 | head -1)" | \
        jq -r '[.windows[].tabs | length] | add'
    5524

Activity monitor claims firefox is using 3.1GB of ram.

    Real memory size:      2.43 GB
    Virtual memory size: 408.30 GB
    Shared memory size:  746.5  MB
    Private memory size: 377.3  MB

That said, I wholeheartedly agree that "more RAM less problems". The only case I can think of when it's not strictly better to have more is during hibernation (cf sleep) when the system has to write 128GB of ram to disk.

autoexec · 2025-11-22T19:41:50 1763840510

In my experience firefox is "pretty good" about having lots of tabs and windows open if you don't mind it crashing every week or two.

lemoncookiechip · 2025-11-22T20:24:55 1763843095

I've not had a crash on Firefox in like a decade, basically since the Quantum update in like 2016.

autoexec · 2025-11-22T21:12:51 1763845971

Try living like I do. I currently have 1,838 tabs open across 9 different windows. On second thought, maybe don't live like I do...

philsnow · 2025-11-28T07:13:14 1764313994

I've got ~5k+ tabs, and I've also seen basically zero crashes in the last decade. I'm on Macos, not very many extensions though one of them is Sidebery (and before that Tree Style Tabs) which seems to slow things down quite a lot.

ddoeth · 2025-11-23T12:18:08 1763900288

Why do you need all of these tabs open? How do you find what you need?

autoexec · 2025-11-24T17:20:50 1764004850

I likely don't need all the tabs. Some were opened only because they might be useful or interesting. Others get opened because they cover something I want to dig into further later on, but in this case it's the buildup of multiple crash>restore cycles. Eventually I'll get to each tab and close it or save the URL separately until it's back to 0, but even in that process new tabs/windows get opened so it can take time.

zrail · 2025-11-22T12:18:15 1763813895

On consumer chips the more memory modules you have the slower they all run. I.e. if you have a single module of DDR5 it might run at 5600MHz but if you have four of them they all get throttled to 3800MHz.

imtringued · 2025-11-22T12:58:09 1763816289

Mainboards have two memory channels so you should be able to reach 5600mhz on both and dual slot mainboards have better routing than quad slot mainboards. This means the practical limit for consumer RAM is 2x48GB modules.

drtgh · 2025-11-22T17:39:28 1763833168

Intel's consumer processors (and therefore the mainboards/chipsets) used to have four memory channels, but around the year 2020 this was suddenly limited to two channels since the 12th generation (AMD's consumer processors had always two channels, with exception of Threadriper?).

However this does not make sense, as for more than a decade the processors have only grown increasing the number of threads, therefore two channels sounds like a negligent and deliberately imposed bottleneck to access the memory if one use all those threads (Lets say 3D render, Video postproduction, Games, and so on).

And if one want four channels to surpass such imposed bottleneck, the mainboards that nowadays have four channels don't contemplate consumer use, therefore they have one or two USB connectors with three or four LAN connectors at prohibitive prices.

We are talking about consumer quad-channel DDR4 machines ten years old, wildly spread, keeps being competent compared with current consumers ones, if not better. It is like if all were frozen along this years (and what remains to be seen with such pattern).

Now it is rumoured that AMD may opt for four channels for its consumer lines due to the increased number of pin connectors (good news if true).

It is a bad joke what the industry is doing to customers.