Mom needs something very like a grocery store to get food from. LLM training does not need HTTP, HTTPS, HTML, CSS, JavaScript, hyperlinks, banner ads, cookies, and so on. Yes the web was a convenient source to find a lot of text but it's not the only source.
It's obviously not even close to the same logic. One obvious difference is that "mom" (who gets their dinner from "mom"? Not me) pays the grocery store for the food. If I get all my meals via DoorDash, the farmers still get paid. If I get all my news via LLMs rather than a subscription to the NYT, the NYT and its investigative reporters get no revenue from me.
Seems like the same logic to me. Isn't the Web where the information came from?