I always like to think LLMs are markov models in the way that real-world compute...

dragonwriter · 2025-11-21T16:52:31 1763743951

> Both LLMs and n-gram models satisfy the markov property, and you could in principle go through and compute explicit transition matrices (something on the size of vocab_size*context_size I think).

Isn’t it actually (vocab_size)^(context_size)?

krackers · 2025-11-21T20:10:57 1763755857

Yes, you're right. I typed "**" (exponentiation) but HN ate the second star since I forgot to escape.