> Why would batching lead to variance? Depending on the shape of the data a slig...

zxexz · 2025-06-01T12:52:37 1748782357

Yep, this. I see a lot of other worryingly confident answers in the thread that are wrong.

SGLang finally has at least some notes[0], but I’m always surprised there isn’t more of a community wide effort to trace down the sources of indeterminism.

[0] https://docs.sglang.ai/references/faq.html

delusional · 2025-06-01T19:28:24 1748806104

> not entirely deterministic

There's a Nobel prize waiting for you if that's the case. I'll assume you meant theoretically consistent or accurate.

bhickey · 2025-06-01T13:51:20 1748785880

Some of the non-determinism mentioned above manifests as sensitivity to _where_ data falls within a batch.

tough · 2025-06-01T18:11:23 1748801483

In my experience with other regular models, once the context starts to fill up, quality starts to degrade.

wouldn't getting batched at the end of a batch, have a similar -effect- on the results, where your prompt might recieve overall less attention focused into it, if the context window is almost full?

Idk just going by the vibes