Yeah...the whole point is that you're testing the model on something it hasn't s...

		pertymcpert 10 months ago \| parent \| context \| favorite \| on: An analysis of DeepSeek's R1-Zero and R1 Yeah...the whole point is that you're testing the model on something it hasn't seen already. If the problems were in the training set by definition the model has seen them before.