Hacker News
new
|
past
|
comments
|
ask
|
show
|
jobs
|
submit
login
pertymcpert
10 months ago
|
parent
|
context
|
favorite
| on:
An analysis of DeepSeek's R1-Zero and R1
Yeah...the whole point is that you're testing the model on something it hasn't seen already. If the problems were in the training set by definition the model has seen them before.
Guidelines
|
FAQ
|
Lists
|
API
|
Security
|
Legal
|
Apply to YC
|
Contact
Search: