Hacker Newsnew | past | comments | ask | show | jobs | submitlogin

They are testing with a different dataset. The authors saying that they have not tested on the version of o3 that has not seen the training set.


Yeah...the whole point is that you're testing the model on something it hasn't seen already. If the problems were in the training set by definition the model has seen them before.




Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: