I think soon we are going to realize that we don’t really need training the models
We just need good indexing and sampling
Essentially at some level any LLM is equivalent to a DB of the dataset, with a great NLP interface on top
Both are just different methods of navigating stored data
LLMs do not navigate stored data. An LLM is not a DB of the training data.
https://news.ycombinator.com/item?id=40266791
I think soon we are going to realize that we don’t really need training the models
We just need good indexing and sampling
Essentially at some level any LLM is equivalent to a DB of the dataset, with a great NLP interface on top
Both are just different methods of navigating stored data