I would point to the problem that chatbots fail not at having a “spark” but at t...

fnordpiglet · on Sept 20, 2023

LLMs are just one tool in a collection. Intelligence is based on many models, not just the language parts of our brain - and I expect AI to incorporate more models in a system approach. Why does it matter if LLMs are able to play chess at a grandmaster level or not? They can delegate the actual chess optimization problem to a chess optimizing program. While it’s interesting language alone is as powerful as it is, it’s very myopic to judge the tool alone and not as a part of a toolbox.

FrustratedMonky · on Sept 20, 2023

Exactly It is NOT all about LLM's. There are a lot of other successful models. From AlphaGo, to visions systems, robotics. LLM is just the latest shiny thing.

At some point they will all be tied together, and at that point it will start to look a lot more like sections of our brain, one is vision, one is language, one is movement. etc....

candiodari · on Sept 20, 2023

I think it's already been made clear that the main reason for the "asymptote" is wrong data input. These models attempt to learn from random internet text ... and this turns out to not be all that accurate.

Also, I've observed a model I was training having the same problem as I do myself. If I at any point learn wrong data, which happens of course, then getting that wrong data back out is very hard and requires 10 or 50 times the effort I spent learning the initial data. In fact I strongly suspect I never unlearn bad data, I just additionally learn "if I say X, it's wrong, say Y instead".

jacquesm · on Sept 20, 2023

Brains suck for exact work such as database work or precise calculations over longer chains. But they excel at approximate work, and that's a very useful skill to have as long as if you have to you can fire up the pencil and paper and do your precise calculations that way. And paper works fine for database work as well and will remember all of those sports stats for as long as you care (and even after you're dead).

Brains are so powerful because they are universal, they can use auxiliary data stores and co-processors just fine.

pmarreck · on Sept 21, 2023

So basically we have to give the LLM access to (both read and add to) a tool that can deal with structured knowledge/state strictly, same thing we have to do for humans- like calculators, databases, clocks/alarms, computer language executors… That way if we tell it “remember that my birthday is April 5” it can enter it into a calendar tool in such a way that it can quickly retrieve it later to confirm its “LLM guesswork” or get triggered a reminder of it on that date.

I’ve been experimenting with prompting to get GPT4 to realize it has a “memory” (just a flat file for now) which it can contextually retrieve and write to, coupled with a process that interprets any requests it makes of this “memory” and adds them to the conversation. Limited success so far. End goal is a “life agent” that does things like remind me of things in a human-like way, sums up my emails, etc.