Hacker Newsnew | past | comments | ask | show | jobs | submitlogin

Yes. “Superalignment” (admittedly a corny term) refers to the specific case of aligning AI systems that are more intelligent than human beings. Alignment is an umbrella term which can also refer to basic work like fine-tuning an LLM to follow instructions.


Is this not something of an oxymoron? If there exists an ai that is more intelligent than humans, how could we mere mortals hope to control it? If we hinder it so that it cannot act in ways that harm humans, can we really be said to have created superintelligence?

It seems to me that the only way to achieve superalignment is to not create superintelligence, if that is even within our control.


Not self-evident. Fungus can control ant. Toxoplasma gondii can control human. Who is more intelligent? So if control of more intelligent being is possible, could it be symbiotic to permit? Alpha-proteobacteria sister to ancestor proto-mitochondria and now we live aligned. But those beings lacked conscious agency. We have more than them. Not self-evident we will fail at this.


Another example is the alignment between our hindbrain, limbic system and neocortex. Neocortex is smarter but is usually controlled by lower level processes…

Note that misalignment between these systems is very common.


Many people share your views, but others believe it is possible.


Huh! All this time I thought the "super" was just for branding/differentiation.


Alignment was the original term, but has been largely coopted to mean a vaguely similar looking concept of public safety around the capabilities of current models.


That was definitely part of it.


Then why don't they call politicians "super-politicians"?

Their purpose is to control the population by being lesser beings who feed off corporations and just push their message.




Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: