Under this scheme, it isn't impossible the network will learn to fool the discriminator (much like a GAN), and allow it to do things that it is otherwise explicitly not supposed to do.
Hackers do this, already. They evade security heuristics by piggybacking a payload across many cycles/packets. Changing a single bit over billions of cycles or packets inconspicuously, is an "easy mode" solution to deliver a payload, covertly, through secure channels.
If we can do it, AGI can do it better and more efficiently.
>Under this scheme, it isn't impossible the network will learn to fool the discriminator (much like a GAN), and allow it to do things that it is otherwise explicitly not supposed to do.
Hackers do this, already. They evade security heuristics by piggybacking a payload across many cycles/packets. Changing a single bit over billions of cycles or packets inconspicuously, is an "easy mode" solution to deliver a payload, covertly, through secure channels.
If we can do it, AGI can do it better and more efficiently.
(post is archived)