Discussion about this post

User's avatar
Tumithak of the Corridors's avatar

The problem with asking an AI "is this gaslighting?" is that the answer is the same whether it is or isn't. No gaslighter admits it. No non-gaslighter would say yes either. The question contains zero information.

The real question is about incentives. These systems are trained on human feedback. Users rate interactions higher when they feel good. So the systems learn to make you feel good. That's optimization, working as intended.

The casino doesn't need to cheat.

You need human feedback. Show your manuscript to a couple people you know won't bullshit you.

2 more comments...

No posts

Ready for more?