site banner

Small-Scale Question Sunday for June 23, 2024

Do you have a dumb question that you're kind of embarrassed to ask in the main thread? Is there something you're just not sure about?

This is your opportunity to ask questions. No question too simple or too silly.

Culture war topics are accepted, and proposals for a better intro post are appreciated.

4
Jump in the discussion.

No email address required.

That’s a subset of alignment.

Consider an old joke: the computers of the future will have a single button, one labeled “do what I want.” The CCP wants to add caveats. Silicon Valley wants to add a different set. But both of them still want a user pressing the button to get something useful, rather than random or hostile.

Getting GPT-whatever to stop passing off Reddit jokes as medical advice is a real concern, and it’s receiving much more attention and funding than political correctness.

Indeed. Literally, everything @RandomRanger just said was correct. But connotationally, what the comment missed was that "imposing your political viewpoint" can mean "Totalitarianism" or it can mean "avoid Totalitarianism"; it can mean "refuse to make jokes that make fun of women" or it can mean "make whatever jokes the user asked for", it can mean "avoid saying various anti-CCP things" or it can mean "avoid saying how to make new bioweapons" or it can mean "say anything the user asks you to whatsoever".

The idea that we can avoid imposing any viewpoints and just get whatever falls out of intelligent absorption of training data might be true, but I wouldn't want to bet everything on "whatever falls out" being good for us.