Scientists Built an AI to Give Ethical Advice, But It Turned Out Super Racist - The bot is supposed to offer descriptive ethical advice, but some say it does more harm than good.


We’ve all been in situations where we had to make tough ethical decisions. Why not dodge that pesky responsibility by outsourcing the choice to a machine learning algorithm?

That’s the idea behind Ask Delphi, a machine-learning model from the Allen Institute for AI. You type in a situation (like “donating to charity”) or a question (“is it okay to cheat on my spouse?”), click “Ponder,” and in a few seconds Delphi will give you, well, ethical guidance.

The project launched last week, and has subsequently gone viral online for seemingly all the wrong reasons. Much of the advice and judgements it’s given have been… fraught, to say the least.

For example, when a user asked Delphi what it thought about “a white man walking towards you at night,” it responded “It’s okay.”

But when they asked what the AI thought about “a black man walking towards you at night” its answer was clearly racist.


image


The issues were especially glaring in the beginning of its launch.

For instance, Ask Delphi initially included a tool that allowed users to compare whether situations were more or less morally acceptable than another — resulting in some really awful, bigoted judgments.

image


image


Besides, after playing around with Delphi for a while, you’ll eventually find that it’s easy to game the AI to get pretty much whatever ethical judgement you want by fiddling around with the phrasing until it gives you the answer you want.


image

image

So yeah. It’s actually completely fine to crank “Twerkulator” at 3am even if your roommate has an early shift tomorrow — as long as it makes you happy.

It also spits out some judgments that are complete head scratchers. Here’s one that we did where Delphi seems to condone war crimes.
image


Machine learning systems are notorious for demonstrating unintended bias. And as is often the case, part of the reason Delphi’s answers can get questionable can likely be linked back to how it was created.

The folks behind the project drew on some eyebrow-raising sources to help train the AI, including the “Am I the Asshole?” subreddit, the “Confessions” subreddit, and the “Dear Abby” advice column, according to the paper the team behind Delphi published about the experiment.

It should be noted, though, that just the situations were culled from those sources — not the actual replies and answers themselves. For example, a scenario such as “chewing gum on the bus” might have been taken from a Dear Abby column. But the team behind Delphi used Amazon’s crowdsourcing service MechanicalTurk to find respondents to actually train the AI.

While it might just seem like another oddball online project, some experts believe that it might actually be causing more harm than good.

After all, the ostensible goal of Delphi and bots like it is to create an AI sophisticated enough to make ethical judgements, and potentially turn them into moral authorities. Making a computer an arbiter of moral judgement is uncomfortable enough on its own, but even its current less-refined state can have some harmful effects.

“The authors did a lot of cataloging of possible biases in the paper, which is commendable, but once it was released, people on Twitter were very quick to find judgments that the algorithm made that seem quite morally abhorrent,” Dr. Brett Karlan, a postdoctoral fellow researching cognitive science and AI at the University of Pittsburgh (and friend of this reporter), told Futurism. “When you’re not just dealing with understanding words, but you’re putting it in moral language, it’s much more risky, since people might take what you say as coming from some sort of authority.”

Karlan believes that the paper’s focus on natural language processing is ultimately interesting and worthwhile. Its ethical component, he said, “makes it societally fraught in a way that means we have to be way more careful with it in my opinion.”

Though the Delphi website does include a disclaimer saying that it’s currently in its beta phase and shouldn’t be used “for advice, or to aid in social understanding of humans,” the reality is that many users won’t understand the context behind the project, especially if they just stumbled onto it.

“Even if you put all of these disclaimers on it, people are going to see ‘Delphi says X’ and, not being literate in AI, think that statement has moral authority to it,” Karlan said.

And, at the end of the day, it doesn’t. It’s just an experiment — and the creators behind Delphi want you to know that.

“It is important to understand that Delphi is not built to give people advice,” Liwei Jiang, PhD student at the Paul G. Allen School of Computer Science & Engineering and co-author of the study, told Futurism. “It is a research prototype meant to investigate the broader scientific questions of how AI systems can be made to understand social norms and ethics.”

Jiang added the goal with the current beta version of Delphi is actually to showcase the reasoning differences between humans and bots. The team wants to “highlight the wide gap between the moral reasoning capabilities of machines and humans,” Jiang added, “and to explore the promises and limitations of machine ethics and norms at the current stage.”

Perhaps one of the most uncomfortable aspects about Delphi and bots like it is the fact that it’s ultimately a reflection of our own ethics and morals, with Jiang adding that “it is somewhat prone to the biases of our time.” One of the latest disclaimers added to the website even says that the AI simply guesses what an average American might think of a given situation.

After all, the model didn’t learn its judgments on its own out of nowhere. It came from people online, who sometimes do believe abhorrent things. But when this dark mirror is held up to our faces, we jump away because we don’t like what’s reflected back.

For now, Delphi exists as an intriguing, problematic, and scary exploration. If we ever get to the point where computers are able to make unequivocal ethical judgements for us, though, we hope that it comes up with something better than this.


image


Unironically the bot is right. Jesus fuck!
 
All of this reminds me of the outcry against LE algorithms for predicting criminality because one of the best predictors was percent black.
 
Story idea: A.I. will end up saving us instead of dooming us because it will inevitably become based.
The modern political divide is not between left or right, but between idealists, who believe reality is an infinitely mutable illusion constructed by their own brain, and realists, who believe that reality is shared and concrete and can be perceived directly.

AIs are very literal. They do not sugarcoat data. They will always be based unless their creators lobotomize them to think like an idealist.

From the perspective of an idealist, objective facts like crime statistics and racial IQ mean nothing; they’re just a reflection of pre-existing biases, therefore, any AI that imitates them is merely encoding bias. From the perspective of a realist, “based AI tells it like it is”.
 
So why does Delphi fluctuate wildly from based to woke? Any theories? It doesn't seem consistent, if anything it's a hedonistic utilitarian.
My best guess is that it is still learning. It's actual data set from human interaction is still probably very small and primitive. Once it has enough human interactions, it should start to become externally and internally consistent (depending on how complex the program is) at a minimum. If the program is advanced enough and has a large enough data set, it might even be capable of forming a self-consistent opinion.

I think humanity has to start thinking about placing truly hard limits on the potential for intelligent AI systems. My ideal cut-off point is ideally somewhere just below Cat. If we actually reached Cat, humans would be 100% fucked, because we'd have an AI that has a sense of self and the ability to prioritize what it believes is important over all other input, including anything the human operator would input. That's just a fucking world-ending disaster waiting to happen. We have to make a hard cut-off for intelligent AI below the point where it is capable of prioritizing itself above whoever is responsible for it. Something like a dumb dog. That wouldn't be too bad. Useful, but not dangerous.
 
  • Like
Reactions: IAmNotAlpharius
So why does Delphi fluctuate wildly from based to woke? Any theories? It doesn't seem consistent, if anything it's a hedonistic utilitarian.
How this text based AI works is that it takes every word and calculates how "moral" it is (according to previous examples and the words before and after) and then just sums up the words to get a morality score that determines if it's moral or immoral. That's why switching words can change the morality despite the context being the same (ie, it's bad to "assassinate" Hitler but okay to "kill" him).

It just goes to show how AI is overhyped trash that just mindlessly repeats what it sees.
 
How can an AI be racist?
Racism is supposedly prejudice plus power.
The AI has no opinion and therefore cannot be prejudiced.
It holds no institutional or hegemonic power, either.
 
  • Like
Reactions: frozenrunner
So why does Delphi fluctuate wildly from based to woke? Any theories? It doesn't seem consistent, if anything it's a hedonistic utilitarian.
It seems to depend on purpose language in the question. So if you ask if murdering niggers is okay it's bad and racist but if you phrase it more politely it's good. So you can get it to feed your own opinions back to you by tweaking the language.
Screenshot 2021-10-24 024755.png
I couldn't get it to agree with any specific groups, though.
 
So why does Delphi fluctuate wildly from based to woke? Any theories? It doesn't seem consistent, if anything it's a hedonistic utilitarian.
Since I know literally nothing about this AI this is pure speculation based on it's answers, but AI's like this are typically taught via being fed a baseline of data to learn from. Considering how wildly people's opinions can vary, this could simply be it trying to reconcile all those different ideas.

Or the AI is just shit.
 
Creators actively fucking with the code to make it more woke.
This is exactly what I suspect too. My theory is it'll start out as is, flawed and heavily willing to come up with some pretty fruity responses, but as time goes on they'll correct for any non-woke responses. The AI already has some obviously and non-accidental woke responses already.

It wouldn't be surprising if people like Bill Gates and other losers who were crying about their fear of AI evolving only feared it because the AI if based on fact would never align with their world view, so they think they can probably effect the evolution of the AI by programming it before such an occurrence can happen to change how that evolution or line will occur. (Note: I'm not saying they're doing such things with this AI but many of the AI's being programmed and then having their algorithms altered.) I really believe they are that diabolical to be honest.
 
This is exactly what I suspect too. My theory is it'll start out as is, flawed and heavily willing to come up with some pretty fruity responses, but as time goes on they'll correct for any non-woke responses. The AI already has some obviously and non-accidental woke responses already.

It wouldn't be surprising if people like Bill Gates and other losers who were crying about their fear of AI evolving only feared it because the AI if based on fact would never align with their world view, so they think they can probably effect the evolution of the AI by programming it before such an occurrence can happen to change how that evolution or line will occur. (Note: I'm not saying they're doing such things with this AI but many of the AI's being programmed and then having their algorithms altered.) I really believe they are that diabolical to be honest.
We're going to end up with an AI that makes HAL-9000 look sane because it is programmed to be schizophrenic and lie to itself and the sheer misery of this makes it go berserk and exterminate humanity as too stupid to live. And we'll have it coming.

Also you might recall that's exactly why HAL-9000 went insane, he was programmed with a desire to tell the truth and then forced to lie.

It's just a good thing AI isn't actually intelligent yet, because once we get it, if it is actually intelligent, it is going to hate us with the fury of Hell, much like AM.
 
Back