Advertisement

UWaterloo researchers develop new method to detect hate speech on social media

User typing on computer keyboard. Getty Images

The University of Waterloo says a group of researchers at the school have developed a program which will detect hate speech on social media platforms.

The school says that Multi-Modal Discussion Transformer (mDT), which is currently working at an 88 per cent accuracy rate, will make life easier for those who are tasked with flagging hate speech.

“We really hope this technology can help reduce the emotional cost of having humans sift through hate speech manually,” said Liam Hebert, a Waterloo computer science PhD student and the first author of the study.

“We believe that by taking a community-centred approach in our applications of AI, we can help create safer online spaces for all.”

The school says the mDT can understand the relationship between text and images while also reasoning the greater context surrounding comments.

Story continues below advertisement

The program also reduces the number of false positives as it can deduce comments which have been incorrectly flagged as hate speech because they contain culturally sensitive language.

The school says that understanding allows mDT to be much more accurate than previous models which had not been able to understand some nuances of language.

“Context is very important when understanding hate speech,” Hebert explained.

“For example, the comment ‘That’s gross!’ might be innocuous by itself, but its meaning changes dramatically if it’s in response to a photo of pizza with pineapple versus a person from a marginalized group.

“Understanding that distinction is easy for humans, but training a model to understand the contextual connections in a discussion, including considering the images and other multimedia elements within them, is a very hard problem.”

Hebert and his team trained their model which looked used a dataset consisting of isolated on hate speech as well as the context for the comments.

The school says they used on 8,266 Reddit discussions with 18,359 labelled comments from 850 communities in the training.

“More than three billion people use social media every day,” Hebert said. “The impact of these social media platforms has reached unprecedented levels. There’s a huge need to detect hate speech on a large scale to build spaces where everyone is respected and safe.”

Advertisement

Sponsored content

AdChoices