UWaterloo researchers develop new method to detect hate speech on social media

By Kevin Nielsen Global News

Posted May 30, 2024 4:27 pm

2 min read

If you get Global News from Instagram or Facebook - that will be changing. Find out how you can still connect with us.

User typing on computer keyboard. Getty Images

The University of Waterloo says a group of researchers at the school have developed a program which will detect hate speech on social media platforms.

The school says that Multi-Modal Discussion Transformer (mDT), which is currently working at an 88 per cent accuracy rate, will make life easier for those who are tasked with flagging hate speech.

“We really hope this technology can help reduce the emotional cost of having humans sift through hate speech manually,” said Liam Hebert, a Waterloo computer science PhD student and the first author of the study.

“We believe that by taking a community-centred approach in our applications of AI, we can help create safer online spaces for all.”

Get daily National news

Get daily Canada news delivered to your inbox so you'll never miss the day's top stories.

By providing your email address, you have read and agree to Global News' Terms and Conditions and Privacy Policy.

The school says the mDT can understand the relationship between text and images while also reasoning the greater context surrounding comments.

Story continues below advertisement

The program also reduces the number of false positives as it can deduce comments which have been incorrectly flagged as hate speech because they contain culturally sensitive language.

The school says that understanding allows mDT to be much more accurate than previous models which had not been able to understand some nuances of language.

“Context is very important when understanding hate speech,” Hebert explained.

“For example, the comment ‘That’s gross!’ might be innocuous by itself, but its meaning changes dramatically if it’s in response to a photo of pizza with pineapple versus a person from a marginalized group.

“Understanding that distinction is easy for humans, but training a model to understand the contextual connections in a discussion, including considering the images and other multimedia elements within them, is a very hard problem.”

Hebert and his team trained their model which looked used a dataset consisting of isolated on hate speech as well as the context for the comments.

The school says they used on 8,266 Reddit discussions with 18,359 labelled comments from 850 communities in the training.

“More than three billion people use social media every day,” Hebert said. “The impact of these social media platforms has reached unprecedented levels. There’s a huge need to detect hate speech on a large scale to build spaces where everyone is respected and safe.”

Stick to the Facts

Add Global News as a Preferred Source on Google to see more of our stories in your search results.

Journalistic standards Report an error

Menu

Topics

TV Programs

Connect

Local

your local region

Share

Calendar

Calendar

Search

Quick Search

Add Global News to Home Screen

Comments