• Home
  • Latest
  • Fortune 500
  • Finance
  • Tech
  • Leadership
  • Lifestyle
  • Rankings
  • Multimedia
TechThe Mobile Executive

A New Kind of AI Spots 90% of Online Abuse

By
David Z. Morris
David Z. Morris
Down Arrow Button Icon
By
David Z. Morris
David Z. Morris
Down Arrow Button Icon
July 30, 2016, 3:04 PM ET
105208312
Troll Road Sign, Trollstigen (The Troll Path)Photograph by Douglas Pearson — Getty Images

Researchers at Yahoo (yes, for the moment, it’s still Yahoo) have unveiled an algorithm that uses machine learning and natural language processing to detect online abuse and hate speech. Abusive behavior online has been in the limelight lately, both because it’s so inherently vile, and because it could alienate users of platforms like Twitter (TWTR) and Yahoo (YHOO), arguably threatening their bottom line, or even the entire digital economy.

Most such platforms use a combination of user reporting, keyword filtering, and monitoring by legions of trained humans to detect and block trolls and harassers. But filters are easy to work around through creative spelling (the example “kill yrslef a$$hole” pops up early in the researchers’ report).

Get Data Sheet, Fortune’s technology newsletter.

Slurs and insults also shift rapidly, making blacklists ineffective, while some more subtle abuse can be expressed without any single objectionable word. All of that – plus the likelihood of false positives from sarcastic or satirical posts—makes the problem a thorny one for artificial intelligence.

The Yahoo researchers set their AI to evaluate a set of messages already flagged as abusive for common traits. The comment dataset came from Yahoo! Finance and News, which you wouldn’t think of as exactly the dank basement of the internet—but it turns out a whopping 7% of comments on Finance and 16.4% on News were deemed abusive by human screeners.

The program trained itself by scanning those comments for specific sequences of characters, which helped it catch non-standard spellings of offensive words. The processor also tracked linguistic features like comment length, use of capital letters, and punctuation style. It could even parse so-called “dependencies” to find complex phrases that added up to abuse.

The program was then tested by comparing its judgment to the majority opinion of human screeners. At its best, researchers found that their model was more accurate than prior models by a substantial margin, matching human judgment in as many as 90% of its classifications.

For more on the problem of online abuse, watch our video.

What’s most interesting about the results is that the model was most effective when its ‘training’ was updated with new data over time, indicating how fluid online abuse is. In fact, while larger data sets produced better results, even using a much smaller but more recent comment database led to fairly accurate results, which could be an important finding from an efficiency perspective.

The researchers have said they will soon make their datasets available through Yahoo’s Webscope program. However, that database is explicitly available for use only by non-commercial researchers—which means this work may wind up being a part of Yahoo that’s actually worth something to its new owners.

About the Author
By David Z. Morris
See full bioRight Arrow Button Icon

Latest in Tech

Finance
Lorem ipsum dolor sit amet, consectetur adipiscing elit, sed do eiusmod tempor incididunt ut labore et dolore magna aliqua. Ut enim ad minim veniam
By Fortune Editors
October 20, 2025
Finance
Lorem ipsum dolor sit amet, consectetur adipiscing elit, sed do eiusmod tempor incididunt ut labore et dolore magna aliqua. Ut enim ad minim veniam
By Fortune Editors
October 20, 2025
Finance
Lorem ipsum dolor sit amet, consectetur adipiscing elit, sed do eiusmod tempor incididunt ut labore et dolore magna aliqua. Ut enim ad minim veniam
By Fortune Editors
October 20, 2025
Finance
Lorem ipsum dolor sit amet, consectetur adipiscing elit, sed do eiusmod tempor incididunt ut labore et dolore magna aliqua. Ut enim ad minim veniam
By Fortune Editors
October 20, 2025
Finance
Lorem ipsum dolor sit amet, consectetur adipiscing elit, sed do eiusmod tempor incididunt ut labore et dolore magna aliqua. Ut enim ad minim veniam
By Fortune Editors
October 20, 2025
Finance
Lorem ipsum dolor sit amet, consectetur adipiscing elit, sed do eiusmod tempor incididunt ut labore et dolore magna aliqua. Ut enim ad minim veniam
By Fortune Editors
October 20, 2025

Most Popular

Finance
Lorem ipsum dolor sit amet, consectetur adipiscing elit, sed do eiusmod tempor incididunt ut labore et dolore magna aliqua. Ut enim ad minim veniam
By Fortune Editors
October 20, 2025
Finance
Lorem ipsum dolor sit amet, consectetur adipiscing elit, sed do eiusmod tempor incididunt ut labore et dolore magna aliqua. Ut enim ad minim veniam
By Fortune Editors
October 20, 2025
Finance
Lorem ipsum dolor sit amet, consectetur adipiscing elit, sed do eiusmod tempor incididunt ut labore et dolore magna aliqua. Ut enim ad minim veniam
By Fortune Editors
October 20, 2025
Finance
Lorem ipsum dolor sit amet, consectetur adipiscing elit, sed do eiusmod tempor incididunt ut labore et dolore magna aliqua. Ut enim ad minim veniam
By Fortune Editors
October 20, 2025
Finance
Lorem ipsum dolor sit amet, consectetur adipiscing elit, sed do eiusmod tempor incididunt ut labore et dolore magna aliqua. Ut enim ad minim veniam
By Fortune Editors
October 20, 2025
Finance
Lorem ipsum dolor sit amet, consectetur adipiscing elit, sed do eiusmod tempor incididunt ut labore et dolore magna aliqua. Ut enim ad minim veniam
By Fortune Editors
October 20, 2025
Fortune Secondary Logo
Rankings
  • 100 Best Companies
  • Fortune 500
  • Global 500
  • Fortune 500 Europe
  • Most Powerful Women
  • Future 50
  • World’s Most Admired Companies
  • See All Rankings
Sections
  • Finance
  • Fortune Crypto
  • Features
  • Leadership
  • Health
  • Commentary
  • Success
  • Retail
  • Mpw
  • Tech
  • Lifestyle
  • CEO Initiative
  • Asia
  • Politics
  • Conferences
  • Europe
  • Newsletters
  • Personal Finance
  • Environment
  • Magazine
  • Education
Customer Support
  • Frequently Asked Questions
  • Customer Service Portal
  • Privacy Policy
  • Terms Of Use
  • Single Issues For Purchase
  • International Print
Commercial Services
  • Advertising
  • Fortune Brand Studio
  • Fortune Analytics
  • Fortune Conferences
  • Business Development
  • Group Subscriptions
About Us
  • About Us
  • Editorial Calendar
  • Press Center
  • Work At Fortune
  • Diversity And Inclusion
  • Terms And Conditions
  • Site Map
  • About Us
  • Editorial Calendar
  • Press Center
  • Work At Fortune
  • Diversity And Inclusion
  • Terms And Conditions
  • Site Map
  • Facebook icon
  • Twitter icon
  • LinkedIn icon
  • Instagram icon
  • Pinterest icon

Latest in Tech

Stressed out job seeker on laptop
Successjob hunting
Job seekers aren’t imagining things: the number of candidates ghosted by employers just reached a three-year high thanks to AI
By Emma BurleighMarch 20, 2026
3 hours ago
SuccessCareers
AI boom is fueling demand for skilled trades—and demand for technicians, HVAC workers, and electricians is soaring, with six-figure salaries to match
By Preston ForeMarch 20, 2026
3 hours ago
LawX
Three Tennessee teenagers are suing Elon Musk’s xAI for creating sexually explicit images of them
By The Associated Press and Travis LollerMarch 20, 2026
4 hours ago
Trump standing waving hi at a crowd
AIDonald Trump
The White House has a plan for AI regulation, and it starts with keeping states out of it
By The Associated Press and Seung Min KimMarch 20, 2026
5 hours ago
london
Commentaryinvestment banking
The 19th century banking problem that AI hasn’t solved yet
By Silvio Savarese and Sabastian NilesMarch 20, 2026
6 hours ago
spreng
CommentaryVenture Capital
Unicorns are flush with cash and stuck. A new kind of startup crisis is taking hold in 2026
By David SprengMarch 20, 2026
6 hours ago

© 2026 Fortune Media IP Limited. All Rights Reserved. Use of this site constitutes acceptance of our Terms of Use and Privacy Policy | CA Notice at Collection and Privacy Notice | Do Not Sell/Share My Personal Information
FORTUNE is a trademark of Fortune Media IP Limited, registered in the U.S. and other countries. FORTUNE may receive compensation for some links to products and services on this website. Offers may be subject to change without notice.