AI & RoboticsNews

Instagram uses AI to warn users before they post offensive captions

Instagram is launching a new feature that is designed to make users think twice before posting videos or photos with offensive captions.

Moving forward, whenever someone tries to post a caption that Instagram detects as being potentially offensive, the user will see an alert asking them to reconsider their words. The alert system is entirely automated, with Instagram leaning on data garnered from previously bullying reports that contain similarly-worded captions.

This is essentially an extension of a similar feature Instagram rolled out back in July aimed at the comment section — again, Instagram uses AI to detect language, warn the user, and ultimately “encourage positive interactions.”

Above: Abuse alert for captions in Instagram

Of course, there is nothing stopping the user from going against the advice and posting an abusive comment or caption anyway, but the idea here is that a gentle prompt could be all that’s needed to remove at least some abuse from Facebook’s mega-popular photo- and video-sharing platform.

“As part of our long-term commitment to lead the fight against online bullying, we’ve developed and tested AI that can recognize different forms of bullying on Instagram,” Instagram wrote in a blog post. “Earlier this year, we launched a feature that notifies people when their comments may be considered offensive before they’re posted. Results have been promising, and we’ve found that these types of nudges can encourage people to reconsider their words when given a chance.”

Online abuse and bullying is a perennial problem faced by most social platforms, and studies have shown that people under the age of 25 who are subjected to cyberbullying are more than twice as likely to self-harm or attempt suicide.

Monitoring the comments and captions of billions of users is a near-impossible challenge for humans alone, which is why all the major platforms have increasingly turned to automated tools — Twitter recently reported that it now proactively removes half of all abusive tweets without anyone reporting them first.

Instagram’s latest bullying alert will be landing first in “select countries,” before arriving in global markets in the months that follow.


Author: Paul Sawers
Source: Venturebeat

Related posts
AI & RoboticsNews

Hacking internal AI chatbots with ASCII art is a security team’s worst nightmare

AI & RoboticsNews

Microsoft launches new Azure AI tools to cut out LLM safety and reliability risks

AI & RoboticsNews

AI21 Labs juices up gen AI transformers with Jamba

DefenseNews

Northrop says Air Force design changes drove higher Sentinel ICBM cost

Sign up for our Newsletter and
stay informed!