Reddit introduces an AI-powered instrument that can detect on-line harassment

[

Reddit has launched an AI-powered security filter that can assist take away posts that comprise harassing or different objectionable content material.

Reddit explains that the “harassment filter” — quietly added to the platform's help web page final week and detected by Android Authority — makes use of a big language mannequin (LLM), which was eliminated by Reddit's inner instruments and enforcement groups. The moderator is educated on duties and content material. The instrument is meant to help the already troublesome work of Reddit moderators who’re tasked with monitoring the web communities they’re part of.

Simply final month, bloomberg The report says Reddit has signed a content material licensing cope with a serious “AI participant” that can supply the location and consumer information to coach potential AI know-how.

See additionally:

New studies hyperlink meta and 'momfluencers' in selling on-line baby exploitation

When a neighborhood and its moderators activate the filter, a brand new flag will seem within the web site's mod queue indicating content material (posts and feedback) that has been “flagged as”potential harassmentModerators can then approve or take away the content material, and report again to Reddit whether it is decided to be correct.

Forward of its inventory market debut this month, the platform has launched a lot of new options and up to date experiences in current months. Final yr, Reddit introduced the ModMail harassment filter, which acts like a “spam” folder for moderator messages containing probably abusive content material.

Learn how to arrange Reddit's harassment filter

For desktop, go to the About Group tab on the proper sidebar and choose Mod Instruments. For iOS and Android, click on the Mod Instruments button below your neighborhood banner.
Go to moderation. Click on Safety.
Choose the Harassment Filter choice and activate the toggle.
Select from low or excessive filter choices. Low filtering blocks the least quantity of content material, however is extra correct in detecting harassment. The upper filter performs a wider sweep of posts, and thus will block extra posts. In case your neighborhood encounters a “important quantity of disturbing content material,” Reddit recommends utilizing the upper choice.

Whereas Reddit says directors will proceed to mechanically take away posts that straight violate Reddit's content material coverage, Harassment filters present communities with oversight over objectionable however nonetheless “policy-compliant” content material that will slip by way of the cracks.

Learn how to arrange Reddit's harassment filter

Leave a Comment Cancel reply