A Reddit moderation tool is flagging ‘Luigi’ as potentially violent content

Reddit’s automatic moderation tool is flagging the word “Luigi” as potentially violent — even when the content isn’t. But Reddit does appear to be flagging comments that mention “Luigi” in some cases, even those unrelated to Mangione — just not in the way that it first appeared to be. The Reddit spokesperson said that because…

Read More

Mistral launches a moderation API

AI startup Mistral has launched a new API for content moderation. The API, which is the same API that powers moderation in Mistral’s Le Chat chatbot platform, can be tailored to specific applications and safety standards, Mistral says. It’s powered by a fine-tuned model (Ministral 8B) trained to classify text in a range of languages,…

Read More