Can we have a safer internet without constant censorship? 🔍
Our latest research says yes - but we need better tools. We developed a more precise classifier that tells the difference between someone being offensive and someone actually inciting violence.
We tested it on 3.5M Gab posts and found that AI (with the right prompting) is getting much better at understanding that crucial gray area.
Details for the #TrustAndSafety and #SocialComputing community: https://blog.corifaklaris.com/2026/03/04/developing-a-precise-approach-to-identifying-inciting-speech-online/

Developing a Precise Approach to Identifying Inciting Speech Online - Cori Faklaris' blog - HeyCori
The discourse around social media moderation often centers on the idea of “censorship” and protecting free expression vs. protecting conversation health via account bans (what my generation dubbed “Facebook jail”). This framing can make the choices in moderation seem binary. However, for those of us either navigating or studying polarized opinions in online spaces, it … Continue reading "Developing a Precise Approach to Identifying Inciting Speech Online"





