Balancing transparency in AI content moderation is a complex challenge: explaining decisions helps accountability but can aid bad actors. As regulations tighten, platforms must find ways to explain at scale without compromising security.
Discover more at https://smarterarticles.co.uk/the-transparency-trap-when-explaining-ai-moderation-helps-bad-actors?pk_campaign=rss-feed
#HumanInTheLoop #AIinContentModeration #RegulatoryTech #TrustAndSafety
The Transparency Trap: When Explaining AI Moderation Helps Bad Actors

Every second, an unfathomable volume of content floods the world's largest social media platforms. TikTok videos, Instagram Reels, YouT...

SmarterArticles

We are still hiring engineers, with a bit more well-scoped job descriptions now. Come work with us on open source trust and safety tools. 👀

We are a global, remote organization so let us know if you’re interested and have relevant experience!

https://roost.tools/infrastructure-engineer/

#OpenSource #TrustAndSafety #getFediHired

Infrastructure Engineer | ROOST Careers

Join ROOST as an Infrastructure Engineer to build safety infrastructure for the AI era.

Ctrl-Alt-Speech: For Meta Or Worse

Ctrl-Alt-Speech is a weekly podcast about the latest news in online speech, from Mike Masnick and Everything in Moderation’s Ben Whitelaw. Subscribe now on Apple Podcasts, Overcast, Spotify, …

Techdirt

Domain Bugs Cost More Than Code Bugs

Domain-Driven Design, ubiquitous language, and bounded contexts matter because product teams ship the wrong workflow when legal, product, and engineering mean different things by the same word.

🔗 Read more at ryanw.eu

In addition to our two flagship open source projects, we maintain a simple list of open source safety tools:

https://github.com/roostorg/awesome-safety-tools

If you're building an online platform or community, you don't have to reinvent everything from scratch; lean on what has come before.

Tell us if you discover a new or interesting tool from this list, or if you think there's something missing!

#OpenSource #TrustAndSafety #ROOST

GitHub - roostorg/awesome-safety-tools: Directory of open source tools for online safety

Directory of open source tools for online safety. Contribute to roostorg/awesome-safety-tools development by creating an account on GitHub.

GitHub

RE: https://mastodon.blaede.family/@cassidy/116288502597156855

In the (admittedly unlikely) event you happen to be following us here on the fediverse but are also attending ATmosphereConf, be sure to find our ROOSTers @cassidy and @julietshen! They're giving a talk about Coop, our open source trust moderation and review dashboard, and will also be attending other talks and hanging out with folks to help spread the word about open source trust & safety tools.

Even if you're not attending, they can be your line of communication between the fediverse and ATproto. What should they talk to ATproto folks about? We're all ears!

#OpenSource #TrustAndSafety #ATproto #Bluesky

@strypey I nearly joined #Chatroulette, but then Covid and the Ukraine war started, so that never went anywhere. I had a different direction I wanted to take them, though.

#Bumble #TrustAndSafety ? Well, the industry isn't exactly full of dating site whistleblowers, but safety workers in the industry do talk to me. I know more than I would ever want to know about the inside workings of most dating sites, and sadly. It's not really an amazing revelation that Bumble is as bad as the rest, though.

None of the dating sites puts more than a veneer of effort into T&S. They weigh the cost of litigation against the cost of actually doing the work, and so far, litigation costs have always been lower.

Ctrl-Alt-Speech: Money For Nothing And Clicks For A Fee

Ctrl-Alt-Speech is a weekly podcast about the latest news in online speech, from Mike Masnick and Everything in Moderation’s Ben Whitelaw. Subscribe now on Apple Podcasts, Overcast, Spotify, …

Techdirt

We released Coop v0 a month ago, and we have heard a lot of feedback about what would make it easier to use, explore, adopt, contribute to, etc. Check out our Coop simplification plan; we welcome any and all ideas and suggestions!

https://github.com/roostorg/coop/discussions/123

#OpenSource #TrustAndSafety #OnlineSafety

Coop Code Simplification Plan · roostorg coop · Discussion #123

Since Coop v0 was released, community members have raised consistent feedback multiple times on the architecture and complexity of the codebase for what the project is: Deployment complexity Coop r...

GitHub

Although the DMCA and DSA are important tools, this report shows that they are not immune to misuse—particularly as bad actors increasingly weaponize AI to exploit them.

https://transparency.automattic.com/2026/02/23/transparency-report-update-july-december-2025/ #TrustAndSafety #transparency #reports

Transparency Report Update: July – December 2025

The 25th edition of our biannual transparency report, covering the period from July through December 2025 is now available. The work of Automattic’s Trust & Safety team is grounded in key princ…

Transparency Report