Someone spotted a Substack account with slop content impersonating a specific "laid off WaPo journalist." The CEO of Substack's advice:

"Hit the little “x” button in the top right if you see stuff you don’t like"

https://substack.com/profile/2-chris-best/note/c-213612659 #substack #fraud #contentModeration #worstPractices

Chris Best (@cb)

Hit the little “x” button in the top right if you see stuff you don’t like Edit: this will make your own experience better, and helps the system cope with this stuff. If there is impersonation of a specific person it helps our systems catch it, and it helps counteract low quality stuff

Substack
Leading AI video systems are generating hateful and extremist content in up to 40% of problematic prompts, exposing critical safety and ethical failures. Urgent action is needed to prioritise responsible development, regulation, and community-led safeguards.
Discover more at https://dev.to/rawveg/the-video-ai-hate-problem-5h9h
#HumanInTheLoop #AIGeneration #ContentModeration #AIethics
The Video AI Hate Problem

In October 2025, researchers at the Anti-Defamation League's Centre on Technology and Society...

DEV Community
Joseph Gordon-Levitt Goes To Washington DC, Gets Section 230 Completely Backwards

You may have heard last week that actor Joseph Gordon-Levitt went to Washington DC and gave a short speech at an event put on by Senator Dick Durbin calling for the sunsetting of Section 230. It’s …

Techdirt
Ctrl-Alt-Speech: Panic! At The Discord

Ctrl-Alt-Speech is a weekly podcast about the latest news in online speech, from Mike Masnick and Everything in Moderation’s Ben Whitelaw. Subscribe now on Apple Podcasts, Overcast, Spotify, …

Techdirt
On Its 30th Birthday, Section 230 Remains The Linchpin For Users’ Speech

For thirty years, internet users have benefited from a key federal law that allows everyone to express themselves, find community, organize politically, and participate in society. Section 230…

Techdirt
India forces social media into three-hour takedown deadline: India mandates social media platforms remove unlawful content within three hours, down from 36 hours, creating compliance challenges for Meta, Google, and X. https://ppc.land/india-forces-social-media-into-three-hour-takedown-deadline/ #SocialMedia #ContentModeration #India #TechPolicy #DigitalCompliance
India forces social media into three-hour takedown deadline

India mandates social media platforms remove unlawful content within three hours, down from 36 hours, creating compliance challenges for Meta, Google, and X.

PPC Land

"Two independent analyses of social media content in the lead-up to the German federal election in 2025 have shown that extremist parties, in particular the right-wing Alternative for Germany (AfD), were disproportionately favored by X, TikTok, YouTube, and Instagram. A report prepared by the German nonprofit organization Bertelsmann Stiftung found that on TikTok, for example, 50% of all suggested political content was found to be AfD-related, with the mainstream conservatives a distant second at 15%. The outsized prominence of extremist content cannot be explained by the parties’ actions alone, because they all used very similar social media strategies. Another study, which has been shared by the authors via the pre-publication platform Arxiv, showed that the X algorithm disproportionately amplified content by extreme parties, especially on the extreme right. This selective amplification is particularly concerning in light of earlier research conducted by me and my team, which showed that German politicians from the extreme right and left share far more untrustworthy content on Twitter than politicians of the four mainstream parties.

A recent field experiment investigated the consequences of algorithmic amplification by re-ranking content favored by the X/Twitter algorithm that expressed antidemocratic attitudes and partisan animosity. When antidemocratic content was downranked, participants’ outgroup animosity declined compared to a control group that was exposed to the standard X/Twitter algorithm, both during the study and afterwards. Reduced exposure to antidemocratic content also reduced people’s negative emotions during the study. This is not an isolated finding but adds to existing evidence that social media causally contributes to hate crimes and xenophobia.

The DSA was designed to address such challenges."

https://www.science.org/doi/full/10.1126/science.aee9835

#SocialMedia #SocialNetworks #ContentModeration #Algorithms #AlgorithmicRecommendation #EU #DSA

When Patriotism Becomes a Loyalty Test - Dominus Owen Markham

There’s a moment in the opening ceremony of the Milan-Cortina Winter Games that keeps rattling around my head. The American team walks in, flag raised, and when Trump appears on the big screens…

Dominus Owen Markham
"The old way of thinking about how to make #socialplatforms safer was that you had to make them do more #contentmoderation.
But by the mid-2020s, almost everyone knew both adults & children who struggled to regulate their usage of apps and suffered as a result.
Regulators & plaintiffs’ attorneys began new investigations into whether a #socialapp might be held liable not for what people said on it, but rather how it worked.
Increasingly, it appears they will."
#Section230
https://www.platformer.news/social-media-addiction-trial-eu-tiktok-investigation/
Why the infinite-scroll childhood may be coming to an end

A lawsuit that begins in LA this week, along with a new investigation into TikTok by the European Commission, could change social apps forever. PLUS: AI ads at the Super Bowl and in ChatGPT

Platformer

OpenAI의 Moderation API 요약: 무료로 제공되며 최신 omni-moderation-latest(텍스트·이미지 다중모달)과 구형 text-moderation-latest(텍스트 전용)를 지원합니다. 사용 예제와 함께 응답은 flagged, categories, category_scores, category_applied_input_types 등을 반환해 어떤 입력이 어떤 위반 가능성이 있는지 보여줍니다. 일부 카테고리는 텍스트 전용이며 모델 업그레이드로 점수 재보정이 필요합니다.

https://developers.openai.com/api/docs/guides/moderation

#moderation #safety #openai #omnimoderation #contentmoderation

Moderation | OpenAI API

Learn how to use OpenAI's moderation endpoint to identify harmful content in text and images.