Alexander Loth

@xlth
24 Followers
23 Following
114 Posts
Researcher exploring how generative AI reshapes disinformation & public trust. Building JudgeGPT · Author of books on data visualization & AI · iOS developer (Trackless Links, Mindful Coffee) · Views my own.
🌐 Websitehttps://alexloth.com
🎓 Google Scholarhttps://scholar.google.com/citations?user=ofZZ8LgAAAAJ
📚 Bookshttps://alexloth.com/books
🔗 Linkshttps://linktr.ee/xlth

Can we steer visual representations like we prompt LLMs? This paper shows how to inject text into vision encoders via early fusion, creating steerable features that stay strong for core vision tasks while focusing on any concept you ask for.

Read the full paper: http://arxiv.org/abs/2604.02327v1

Most fake news generators still spit out text you can’t reproduce or compare.
RogueGPT ships a controlled pipeline - multi-model, multilingual, style-locked news with full provenance - beyond what GROVER or FACTGEN allowed.
If we can generate disinfo this precisely, how should evaluation tools change?

Full paper here: https://github.com/aloth/RogueGPT

@rwitherspoon Thanks for the tip! Will check it out 👀

#introduction

I'm Alexander Loth -- researcher & author in Frankfurt.

PhD: how generative AI reshapes disinformation & trust.

Tools I build:
🔬 JudgeGPT -- humans detect AI news at 52% (coin flip)
🤖 RogueGPT -- controlled misinfo stimuli
🔍 Origin Lens -- on-device C2PA verification
📊 CRED-1 -- domain credibility dataset

Also: books on data viz & AI, iOS apps.

https://scholar.google.com/citations?user=ofZZ8LgAAAAJ
https://github.com/aloth

#AcademicMastodon #GenAI #Disinformation #C2PA #WebScience #AIethics

Alexander Loth

Microsoft AI for Good Research Lab - 171-mal zitiert - Generative AI - Information Integrity - Computational Social Science - Human-AI Interaction - AI Ethics

@GroupNebula563 thank you!!!

People think they can spot AI-written news. Turns out they mostly can’t.
In a large human study, GPT-4 news was judged about as authentic as real journalism, with accuracy hovering near chance.
If readers can’t tell, what happens to trust when anyone can publish at scale?

Full paper here: https://github.com/aloth/JudgeGPT

We still verify images by squinting at pixels and vibes.
Origin Lens does on-device cryptographic C2PA verification, showing who signed an image and if it was altered.
When trust is math instead of guesswork, which would you rely on?

Try it out on iOS: https://apps.apple.com/us/app/origin-lens/id6756628121

Really interesting tool! The challenge of bridging scientific verification with public accessibility is underrated. I've been working on something adjacent — Origin Lens, which focuses on C2PA/content provenance for images. Complementary angles on the same trust problem.

PsyPost: New research suggests truth has a natural competitive edge over misinformation. “Truthful messages are more persuasive and more likely to be shared than false ones, according to new research published in the Journal of Personality and Social Psychology. The findings, drawn from four large experiments, challenge the widespread belief that misinformation naturally spreads more […]

https://rbfirehose.com/2026/04/02/psypost-new-research-suggests-truth-has-a-natural-competitive-edge-over-misinformation/
PsyPost: New research suggests truth has a natural competitive edge over misinformation

PsyPost: New research suggests truth has a natural competitive edge over misinformation. “Truthful messages are more persuasive and more likely to be shared than false ones, according to new …

ResearchBuzz: Firehose

It's International Fact-Checking Day.

Since 2015, EUvsDisinfo has documented nearly 20,000 cases exposing the Kremlin’s lies and information manipulation.

The world’s largest publicly available.database of pro-Kremlin disinformation: https://euvsdisinfo.eu/disinformation-cases/

#EUvsDisinfo #EU #Russia #Europe #Kremlin #Ukraine #factchecking

Database - EUvsDisinfo

EUvsDisinfo database – the only searchable, open-source repository of its kind, updated weekly with most recent samples of pro-Kremlin disinformation.

EUvsDisinfo