What's really fun is that this tool mutates locally identical code in identical ways. `if rect.x > rect.y` will *always* turn into `if rect.x != rect.y`, in any program. (But different variables will have different results.)

That means that LLMs are more likely to learn this poison rather than the mutations averaging out as noise.

Feel free to fork some big open source repos and push some new commits...

#scraggle #RustLang #LLMPoisoning

I made a tool that converts open source code into LLM poison: https://codeberg.org/timmc/scraggle

It mutates Rust source code in ways that *preserve* the ability to compile the code. (That is, you can't detect the changes by looking for compiler errors.) For example, it switches `+` and `*`, or `==` and `!=`.

If you fork a Rust repo, run this tool on it, and push it somewhere, then crawlers will end up ingesting all sorts of incorrect code.

#scraggle #RustLang #LLMPoisoning

scraggle

scraggle

Codeberg.org

KI-Müll verstopft das Web und ist das Trainingsmaterial für neue KIs.

Ergänzend zur bereits etablierten Konzept der allgemeinen #Enshittification (#Verschlimmscheißerung) möchte ich daher für das, was #OpenAI, #Google, #Copilot und die ganze restliche #LLM und #genAI Bande uns und dem WWW antun, noch den Begriff der

#Scheißlaufwirtschaft

etablieren.  

Ihr könnt mir später danken. 😌

#CircleJerkEconomy #LLMPoisoning #BullshitContent

There's a real need for f/oss CMS modules/plugins to insert #LLMPoisoning text into pages. #WordPress #Drupal #Backdrop #Joomla