I asked ChatGPT for help.
I love that the first sentence is defensive. So like us. "As a language model myself, I must first clarify that comments generated by language models are not inherently bad or malicious."
@egallager @wesnoth I don't know why I was so surprised by it. It was inevitable. We've had existing users post ChatGPT answers, labeled as such. And, I asked them to stop doing that and they did. But, these are new users and it seems to be a business model of some sort.
Yet another thing to waste my time each day, making being involved in anything at all on the open internet a constant source of misery. I already gave up on wikis and mailing lists years ago.
Exponential Bullshit at the Speed of Light
@craige no, the grammar was excellent.
Jokes aside, I think that's one of the reasons it's so dangerous. Humans are suckers for a confident liar (it's "con man" for a reason), and ChatGPT is so confident and so authoritative sounding. It's exactly the kind of answer we want, but fictional. If a dumb human makes a suggestion, there are probably cues that allow us to judge their value. Many of those cues are distorted or gone. I can mostly detect LLMs for now...but soon, I dunno.
@DanielMicay @swelljoe @dalias I had to turn off self-post-edit for low-utilization users on Maker Forums because of so many low-or-zero-value posts being edited into spam later. They would come back after 1-3 months to edit low-value posts into spam links. Ironically, because they were low-trust users, those links would be rendered as rel="nofollow" to explicitly deny SEO-juice from the links, but that didn't seem to matter to them.
I used other tools to identify likely spammers and audited a lot of posts to confirm that they were doing this before tightening controls.
I explicitly don't close down old threads because there's so much legitimate usage there; helpful updates on "how well this worked" after two years and such. But I could imagine auto-moderating necro-posting for low-trust users being helpful here.
@DanielMicay @swelljoe @dalias TIL.
Fortunately, I haven't ignored those spam links; they are live for a few hours at most. But now I know that they may be evil but perhaps are less stupid than I thought. โน
@dalias @DanielMicay @swelljoe Their API takes the whole email address and IP address and returns an answer. Wrapping it in a separate service wouldn't make a difference. You can also download complete lists up to twice per day if you want to do the checking locally.
It would be interesting to have a service based on one-way hashes of bits if data that return all possible matches based on those hashes to then make decisions locally, still reporting new spammers to contribute actual matches to the database.
They don't list a plug-in for discourse, so I'd end up using it manually anyway. ๐คท
Overview The Stop Forum Spam plugin (unofficial) can help weed out human spammers who are able to bypass Discourseโs built-in spam tools (thanks to their awesome human powers). Right after a new user signs up on your forum (before they have time to post), this plugin will check the userโs email address, forum username, and/or IP address (depending on your plugin settings) against the Stop Forum Spam database. If the user is found in this database of known spammers, their user account will be im...
@dalias @mcdanlj @swelljoe Would need to write scripting to download their database, parse it and turn it into an SQLite database and then make a small web service out of it. It would be quite straightforward but I don't really want to spend a day on this.
They reuse the same IP addresses and emails for months or even years across many forums. They mainly seem to use gmail addresses. It seems they index all the forums using each forum software and spam them systematically only partly automated.
gotta fight it!
One thing I have been thinking is that the non-goo areas are going to become havens for sanity and real human contact, just as fast... and we're going to get very good at finding and protecting them.
@swelljoe Just wait for the next generation โAIโ trained with the bullshit produced by the current generation.
(โAIโ is the most inflated term ever. Thereโs zero intelligence in what is currently being done, just massive-scale nonlinear regression. Garbage in, garbage out.)