@fishidwardrobe @0xabad1dea Most of the patterns under the "Language and tone" section are actually good writing practices in the right context, such as writing a college essay, a news article, or a travel brochure, but an important part of writing is knowing what you're writing for, and adapting your language to that format. Most of the "Language and tone" category are language patterns that would be inappropriate in an encyclopedia format (such as Wikipedia), but may be perfectly fine elsewhere.
I'd bet a lot of these "tells" for AI-generated text probably also detect a lot of plagiarised edits, where something is just copied directly from an article, travel brochure, etc. instead of rephrasing it in objective language for an encyclopedia. I guess an argument could be made that the generative AI is also just plagiarism with extra steps.
I'm relieved in ways I can't express that I'm out education entirely because based on attitudes I encountered I have total confidence that I'd encounter so many people with no revulsion to just submitting slop as assignments.
I've high confidence that many lecturers would not give a damn about false positive accusations of slop submissions to students, to the point of lecturers handing responsibiliy of detecting slop to an LLM, & feeding student work to that LLM.
@0xabad1dea #TIL that I'm just an LLM bot
I'm using some Russian #typography rules which differs from the same English rules and, sadly , the LLMs tends to use same rules when they comes to the dashes.
In RU typography en-dash used to divide numbers and it doesn't have spaces on the left and right. Like this: 123–456–789.
And the em-dash used to divide parts of sentence and it should have space on the left and right — like this
@evgandr
@0xabad1dea Well… it is on the English Wikipedia referring to English text 😉
But on RU keyboards, you use the same Unicode codepoints, right? So U+002D for everything, right?
@tajpulo @0xabad1dea Yep, it was just funny to treat myself like a bot 🤖 beep-boop🙂. And also I wanted to write a bit about ru-typography
> you use the same Unicode codepoints
Yes, most of the people just use the same codepoint (-). But for people who want to use the typography symbols properly there are a Birman's layout (https://ilyabirman.net/typography-layout/) or Compose key in the X-server-based systems
@0xabad1dea Isn't this basically just another "Who's adapting faster" situation?
The more detailed the list becomes, the easier it is to simply adapt AI generated content to avoid these things, eventually making it harder and harder to tell if something is written by AI.
@0xabad1dea The average say social media comment and the like definitely wouldn't be changed because of this, but I'd argue for example news sites could definitely come up with "refining" steps fairly easily.
The potential gain in viewers from not looking like AI compared to the flood of obvious AI spam enough would justify a few extra steps already.