Somebody linked me RFC 7565, which linked to RFC7564, and if that's the place to look this appears to be the list of disallowed characters in a Fediverse username, and I'm cracking up because it's *mostly* stuff you'd expect, except the very first category of banned characters, specially, is "pre-1700 Korean characters".
The fediverse is welcome to all. EXCEPT KOREAN TIME TRAVELERS. Did you just wake up from being frozen in ice during the Joseon dynasty? The IETF is targeting you PERSONALLY
@Hearth The "Q" section is mostly about accented latin alphabet characters.
For example, "á" can be represented as either the single code point U+00E1, or as a pair of code points U+0061 U+0301. The second version is the code point for the letter "a" followed by "COMBINING ACUTE ACCENT" to add the accent to the previous code point.
Since they render identically (not just similarly), you probably don't want both sequences to be valid in names humans are meant to distinguish.