Mastodawn

@0xabad1dea I don’t know if there are photos online of the CJK-JRG 1991 Tokyo meeting. The earliest IRG meeting with group photos on https://www.unicode.org/irg/meetings.html is Macao 2002. So « what actually happened » might look more like this, although that photo is still ten years later:

Show thread

Robin Leroy Feb 28

@0xabad1dea A lot of the CJK work is on the ISO side, see https://www.unicode.org/irg/.

On the UTC side, see the relevant WG https://www.unicode.org/consortium/cjkunihan.html.

On the history see https://www.unicode.org/versions/Unicode17.0.0/core-spec/appendix-e/.

Henry Chan (now IRG ORT manager) had a interesting thread on Twitter on the necessity of unification, see https://web.archive.org/web/20220115002546/https://twitter.com/FakeUnicode/status/1455676926568271873. See also https://www.unicode.org/notes/tn26/.

Ideographic Research Group

Show thread

Robin Leroy Feb 21

@mcc @manishearth says it’s a typo.

Show thread

Robin Leroy Feb 21

@mcc Yeah two is just wrong for all versions of Unicode for that string.

But then to your earlier question, the actual string seems weird (a virama on a vowel?). The Old Hindi Wiktionary entry mentioned above doesn’t have the first virama, and thus is two (modern) EGCs.

Show thread

Robin Leroy Feb 21

@mcc I think your post is missing some words so I am not sure what the other grapheme count is; but the relevant rule changed a couple of years ago, so this may be a mismatch in version of grapheme cluster segmentation. See PU UAX #29 for 15.1, https://www.unicode.org/reports/tr29/tr29-42.html#GB9c.

(Assuming I ran the various segmentation algorithms in my head correctly—a daring assumption, I have a cold—if the count is 3, this is a version mismatch; if the count is 4, it is EGC vs. LGC.)

UAX #29: Unicode Text Segmentation

Robin Leroy Jan 27

Eve

Jan 27

happy Large boulder the size of a small boulder day to all who celebrate

Show thread

Robin Leroy Jan 10

@whitequark I feel like most games I enjoy function as a lesson in something or other.

Show thread

Robin Leroy Nov 21

@luna Yes, U+22C7 ⋇ DIVISION TIMES.

And in Unicode 18 there will be a U+1CEF3 LEIBNIZIAN MULTIPLICATION-DIVISION SIGN to go with U+1CEF1 LEIBNIZIAN DIVISION SIGN and U+1CEF2 LEIBNIZIAN MULTIPLICATION SIGN.

Robin Leroy Oct 8

Robert McNees Oct 8

Astronomer Ejnar Hertzsprung was born #OTD in 1873.

Emoji Hertzsprung-Russell diagram:

🔵 🟦 🟥
🔵 🟧
⚪️ 🟨
⚪️
🟡🟡
⚬ ⚬ 🟠🟠
⚬ ⚬ ⚬ 🔴
⚬ 🔴
⚫️

Robin Leroy Oct 8

Show thread

Glyph Oct 7

the biggest problem we *already have* in open source right now, which we have oversimplified into the term "supply chain security", is the lack of understanding that putting a dependency in your project's dependency set (package.json, pyproject.toml, requirements.txt, cargo.toml, etc) is not just "downloading some code", it is *establishing an ongoing trust relationship with a set of human beings*. this fact is *way* too obscured in all the tools we use.