Robin Leroy

@eggrobin
132 Followers
117 Following
138 Posts
๐’‰ญโ€‹๐’„ฟ๐’ˆพโ€‹๐’€€๐’€Š๐’€โ€‹๐’Šญโ€‹๐’พ๐’ŽŒโ€‹๐’‰ˆ๐’๐’Œ‘๐’Œ
An egg drowning in a sea of papers
๐’‰ญโ€‹๐’€€๐’€Š๐’€โ€‹๐’พ๐’€๐’…—โ€‹๐’€๐’€ญ๐’‹ข๐’‹ข
ไธ€้ข—ๅœจ่ฎบๆ–‡ๆตทไธญ็š„่›‹
๐’€ญ๐’๐’†ณ๐’† https://bsky.app/profile/eggrobin.bsky.social
@0xabad1dea I donโ€™t know if there are photos online of the CJK-JRGโ€ฏ1991 Tokyo meeting. The earliest IRG meeting with group photos on https://www.unicode.org/irg/meetings.html is Macao 2002. So ยซ what actually happened ยป might look more like this, although that photo is still ten years later:

@0xabad1dea A lot of the CJK work is on the ISO side, see https://www.unicode.org/irg/.

On the UTC side, see the relevant WG https://www.unicode.org/consortium/cjkunihan.html.

On the history see https://www.unicode.org/versions/Unicode17.0.0/core-spec/appendix-e/.

Henry Chan (now IRG ORT manager) had a interesting thread on Twitter on the necessity of unification, see https://web.archive.org/web/20220115002546/https://twitter.com/FakeUnicode/status/1455676926568271873. See also https://www.unicode.org/notes/tn26/.

Ideographic Research Group

@mcc @manishearth says itโ€™s a typo.

@mcc Yeah two is just wrong for all versions of Unicode for that string.

But then to your earlier question, the actual string seems weird (a virama on a vowel?). The Old Hindi Wiktionary entry mentioned above doesnโ€™t have the first virama, and thus is two (modern)โ€ฏEGCs.

@mcc I think your post is missing some words so I am not sure what the other grapheme count is; but the relevant rule changed a couple of years ago, so this may be a mismatch in version of grapheme cluster segmentation. See PU UAX #29 for 15.1, https://www.unicode.org/reports/tr29/tr29-42.html#GB9c.

(Assuming I ran the various segmentation algorithms in my head correctlyโ€”a daring assumption, I have a coldโ€”if the count is 3, this is a version mismatch; if the count is 4, it is EGC vs. LGC.)

UAX #29: Unicode Text Segmentation

happy Large boulder the size of a small boulder day to all who celebrate
@whitequark I feel like most games I enjoy function as a lesson in something or other.

@luna Yes, U+22C7 โ‹‡ DIVISION TIMES.

And in Unicode 18 there will be a U+1CEF3 LEIBNIZIAN MULTIPLICATION-DIVISION SIGN to go with U+1CEF1 LEIBNIZIAN DIVISION SIGN and U+1CEF2 LEIBNIZIAN MULTIPLICATION SIGN.

Astronomer Ejnar Hertzsprung was born #OTD in 1873.

Emoji Hertzsprung-Russell diagram:

๐Ÿ”ต ๐ŸŸฆ ๐ŸŸฅ
๐Ÿ”ต ๐ŸŸง
โšช๏ธ ๐ŸŸจ
โšช๏ธ
๐ŸŸก๐ŸŸก
โšฌ โšฌ ๐ŸŸ ๐ŸŸ 
โšฌ โšฌ โšฌ ๐Ÿ”ด
โšฌ ๐Ÿ”ด
โšซ๏ธ

the biggest problem we *already have* in open source right now, which we have oversimplified into the term "supply chain security", is the lack of understanding that putting a dependency in your project's dependency set (package.json, pyproject.toml, requirements.txt, cargo.toml, etc) is not just "downloading some code", it is *establishing an ongoing trust relationship with a set of human beings*. this fact is *way* too obscured in all the tools we use.