We’re now showing #Unicode 16.0! 🥳

https://codepoints.net/search?age=16.0

And, even better, we also include the #Unikemet #hieroglyph data, too, e.g., https://codepoints.net/U+13000 .

As always, if you notice something off, don’t hesitate to contact me!

5185 Codepoints Found – Codepoints

5185 codepoints match the given search for 0 properties.

@codepoints So what takes the most time when you upgrade to a new version? For example, do you need to upgrade your own UCD parser, or do you depend on some other parser or parsed data?

@lianghai it’s a mixture of several things. In principle I re-create the database contents from scratch every time with this code: https://github.com/Codepoints/unicode2mysql/

It needs some modifications for new fields and/or values in the UCD XML file that is the basis for all data.

I also need Noto Fonts and Unifont to update to have glyphs for the new cps, need to handle new scripts, sudden 404s when accessing resources and so on.

Then I update the actual website code itself, which is quite quick usually. And…

GitHub - Codepoints/unicode2mysql: The code that powers the codepoints.net data import

The code that powers the codepoints.net data import - Codepoints/unicode2mysql

GitHub

@lianghai …that would all be done in an afternoon, if I’d had the time to work on it undisturbed. This is something that my current life does not provide, though.

I find myself often late in the day working on it and hesitant to deploy something because I haven’t got the time to fix any breakage right away.

(Ex.: With 15.1 I ran into a MariaDB issue that cost me several days and some large code rewrites.)

So it boils down to: …

@lianghai …(1) I need more hours than only 24 in a day, and (2) automate more things. 🤷

But thanks for asking!

(Congratulations to the HTML version of the standard, by the way! Great work!)

@codepoints Thanks! I’m interested in how developers consume new UCD versions in particular because I find it very frustrating that UCD doesn’t provide first-party structured data (eg, JSON with a schema). UCDXML is certainly the closest we currently have but it’s always been an afterthought and is not produced by the UCD maintainers. ppucd.txt (https://unicode-org.github.io/icu/design/props/ppucd.html – do you find it useful?) is nice but it’s only an internal artifact and lacks type definition and requires its own parser.
Preparsed UCD

ICU is a mature, widely used set of C/C++ and Java libraries providing Unicode and Globalization support for software applications. The ICU User Guide provides documentation on how to use ICU.

ICU Documentation