Unicode 17 includes a change that may improve line breaking, backspacing, and other behavior for Khmer, Myanmar, and twelve other Brahmic scripts: Extended grapheme cluster breaks, which may be used in such processes to identify “characters”, no longer occur within sequences of a conjoiner and a consonant in these scripts. Such sequences represent conjunct forms that users see as indivisible entities.
See
https://www.unicode.org/reports/tr44/tr44-36.html#Derivation_InCB



