Today I learned that Salesforce B2C Commerce has a global setting for whether it should export valid XML or whether it should generate two invalid XML numeric entities for each half of a character using surrogate pairs in UTF-16 encoding. Which I guess means that someone maintains a broken XML parser that can only handle the invalid encoding and refused to fix it.
