the data corpus i made during my time as a phd student has been downloaded yesterday by uh, Zalgo the Devourer, I guess
@halcy mojibake! Looks like maybe Cyrillic since I saw these sequences before
@halcy but telling Cyrillic apart from CJK in UTF mojibake is hard

@mia it is a mystery

could also just be some script filling garbage into the form to see if anything interesting drops out