Just gonna throw this out here - any Ruby gem for relatively quick & accurate language detection that you know of?
@Gargron https://github.com/hashwin/scylla ? any textcat derivative is a good start
@piecritic @Gargron This looks like it might work? Also seems to be listed here, but there aren't any others in the list unfortunately https://github.com/arbox/nlp-with-ruby#language-identification