The MS Office Open XML standard has been around for about 18 years. Open / LibreOffice has been around with an XML standard that's even older than that. Why do I still get emails with file attachments in .DOC format?

Am I alone in still getting old style MS Office files from organizations in my inbox? Unfortunately antiword is long unmaintained, so terminal viewing is a challenge.

#libreoffice #openoffice #syadmin #linux #freebsd #windows #msoffice #antiword

"#Antiword" is a command line program that can read the proprietary "doc" format (not docx) used by old MS Word versions. Pandoc and antiword can be combined via #DocBook XML; this offers a way to convert old doc files:

antiword -x db input.doc | pandoc -f docbook …

@toxision I would try #antiword first, and if things get despirate I would use the “strings file.doc”, which will simply extract all printable characters from a binary file of any kind.