What say we run 'file' and #siegfried against #ApacheTika's 600k 'application/octet-stream's in the most recent #CommonCrawl crawl?

Anyone else want to join in the fun?

https://issues.apache.org/jira/browse/TIKA-3992

#filefun #fileophiles #digipres #mimedetection

[TIKA-3992] Add common missing mimes based on Common Crawl data - ASF JIRA