@asmw Very interesting. Have you tried experimenting with different page segmentation modes (PSMs)? It might be assuming one over another based on file extension? Might also be worth searching the code base (on Github) for mentions of file formats.
@tarek I'm still running Windows 7 on a Gateway PC from 2010. I use Windows Media Center to record live TV. Haven't been forced to upgrade and it still gets Windows updates.