Links to any publicly available PDF with good semantic structure, tagging, and accessibility to elements like tables and images would be very helpful
although of course you may also use this thread to dunk on the format, if you wish
@ChanceyFleet I don't have any example files, but I did hear recently that Adobe Acrobat has an accessibility checker that is good.* So if you do get examples, you can run them against Acrobat and see what Adobe thinks.
* My license has expired and I'm in no hurry to renew, so I haven't tried it out on any of my own PDFs.
pdfinfo.@ChanceyFleet I wish we could all agree that PDFs are terrible and will never actually be accessible.
Sorry, had to rant.
@ChanceyFleet The @PDFassociation offers some accessible PDFs.
Their Matterhorn Protocol provides a testing model for PDFs, and the protocol doc itself conforms to PDF/UA-1. [1]
Their Tagged PDF Best Practices Guide also follows best practices. [2]
But it's definitely hard to find good examples of accessible PDFs in the wild! Which says a lot about the problems with PDF format.
Links to 1 MB PDFs:
[1] https://pdfa.org/wp-content/uploads/2021/04/Matterhorn-Protocol-1-1.pdf
[2] https://pdfa.org/wp-content/uploads/2023/07/Tagged-PDF-Best-Practice-Guide.pdf
@pkiff @ChanceyFleet Aww, they aren't THAT hard to find!
Speaking of "in the wild", our latest "PDF in the Wild" feature mentions several organizations proudly posting PDF/UA files!
https://pdfa.org/nara-updates-its-guidance-on-ocr-and-pdf-collections/