Introducing WebAccessBench, a novel benchmark for AI language models to assess #accessibility quality and WCAG conformance in generated web interfaces under realistic prompting conditions.

I did a bit of research and found that LLMs are incredibly bad at basic digital accessibility tasks. You can compare models and read the full white paper at https://conesible.de/wab.

Overall data suggests massive implications for society at large, and major discrimination of people with disabilities. #a11y

I have published a minor update to the white paper to get it ready for a wider audience, featuring a more in-depth introduction and a clearer explanation for why scoring is done the way it is: https://conesible.de/wab/
Accessibility Is Civil Rights. AI Must Stop Shipping Barriers.