Introducing WebAccessBench, a novel benchmark for AI language models to assess quality and WCAG conformance in generated web interfaces under realistic prompting conditions.

I did a bit of research and found that LLMs are incredibly bad at basic digital accessibility tasks. You can compare models and read the full white paper at conesible.de/wab.

Overall data suggests massive implications for society at large, and major discrimination of people with disabilities.

A sharepic that lists all benchmarked models and their score in a bar chart. Find them listed at https://conesible.de/wab. Beneath is a preview of the whitepaper PDF.
0

If you have a fediverse account, you can quote this note from your own instance. Search https://chaos.social/users/kc/statuses/116113343357893488 on your instance and quote it. (Note that quoting is not supported in Mastodon.)