This entire report from the Ontario government on genAI systems is worth a read, but the review of healthcare scribe accuracy is pretty devastating, imo. This has to work for the tech to be worth anything. If the notes in the chart are wrong, the whole thing falls apart.

https://www.auditor.on.ca/en/content/specialreports/specialreports/en26/2026_AI_EN.pdf

@mttaggart I was cynically thinking to myself "and what are the chances that an industry-loving institution like the Ontario government had any conclusion other than 'well we'll just choose to use Good AI and that will be fine', probably 100%" and jumping to the report's conclusions,

  • establish KPI targets to measure and track Microsoft Copilot Chat’s adoption
  • take actions to increase use of Microsoft Copilot Chat to the targeted rates and usage in the OPS
  • educate OPS staff through AI training about the dangers of using non-Microsoft browsers when accessing AI websites

So, yeah, they did an audit showing LLMs are wildly unreliable and . . . concluded they should encourage use of Microsoft LLM products.

Their audit criteria also included "having due regard for economy".