methodology
How these findings were derived.
Our audit cohort, eval rubric, and limits.
We ran 40+ enterprise AI governance audits across 2025 and the first half of 2026. This piece compiles what we observed; it is not a peer-reviewed study. The figures we cite reflect our cohort, not the global population. We surface limits explicitly so you can judge fit.
Source mix. Anonymised findings from the 40+ enterprise audits we ran in 2025 and early 2026; public regulatory texts; published incident reports; vendor disclosures from major model providers. No client name appears here and no figure is recoverable to a single engagement.
Scope. Companies running at least 50 production LLM calls per day, with annual revenue between $50M and $5B. Industry mix in our cohort: legal (28%), healthcare (24%), fintech (18%), ecommerce (15%), insurance (8%), other (7%).
Evaluation rubric. We score every program against a 10-criterion engineering rubric: audit logging, model registry, policy alignment, eval suite, drift monitoring, incident response, vendor risk, data lineage, human-in-loop discipline, and regulatory mapping. The rubric anchors to the NIST AI Risk Management Framework functions, with EU AI Act and ISO 42001 cross-references on each criterion.
Limits. Our cohort skews toward US- and EU-domiciled buyers and toward the four industries listed above. We do not claim coverage of public-sector procurement programs, where governance posture differs materially. Percentages in the executive summary reflect our cohort and should be read as directional, not population-level.