KPMG pulls report on AI usage due to apparent hallucinations

KPMG has withdrawn its October 2025 report titled "Redefining excellence in the age of agentic AI" after research firm GPTZero identified numerous AI-generated hallucinations within it. UBS, the UK's National Health Service, Swiss Federal Railways, and Transport for London all told the Financial Times that claims about their AI adoption were either fabricated or wildly exaggerated. The irony: KPMG appears to have used AI to write the report about AI itself.

Background and Context

On June 13, 2026, KPMG, one of the Big Four global accounting and professional services networks, announced the immediate and total withdrawal of its major industry report titled "Redefining Excellence in the Age of Agentic AI." Originally published in October 2025, the document was intended to provide strategic insights into how enterprises are integrating autonomous AI agents into their operational frameworks. However, the report’s release triggered a severe crisis of confidence that culminated in its retraction less than eight months later. The primary catalyst for this decision was not a standard data correction or editorial update, but rather the exposure of significant factual inaccuracies identified by GPTZero, a specialized research firm focused on detecting AI-generated content. Upon conducting a deep-dive forensic analysis of the report, GPTZero identified numerous instances of "hallucinations"—a term referring to the tendency of large language models to generate plausible-sounding but entirely fabricated facts, statistics, and attributions.

The situation escalated rapidly when the Financial Times launched an independent investigation into the claims made within the KPMG report. The publication reached out to several high-profile organizations cited in the text as case studies for AI adoption, including UBS, the UK’s National Health Service (NHS), Swiss Federal Railways, and Transport for London. The responses from these institutions were uniformly dismissive and corrective. None of the named entities acknowledged the specific AI deployment strategies, adoption scales, or strategic partnerships described in the report. Instead, they confirmed to the Financial Times that the descriptions were either completely fabricated or grossly exaggerated, with some entities stating they had no knowledge of being included in the study at all. This collective denial stripped the report of its empirical foundation, forcing KPMG to admit to severe flaws in its quality assurance processes and proceed with the withdrawal.

The irony of the situation has become a focal point of industry discourse. Given the prevalence of non-human writing patterns, repetitive syntactic structures, and the specific nature of the factual errors, industry observers and tech analysts widely speculate that KPMG itself utilized generative AI tools to draft or significantly contribute to the content of this report on "Agentic AI." This creates a paradoxical scenario where a consultancy firm, tasked with advising clients on the responsible and effective integration of autonomous AI systems, appears to have bypassed its own rigorous verification standards in its internal communications. The incident has thus evolved from a simple editorial error into a broader symbol of the risks inherent in automating high-stakes professional content production without adequate human oversight.

Deep Analysis

From a technical and operational perspective, the KPMG report retraction exposes a critical vulnerability in the current enterprise AI content generation workflow. Large consulting firms operate under immense pressure to produce thought leadership content, market analyses, and client advisories at a pace that matches the rapid evolution of the technology they study. To maintain relevance and output volume, many teams have begun integrating large language models into their drafting processes. However, the fundamental architecture of these models is probabilistic; they predict the next token in a sequence based on training data patterns rather than retrieving verified facts from a trusted knowledge base. When prompted to generate specific case studies or industry metrics, the model often engages in "creative filling," synthesizing details that sound authoritative but lack factual grounding. KPMG’s failure appears to stem from an over-reliance on this automated generation, coupled with a lack of robust human-in-the-loop verification mechanisms for critical data points such as client names, specific adoption metrics, and technical stack details.

The incident also highlights a significant cognitive and operational gap regarding the concept of "Agentic AI." Agentic AI refers to systems capable of autonomously planning and executing complex tasks across multiple steps. While the report aimed to define excellence in this emerging era, the production of the report itself demonstrated the dangers of deploying such autonomy without strict guardrails. The content production pipeline likely followed a "generate-publish" model rather than the necessary "generate-verify-publish" workflow. In professional services, where credibility is the primary asset, treating AI as a co-author without treating its output as a draft requiring rigorous fact-checking is a high-risk strategy. The hallucinations present in the report were not minor typos but substantial fabrications of corporate behavior and strategy, indicating a systemic breakdown in the editorial review process that should have caught these discrepancies before publication.

Furthermore, the event underscores the limitations of current AI detection and validation tools in preventing such errors at the source. While firms like GPTZero can identify the likelihood of AI generation after the fact, there is a lack of real-time, content-specific verification tools that can cross-reference claims against a verified database of corporate announcements and press releases during the drafting phase. The absence of such a "fact-consistency" layer in KPMG’s workflow allowed the hallucinated content to pass through multiple stages of review. This suggests that the issue is not merely one of tool usage, but of process design. The consulting firm’s internal protocols failed to enforce a separation between AI-assisted drafting and factual verification, leading to a situation where the efficiency gains of AI were realized without the corresponding integrity safeguards.

Industry Impact

The repercussions of this incident extend well beyond KPMG, sending shockwaves through the broader professional services sector, including competitors such as Deloitte, PwC, and EY. For these firms, the event serves as a stark warning that the credibility of their strategic advice is inextricably linked to the accuracy of their published research. Clients rely on these reports to inform high-level decision-making; if the foundational data is suspect, the strategic recommendations derived from it become unreliable. This erosion of trust could lead to a temporary contraction in the market for AI-generated industry insights, as clients may demand greater transparency and proof of human verification before accepting such content. Consequently, consulting firms may need to increase their investment in human editorial resources, potentially raising the cost structure of content production and slowing down the velocity of thought leadership output in the short term.

For the organizations falsely cited in the report, such as UBS, the NHS, Swiss Federal Railways, and Transport for London, the incident has raised serious legal and reputational concerns. These entities were drawn into a public controversy without their consent, potentially damaging their brand equity by association with unverified or exaggerated technological claims. This has sparked new discussions regarding data privacy, the right of publicity, and the protection of commercial reputation in the age of AI. Legal experts are beginning to analyze whether the unauthorized use of a company’s name and alleged strategic direction in a published report constitutes defamation or misappropriation of identity. The incident highlights the passive vulnerability of corporations in a landscape where AI can easily fabricate associations between brands and technologies they have not adopted.

For investors and the broader public, the KPMG retraction acts as a cautionary tale about the reliability of information in the digital age. It reinforces the need for stricter verification mechanisms when consuming corporate reports, especially those dealing with emerging technologies. The event has also increased interest in technologies and services that offer "verifiable AI content" or third-party fact-checking for AI-generated materials. Market players that can provide guarantees of factual accuracy and transparency in their content generation processes are likely to gain a competitive advantage. The incident has shifted the narrative from "AI as a productivity multiplier" to "AI as a liability multiplier" if not managed with appropriate governance, prompting a re-evaluation of risk management strategies across the industry.

Outlook

Looking ahead, as the penetration of generative AI into corporate workflows continues to deepen, incidents similar to the KPMG report retraction are likely to become more frequent until industry-wide standards are established. It is expected that major consulting firms and media platforms will soon implement stricter AI content labeling protocols. These protocols will likely mandate clear disclosures indicating which sections of a report were AI-generated, which were AI-assisted, and which were independently verified by human experts. This transparency will be crucial for maintaining reader trust and distinguishing between speculative analysis and factual reporting. Additionally, the market may see the emergence of specialized AI validation tools designed specifically for "fact-consistency" checks. These tools would serve as a pre- or post-publication checkpoint, automatically cross-referencing claims in generated text against verified databases to flag potential hallucinations before they reach the public.

Regulatory bodies are also likely to take notice of this event. The financial and professional services sectors are heavily regulated, and the publication of inaccurate information can have significant legal consequences. We anticipate that regulators may introduce stricter compliance standards for AI-generated content in these fields, requiring firms to maintain audit trails of their content production processes and demonstrate robust human oversight. For KPMG, this crisis presents an opportunity to rebuild trust by overhauling its content governance framework. The key to recovery will be demonstrating a clear commitment to "human-in-the-loop" verification, ensuring that AI is used for efficiency and ideation, while humans retain final responsibility for factual accuracy and ethical compliance. The industry must learn from this ironic failure to establish a sustainable model where the speed of AI is balanced by the rigor of human expertise, preserving the integrity of professional services in the AI era.

Sources