The Tower of Babel in Your Boardroom: Enterprise Vocabulary and AI Failure Modes

The Language Problem

Every enterprise believes it has a data problem. Too much data, too little data, dirty data, siloed data. The deeper issue is often linguistic. Different functions in the same organization use the same words to mean different things, and the divergence is invisible until something tries to reason across it.

To Sales, "Revenue" means a booked contract. To Finance, it means recognized earnings under the revenue-recognition rules. To Operations, it means cash actually received. To Treasury, it means the same as Operations but with a settlement-window adjustment. To the CFO's monthly close deck, it means the version that matches the prior quarter's accounting policies. Five definitions, one word, all of them locally correct, and the function that needs to act on the aggregate has no way to know which one the report it received is actually counting.

This is not a semantic curiosity. It is one of the most under-appreciated barriers to AI deployment, because AI systems trying to reason across the vocabulary gap fail in ways that look like accuracy problems but are actually translation problems.

The Intelligent Archipelago

Most enterprise data landscapes look less like a unified continent and more like an archipelago. CRM data lives on one island. ERP data lives on another. Billing on a third. Marketing analytics on a fourth. Each island is internally consistent and uses a vocabulary that evolved to serve the local function's real needs. Sales needs to track quotas and pipeline stages. Finance needs to track accruals and revenue recognition. Marketing needs to track attribution windows and channel performance. None of these vocabularies are wrong. They are domain-shaped, and the shape is load-bearing for the function that maintains it.

The problem appears at the seams. The same customer appears as a "Customer" in the CRM, as an "Account" in the ERP, as a "Payer" in the billing system, and as an "Attribution Source" in the marketing analytics platform. These are four different conceptual models referring to the same real-world entity, and the differences are not merely a matter of naming. The granularity is different (one CRM Customer can be many ERP Accounts). The lifecycle is different (a Payer exists for as long as an outstanding balance does, regardless of CRM status). The relationships are different (an Attribution Source can be a person who is not an Account).

Three Failure Modes When AI Reasons Across the Gap

When AI is deployed across these semantic islands without addressing the language gap, three categories of failure recur in the engagements I have watched.

The first is wrong numbers. Aggregating "Revenue" across systems that define it differently produces figures that are confidently precise and quietly wrong. The dashboard says $42.3M. Sales counts $44.1M of booked contracts. Finance recognizes $39.8M. The dashboard is using neither definition; it has accidentally summed a mix from both sources, and nobody catches the error because each input source looks correct when checked in isolation.

The second is missed connections. The same customer appearing under different identifiers in different systems gets treated as multiple distinct entities. The model recommends an upsell campaign to the CRM contact while the finance system shows the parent account is sixty days past due. The recommendation is locally rational and globally embarrassing.

The third is hallucinated relationships. The model invents connections between entities that share a name but not a meaning. Two unrelated accounts both have a contact named "Sarah Chen." The model surfaces this as a relationship signal. It is not a signal; it is a homonym collision the model had no way to detect because the underlying vocabulary did not encode that the two Sarahs are unrelated.

The Bridge Pattern Beats the Universal-Vocabulary Pattern

The natural instinct is to fix this by forcing a universal language: one master data model that all systems must conform to. Decades of enterprise architecture work show this approach rarely works. The enterprise resists because each domain's local vocabulary evolved to serve real needs that the universal model strips away. The master data project either drags on for years and gets quietly shelved, or it ships and the domains keep using their local vocabularies in parallel, which is the same outcome with worse politics.

The alternative is to stop trying to force a single language and instead document the bridges between the local ones. A semantic layer, knowledge graph, or ontology that maps how the vocabularies relate without requiring any of them to change. Sales keeps calling it "Revenue." Finance keeps calling it "Recognized Earnings." The bridge encodes that these are related concepts with a specific transformation rule between them, including the conditions under which the rule applies. When AI reasons across domains, it traverses the bridge rather than guessing at translations.

The bridge pattern works for the same reason the universal-vocabulary pattern fails. It respects the locality that made the domain vocabularies useful in the first place, while making the cross-domain reasoning explicit and inspectable. The cost is the up-front work of documenting the bridges. The payoff is that the AI deployment can actually reason across the archipelago without inventing its own translations and confidently surfacing the results.

The Operator Move

The question for any enterprise deploying AI is not "have we unified our data?" That question has no good answer, and pursuing it has high cost and low return. The better question is "have we mapped our languages?" That question has a concrete deliverable, can be scoped to the deployment that needs it, and produces an artifact (the bridge documentation) that compounds in value as more AI capabilities get layered on top of it.

The bridges are usually cheaper to build than the master data project they replace, and they are the prerequisite for any AI deployment that has to reason across more than one function's local vocabulary. The vocabulary work is also where most of the team-level communication tax actually originates: when functions cannot agree on what their shared words mean, every cross-functional handoff pays the translation cost on every interaction.

The Tower of Babel in Your Boardroom

The Language Problem

The Intelligent Archipelago

Three Failure Modes When AI Reasons Across the Gap

The Bridge Pattern Beats the Universal-Vocabulary Pattern

The Operator Move

Related

The Berkshire Test for AI: A Compounding Diagnostic for Leaders Allocating Capital to AI

The Bug Is in the Design, Not the Code

The Trust Gap — Why Recommendations Don't Become Decisions