DOCUMENTFoundational · 04 of 05READING TIME~ 18 min

Why dimension management is a discipline now.

The conversational shift removes the human reviewer from the last mile of analytics. What that means for the categorical surface, and the discipline that has to exist on the other side.

Dimension management is emerging as a discipline now because the shift from dashboards to conversational analytics has removed the human reviewer who was, for thirty years, the unwritten quality gate at the last mile of analytics. This essay traces that shift, the upstream redistribution of quality control that follows from it, and the structural gap that dimension management exists to fill.

I. The control that was never written down.

For thirty years, the architecture diagram of a business analytics stack has shown five boxes connected by arrows. Source systems on the left. A pipeline. A warehouse. A dashboard. An executive on the right. The arrows go in one direction. Numbers flow from where they are recorded to where they are read, picking up structure as they go, and the dashboard is the artifact the organisation argues over.

The diagram is correct as far as it goes. It is also incomplete in a way that matters. Between the dashboard and the executive is a control that no architecture diagram has ever named, because the control is a person rather than a system. An analyst opens the dashboard before the executive does. The analyst applies tacit domain knowledge. A regional split that looks wrong gets investigated. A vendor name that appears twice gets flagged. A quarter total that has moved outside its expected range gets reconciled before anyone with a calendar invite sees it. The review is informal. It is not in any process document. It is, on most teams, the quality gate that protects every executive view.

This control survives because of three properties of the dashboard model. The dashboard is persistent: built once, viewed many times, which gives the analyst a stable artifact to inspect. The questions are bounded: the dashboard's filters, segments, and time windows constrain what an executive can ask, which constrains what the analyst has to anticipate. And the cadence is human: dashboards are read on weekly or monthly rhythms, leaving time for the analyst to walk the data before the meeting.

Remove any one of those properties and the control becomes impossible to operate. Remove all three and the control disappears.

II. The shift to conversational analytics.

The output artifact of analytics is changing from the dashboard to the agent answer. The pipeline upstream stays the same. The source systems are unchanged. The warehouse is unchanged. What changes is the surface on which a question gets asked and an answer gets read.

In the conversational model an executive types a question. An agent reads from the warehouse, picks the relevant tables, writes a query, renders a chart, and returns an answer. The chart did not exist before the question was asked. It will not exist after the answer is read. The artifact is generated per question and not retained. The executive does not bookmark the dashboard, because there is no dashboard to bookmark. They ask the next question.

The three properties that made the analyst's control possible all change at once. Persistence goes away because the artifact is ephemeral. The boundary on questions goes away because the agent can join, filter, and slice in ways no dashboard would have been built to support. The human cadence goes away because answers arrive in seconds, and a conversation that produces ten answers in an hour cannot be walked through by an analyst between meetings.

None of this is a critique of conversational analytics. The shift is happening because the dashboard model has costs of its own: dashboards proliferate, go stale, and answer the questions someone asked last year rather than the question being asked now. The point is narrower. The shift removes the conditions under which the informal quality control at the last mile could operate. Whatever the new model does better, it does not do this.

III. What disappears with the dashboard.

Two pieces of the old model do not survive the shift, and it is worth naming both separately because they are sometimes conflated.

The first is the dashboard as artifact. A persistent page built once and read many times is no longer the unit of analytical output. The role of the BI analyst as curator of views goes with it: curating ten dashboards for a finance team only makes sense if the finance team is reading those dashboards rather than asking ad-hoc questions. The work does not vanish. It moves: from curating dashboards to curating the upstream models, metrics, and dimensions that the agent reads from. That is a different kind of work requiring different tools and a different operating model.

The second is the human review at the last mile. This is the structurally significant change, and the one that has received the least attention in the discussion of agentic analytics. No analyst sits between the agent and the executive. The judgment that caught a misclassified vendor, a wrongly aggregated segment, a timezone-confused timestamp, that judgment is no longer in the path of the answer. The reactive correction that happened at the visualisation layer, on the analyst's screen, in the hour before the meeting, has nowhere to happen now.

The dashboard going away is a UX change. The reviewer going away is a quality-control change. The two are happening together because they were always coupled, but only the second is the subject of this essay.

IV. The consequence: no safety net.

Agent answers reach the executive without a buffer. A misclassified vendor, a label with four representations, a timestamp in the wrong timezone, none of these produce an error. The agent does not hesitate. It renders a confident chart from whatever data it receives, and the chart looks correct because the chart is always going to look correct. Correctness is a property of the data the chart is built from. The chart is not in a position to disagree with its inputs.

The failure profile is worth being specific about, because the cases that matter are not the dramatic ones.

  • The same supplier appears as five strings. AWS, aws, A.W.S., Amazon Web Services, Amazon AWS. The spend totals split across five vendors. The chart renders. The answer is wrong, and there is nothing about the chart that announces it as wrong.
  • Mid-Market is defined by employee count in one system and by revenue in another. A blended segment total combines two incompatible populations. The chart renders. The answer has no defensible meaning, and there is no way to recover the intent that produced it.
  • A product was renamed last quarter. New records carry the new label. Old records keep the old one. A filter on the new label drops half the history. The chart renders. The trend it shows is an artefact of the rename.

The pattern across all three is the same. There is no error state. There is no flag. The agent does what it was asked to do, on the data it was given. The trust problem is structural: the absence of a human checkpoint at the last mile means upstream quality must now be guaranteed, not assumed. The cost of assuming has gone from a discrepancy an analyst would have caught to a decision an executive will act on.

V. Quality control moves upstream, and distributes.

The check that used to happen at the visualisation layer does not consolidate into one new place. It distributes across the pipeline. Each class of problem is handled at the layer where it can be handled most cleanly. Two of the resulting layers can be automated. Two require a human in the loop, because the decision is irreducibly organisational rather than technical.

  • Source contracts. Schema, type, nullability, freshness. Enforced automatically at ingestion against a declared contract. The mature tooling in this category, Great Expectations, Soda, dbt tests, handles the work without a human in the loop because the questions have one right answer. A null in a non-nullable column is a null in a non-nullable column.
  • Ingestion observability. Outliers, distribution drift, volume and freshness anomalies. Caught by monitors. Tools in this category, Monte Carlo, Bigeye, Anomalo, observe the data statistically and surface what has changed. The judgement of whether a change is a problem is human, but the detection is automated.
  • Dimension management. Variant resolution, canonical definitions, naming and hierarchy decisions on categorical values. This is the layer at which the question being asked is what does this label mean and which other labels are the same as it. The detection of inconsistency can be automated. The resolution cannot, because the resolution is a decision about how the business wants to describe itself.
  • Metric governance. KPI definitions, slowly-changing-dimension policy, hierarchy versioning. Authored by humans, enforced by the agents and pipelines that read from the semantic layer. Tools in this category, Cube, MetricFlow, the dbt Semantic Layer, sit between the warehouse and the consumers and answer the question what does revenue mean here in a way that every consumer reads the same answer to.

Layers one and two catch problems that have one right answer. Layers three and four carry decisions that no algorithm can make alone. They depend on a named owner with authority over the dimension or the metric. This essay is about the third layer.

VI. Why dimensions specifically need humans in the loop.

Dimensional problems are semantic, not statistical. The companion primer identifies five forms variance takes, and four of the five cannot be resolved without a human decision. The reasoning is worth tracing in one place, because the distinction between what can be automated and what cannot is the argument for why dimension management exists as a discipline at all.

Surface variance is mechanical. The same concept recorded with different formatting: AWS versus aws versus A.W.S. versus Amazon Web Services. An algorithm can propose a canonical form and a steward can approve the proposal once, after which the resolution propagates. This is the only form of variance that is safely automatable, and it is also the form most associated with dimensional inconsistency in the general imagination. It is the smallest part of the actual problem.

Semantic variance is different strings for the same concept where the equivalence cannot be inferred from the strings themselves. Vendor, Supplier, Partner. Laptop, Notebook. The choice of which string becomes canonical requires a naming decision, and the naming decision is an organisational decision rather than a technical one. The team that uses Vendor and the team that uses Supplier both have reasons. Reconciling them is the kind of choice that ends in a meeting, not an algorithm.

Definitional variance is the same label used for different things in different parts of the organisation. Mid-Market means companies between $1M and $10M in annual revenue to the finance team, companies with 100 to 999 employees to the sales team, and self-reported on a marketing form to the demand-generation team. The label looks consistent. The underlying meaning is fragmented. No algorithm can resolve this, because the resolution is a choice about what the organisation wants the word to mean.

Granularity variance is the same concept recorded at different levels of a hierarchy. Financial Services in the CRM, Retail Banking in the warehouse, BFSI in an external report. Each value is correct at its own level. The choice of which level is canonical, and how the others roll into it, is a hierarchy decision the organisation has to make explicitly.

Temporal variance is the canonical form changing over time. A product is renamed. A segment definition is updated. The new label is applied going forward; old records retain the old form. The decision of whether to remap history, cut over at a date, or maintain both labels indefinitely is a policy decision with downstream consequences. No algorithm has the authority to make it.

An agent can surface discrepancies at machine speed. It cannot decide what Mid-Market means. The system is therefore necessarily human-in-the-loop: agents detect, flag, and queue; humans decide, approve, and certify. The architecture that follows from this constraint is described in the target state.

VII. Why existing tools fall short.

Adjacent tool categories touch the categorical surface. None of them treats it as the primary object. Each was built for a related problem and is being asked, by organisations noticing the dimensional gap for the first time, to extend into a problem it was not built for. The extension is possible in principle and structurally constrained in practice.

  • Data observability tools. Monte Carlo, Soda, Bigeye, Anomalo. Detect volume, freshness, and distribution drift. Cover structural and statistical quality control well. They do not govern categorical semantics or canonical values, because the questions they answer are statistical and the questions dimensions ask are semantic. A monitor can tell you that a new value appeared in the vendor field. It cannot tell you what that value should resolve to.
  • Master data management. Informatica, Reltio, Profisee. Resolve whether two records refer to the same entity. This is genuinely valuable work. It is also a different problem from governing the categorical attributes on those entities once resolved. MDM answers the question are these the same customer; dimension management answers the question what segment is this customer in, and what does that segment mean. The attributes on master records get populated by survivorship rules, which have no view on whether the surviving value is canonical.
  • Data catalogues. Atlan, Collibra, Alation. Document what data assets exist, what they mean, how they connect. The catalogue is the reference for what is in the warehouse. It documents the policy that governs categorical values; it does not enforce that policy at runtime. A catalogue entry saying the vendor field uses a specific reference is decorative if the operational systems and agents are not actually reading from that reference.
  • Semantic layers. Cube, MetricFlow, the dbt Semantic Layer. Define metrics and joins above the warehouse so every consumer reads the same definition of revenue. They consume dimension values; they do not curate them. The semantic layer is downstream of the dimensional layer. It depends on the dimensions being correct; it does not make them correct.

Each of these tools touches the categorical surface in the course of doing its primary job. None of them was designed to own it. And each is, at the same moment, undergoing its own transition toward the agent era: observability tools rebuilding for autonomous remediation, MDM platforms figuring out agent-driven matching, catalogues becoming semantic surfaces for LLMs to read. Asking any one of them to also solve dimensional governance is asking it to solve a problem it was not built for, at a scale it was not designed for, while undergoing its own architectural transformation.

VIII. Why horizontal, not in-house.

The instinct, when an organisation first recognises the dimensional gap, is to ask each of the existing tools to solve it locally. The warehouse team adds a values table. The catalogue team adds a controlled vocabulary feature. The semantic-layer team adds a dimension registry. Each solution is internally coherent. Together they produce the problem the discipline exists to eliminate.

Every system gets its own copy of the truth. The copies drift apart. The agent in the chat tool reads one canonical form for vendor. The model in the warehouse reads another. The dashboard tool, which still exists somewhere, reads a third. The configuration variance that the agent era introduces, where two configured systems produce structurally inconsistent output at machine speed, returns by a different door. The work each team did was good work. The composition of that work is the problem in a new form.

The architectural conclusion is unavoidable. Dimension management has to sit above the tools, not inside any one of them. A canonical reference embedded in a warehouse is unreachable to an agent operating outside it. A reference inside a catalogue is decoupled from the systems that produce and consume values. A reference inside an MDM platform inherits the entity-centric assumptions that have limited those platforms from governing categorical values for thirty years.

The reference has to be system-agnostic, API-first, and consumed by every layer: operational systems, ingestion pipelines, agents, semantic layer, governance tools. It has to be federated, so that different parts of the organisation can steward the dimensions they understand without going through a central choke point. It has to be versioned, so that records written under one definition remain interpretable when the definition changes. And it has to be readable by software in the loop, not only by humans, because the consumers writing values are no longer only humans. This is the shape of the discipline described in the target state.

IX. From reactive cleanup to runtime governance.

The shift is one of operating model rather than tooling. The old model was reactive: notice a problem in a report, patch the variant in a query, file a ticket for someone to fix it upstream, move on. The fix did not propagate. The next report had a different patch. Each analyst maintained their own private dictionary of which strings meant which things. The cost was real and the cost was diffuse, which is the profile of work organisations systematically underinvest in.

The new model is at the other end of a maturity path with five recognisable stages. Most organisations sit at stage one or two for most of their dimensions: invisible (no one owns the dimension, variance accumulates), or reactive (analysts patch in queries and the patches do not propagate). Some have reached stage three: a canonical reference exists in a wiki or spreadsheet, but no system reads it. Stage four is governed: a named steward holds the reference, pipelines validate against it, changes follow a request flow.

Stage five is runtime. The canonical reference is load-bearing. Every agent and pipeline reads from it through a synchronised local cache. Confidence travels with the value, so a consumer downstream knows whether the classification it received was made from a current reference or a stale one. The categorical surface is governed in production rather than in retrospect. The full model is in the target state.

Dimension management starts as a transition enabler, the discipline that lets organisations move from dashboards to conversational analytics safely. It becomes permanent infrastructure, the categorical governance layer that every analytical and AI system in the organisation depends on. The shift to conversational analytics is the trigger. The infrastructure that has to exist on the other side is what this essay has been about.

X. How to keep reading.

This essay traces the shift that makes dimension management a discipline. The three companion documents develop the rest of the picture. The primer establishes the category from first principles, observing dimensional data as a class with its own characteristic failure modes. The target state specifies what well-managed dimensional data looks like in operational detail: the canonical reference, the runtime, the workflow, the metrics, the maturity path. The manifesto argues why the cost of leaving dimensions unmanaged, which was tolerable for thirty years, is becoming intolerable in the agent era.

For practitioners who prefer a sequenced walk through the same argument, the Origins module in the Learn pillar covers this material in eight pages, read in sequence, with cross-references to the glossary and the rest of the curriculum.