BlogData Architecture

What Is Data Governance? A Practical Guide for Enterprise Organisations

Austin Duncan
Austin Duncan
Managing Director & Principal Data Architect
·May 18, 202612 min read

Data governance is not a compliance project. It is the set of policies, ownership structures, and technical controls that make your data trustworthy enough to act on. Here is what it actually involves — and what most implementations get wrong.

The quick answer

Data governance is the set of policies, ownership structures, standards, and technical controls that determine how data is defined, managed, accessed, and trusted across an organisation. It is not a compliance programme, a data quality project, or a technology platform — it is an organisational capability that makes data trustworthy enough to act on at scale. Organisations with mature data governance have one agreed definition of every key business metric, clear ownership of every data domain, documented data lineage so any number can be traced to its source, and access controls that restrict sensitive data without creating barriers to legitimate use. Most enterprise organisations have some governance in place, but it is fragmented, inconsistently applied, and not connected to the people who need to enforce it.

Why data governance matters now more than ever

Data governance has existed as a discipline for decades, but two developments have made it urgent rather than aspirational.

The first is AI. Every enterprise AI initiative begins with the same discovery: the data required to support the use case is not clean enough, not governed enough, or not accessible enough. AI systems that operate on ungoverned data do not produce ungoverned outputs — they produce outputs that appear authoritative but reflect whatever inconsistencies and quality issues exist in the underlying data. When a human analyst uses dirty data, they often catch the problem. When an AI system uses dirty data at machine speed, the errors propagate before anyone notices.

The second is scale. Organisations that could manage data informally at 50 people and three systems cannot do the same at 5,000 people and 200 systems. The informal governance that worked — people knowing who to ask, teams having shared understandings of what metrics mean — breaks down as organisations grow. Governance is what replaces informal knowledge with durable, scalable structure.

The four components of data governance

**1. Data ownership.** Every data domain in an enterprise needs an owner — not a technical owner who manages the pipelines, but a business owner who is accountable for the quality, definition, and appropriate use of that data. Revenue belongs to Finance. Customer belongs to Sales or Marketing. Patient belongs to Clinical Operations. Without clear ownership, data quality issues have no one accountable for fixing them, definition conflicts have no authority to resolve them, and access decisions have no one to make them. Assigning data ownership is primarily an organisational change, not a technical one. It is often the hardest part of a governance programme because it requires business leaders to accept accountability for something they previously deferred to the data team.

**2. Data definitions and the semantic layer.** Governance requires that key business concepts — revenue, customer, active user, churn, headcount — have one authoritative definition that every system and every report uses. In most enterprises, these definitions are not agreed. Revenue in the CRM is calculated differently from revenue in the ERP. Customer count in the analytics platform differs from customer count in the marketing tool. The result is that executive reports contradict each other, analysts spend time reconciling rather than analysing, and decisions are made on the basis of whichever number arrived first. The technical implementation of agreed definitions is the semantic layer — a governed set of metric definitions that sits between the raw data and the reporting layer. Every downstream consumer — BI tools, AI systems, Excel exports — gets the number from the same source with the same logic applied.

**3. Data quality standards.** Governance requires that data quality is measured, owned, and maintained — not just assumed. Quality standards define: what completeness levels are acceptable for each dataset, what timeliness requirements apply (how current does this data need to be?), what consistency checks apply across systems, and what accuracy validation is performed. The governance framework specifies who is responsible for maintaining quality in each domain and what the escalation path is when quality drops below the defined threshold. Technical implementation typically involves data quality rules applied at ingestion into the Silver layer, automated quality monitoring, and alerting to data owners when standards are breached.

**4. Data access and security.** Governance defines who can access what data, under what conditions, and for what purposes — and ensures that those rules are enforced technically and audited regularly. Access governance is particularly important for sensitive data: personally identifiable information, protected health information, financial data, and commercially sensitive data. The governance framework specifies access tiers, approval workflows, time-limited access for sensitive data, and audit log requirements. For regulated organisations — those under HIPAA, PCI DSS, SOC 2, GDPR, or financial regulatory oversight — access governance is not optional. It is a compliance obligation with specific technical and procedural requirements.

What data governance is not

**Governance is not a technology.** Data catalogues (Microsoft Purview, Alation, Collibra), Unity Catalog, data quality tools — these are technology that supports governance. But deploying a data catalogue does not create governance. Governance is the combination of policies, ownership, and processes that the technology enforces. Organisations that implement governance tools without the corresponding organisational changes — data ownership, agreed definitions, accountability structures — end up with expensive tools that nobody uses.

**Governance is not a one-time project.** Governance programmes that treat the work as a project — define the standards, document the lineage, launch the catalogue — and then stop maintenance produce governance documentation that decays. Data environments change constantly: new systems, new data domains, new use cases, new regulatory requirements. Governance needs to be maintained as an ongoing operational capability, not implemented and declared complete.

**Governance is not primarily about restriction.** Governance frameworks that focus exclusively on access control and data restriction — who cannot access what — create friction without proportionate value. The goal of governance is to make data trustworthy and accessible for legitimate use, not to restrict it. Access controls are part of that, but the more important elements are the quality standards, definitions, and ownership structures that make data worth accessing.

Common data governance failure modes

**Governance without ownership.** The most common failure mode: an organisation publishes data policies and standards but does not assign business ownership of data domains. Without ownership, there is no one accountable for enforcing the standards, no authority to resolve definition conflicts, and no one to escalate quality issues to. The governance documentation exists. The governance does not.

**Standards nobody follows.** Governance frameworks designed by the data team and imposed on the organisation without stakeholder involvement produce standards that nobody follows. Business users ignore policies they did not help create and do not understand the rationale for. Governance needs to be designed with the business, not for it.

**Tooling without process.** Deploying a data catalogue and expecting governance to follow is a common and expensive mistake. A catalogue is useful when data owners are maintaining their domain's documentation, when definitions are kept current, and when the catalogue is the first place people go to understand a dataset. Without the process discipline and ownership structures that make the catalogue accurate, it becomes a snapshot of what the organisation's data looked like at the time of the initial cataloguing exercise — which becomes less true over time.

**Governance that blocks AI.** Governance frameworks designed before the AI era often create access control structures that prevent AI systems from querying the data they need. Human-approval workflows for data access cannot scale to the query volumes AI systems generate. Effective governance for AI-enabled organisations requires policy-based access control that evaluates agent permissions programmatically, not manual approval workflows designed for human analysts.

How to build a data governance programme

Mature governance programmes follow a consistent build sequence:

**Start with a data inventory.** Before you can govern data, you need to know what you have. This means cataloguing every data source, understanding what it contains, how it connects to other systems, and who uses it. Most organisations discover sources they did not know existed during this exercise.

**Identify the highest-value, highest-risk domains.** Not all data needs the same governance maturity at the same time. Prioritise the domains where governance gaps cause the most business pain: disputed metrics in executive reporting, uncontrolled access to sensitive personal data, quality issues in the data that feeds your highest-stakes decisions.

**Assign data owners.** Work with business leadership to assign accountable owners to each prioritised domain. Define what ownership means: the owner is responsible for approving the canonical definition, maintaining quality standards, and approving access requests for sensitive data in their domain.

**Agree on canonical definitions.** For each prioritised domain, work with the data owner and relevant stakeholders to define the authoritative version of key metrics. Document the definition, the calculation logic, the approved data source, and the acceptable quality thresholds.

**Implement technical controls.** Deploy the technical infrastructure that enforces the governance policies: access controls in Unity Catalog or your platform's equivalent, data quality rules at ingestion, lineage tracking, and the semantic layer that enforces metric definitions downstream.

**Establish ongoing operations.** Define the governance cadence: quarterly definition reviews, monthly quality reviews, annual access reviews, and a defined process for how new data domains get onboarded into the governance programme.

Frequently Asked Questions

How long does it take to implement data governance?

Building foundational governance for an enterprise's highest-priority data domains — financial, customer, operational — typically takes 6–12 months. This includes the organisational work (assigning ownership, agreeing definitions, establishing the governance committee structure) and the technical work (deploying the catalogue, implementing quality checks, setting up access controls). Extending governance to the full data estate is a 2–3 year programme. The key is sequencing: start with the highest-value domains and the most visible governance gaps, demonstrate value, and expand from there.

What is the difference between data governance and data management?

Data management is the broader discipline that encompasses how data is stored, processed, moved, and maintained. Data governance is a subset of data management that specifically covers the policies, ownership, and standards that govern data — the rules about how data should be used, who is accountable for it, and what standards it must meet. Data management without governance produces technically functional data infrastructure that is not trusted or used effectively.

Do we need a data governance committee?

A cross-functional governance committee — with representation from Finance, Legal, Compliance, IT, and the business functions that own key data domains — is the organisational mechanism that makes governance decisions that no single function has the authority to make alone. What does our canonical definition of revenue mean when Finance and Sales calculate it differently? Who approves exceptions to our data retention policy? How do we handle a data quality issue that originated in a system owned by Procurement but affects reports used by the CEO? These questions require cross-functional authority. Whether you call the body a governance committee, a data council, or something else is less important than having a clear forum with the right participants and a defined mandate.

We already have Microsoft Purview deployed. Does that mean we have data governance?

Microsoft Purview (and similar tools — Alation, Collibra, Atlan) are data catalogue and governance platforms. Deploying Purview gives you the technology infrastructure for governance: a place to document data sources, track lineage, apply classification labels, and manage access policies. It does not create governance. Governance requires the organisational elements — ownership, agreed definitions, quality standards, accountability structures — that the tool enforces. Purview with active data ownership and maintained definitions is excellent governance infrastructure. Purview without those elements is an empty catalogue that decays.

How does data governance relate to GDPR and other privacy regulations?

Privacy regulations like GDPR, CCPA, and HIPAA impose specific data governance requirements: lawful basis documentation for data processing, data subject access and deletion rights, retention limits, security standards, and breach notification obligations. A mature data governance programme that includes data inventory, access control, lineage tracking, and retention management provides the foundation for privacy compliance. Organisations without governance programmes typically struggle with privacy compliance because they cannot answer basic questions: what personal data do we hold? Where is it? Who has access to it? How long have we had it? Governance programmes built for compliance reasons are narrower than full data governance but produce the same foundational capabilities.

Our data architecture consulting practice designs and implements governance frameworks that are maintainable by your team — not governance theatre that satisfies an audit and then decays. If your organisation has governance gaps that are causing visible business problems, book a free 30-minute audit and we will tell you exactly what is driving them and what a realistic remediation looks like.

Get your data architecture audit in 30 minutes.

A former Microsoft data architect audits your data foundation, identifies your top priorities, and sends you a written plan. Free. No pitch.

Book a Call →