BlogData Architecture

Data Product Thinking: How to Build Data Assets That People Actually Use

Austin Duncan
Austin Duncan
Managing Director & Principal Data Architect
·May 9, 202711 min read

A data product is not a dashboard or a table — it is a data asset designed with a specific user and a specific use case in mind, built to quality standards, maintained as a product, and measured by the outcomes it enables. Most data teams produce data outputs; few produce data products. The distinction explains most of the gap between analytics investment and analytics value.

The term "data product" has been used broadly enough to mean almost anything. In some contexts it means any analytical output. In others it means a machine learning model. In the data mesh literature, it means a domain team's managed data assets.

For this guide, a data product is a data asset designed with a specific user and a specific use case in mind, built to explicit quality standards, maintained as an ongoing product rather than a one-time deliverable, and measured by the outcomes it enables for its users.

This is a narrower definition than "anything the data team produces." It is also a more useful one, because it distinguishes data assets that are designed to be used from data assets that are designed to exist.

The Product vs Project Distinction

Most data team work operates on a project model. A stakeholder requests an analysis. The data team scopes, builds, and delivers. The project is closed. The output — a dashboard, a model, a report — continues to exist, but no one is accountable for its ongoing quality, relevance, or adoption.

The product model applies a different accountability structure. A product has: an owner (someone accountable for its quality and relevance), a user (the specific person or group the product serves), a defined use case (the decision or workflow the product enables), quality standards (what good looks like, quantified), and a lifecycle (creation, iteration, maintenance, and eventually deprecation when it is no longer needed).

Most data team output is not a product. It is a deliverable that stops being relevant when it stops being actively maintained, and that stops being actively maintained as soon as the project is "done."

The shift from project to product thinking is not a tools problem or a technology problem — it is an accountability structure and incentive design problem.

Who Is the User?

The first question in data product design is specific: who is this for? Not "the business" or "our stakeholders" — a specific role, a specific person, or a specific workflow.

"Revenue dashboard for the VP of Sales" is more specific than "sales dashboard." "Customer health score surfaced in Salesforce for account executives who manage the renewal conversation" is more specific than "customer health analytics." The specificity of the user definition determines the specificity of the design choices.

Most data teams design data assets for hypothetical users. They build a comprehensive dashboard that could serve multiple roles, with multiple filter options, showing multiple metrics at multiple levels of granularity. This maximises coverage and minimises adoption — because a dashboard designed for everyone is optimised for no one.

A data product is designed for the specific analytical need of the specific user. Other users may find it useful. That is secondary. The design is driven by the primary user's job to be done.

Defining the Use Case

The use case is the specific decision or workflow the data product enables. It should be specific enough to determine design choices.

"Making better decisions" is not a use case. "Identifying which accounts are at risk of churning before their renewal date so account executives can intervene" is a use case. The specific use case drives specific design choices: the data must be at the account level, it must include a churn risk score, the score must be current relative to the renewal timeline, it must be surfaced in the CRM rather than a separate dashboard, it must be simple enough to act on without interpretation.

The test of a well-defined use case: can you trace every design decision back to a specific requirement the use case imposes? If the data product contains elements that do not serve the defined use case, either the use case is poorly defined or the data product is being over-engineered.

Quality Standards as Product Requirements

A data product has quality requirements just as a software product has performance requirements. The quality requirements are:

**Accuracy**: the data is correct. How correct? Define the acceptable error rate. For a financial reporting data product, the acceptable error rate may be zero — any material discrepancy disqualifies the data for reporting use. For an operational priority queue, an error rate of 5% may be acceptable.

**Freshness**: the data is current enough for the use case. How current? A daily inventory management data product needs to be current as of the morning warehouse run. A real-time fraud detection data product needs to be current within seconds. Define the SLA explicitly.

**Completeness**: the data covers all records it should cover. How complete? Is a 99% coverage rate acceptable, or must coverage be 100%? Missing records in a compliance data product may be a regulatory issue. Missing records in a marketing analytics data product may be inconsequential.

**Consistency**: the data is internally consistent. Do calculations that should sum correctly actually sum correctly? Are foreign key relationships valid? Are there duplicate records where uniqueness is expected?

These quality requirements are testable. Automated tests that verify accuracy, freshness, completeness, and consistency run against the data product on each refresh. Violations trigger alerts to the product owner. Quality is monitored, not assumed.

Measuring Adoption and Outcomes

A data product that is not used produces no value. Adoption measurement is required to know whether the product is serving its intended purpose.

**Usage metrics**: how many of the intended users are using the data product? How frequently? What percentage of target users have engaged with it in the last 30 days? Low adoption despite high quality is a signal of a user experience problem, a discovery problem, or a relevance problem.

**Outcome metrics**: is the data product producing the outcomes it was designed to enable? A churn prediction model is adoption-measured by how often account executives access the churn score. It is outcome-measured by whether accounts where the score was acted on had higher retention than comparable accounts where it was not.

Outcome measurement is harder than adoption measurement but more important. Adoption measures whether the product is being used; outcome measures whether the use is producing value. A data product that is frequently accessed but never influences decisions is not a product — it is a popular decoration.

The Data Product Lifecycle

A data product has a lifecycle. It is created for a specific use case, iterated as users provide feedback and requirements evolve, maintained as underlying data sources change, and eventually deprecated when the use case is no longer relevant or a better solution has replaced it.

**Deprecation is as important as creation.** Data teams that do not deprecate accumulate technical debt in the form of maintained products that no one uses. Every active data product consumes engineering capacity for maintenance. The decision to deprecate a data product should be driven by adoption data — when a product's active usage falls below a threshold and there is no clear path to restoring relevance, it should be deprecated.

**Iteration requires user feedback mechanisms.** A data product that is deployed and never revisited is not being managed as a product. Regular feedback cycles — user interviews, usage analysis, periodic review with the primary users — surface opportunities to improve quality, relevance, and usability. The best data products improve over time through systematic user feedback.

Our data architecture and BI strategy practice designs data products for specific user needs — contact us to discuss your data product requirements.

Get your data architecture audit in 30 minutes.

A former Microsoft data architect audits your data foundation, identifies your top priorities, and sends you a written plan. Free. No pitch.

Book a Call →