Tableau Prep is the data preparation tool in the Tableau platform — purpose-built for cleaning, reshaping, and combining data before it reaches Tableau Desktop or Tableau Cloud for visualisation. It gives analysts a visual, step-by-step interface for data transformation that produces reproducible, shareable preparation flows without requiring SQL expertise or engineering support.
Tableau Prep is the visual data preparation tool in the Tableau platform. Where Tableau Desktop and Tableau Cloud focus on visualisation and exploration, Prep focuses on the upstream step: getting data into a state where meaningful analysis is possible. It addresses the cleaning, reshaping, and combining work that otherwise consumes the majority of analyst time — the time spent in Excel, Python, or SQL fixing data quality issues before the actual analysis can begin. Prep externalises this work into explicit, auditable, repeatable preparation flows that other analysts can review, modify, and run.
What Tableau Prep Solves
The data that analysts receive is rarely analysis-ready. Common problems: date fields stored as text strings in inconsistent formats; customer names with leading/trailing spaces, inconsistent capitalisation, and variant spellings of the same entity; multiple source tables that need to be joined but have non-matching key values due to encoding differences; pivoted data that needs to be unpivoted to work with Tableau's row-based data model; fields with mixed data types that need splitting into separate columns.
Addressing these problems in Tableau Desktop calculations is possible but awkward — calculations become defensive and complex, the cleaning logic is hidden inside the visualisation layer, and it cannot be shared with other users. Addressing them in SQL requires database access and SQL proficiency that many analysts do not have. Addressing them in Excel means undocumented, non-reproducible manual steps that break when data changes.
Prep addresses them in a dedicated tool with a visual interface, producing a documented, shareable flow that can be scheduled to run automatically when source data changes.
Core Prep Operations
Tableau Prep provides a visual canvas where data moves through a sequence of step nodes. Each step node applies a transformation:
**Input step** — connects to a data source: database tables, Excel files, CSV files, published Tableau data sources, or cloud storage. Input steps define the source connection and initial field selection.
**Clean step** — the primary transformation node. In a clean step, analysts can: rename fields; change data types; create calculated fields using Tableau calculation syntax; remove nulls and errors; group values (correcting variant spellings, collapsing categories); filter rows; and remove unnecessary fields. The clean step shows a visual profile of each field — value distribution, null percentage, top values — making data quality issues immediately visible.
**Join step** — merges two data streams using join conditions. Prep shows a Venn diagram visualisation of the join result, including an explicit count of records matched on both sides, matched on left only, matched on right only, and unmatched. This visual feedback makes misconfigured joins immediately apparent — if the right-only count is non-zero when you expected a clean inner join, the join condition or data has a problem.
**Union step** — appends two data streams with the same structure. Prep handles union mismatches — fields present in one stream but not the other — by showing them explicitly and allowing the analyst to map or exclude them.
**Pivot step** — reshapes data between wide format (one row per entity, many columns for measures) and long format (one row per entity-measure combination). Tableau's visualisation layer expects long format for most chart types; pivot operations convert the Excel-style wide tables that source systems typically produce.
**Aggregate step** — groups data and computes aggregate measures. Useful for pre-aggregating large datasets before publishing to Tableau, reducing the data volume that needs to be stored in extracts or queried at runtime.
**Output step** — writes the prepared data to a published Tableau data source, a .hyper extract file, or a database table. The output step defines where the prepared data goes and triggers the downstream update when the flow runs.
Flow Design Principles
Well-designed Prep flows are readable, maintainable, and robust to data changes:
**Name steps meaningfully** — each step node has a name. Default names like "Clean 1" and "Clean 2" are useless six months later. "Fix customer name encoding" and "Remove test accounts" are useful.
**One concern per step** — group related transformations in the same step, but do not combine unrelated concerns. A step that fixes date formats, removes nulls, and renames fields is doing three things; consider separating them so each step has a clear purpose.
**Filter early** — remove rows that will not be used as early in the flow as possible. Filtering early reduces the data volume processed by subsequent steps and improves flow performance.
**Profile every field in input data** — the field profile panel in clean steps shows value distributions that reveal data quality issues before they cause downstream problems. Make reviewing profiles part of the flow development process, not an afterthought.
**Document assumptions** — Prep flows can contain description text at the flow level and step level. Document the source system, the expected data format, the business rules applied, and the known limitations. Future maintainers depend on this documentation.
Tableau Prep Builder vs Tableau Prep Conductor
Tableau Prep has two components with different roles:
**Tableau Prep Builder** — the desktop application used to author and run flows. This is the tool analysts interact with when building and testing preparation flows. It is included with Creator licences.
**Tableau Prep Conductor** — the server-side component, available on Tableau Server and Tableau Cloud, that schedules and runs flows automatically. Conductor allows flows authored in Prep Builder to be published to the server and run on a schedule — nightly, hourly, or triggered by data source updates — without manual execution. Conductor is part of the Data Management Add-on for Tableau Server; it is included in Tableau Cloud.
For production use — where flows need to run automatically and produce updated published data sources on a schedule — Conductor is required. Flows run only in Prep Builder are not suitable for production use: they require manual execution and produce outputs that are only as fresh as the last manual run.
Prep vs SQL vs Python for Data Preparation
Organisations sometimes debate whether data preparation belongs in Tableau Prep, SQL, or Python. The answer depends on the analyst's skill set, the complexity of the transformation, and the governance requirements:
**SQL** is appropriate when the transformation logic belongs at the database layer — where it can be version-controlled, tested, and applied consistently across consumers. For transformations that every downstream consumer of a table should receive, SQL views or materialised tables are the right layer.
**Python/dbt** is appropriate for complex, programmatic transformations that exceed SQL's readability at scale — multi-step conditional logic, machine learning feature engineering, or transformations with dependencies across many tables.
**Tableau Prep** is appropriate when the transformation is specific to the analytics use case, when the analyst performing the transformation does not have SQL or Python access or proficiency, or when the visual profile-driven interface significantly accelerates identifying and resolving data quality issues.
Prep is not a replacement for a data engineering pipeline for data that needs broad, consistent transformation. It is a complement — the last-mile preparation tool for analysts who need to reshape or clean data for their specific analysis.
Our Tableau consulting practice designs Prep flows and data preparation architectures for organisations managing Tableau at scale — contact us to discuss data preparation strategy for your analytics environment.
A former Microsoft data architect audits your data foundation, identifies your top priorities, and sends you a written plan. Free. No pitch.
Book a Call →