BlogTableau

Tableau Prep: What It Is and How to Use It Effectively

Eric Chen
Eric Chen
Senior BI Solutions Architect
·June 11, 20269 min read

Tableau Prep is Tableau's data preparation tool — designed for analysts who need to clean, shape, and combine data before visualising it. Here is what it does well, where it falls short, and how it fits in a broader data architecture.

The quick answer

Tableau Prep is Tableau's visual data preparation tool — designed for analysts who need to clean, reshape, and combine data before visualising it in Tableau Desktop or Tableau Cloud. You connect to source data, build a visual flow of preparation steps (filter, clean, pivot, join, aggregate), and output a clean dataset to a Tableau data source or a file.

Tableau Prep is best for analysts who need to perform data preparation tasks that Tableau Desktop cannot do but who do not want to write SQL or Python. It is not a replacement for a proper data engineering pipeline — for production data transformation at scale, dbt and your data warehouse are the right tools. Prep is for analyst-owned, ad-hoc preparation work.

What Tableau Prep does

Tableau Prep Builder (the desktop application) provides a visual canvas where you build preparation flows:

**Connect**: Connects to files (Excel, CSV, JSON, Parquet), databases (SQL Server, PostgreSQL, Oracle, Snowflake, BigQuery), and Tableau Server/Cloud published data sources. You start by dragging in one or more input connections.

**Clean**: Inspect column distributions in a visual profile, rename or remove columns, change data types, find and replace values, group inconsistent values (New York / NY / New-York → New York), filter rows. The visual column profiles make it easy to spot anomalies — outliers, null clusters, inconsistent formatting — without querying.

**Pivot**: Convert wide data (one column per date) to long data (a date column and a value column), or the reverse. Pivots in Prep are more flexible than Tableau Desktop's limited pivot functionality.

**Join**: Combine two datasets on matching keys. Prep shows the join results visually — unmatched records, join type effects — making it easier to diagnose join problems than writing SQL.

**Union**: Stack datasets with the same structure (multiple monthly files into a single dataset). Prep handles schema mismatches between files automatically.

**Aggregate**: Group and summarise data — GROUP BY with SUM, COUNT, AVG, MIN, MAX. Use this to reduce row-level transaction data to a summary level appropriate for reporting.

**Script**: Apply R or Python scripts within a Prep flow for transformations that cannot be expressed visually. This makes Prep extensible for analysts with scripting skills.

**Output**: Write the prepared data to a Tableau data source (.hyper extract), a CSV or Excel file, a published data source on Tableau Server or Cloud, or a database table. The output is consumed by Tableau Desktop, Tableau Cloud, or other downstream tools.

Tableau Prep vs Tableau Desktop calculations

Many preparation tasks can be performed in Tableau Desktop using calculated fields: string manipulation, date calculations, CASE WHEN logic, bin creation. The question is where the preparation should happen.

Tasks that must be done in Prep (not Desktop):

- Pivoting data from wide to long format at scale

- Unioning multiple files from a directory

- Joining data from different databases (Desktop can join within a single connection but cross-database joins are limited)

- Aggregating to a different grain than the source data before analysis

- Cleaning messy categorical values across hundreds of rows

Tasks that can be done in Desktop calculations (and often should be, for maintainability):

- Simple string manipulation (TRIM, REPLACE, UPPER/LOWER)

- Date extraction (YEAR([Date]), MONTH([Date]))

- Conditional classification (IF [Revenue] > 100000 THEN 'Large' END)

- Simple ratio calculations

The general principle: transformations that change the shape or grain of the data belong in Prep (or in the data warehouse if at scale). Transformations that create derived analytical variables belong in Desktop calculated fields.

Tableau Prep Conductor (Server/Cloud scheduling)

Tableau Prep Builder is the desktop authoring tool. **Tableau Prep Conductor** is the server-side execution capability that schedules flows to run automatically on Tableau Server or Tableau Cloud.

With Prep Conductor, you publish a Prep flow to Server/Cloud, configure a schedule (daily at 2am, every 4 hours, etc.), and the flow runs automatically and outputs to a published data source that dashboards refresh against. This converts a one-time, analyst-driven preparation task into an automated pipeline that keeps dashboards current without manual intervention.

Prep Conductor requires a **Data Management** licence on Tableau Server or Tableau Cloud. This is an add-on to base Tableau licensing — verify that your organisation has this licence before building a Prep-based automation workflow.

Where Tableau Prep fits in a broader data architecture

Tableau Prep is appropriate when:

- An analyst needs a one-off data preparation task that is too complex for Desktop but does not justify building a proper ETL pipeline

- A data preparation workflow is analyst-owned and analyst-maintained — it does not need data engineering involvement

- The source data is in a file format (Excel, CSV) that a proper ETL/ELT pipeline would also need to handle manually

- The preparation logic is simple enough to be built and maintained in Prep without generating technical debt

Tableau Prep is **not** appropriate when:

- The preparation logic needs to be production-grade, tested, and monitored — use dbt and your data warehouse

- Multiple people need to maintain the preparation logic — Prep flows are hard to version-control and collaborate on

- The data volume is large — Prep is not designed for big data volumes; it processes data in memory on the desktop

- The same transformation logic is needed across multiple downstream tools — Prep output is primarily for Tableau; shared transformation logic belongs in the data warehouse

For organisations with a data warehouse and a transformation layer (dbt), Tableau Prep is rarely the right tool for data preparation. The transformation logic should live in dbt, and Tableau should connect to the dbt-produced Gold layer tables. Prep is most valuable in organisations that do not yet have a mature data engineering function — where analysts are doing their own data wrangling and need a tool that is more powerful than Excel but does not require engineering support.

Tableau Prep vs alternatives

**Alteryx** is Tableau Prep's closest alternative — a visual data preparation and analytics workflow tool with significantly more capability. Alteryx handles complex text mining, spatial analytics, machine learning, and predictive analytics that Prep cannot. It also integrates with a wider range of data sources. The limitation: Alteryx is significantly more expensive than Tableau Prep (which is included with Tableau Creator licences). For organisations already on Tableau Creator, Prep is the lower-friction choice for simple preparation tasks.

**Power Query (Microsoft)** is the data preparation tool embedded in Excel and Power BI. For organisations on Power BI, Power Query is the equivalent of Tableau Prep — visual data transformation before loading into the Power BI model. Similar capabilities, similar limitations: good for analyst-owned, modest-scale preparation; not a substitute for a data engineering pipeline.

**dbt** is not a Tableau Prep alternative in the user-facing sense — dbt requires SQL, a data warehouse, and a more technical user. But for any preparation task that is production-grade and part of a repeating analytics workflow, dbt in the data warehouse is the right answer over Tableau Prep. See what is dbt for the full picture.

**Python/Pandas/Spark** — for analysts who are comfortable with Python, pandas offers comparable flexibility to Prep with more programmatic control. For large-scale preparation, PySpark on Databricks or a cloud warehouse handles volumes that Prep cannot. For production data pipelines, Python-based engineering is more maintainable than Prep flows.

Best practices for Tableau Prep

**Document your flows.** Prep flows can become hard to follow over time. Use step descriptions and meaningful naming to document what each step does. This matters especially if someone else will need to maintain the flow later.

**Test outputs against expectations.** Before publishing a flow for scheduled production use, manually verify the output against known data. Row counts, total amounts, distinct counts on key dimensions. Prep flows can silently produce incorrect results (a join that drops rows unexpectedly, an aggregate that summarises at the wrong grain) — manual verification catches these before they affect dashboards.

**Prefer published data sources as inputs.** Where possible, connect Prep flows to published certified data sources on Tableau Server or Cloud rather than directly to databases or files. This keeps permissions centralised and makes it easier to manage data source changes downstream.

**Put complex logic in the database.** If a Prep flow contains more than 10-15 steps or significant complexity, that is a signal the logic belongs in the database or in dbt rather than in Prep. Technical debt in Prep flows is hard to manage.

**Keep Prep for what it is good at.** Prep excels at one-off data shaping tasks, combining files, and preparing data for a specific analytical purpose. Use it for that. Avoid building production data pipelines in Prep unless you have no engineering alternative.

For Tableau consulting that covers the full stack — from Tableau Server and Tableau Cloud management to data source architecture and Prep workflow design — see our Tableau consulting services. If you are working through how Tableau Prep should fit in your specific environment, book a free 30-minute audit for a direct assessment.

Get your data architecture audit in 30 minutes.

A former Microsoft data architect audits your data foundation, identifies your top priorities, and sends you a written plan. Free. No pitch.

Book a Call →