Question 1

What does an Azure data engineering engagement typically cost?

Accepted Answer

Cloud data engineering engagements run across a wide range depending on scope. A cloud architecture assessment (mapping your current state, identifying gaps, producing a remediation roadmap) typically costs $15,000–$35,000 for a mid-market organisation — 2–3 weeks of senior engineering time. A greenfield lakehouse build covering 3–5 data domains typically runs $80,000–$200,000. A full enterprise data platform build with 10+ data domains and complex governance requirements runs $200,000–$500,000+. Cloud migration engagements from on-premise infrastructure are scoped per environment — a mid-market organisation migrating a SQL Server data warehouse typically runs $60,000–$150,000. We provide fixed-price proposals for defined scope after an initial discovery call.

Question 2

Should we use Databricks or Snowflake for our data platform?

Accepted Answer

Both are excellent platforms — the right choice depends on your primary workloads. Databricks is stronger when you have significant ML and AI engineering requirements, need Spark for large-scale distributed processing, or are building a lakehouse on Delta Lake with complex transformation requirements. Snowflake is stronger for SQL-first analytics teams, time-series workloads, and organisations that want a fully managed platform with minimal infrastructure overhead. Many enterprise organisations run both: Databricks for ML engineering and complex Spark pipelines, Snowflake for governed SQL analytics. If you are on Azure with a Microsoft 365 footprint, Microsoft Fabric is a third option that integrates tightly with your existing investments. We work across all three and will recommend based on your actual workload requirements.

Question 3

What is dbt and do we need it?

Accepted Answer

dbt (data build tool) is a transformation framework that lets you write data transformations as SQL SELECT statements with built-in testing, documentation, and version control. It runs inside your data warehouse or lakehouse — it is not another data movement tool. For most organisations building a Silver and Gold layer in a lakehouse, dbt is the right tool for managing transformation logic: it makes transformations testable, documented, and maintainable. If your transformation requirements are simple and your team is SQL-first, dbt is almost always the right choice over custom Spark or Python. The main reason organisations do not use dbt is that they have not been introduced to it — it has a modest learning curve but pays back quickly in maintainability.

Question 4

How long does a cloud data migration from on-premise take?

Accepted Answer

A mid-market organisation migrating a SQL Server or Oracle data warehouse to Azure typically takes 12–20 weeks for the full migration, including assessment, redesign, pipeline build, parallel operation, and cutover. The timeline is driven primarily by data complexity (number of source systems, data volume, transformation logic complexity) and governance requirements. The most common cause of delays is discovering mid-migration that source data quality is significantly lower than assumed — which is why we always start with a thorough assessment before committing to a migration timeline. Organisations that skip the assessment and go straight to migration routinely hit scope problems in weeks 6–10 that extend the timeline by months.

Question 5

How do you handle data governance in a cloud data platform?

Accepted Answer

Governance in a cloud data platform requires three things: access control (who can see what data), data lineage (where does each dataset come from and what transformations has it gone through), and data quality (what standards does each dataset meet and who is accountable for maintaining them). On Databricks, Unity Catalog provides fine-grained access control at the column level, complete data lineage tracking, and a cataloguing interface for data discovery. On Snowflake, a combination of role-based access control and Dynamic Data Masking handles access governance. For organisations with regulatory requirements — HIPAA, PCI, SOC 2 — we design governance frameworks that satisfy audit requirements with documented lineage, access logs, and data classification policies. Governance is not a feature you add after the platform is built; it needs to be designed in from the start.

Question 6

Can you support both our Azure environment and our Tableau or Power BI layer?

Accepted Answer

Yes. We work across both layers — the data platform and the BI layer — as integrated disciplines. The most common failure mode in enterprise analytics is building a technically sound data platform that is connected to BI tools poorly: wrong connection types (live connections to tables that should be extracts), no semantic layer between the Gold layer and the BI tool, and business logic duplicated independently in the BI tool rather than governed in the platform. We design the integration between your cloud data platform and your BI layer as a deliberate architecture decision, not an afterthought. See our data architecture consulting and Tableau consulting pages for more on how we approach both sides.

Azure Data Engineering & Cloud Consulting

Capabilities

Azure Data Platform Engineering

Data Lakehouse Architecture & Build

ETL/ELT Pipeline Development

Cloud Migration from On-Premise

Databricks & Snowflake Engineering

Cloud Cost Optimisation

Use Cases

BUILD FOR THE CLOUD

Common Questions