The ETL vs. ELT Debate Is Over. The Answer Is Both.

After analyzing 14 production data architectures, I found that 11 used both ETL and ELT patterns strategically based on source system constraints and transformation complexity. The “ETL vs. ELT” debate was a false binary. Modern architectures use both patterns in the same pipeline, and the decision is technical (where can transformation best occur?), not ideological.

Why was the ETL versus ELT debate a false binary?

The debate was a false binary because ETL and ELT are not competing philosophies but complementary patterns suited to different technical constraints: ETL for transformations that must happen before loading (PII masking, format conversion, volume reduction) and ELT for transformations that benefit from warehouse compute (joins, aggregations, business logic).

ETL (Extract, Transform, Load) transforms data before loading it into the target system. ELT (Extract, Load, Transform) loads raw data first and transforms it within the target system. ETL reduces what enters the warehouse but requires external compute. ELT preserves raw data and uses warehouse power but increases storage and compute costs.

The ELT narrative won the marketing war around 2019 when cloud warehouses (Snowflake, BigQuery) made in-warehouse compute cheap and elastic. “Load everything raw, transform in the warehouse” became the default recommendation. dbt codified this pattern. But the narrative skipped cases where ELT is the wrong answer.

I maintain a pipeline that ingests from a legacy ERP system. The raw extract contains 47 columns of which 8 are needed. The raw data includes PII that must be tokenized before it enters the warehouse (compliance requirement). The raw records include duplicates from the source system’s replication method. Loading all of this raw and transforming in the warehouse would mean: storing 6x more data than needed, moving PII into the warehouse before masking (a compliance violation), and processing duplicates that could be eliminated at extraction time. ETL, specifically transforming during extraction, is the correct pattern here.

When does each pattern make sense?

ETL makes sense when transformations reduce data volume before loading, when compliance requires masking before data enters the warehouse, or when source-side compute is cheaper than warehouse compute. ELT makes sense when raw data preservation is valuable, when transformations require joining multiple sources already in the warehouse, or when business logic changes frequently and benefits from warehouse-speed iteration.

ETL for volume reduction: When a source produces 100GB daily but the analytical use case needs 15GB, filtering and projecting during extraction saves 85GB of storage and ingestion time. At $23 per TB per month for warehouse storage, the savings compound
ETL for compliance: When PII must be masked before entering the warehouse, the transformation must happen before loading. This is not a preference. It is a regulatory requirement in many industries. The privacy engineering discipline demands it
ELT for analytical flexibility: When the same raw data supports multiple analytical use cases, loading raw and transforming for each use case preserves optionality. The warehouse’s compute scales to handle the transformation load
ELT for rapid iteration: When business logic is evolving (new metric definitions, changing aggregation rules), transforming in the warehouse with dbt allows iteration without rebuilding extraction pipelines. According to ETL architecture principles, the choice between ETL and ELT should be driven by data characteristics, not by vendor preference

What does a hybrid architecture look like in practice?

A hybrid architecture applies ETL at the extraction layer (filtering, masking, deduplication) and ELT at the transformation layer (joining, aggregating, applying business logic), using each pattern where it provides the most value.

My standard pipeline has three stages. Stage 1 (ETL): extract from source, apply PII masking, filter to relevant records, deduplicate, and load into a raw zone. Stage 2 (ELT): within the warehouse, apply business logic transformations using dbt, joining across sources, aggregating, and creating analytical models. Stage 3 (serving): materialize final tables for consumption by dashboards and APIs.

The key design principle: transformation should happen where it is most efficient and most appropriate. PII masking is most appropriate outside the warehouse (the data should never enter unmasked). Business logic joining is most efficient inside the warehouse (where all source data is co-located). The modern data stack’s insistence on “ELT everything” was an overcorrection from the old world’s “ETL everything.” The mature position is “the right pattern for the right transformation.”

What should data teams take from this?

Data teams should stop identifying as “ETL shops” or “ELT shops” and start evaluating each transformation independently: where does it need to happen, what are the constraints, and which pattern serves the specific technical requirement.

The debate was never about architecture. It was about marketing. Cloud warehouse vendors promoted ELT because it increased warehouse compute usage (their revenue model). ETL tool vendors promoted ETL because it justified their existence. Neither side was lying. Both were selling. The engineering answer was always “it depends,” and specifically it depends on data volume, compliance requirements, source system constraints, transformation complexity, and cost optimization goals. The FinOps perspective makes hybrid architectures the obvious choice: use cheap compute where it is available and expensive warehouse compute only where it is necessary.

The ETL vs. ELT debate is over because the answer was always both. The organizations that accepted this early built more efficient, more compliant, and less expensive data platforms. Those that committed ideologically to one pattern paid for that commitment in workarounds, compliance gaps, or unnecessary costs. Pattern flexibility is not indecision. It is engineering maturity.

Why was the ETL versus ELT debate a false binary?

When does each pattern make sense?

What does a hybrid architecture look like in practice?

What should data teams take from this?

More Essays

Data Quality as a Leading Indicator of Organizational Health

The Difference Between Data and Evidence

Python’s Gravity Well: Language Choice Shapes Architecture