DP-750 Certification Practice Question #40

Question

You are modeling a `dim_customer` dimension in Unity Catalog from a CDC feed using Lakeflow Spark Declarative Pipelines `AUTO CDC ... INTO`. The business has these requirements:

- Auditors and analysts must be able to reconstruct **what a customer's address and tier were at any point in time** (full historical record).
- Each historical version must carry validity columns so the **current** row is identifiable (active record has a null end timestamp).
- Out-of-order CDC events must be sequenced correctly by an event timestamp.

Which **two** statements correctly describe the SCD design you should implement? (Choose TWO.)

Accepted Answer

SCD Type 1 overwrites and retains only the current state, so it cannot satisfy the point-in-time/audit requirement (eliminating B and D). SCD Type 3 stores only a limited set of prior values (such as "previous" columns), not unbounded history (eliminating E). SCD Type 2 maintains every version with `__START_AT`/`__END_AT` validity columns and a null end value on the active row, exactly matching the requirement. Pairing it with `SEQUENCE BY` (the SQL form of `_sequence_by`) ensures out-of-order CDC events are applied in the correct order.

More DP-750 practice questions