DP-750 Certification Practice Question #80

Question

Your team is building a CI/CD process for a Lakeflow project deployed with Databricks Asset Bundles. You are defining the automated testing strategy and want to map each test type to the right tool and scope so that the pipeline catches defects at the cheapest possible layer before promotion to production.

The team agrees on these layers:

- **Unit tests** — validate isolated transformation logic (a single function or table) with mock data, fast, no production data.
- **Integration tests** — validate that workflows and data pipelines run end-to-end against real Databricks compute.

Which TWO statements correctly describe Databricks-recommended testing practices for these layers? (Choose TWO.)

Accepted Answer

Databricks recommends `pytest` for unit-testing Python business logic, and the Lakeflow pipeline unit-testing framework runs a subset of a pipeline against a fully isolated catalog with mock data, asserting results with standard `pytest` — exactly option A. For integration tests, Databricks recommends running them after deployment (for example via `databricks bundle run`) against real compute, with tools like `chispa` for DataFrame comparison — option B. Option C contradicts the isolation principle, D confuses `bundle validate` (a YAML/configuration syntax check) with logic testing, and E is false because integration tests are explicitly automated in CI/CD pipelines.

More DP-750 practice questions