FEFreeExamDumps.in

Implementing Data Engineering Solutions Using Azure Databricks

Topic 1

Question 63

DP-750 voucher + Udemy course (lifetime access) = ₹3,500 for Indian ID card holders.

Details →

An upstream SaaS vendor frequently adds new optional columns to the JSON files they drop into a Unity Catalog volume. You ingest these files into the Delta table `bronze.events` with Auto Loader. Today, when a new column appears, your nightly batch write fails with a schema-mismatch error because Delta Lake enforces the schema on write and rejects columns that are not already in the target table. The requirement is: **new (additive) columns from the source must be automatically appended to the `bronze.events` schema and their data persisted** — without manually altering the table and without losing data — while existing column types remain unchanged. You have this Auto Loader write: ```python (spark.readStream .format("cloudFiles") .option("cloudFiles.format", "json") .option("cloudFiles.schemaLocation", "/Volumes/main/bronze/_schemas/events") .load("/Volumes/main/bronze/landing/events") .writeStream # <-- option goes here .option("checkpointLocation", "/Volumes/main/bronze/_ckpt/events") .toTable("bronze.events") ) ``` Which option should you add to the **write** so additive source columns are merged into the Delta target schema?

  • A`.option("overwriteSchema", "true")`
  • B`.option("mergeSchema", "true")`
  • C`.option("cloudFiles.schemaEvolutionMode", "failOnNewColumns")`
  • D`.option("cloudFiles.inferColumnTypes", "true")`
  • E`.option("ignoreChanges", "true")`