FEFreeExamDumps.in

Implementing Data Engineering Solutions Using Azure Databricks

Topic 1

Question 12

DP-750 voucher + Udemy course (lifetime access) = ₹3,500 for Indian ID card holders.

Details →

A logistics company captures large collections of unstructured files — scanned bills of lading (PDF), dash-cam video, and IoT sensor logs — into an existing Azure Data Lake Storage Gen2 container at `abfss://[email protected]/incoming/`. These files are written **directly by external capture systems** that run outside Azure Databricks and must continue to write to that exact path. The data engineering team wants to govern access to these files from Databricks with Unity Catalog while leaving the files in place and allowing the external systems to keep reading and writing them. Which type of Unity Catalog volume should the team create, and why?

  • AA **managed volume**, because managed volumes can register an existing cloud path and are the only volumes that allow external systems to write to the underlying files.
  • BAn **external volume** registered against the existing path within a Unity Catalog external location, because external volumes add governance to data that already resides in cloud storage and are recommended when files are also read or written by external systems.
  • CA **managed volume**, because Databricks requires managed volumes for all unstructured data and external volumes support only tabular Delta data.
  • DNeither — the files must first be loaded into a managed Delta table with `COPY INTO`, because Unity Catalog cannot govern raw files.
  • EAn **external volume**, but Unity Catalog will physically move the files into the metastore's managed storage location when the volume is created.