FEFreeExamDumps.in

DP-203 Practice Questions — Page 26

Question 251

Open question ↗

You have an Azure Databricks resource.

You need to log actions that relate to changes in compute for the Databricks resource.

Which Databricks services should you log?

  • A.clusters
  • B.workspace
  • C.DBFS
  • D.SSH
  • E.jobs

Question 252

Open question ↗

You are designing a highly available Azure Data Lake Storage solution that will include geo-zone-redundant storage (GZRS).

You need to monitor for replication delays that can affect the recovery point objective (RPO).

What should you include in the monitoring solution?

  • A.5xx: Server Error errors
  • B.Average Success E2E Latency
  • C.availability
  • D.Last Sync Time

Question 253

Open question ↗

You configure monitoring for an Azure Synapse Analytics implementation. The implementation uses PolyBase to load data from comma-separated value (CSV) files stored in Azure Data Lake Storage Gen2 using an external table.

Files with an invalid schema cause errors to occur.

You need to monitor for an invalid schema error.

For which error should you monitor?

  • A.EXTERNAL TABLE access failed due to internal error: 'Java exception raised on call to HdfsBridge_Connect: Error [com.microsoft.polybase.client.KerberosSecureLogin] occurred while accessing external file.'
  • B.Cannot execute the query "Remote Query" against OLE DB provider "SQLNCLI11" for linked server "(null)". Query aborted- the maximum reject threshold (0 rows) was reached while reading from an external source: 1 rows rejected out of total 1 rows processed.
  • C.EXTERNAL TABLE access failed due to internal error: 'Java exception raised on call to HdfsBridge_Connect: Error [Unable to instantiate LoginClass] occurred while accessing external file.'
  • D.EXTERNAL TABLE access failed due to internal error: 'Java exception raised on call to HdfsBridge_Connect: Error [No FileSystem for scheme: wasbs] occurred while accessing external file.'

Question 254

Open question ↗

You have two fact tables named Flight and Weather. Queries targeting the tables will be based on the join between the following columns.

You need to recommend a solution that maximizes query performance.

What should you include in the recommendation?

Question 254
  • A.In the tables use a hash distribution of ArrivalDateTime and ReportDateTime.
  • B.In the tables use a hash distribution of ArrivalAirportID and AirportID.
  • C.In each table, create an IDENTITY column.
  • D.In each table, create a column as a composite of the other two columns in the table.

Question 255

Open question ↗

You have an Azure Data Factory pipeline that has the activities shown in the following exhibit.

Use the drop-down menus to select the answer choice that completes each statement based on the information presented in the graphic.

NOTE: Each correct selection is worth one point.

Hot Area:

Question 255

Question 256

Open question ↗

You have several Azure Data Factory pipelines that contain a mix of the following types of activities:

✑ Wrangling data flow

✑ Notebook

✑ Copy

✑ Jar

Which two Azure services should you use to debug the activities? Each correct answer presents part of the solution.

NOTE: Each correct selection is worth one point

  • A.Azure Synapse Analytics
  • B.Azure HDInsight
  • C.Azure Machine Learning
  • D.Azure Data Factory
  • E.Azure Databricks

Question 257

Open question ↗

You have an Azure Synapse Analytics dedicated SQL pool named Pool1 and a database named DB1. DB1 contains a fact table named Table1.

You need to identify the extent of the data skew in Table1.

What should you do in Synapse Studio?

  • A.Connect to the built-in pool and run sys.dm_pdw_nodes_db_partition_stats.
  • B.Connect to Pool1 and run DBCC CHECKALLOC.
  • C.Connect to the built-in pool and run DBCC CHECKALLOC.
  • D.Connect to Pool1 and query sys.dm_pdw_nodes_db_partition_stats.

Question 258

Open question ↗

You manage an enterprise data warehouse in Azure Synapse Analytics.

Users report slow performance when they run commonly used queries. Users do not report performance changes for infrequently used queries.

You need to monitor resource utilization to determine the source of the performance issues.

Which metric should you monitor?

  • A.Local tempdb percentage
  • B.Cache used percentage
  • C.Data IO percentage
  • D.CPU percentage

Question 259

Open question ↗

A company purchases IoT devices to monitor manufacturing machinery. The company uses an Azure IoT Hub to communicate with the IoT devices.

The company must be able to monitor the devices in real-time.

You need to design the solution.

What should you recommend?

  • A.Azure Analysis Services using Azure PowerShell
  • B.Azure Stream Analytics Edge application using Microsoft Visual Studio
  • C.Azure Analysis Services using Microsoft Visual Studio
  • D.Azure Data Factory instance using Azure Portal

Question 260

Open question ↗

You have an Azure event hub named retailhub that has 16 partitions. Transactions are posted to retailhub. Each transaction includes the transaction ID, the individual line items, and the payment details. The transaction ID is used as the partition key.

You are designing an Azure Stream Analytics job to identify potentially fraudulent transactions at a retail store. The job will use retailhub as the input. The job will output the transaction ID, the individual line items, the payment details, a fraud score, and a fraud indicator.

You plan to send the output to an Azure event hub named fraudhub.

You need to ensure that the fraud detection solution is highly scalable and processes transactions as quickly as possible.

How should you structure the output of the Stream Analytics job? To answer, select the appropriate options in the answer area.

NOTE: Each correct selection is worth one point.

Hot Area:

Question 260