[Jan-2022] DP-201 Certification with Actual Questions from Exam4Tests [Q85-Q105]

Share

[Jan-2022] DP-201 Certification with Actual Questions from Exam4Tests

Updated DP-201 Dumps PDF - DP-201 Real Valid Brain Dumps With 207 Questions!


The content of the Microsoft DP-201 exam revolves around three main topics that comprise of specific knowledge and skills that the candidates should develop before sitting for the test. The details of these areas are outlined below:

Topic 1. Designing Azure Data Storage Solutions (40-45%)

  • Designing non-relational Cloud data stores: here the applicants should demonstrate that they are capable of designing data partitions and distribution; designing for scale (latency, throughput, and multi-region); designing a solution that utilizes Data Lake Storage Gen2, Blob storage, or Cosmos DB; selecting the proper Cosmos DB API; designing a strategy for disaster recovery; designing for high availability.
  • Suggesting an Azure data storage solution in compliance with given requirements: this requires one’s skills in choosing the proper data storage solution in order to satisfy the technical & business requirements; choosing the type of partition distribution.
  • Designing relational Cloud data stores: this subtopic focuses on such abilities as designing data partitions and distribution; designing for scale (comprising latency & throughput); designing a solution that utilizes Azure Synapse Analytics; designing a strategy for disaster recovery; designing for high availability.

 

NEW QUESTION 85
You plan to store 100 GB of data used by a line-of-business (LOB) app.
You need to recommend a data storage solution for the data. The solution must meet the following requirements:
* Minimize storage costs.
* Natively support relational queries.
* Provide a recovery time objective (RTO) of less than one minute.
What should you include in the recommendation?

  • A. Azure SQL Data Warehouse
  • B. Azure Cosmos DB
  • C. Azure SQL Database
  • D. Azure Blob storage

Answer: D

Explanation:
Explanation/Reference:
Incorrect Answers:
A: Azure Cosmos DB would require an SQL API.

 

NEW QUESTION 86
You are designing an app that will provide a data cleaning and supplementing service for customers. The app will use Azure Data Factory to run a daily process to read and write data from Azure Storage blob containers.
You need to recommend an access mechanism for the customers to grant the app access to their dat a. The solution must meet the following requirements:
Provide access for a period of three months.
Restrict the app's access to specific containers.
Minimize administrative effort.
Minimize changes to the existing access controls of the customer's Azure Storage accounts.
What should you recommend?

  • A. a shared key
  • B. anonymous public read access
  • C. a managed identity
  • D. a shared access signature (SAS)

Answer: D

Explanation:
A shared access signature (SAS) provides secure delegated access to resources in your storage account. With a SAS, you have granular control over how a client can access your data. For example:
What resources the client may access.
What permissions they have to those resources.
How long the SAS is valid.
Reference:
https://docs.microsoft.com/en-us/azure/storage/common/storage-sas-overview

 

NEW QUESTION 87
You have a large amount of sensor data stored in an Azure Data Lake Storage Gen2 account. The files are in the Parquet file format.
New sensor data will be published to Azure Event Hubs.
You need to recommend a solution to add the new sensor data to the existing sensor data in real-time. The solution must support the interactive querying of the entire dataset.
Which type of server should you include in the recommendation?

  • A. Azure Stream Analytics
  • B. Azure Databricks
  • C. Azure Cosmos DB
  • D. Azure SQL Database

Answer: A

Explanation:
Explanation
Azure Stream Analytics is a fully managed PaaS offering that enables real-time analytics and complex event processing on fast moving data streams.
By outputting data in parquet format into a blob store or a data lake, you can take advantage of Azure Stream Analytics to power large scale streaming extract, transfer, and load (ETL), to run batch processing, to train machine learning algorithms, or to run interactive queries on your historical data.
Reference:
https://azure.microsoft.com/en-us/blog/new-capabilities-in-stream-analytics-reduce-development-time-for-big-da

 

NEW QUESTION 88
A company has locations in North America and Europe. The company uses Azure SQL Database to support business apps.
Employees must be able to access the app data in case of a region-wide outage. A multi-region availability solution is needed with the following requirements:
* Read-access to data in a secondary region must be available only in case of an outage of the primary region.
* The Azure SQL Database compute and storage layers must be integrated and replicated together.
You need to design the multi-region high availability solution.
What should you recommend? To answer, select the appropriate values in the answer area.
NOTE: Each correct selection is worth one point.

Answer:

Explanation:

Explanation

Box 1: Standard
The following table describes the types of storage accounts and their capabilities:

Box 2: Geo-redundant storage
If your storage account has GRS enabled, then your data is durable even in the case of a complete regional outage or a disaster in which the primary region isn't recoverable.
Note: If you opt for GRS, you have two related options to choose from:
GRS replicates your data to another data center in a secondary region, but that data is available to be read only if Microsoft initiates a failover from the primary to secondary region.
Read-access geo-redundant storage (RA-GRS) is based on GRS. RA-GRS replicates your data to another data center in a secondary region, and also provides you with the option to read from the secondary region. With RA-GRS, you can read from the secondary region regardless of whether Microsoft initiates a failover from the primary to secondary region.

References:
https://docs.microsoft.com/en-us/azure/storage/common/storage-introduction
https://docs.microsoft.com/en-us/azure/storage/common/storage-redundancy-grs

 

NEW QUESTION 89
You are designing a solution for a company. The solution will use model training for objective classification.
You need to design the solution.
What should you recommend?

  • A. a Spark Streaming job
  • B. Power BI models
  • C. a Spark application that uses Spark MLib.
  • D. interactive Spark queries
  • E. an Azure Cognitive Services application

Answer: C

Explanation:
Spark in SQL Server big data cluster enables AI and machine learning.
You can use Apache Spark MLlib to create a machine learning application to do simple predictive analysis on an open dataset.
MLlib is a core Spark library that provides many utilities useful for machine learning tasks, including utilities that are suitable for:
Classification
Regression
Clustering
Topic modeling
Singular value decomposition (SVD) and principal component analysis (PCA) Hypothesis testing and calculating sample statistics References:
https://docs.microsoft.com/en-us/azure/hdinsight/spark/apache-spark-machine-learning-mllib-ipython

 

NEW QUESTION 90
You have an Azure Data Lake Storage Gen2 account named account1 that stores logs as shown in the following table.

You do not expect that the logs will be accessed during the retention periods.
You need to recommend a solution for account1 that meets the following requirements:
Automatically deletes the logs at the end of each retention period
Minimizes storage costs
What should you include in the recommendation? To answer, select the appropriate options in the answer area.
NOTE: Each correct selection is worth one point.

Answer:

Explanation:

Reference:
https://docs.microsoft.com/en-us/azure/storage/blobs/storage-blob-storage-tiers

 

NEW QUESTION 91
You are planning the deployment of two separate Azure Cosmos DB databases named db1 and db2.
You need to recommend a deployment strategy that meets the following requirements:
* Costs for both databases must be minimized.
* Db1 must meet an SLA of 99.99% for both reads and writes.
* Db2 must meet an SLA of 99.99% for writes and 99.999% for reads.
Which deployment strategy should you recommend for each database? To answer, select the appropriate options in the answer area.
NOTE: Each correct selection is worth one point.

Answer:

Explanation:

Explanation:
Db1: A single read/write region
Db2: A single write region and multi read regions

References:
https://docs.microsoft.com/en-us/azure/cosmos-db/high-availability

 

NEW QUESTION 92
You are designing a solution that will use Azure Table storage. The solution will log records in the following entity.

You are evaluating which partition key to use based on the following two scenarios:
* Scenario1: Minimize hotspots under heavy write workloads.
* Scenario2: Ensure that date lookups are as efficient as possible for read workloads.
Which partition key should you use for each scenario? To answer, select the appropriate options in the answer area.
NOTE: Each correct selection is worth one point.

Answer:

Explanation:

Explanation:
References:
https://docs.microsoft.com/en-us/rest/api/storageservices/designing-a-scalable-partitioning-strategy-for-azure-table-storage

 

NEW QUESTION 93
You have a data model that you plan to implement in an Azure SQL data warehouse as shown in the following exhibit.

All the dimension tables will be less than 5 GB after compression, and the fact table will be approximately 6 TB.
Which type of table should you use for each table? To answer, select the appropriate options in the answer area.
NOTE: Each correct selection is worth one point.

Answer:

Explanation:

Explanation

Box 1: Replicated
Replicated tables are ideal for small star-schema dimension tables, because the fact table is often distributed on a column that is not compatible with the connected dimension tables. If this case applies to your schema, consider changing small dimension tables currently implemented as round-robin to replicated.
Box 2: Replicated
Box 3: Replicated
Box 4: Hash-distributed
For Fact tables use hash-distribution with clustered columnstore index. Performance improves when two hash tables are joined on the same distribution column.
References:
https://azure.microsoft.com/en-us/updates/reduce-data-movement-and-make-your-queries-more-efficient-with-th
https://azure.microsoft.com/en-us/blog/replicated-tables-now-generally-available-in-azure-sql-data-warehouse/

 

NEW QUESTION 94
You need to design the encryption strategy for the tagging data and customer data.
What should you recommend? To answer, drag the appropriate setting to the correct drop targets. Each source may be used once, more than once, or not at all. You may need to drag the split bar between panes or scroll to view content.
NOTE: Each correct selection is worth one point.

Answer:

Explanation:

Explanation

All cloud data must be encrypted at rest and in transit.
Box 1: Transparent data encryption
Encryption of the database file is performed at the page level. The pages in an encrypted database are encrypted before they are written to disk and decrypted when read into memory.
Box 2: Encryption at rest
Encryption at Rest is the encoding (encryption) of data when it is persisted.
References:
https://docs.microsoft.com/en-us/sql/relational-databases/security/encryption/transparent-data-encryption?view=
https://docs.microsoft.com/en-us/azure/security/azure-security-encryption-atrest

 

NEW QUESTION 95
You need to ensure that security policies for the unauthorized detection system are met.
What should you recommend? To answer, select the appropriate options in the answer area.
NOTE: Each correct selection is worth one point.

Answer:

Explanation:

Explanation

Box 1: Blob storage
Configure blob storage for audit logs.
Scenario: Unauthorized usage of the Planning Assistance data must be detected as quickly as possible.
Unauthorized usage is determined by looking for an unusual pattern of usage.
Data used for Planning Assistance must be stored in a sharded Azure SQL Database.
Box 2: Web Apps
SQL Advanced Threat Protection (ATP) is to be used.
One of Azure's most popular service is App Service which enables customers to build and host web applications in the programming language of their choice without managing infrastructure. App Service offers auto-scaling and high availability, supports both Windows and Linux. It also supports automated deployments from GitHub, Visual Studio Team Services or any Git repository. At RSA, we announced that Azure Security Center leverages the scale of the cloud to identify attacks targeting App Service applications.
References:
https://azure.microsoft.com/sv-se/blog/azure-security-center-can-identify-attacks-targeting-azure-app-service-app

 

NEW QUESTION 96
You are designing a solution that uses Azure Cosmos DB to store and serve data.
You need to design the Azure Cosmos DB storage to meet the following requirements:
Provide high availability.
Provide a recovery point objective (RPO) of less than 15 minutes.
Provide a recovery time objective (RTO) of less than two minutes.
Minimize data loss in the event of a disaster.
What should you include in the design? To answer, select the appropriate options in the answer area.
NOTE: Each correct selection is worth one point.

Answer:

Explanation:

Reference:
https://docs.microsoft.com/en-us/azure/cosmos-db/high-availability
https://docs.microsoft.com/en-us/azure/cosmos-db/consistency-levels#consistency-levels-and-throughput

 

NEW QUESTION 97
You have a data model that you plan to implement in an Azure SQL data warehouse as shown in the following exhibit.

All the dimension tables will be less than 5 GB after compression, and the fact table will be approximately 6 TB.
Which type of table should you use for each table? To answer, select the appropriate options in the answer area.
NOTE: Each correct selection is worth one point.

Answer:

Explanation:

Explanation

Box 1: Replicated
Replicated tables are ideal for small star-schema dimension tables, because the fact table is often distributed on a column that is not compatible with the connected dimension tables. If this case applies to your schema, consider changing small dimension tables currently implemented as round-robin to replicated.
Box 2: Replicated
Box 3: Replicated
Box 4: Hash-distributed
For Fact tables use hash-distribution with clustered columnstore index. Performance improves when two hash tables are joined on the same distribution column.
References:
https://azure.microsoft.com/en-us/updates/reduce-data-movement-and-make-your-queries-more-efficient-with-th
https://azure.microsoft.com/en-us/blog/replicated-tables-now-generally-available-in-azure-sql-data-warehouse/

 

NEW QUESTION 98
You need to design the storage for the telemetry capture system.
What storage solution should you use in the design?

  • A. Azure SQL Data Warehouse
  • B. Azure Databricks
  • C. Azure Cosmos DB

Answer: C

Explanation:
Azure Cosmos DB is a globally distributed database service. You can associate any number of Azure regions with your Azure Cosmos account and your data is automatically and transparently replicated.
Scenario:
Telemetry Capture
The telemetry capture system records each time a vehicle passes in front of a sensor. The sensors run on a custom embedded operating system and record the following telemetry data:
* Time
* Location in latitude and longitude
* Speed in kilometers per hour (kmph)
* Length of vehicle in meters
You must write all telemetry data to the closest Azure region. The sensors used for the telemetry capture system have a small amount of memory available and so must write data as quickly as possible to avoid losing telemetry data.
Reference:
https://docs.microsoft.com/en-us/azure/cosmos-db/regional-presence

 

NEW QUESTION 99
A company purchases IoT devices to monitor manufacturing machinery. The company uses an IoT appliance to communicate with the IoT devices.
The company must be able to monitor the devices in real-time.
You need to design the solution.
What should you recommend?

  • A. Azure Analysis Services using Microsoft Visual Studio
  • B. Azure Stream Analytics Edge application using Microsoft Visual Studio
  • C. Azure Data Factory instance using Azure PowerShell
  • D. Azure Analysis Services using Azure PowerShell

Answer: B

Explanation:
Explanation
Azure Stream Analytics (ASA) on IoT Edge empowers developers to deploy near-real-time analytical intelligence closer to IoT devices so that they can unlock the full value of device-generated data.
References:
https://docs.microsoft.com/en-us/azure/stream-analytics/stream-analytics-edge

 

NEW QUESTION 100
Which Azure Data Factory components should you recommend using together to import the customer data from Salesforce to Data Lake Storage? To answer, select the appropriate options in the answer area.
NOTE: Each correct selection is worth one point.

Answer:

Explanation:

Explanation

Box 1: Self-hosted integration runtime
A self-hosted IR is capable of nunning copy activity between a cloud data stores and a data store in private network.
Box 2: Schedule trigger
Schedule every 8 hours
Box 3: Copy activity
Scenario:
* Customer data, including name, contact information, and loyalty number, comes from Salesforce and can be imported into Azure once every eight hours. Row modified dates are not trusted in the source table.
* Product data, including product ID, name, and category, comes from Salesforce and can be imported into Azure once every eight hours. Row modified dates are not trusted in the source table.

 

NEW QUESTION 101
Note: This question is part of a series of questions that present the same scenario. Each question in the series contains a unique solution that might meet the stated goals. Some question sets might have more than one correct solution, while others might not have a correct solution.
After you answer a question in this section, you will NOT be able to return to it. As a result, these questions will not appear in the review screen.
You are designing an HDInsight/Hadoop cluster solution that uses Azure Data Lake Gen1 Storage.
The solution requires POSIX permissions and enables diagnostics logging for auditing.
You need to recommend solutions that optimize storage.
Proposed Solution: Implement compaction jobs to combine small files into larger files.
Does the solution meet the goal?

  • A. Yes
  • B. No

Answer: A

Explanation:
Depending on what services and workloads are using the data, a good size to consider for files is 256 MB or greater. If the file sizes cannot be batched when landing in Data Lake Storage Gen1, you can have a separate compaction job that combines these files into larger ones.
Note: POSIX permissions and auditing in Data Lake Storage Gen1 comes with an overhead that becomes apparent when working with numerous small files. As a best practice, you must batch your data into larger files versus writing thousands or millions of small files to Data Lake Storage Gen1. Avoiding small file sizes can have multiple benefits, such as:
Lowering the authentication checks across multiple files
Reduced open file connections
Faster copying/replication
Fewer files to process when updating Data Lake Storage Gen1 POSIX permissions References:
https://docs.microsoft.com/en-us/azure/data-lake-store/data-lake-store-best-practices

 

NEW QUESTION 102
You need to design a solution that meets the business requirements of Health Insights.
What should you include in the recommendation?

  • A. Azure Data Factory
  • B. Azure Cosmos DB that uses the Gremlin
  • C. Azure Databricks
  • D. Azure Cosmos DB that uses the SQL API

Answer: C

Explanation:
Azure Synapse Analytics is a cloud-based enterprise data warehouse that leverages massively parallel processing (MPP) to quickly run complex queries across petabytes of data. Use SQL Data Warehouse as a key component of a big data solution.
You can access Azure Synapse Analytics (SQL DW) from Databricks using the SQL Data Warehouse connector (referred to as the SQL DW connector), a data source implementation for Apache Spark that uses Azure Blob Storage, and PolyBase in SQL DW to transfer large volumes of data efficiently between a Databricks cluster and a SQL DW instance.
Scenario: ADatum identifies the following requirements for the Health Insights application:
* The new Health Insights application must be built on a massively parallel processing (MPP) architecture that will support the high performance of joins on large fact tables Reference:
https://docs.databricks.com/data/data-sources/azure/sql-data-warehouse.html

 

NEW QUESTION 103
You need to design the storage for the Health Insights data platform.
Which types of tables should you include in the design? To answer, select the appropriate options in the answer area.
NOTE: Each correct selection is worth one point.

Answer:

Explanation:

Explanation

Box 1: Hash-distributed tables
The new Health Insights application must be built on a massively parallel processing (MPP) architecture that will support the high performance of joins on large fact tables.
Hash-distributed tables improve query performance on large fact tables.
Box 2: Round-robin distributed tables
A round-robin distributed table distributes table rows evenly across all distributions. The assignment of rows to distributions is random.
Scenario:
ADatum identifies the following requirements for the Health Insights application:
* The new Health Insights application must be built on a massively parallel processing (MPP) architecture that will support the high performance of joins on large fact tables.
References:
https://docs.microsoft.com/en-us/azure/sql-data-warehouse/sql-data-warehouse-tables-distribute
Topic 5, Data Engineer for Trey Research
Overview
You are a data engineer for Trey Research. The company is close to completing a joint project with the government to build smart highways infrastructure across North America. This involves the placement of sensors and cameras to measure traffic flow, car speed, and vehicle details.
You have been asked to design a cloud solution that will meet the business and technical requirements of the smart highway.
Solution components
Telemetry Capture
The telemetry capture system records each time a vehicle passes in front of a sensor. The sensors run on a custom embedded operating system and record the following telemetry data:
* Time
* Location in latitude and longitude
* Speed in kilometers per hour (kmph)
* Length of vehicle in meters
Visual Monitoring
The visual monitoring system is a network of approximately 1,000 cameras placed near highways that capture images of vehicle traffic every 2 seconds. The cameras record high resolution images. Each image is approximately 3 MB in size.
Requirements: Business
The company identifies the following business requirements:
* External vendors must be able to perform custom analysis of data using machine learning technologies.
* You must display a dashboard on the operations status page that displays the following metrics:
telemetry, volume, and processing latency.
* Traffic data must be made available to the Government Planning Department for the purpose of modeling changes to the highway system. The traffic data will be used in conjunction with other data such as information about events such as sporting events, weather conditions, and population statistics.
External data used during the modeling is stored in on-premises SQL Server 2016 databases and CSV files stored in an Azure Data Lake Storage Gen2 storage account.
* Information about vehicles that have been detected as going over the speed limit during the last 30
* minutes must be available to law enforcement officers. Several law enforcement organizations may respond to speeding vehicles.
* The solution must allow for searches of vehicle images by license plate to support law enforcement investigations. Searches must be able to be performed using a query language and must support fuzzy searches to compensate for license plate detection errors.
Requirements: Security
The solution must meet the following security requirements:
* External vendors must not have direct access to sensor data or images.
* Images produced by the vehicle monitoring solution must be deleted after one month. You must minimize costs associated with deleting images from the data store.
* Unauthorized usage of data must be detected in real time. Unauthorized usage is determined by looking for unusual usage patterns.
* All changes to Azure resources used by the solution must be recorded and stored. Data must be provided to the security team for incident response purposes.
Requirements: Sensor data
You must write all telemetry data to the closest Azure region. The sensors used for the telemetry capture system have a small amount of memory available and so must write data as quickly as possible to avoid losing telemetry data.

 

NEW QUESTION 104
You need to design a solution to meet the SQL Server storage requirements for CONT_SQL3.
Which type of disk should you recommend?

  • A. Standard SSD Managed Disk
  • B. Ultra SSD Managed Disk
  • C. Premium SSD Managed Disk

Answer: B

Explanation:
Explanation
CONT_SQL3 requires an initial scale of 35000 IOPS.
Ultra SSD Managed Disk Offerings

The following table provides a comparison of ultra solid-state-drives (SSD) (preview), premium SSD, standard SSD, and standard hard disk drives (HDD) for managed disks to help you decide what to use.

References:
https://docs.microsoft.com/en-us/azure/virtual-machines/windows/disks-types

 

NEW QUESTION 105
......


Exam Details and Topics

Microsoft doesn’t reveal the details of its certification tests. So, there is little information available about the exam structure. But we know that Microsoft DP-201 consists of about 40-60 questions that are to be completed within the allocated time of 180 minutes. To register for the test, the candidates must pay the fee of $165. This price is valid for the United States. Note that if you take it in another country, this cost can vary for you. You can check the actual pricing for your location on the official webpage. The exam is available in multiple languages, including English, Korean, Simplified Chinese, and Japanese.

 

Pass Your DP-201 Exam Easily With 100% Exam Passing Guarantee: https://exam-labs.exam4tests.com/DP-201-pdf-braindumps.html