Question 5 fill in the blank: a data analyst uses the case statement to consider one or more _____, then returns a value.

Skip to main content

This browser is no longer supported.

Show

Upgrade to Microsoft Edge to take advantage of the latest features, security updates, and technical support.

DP-900: Microsoft Azure Data Fundamentals Sample Questions

  • Article
  • 09/22/2022
  • 18 minutes to read

In this article

Last Updated: 3/26/2022

PLEASE COMPLETE THIS SURVEY (https://aka.ms/samplequestions)

Microsoft is exploring the possibility of providing sample questions as an exam preparation resource, and we would like your feedback. While we prefer that you complete the survey after taking the exam, you may complete it at any time. Thank You!

User Guide

These sample questions are intended to provide an overview of the style, wording, and difficulty of the questions that you are likely to experience on this exam. These questions are not the same as what you will see on the exam nor is this document illustrative of the length of the exam or its complexity (e.g., you may see additional question types, multiple case studies, and possibly labs). These questions are examples only to provide insight into what to expect on the exam and help you determine if additional preparation is required.

In the first section, you will find the questions without answers so that you can test your knowledge. In the second section, the answer, a rationale, and a URL that will link you to additional information is provided immediately below each question.

Questions

Question # 1 (Sentence completion)

Select the answer that correctly completes the sentence.

Objects in which things about data should be captured and stored are called: ____________.

A. tables
B. entities
C. rows
D. columns

Question # 2 (Sentence completion)

Select the answer that correctly completes the sentence.

You need to process data that is generated continuously and near real-time responses are required.

You should use _________.

A. batch processing
B. scheduled data processing
C. buffering and processing
D. streaming data processing

Question # 3 (Matching)

You are evaluating data processing approaches.

Match the data processing approaches on the left to the requirements on the right.

Data processing approachesDescriptions
A. Extract, Transform, Load (ETL)

B. Extract, Load, Transform (ELT)

_____ 1. Optimize data privacy.

_____ 2. Provide support for Azure Data Lake.

_____ 3. Manage large volumes of data.

Question # 4 (Multiple choice)

Select the answer that correctly completes the sentence.

The technique that provides recommended actions that you should take to achieve a goal or target is called _____________ analytics.

A. descriptive
B. diagnostic
C. predictive
D. prescriptive

Question # 5 (Matching)

Match the data processing objects on the left to the requirements on the right.

Data processing objectsDescriptions
A. Tables

B. Indexes

C. Views

D. Keys

_____ 1. Create relationships.

_____ 2. Improve processing speed for data searches.

_____ 3. Store instances of entities as rows.

_____ 4. Display data from predefined queries.

Question # 6 (Sentence completion)

Select the answer that correctly completes the sentence.

The process of splitting an entity into more than one table to reduce data redundancy is called: _____________.

A. deduplication
B. denormalization
C. normalization
D. optimization

Question # 7 (Sentence completion)

Select the answer that correctly completes the sentence.

Azure SQL Database is an example of ________________ -as-a-service.

A. platform
B. infrastructure
C. software
D. application

Question # 8 (Matching)

You need to query an Azure SQL database.

Match the query tools on the left to the correct scenarios on the right.

Query ToolsDescriptions
A. Azure Data Studio

B. Azure Query editor

C. SQL Server Data Tools

_____ 1. Query data while working within a Visual Studio project.

_____ 2. Query data located in a non-Microsoft platform.

_____ 3. Query data from within the Azure portal.

Question # 9 (Sentence completion)

Select the answer that correctly completes the sentence.

The act of increasing or decreasing the resources that are available for a service is called: _____________.

A. computing
B. provisioning
C. networking
D. scaling

Question # 10 (Matching)

You are creating queries to retrieve data from an Azure SQL database.

Match the SQL clauses or functions on the left to the requirements on the right.

SQL clausesDescriptions
A. JOIN

B. WHERE

C. SUM

D. COUNT

_____ 1. Filter records.

_____ 2. Combine rows from multiple tables.

_____ 3. Calculate the total value of a numeric column.

_____ 4. Determine the number of rows retrieved.

Question # 11 (Multiple choice)

What are three characteristics of non-relational data? Each correct answer presents a complete solution.

A. Forced schema on data structures
B. Flexible storage of ingested data
C. Entities are self-describing
D. Entities may have different fields
E. Each row has the exact same columns

Question # 12 (Sentence completion)

Select the answer that correctly completes the sentence.

You have data that consists of JSON-based documents.

You need to store the data in an Azure environment that supports efficient non-key, field-based searching.

You should use _______________________ as the data store.

A. Azure Table Storage
B. Azure Blob Storage
C. Azure File Storage
D. Azure Cosmos DB

Question # 13 (Multiple Choice)

You need to create a graph database.

Which Azure data service should you use?

A. Azure Table
B. Azure Cosmos DB
C. Azure Blob
D. Azure File

Question # 14 (Sentence completion)

Select the answer that correctly completes the sentence.

You use Azure Table Storage as a non-relational data store.

You need to optimize data retrieval. You should use ______________________________ as query criteria.

A. only partition keys
B. only row keys
C. partition keys and row keys
D. only properties

Question # 15 (Multiple Choice)

You need to use JavaScript Object Notation (JSON) files to provision Azure storage.

What should you use?

A. Azure portal
B. Azure command-line interface (CLI)
C. Azure PowerShell
D. Azure Resource Manager (ARM) templates

Question # 16 (Multiple Choice)

For which reason should you deploy a data warehouse?

A. Record daily sales transactions.
B. Perform sales trend analyses.
C. Print sales orders.
D. Search status of sales orders.

Question # 17 (Multiple Choice)

Which two Azure data services support Apache Spark clusters? Each correct answer presents a complete solution.

A. Azure Synapse Analytics
B. Azure Cosmos DB
C. Azure Databricks
D. Azure Data Factory

Question # 18 (Multiple Choice)

You design a data ingestion and transformation solution by using Azure Data Factory service.

You need to get data from an Azure SQL database.

Which two resources should you use? Each correct answer presents part of the solution.

A. Linked service
B. Copy data activity
C. Dataset
D. Azure Databricks notebook

Question # 19 (Multiple Choice)

Which Azure Data Factory component should you use to represent data that you want to ingest for processing?

A. Linked services
B. Datasets
C. Pipelines
D. Notebooks

Question # 20 (Multiple Choice)

You are designing reports by using Microsoft Power BI.

For which three scenarios can you create Power BI reports as paginated reports? Each correct answer presents a complete solution.

A. a report that has a table visual with an ability to print all the data in the table
B. a report that has a table visual with an ability to see all the data in the table
C. a report with a repeatable header and footer
D. a report that is formatted to fit well on a printed page
E. a report that uses only Power BI visuals

Questions and Answers

Question # 1 (Sentence completion)

Select the answer that correctly completes the sentence.

Objects in which things about data should be captured and stored are called: ____________.

A. tables
B. entities
C. rows
D. columns

ItemDescription
Answer: B. Entities
Objective: 1.1 Describe types of core data workloads
Rationale: An entity is a thing in which information needs to be known or held. A table is the object that stores a collection of entities. A row represents a single instance of an entity. A column defines a specific property of an entity.
URL: https://learn.microsoft.com/training/modules/describe-concepts-of-relational-data/2-explore-characteristics

Question # 2 (Sentence completion)

Select the answer that correctly completes the sentence.

You need to process data that is generated continuously and near real-time responses are required.

You should use _________.

A. batch processing
B. scheduled data processing
C. buffering and processing
D. streaming data processing

ItemDescription
Answer: D. Streaming data processing
Objective: 1.1 Describe types of core data workloads
Rationale: When data is generated on a continual basis and insights must you must be able see data insights immediately, process data as it arrives rather than storing and processing data later as a group. Batch processing, scheduled processing, and buffering all collect and store data for later processing.
URL: https://learn.microsoft.com/training/modules/explore-core-data-concepts/4-describe-difference

Question # 3 (Matching)

You are evaluating data processing approaches.

Match the data processing approaches on the left to the requirements on the right.

Data processing approachesDescriptions
A. Extract, Transform, Load (ETL)

B. Extract, Load, Transform (ELT)

_____ 1. Optimize data privacy.

_____ 2. Provide support for Azure Data Lake.

_____ 3. Manage large volumes of data.

ItemDescription
Answer: Extract, Transform, Load (A) matches description 1: Optimize data privacy.
Extract, Load, Transform (B) matches description 2: Provide support for Azure Data Lake, and description 3: Manage large volumes of data.
Objective: 1.2 Describe data analytics core concepts
Rationale: Extract, Transform, Load (ETL) is the correct approach when you need to filter sensitive data before loading the data into an analytical model. It is suitable for simple data models that do not require Azure Data Lake support.
Extract, Load, Transform (ELT) is the correct approach because it supports Azure Data Lake as the data store and manages large volumes of data.
URL: https://learn.microsoft.com/training/modules/explore-concepts-of-data-analytics/2-describe-data-ingestion-process

Question # 4 (Multiple choice)

Select the answer that correctly completes the sentence.

The technique that provides recommended actions that you should take to achieve a goal or target is called _____________ analytics.

A. descriptive
B. diagnostic
C. predictive
D. prescriptive

ItemDescription
Answer: D. Prescriptive
Objective: Exam objective number and text
Rationale: Prescriptive analytics helps you define actions (prescription) that you should perform to achieve what you need (or overcome a problem).

Descriptive analytics help you understand what has happened by looking at historical data. You can further analyze information by using diagnostic analytics.

Predictive analytics helps to forecast the future values using historical values.

URL: https://learn.microsoft.com/training/modules/explore-concepts-of-data-analytics/4-explore

Question # 5 (Matching)

Match the data processing objects on the left to the requirements on the right.

Data processing objectsDescriptions
A. Tables

B. Indexes

C. Views

D. Keys

_____ 1. Create relationships.

_____ 2. Improve processing speed for data searches.

_____ 3. Store instances of entities as rows.

_____ 4. Display data from predefined queries.

ItemDescription
Answer: Keys (D) matches description 1: Create relationships.

Indexes (B) matches description 2: Improve processing speed for data searches.

Tables (A) matches description 3: Store instances of entities as rows.

Views (C) matches description 4: Display data from predefined queries.

Objective: 2.1 Describe relational data workloads
Rationale: Tables use rows to store data or instances of entities.

Indexes are a subset of data used to speed up data searches. Views are virtual tables based on the results of a query.

You use primary keys and foreign keys to create

relationships between tables.

URL: https://learn.microsoft.com/training/modules/describe-concepts-of-relational-data/2-explore-characteristics

https://learn.microsoft.com/training/modules/describe-concepts-of-relational-data/3-explore-structures

Question # 6 (Sentence completion)

Select the answer that correctly completes the sentence.

The process of splitting an entity into more than one table to reduce data redundancy is called: _____________.

A. deduplication
B. denormalization
C. normalization
D. optimization

ItemDescription
Answer: C. Normalization
Objective: 2.2 Describe relational Azure Data Services
Rationale: Normalization is the process of splitting a logical entity into multiple tables and designing the relationship between tables. Normalization is often used to improve query completion speeds for transaction processing databases.

Deduplication is the process of removing duplicate data from tables.

Denormalization is the process of combining tables. This process is often used to store data in databases which are used for data mining and reporting.

Optimization is the process of modifying fields and database structure to improve overall performance.

URL: https://learn.microsoft.com/training/modules/describe-concepts-of-relational-data/2-explore-characteristics

Question # 7 (Sentence completion)

Select the answer that correctly completes the sentence.

Azure SQL Database is an example of ________________ -as-a-service.

A. platform
B. infrastructure
C. software
D. application

ItemDescription
Answer: A. Platform
Objective: 2.2 Describe relational Azure Data Services
Rationale: Azure SQL Database is an example of platform-as-a-service.

Infrastructure-as-a-service includes technologies such as virtual machines and virtual networks.

Software-as-a-service describes a method of software delivery in which users license software online by subscription. Microsoft offers subscriptions to license Office products like Microsoft Excel.

Application-as-a-service describes applications that are hosted in the cloud and offered to users by subscription. This is like software-as-a-service but often relates to custom line-of-business solutions.

There are no Azure data services that use the software-as-a-service or application-as-a-service model.

URL: https://learn.microsoft.com/training/modules/explore-relational-data-offerings/2-azure-data-services

Question # 8 (Matching)

You need to query an Azure SQL database.

Match the query tools on the left to the correct scenarios on the right.

Query ToolsDescriptions
A. Azure Data Studio

B. Azure Query editor

C. SQL Server Data Tools

_____ 1. Query data while working within a Visual Studio project.

_____ 2. Query data located in a non-Microsoft platform.

_____ 3. Query data from within the Azure portal.

ItemDescription
Answer: SQL Server Data Tools (C) matches description 1: Query data while working within a Visual Studio project.

Azure Data Studio (A) matches description 2: Query data located in a non-Microsoft platform.

Azure Query editor (B) matches description 3: Query data from within the Azure portal.

Objective: 2.3 Identify basic management tasks for relational data
Rationale: Azure Data Studio is a cross-platform database tool that you can use with both on-premises and cloud data platforms on Windows, MacOS and Linux.

Azure Query editor is available in the Azure portal. You can use this tool for querying Azure SQL databases.

SQL Server Data Tools is available in Visual Studio. You can use this tool to connect to and query on-premises and cloud data services.

URL: https://learn.microsoft.com/training/modules/query-relational-data/3-sql-database?ns-enrollment-type=LearningPath&ns-enrollment-id=learn.wwl.azure-data-fundamentals-explore-relational-data

https://learn.microsoft.com/sql/azure-data-studio/download-azure-data-studio?view=sql-server-ver15

https://learn.microsoft.com/sql/ssms/download-sql-server-management-studio-ssms?view=sql-server-ver15

Question # 9 (Sentence completion)

Select the answer that correctly completes the sentence.

The act of increasing or decreasing the resources that are available for a service is called: _____________.

A. computing
B. provisioning
C. networking
D. scaling

ItemDescription
Answer: D. Scaling
Objective: 2.3 Identify basic management tasks for relational data
Rationale: Scaling is the act of increasing or decreasing the resources used by a service. Computing is the act of processing data. Provisioning is the act of running series of tasks to create and configure a service. Networking is the act of providing connectivity to a data resource.
URL: https://learn.microsoft.com/training/modules/explore-provision-deploy-relational-database-offerings-azure/2-describe-provision-relational-data-services

Question # 10 (Matching)

You are creating queries to retrieve data from an Azure SQL database.

Match the SQL clauses or functions on the left to the requirements on the right.

SQL clausesDescriptions
A. JOIN

B. WHERE

C. SUM

D. COUNT

_____ 1. Filter records.

_____ 2. Combine rows from multiple tables.

_____ 3. Calculate the total value of a numeric column.

_____ 4. Determine the number of rows retrieved.

ItemDescription
Answer: WHERE (B) matches description 1: Filter records.

JOIN (A) matches description 2: Combine rows from multiple tables.

SUM (C) matches description 3: Calculate the total value of a numeric column.

COUNT (D) matches description 4: Determine the number of rows retrieved.

Objective: 2.4 Describe query techniques for data using SQL language
Rationale: Use the JOIN clause for combining rows retrieved from multiple tables based on a condition.

Use the WHERE clause for filtering out rows.

Use the SUM function for calculating the total sum of a numeric column.

Use the COUNT function returns the number of rows that matches a specified criterion.

URL: https://learn.microsoft.com/training/modules/query-relational-data/6-exercise-perform-query?ns-enrollment-type=LearningPath&ns-enrollment-id=learn.wwl.azure-data-fundamentals-explore-relational-data

Question # 11 (Multiple choice)

What are three characteristics of non-relational data? Each correct answer presents a complete solution.

A. Forced schema on data structures
B. Flexible storage of ingested data
C. Entities are self-describing
D. Entities may have different fields
E. Each row has the exact same columns

ItemDescription
Answer: B. Flexible storage of ingested data. AND

C. Entities are self-describing. AND

D. Entities may have different fields.

Objective: 3.1 Describe non-relational data workloads
Rationale: Characteristics of non-relational data are flexible storage of ingested data, self-describing entities, and the ability to have different fields for entities. Forced schemas are a feature of relational tables. Forcing each row to maintain the same columns describes a relational table.
URL: https://learn.microsoft.com/training/modules/explore-concepts-of-non-relational-data/2-explore-characteristics

Question # 12 (Sentence completion)

Select the answer that correctly completes the sentence.

You have data that consists of JSON-based documents.

You need to store the data in an Azure environment that supports efficient non-key, field-based searching.

You should use _______________________ as the data store.

A. Azure Table Storage
B. Azure Blob Storage
C. Azure File Storage
D. Azure Cosmos DB

ItemDescription
Answer: D. Azure Cosmos DB
Objective: 3.1 Describe non-relational data workloads
Rationale: Azure Cosmos DB is a non-relational document DB that supports high latency for both reading and writing. It adds indexes automatically on elements. Azure Table Storage is a NoSQL key-value storage that supports semi-structured data with dynamic column. This technique optimizes both data retrieval and writing but performance is not good when searching on non-partition key and non-key values. Azure File Storage and Blob Storage are for files and BLOBs, not for searchable JSON documents.
URL: https://learn.microsoft.com/training/modules/explore-non-relational-data-offerings-azure/5-explore-azure-cosmos-database?ns-enrollment-type=LearningPath&ns-enrollment-id=learn.wwl.azure-data-fundamentals-explore-non-relational-data

Question # 13 (Multiple Choice)

You need to create a graph database.

Which Azure data service should you use?

A. Azure Table
B. Azure Cosmos DB
C. Azure Blob
D. Azure File

ItemDescription
Answer: B. Azure Cosmos DB
Objective: 3.2 Describe non-relational data offerings on Azure
Rationale: Only Azure Cosmos DB supports creating graph databases. Azure Table Storage, Azure Blob Storage, and Azure File Storage do not support graph databases.
URL: https://learn.microsoft.com/training/modules/explore-non-relational-data-offerings-azure/5-explore-azure-cosmos-database

Question # 14 (Sentence completion)

Select the answer that correctly completes the sentence.

You use Azure Table Storage as a non-relational data store.

You need to optimize data retrieval. You should use ______________________________ as query criteria.

A. only partition keys
B. only row keys
C. partition keys and row keys
D. only properties

ItemDescription
Answer: C. partition keys and row keys
Objective: 3.2 Describe non-relational data offerings on Azure
Rationale: You can optimize data read performance with Azure Table Storage when the search completes by using both a partition key and arow key. Azure Table Storage does not support search on properties.
URL: https://learn.microsoft.com/training/modules/explore-non-relational-data-offerings-azure/2-explore-azure-table-storage?ns-enrollment-type=LearningPath&ns-enrollment-id=learn.wwl.azure-data-fundamentals-explore-non-relational-data

Question # 15 (Multiple Choice)

You need to use JavaScript Object Notation (JSON) files to provision Azure storage.

What should you use?

A. Azure portal
B. Azure command-line interface (CLI)
C. Azure PowerShell
D. Azure Resource Manager (ARM) templates

ItemDescription
Answer: D. Azure Resource Manager (ARM) templates
Objective: 3.3 Identify basic management tasks for non-relational data
Rationale: Azure Resource Manager templates use JSON to create text files to standardize Azure Storage provisioning. Azure portal, Azure CLI, and Azure PowerShell do not use JSON text files.
URL: https://learn.microsoft.com/training/modules/explore-provision-deploy-non-relational-data-services-azure/2-describe-provision-non-relational-data-services

Question # 16 (Multiple Choice)

For which reason should you deploy a data warehouse?

A. Record daily sales transactions.
B. Perform sales trend analyses.
C. Print sales orders.
D. Search status of sales orders.

ItemDescription
Answer: B. Perform sales trend analyses
Objective: 4.1, Describe analytics workloads
Rationale: You typically create data warehouses to support analytical queries that involve high volume data and generating aggregated values.

Recording daily sales, printing reports, and searching sales orders are typical operations seen in online data processing (OLTP) databases and not in data warehouses.

URL: https://learn.microsoft.com/training/modules/examine-components-of-modern-data-warehouse/1-introduction
https://learn.microsoft.com/training/modules/examine-components-of-modern-data-warehouse/2-describe-warehousing

Question # 17 (Multiple Choice)

Which two Azure data services support Apache Spark clusters? Each correct answer presents a complete solution.

A. Azure Synapse Analytics
B. Azure Cosmos DB
C. Azure Databricks
D. Azure Data Factory

ItemDescription
Answer: A. Azure Synapse Analytics AND
C. Azure Databricks
Objective: 4.2 Describe the components of a modern data warehouse
Rationale: Azure Synapse Analytics and Azure Databricks both support using Apache Spark clusters to process data. Azure Cosmos DB is for non-relational data. You can use Azure Data Factory for data integration and migration.
URL: https://learn.microsoft.com/training/modules/explore-provision-deploy-non-relational-data-services-azure/2-describe-provision-non-relational-data-services

Question # 18 (Multiple Choice)

You design a data ingestion and transformation solution by using Azure Data Factory service.

You need to get data from an Azure SQL database.

Which two resources should you use? Each correct answer presents part of the solution.

A. Linked service
B. Copy data activity
C. Dataset
D. Azure Databricks notebook

ItemDescription
Answer: A. Linked service AND
B. Dataset
Objective: 4.3 Describe data ingestion and processing on Azure
Rationale: To get data from a source, you need to create a linked service for Azure Data Factory. The linked service contains details about the data source including the server name and credentials. You must also define a dataset to describe the expected data structure. A dataset stores data retrieved from a data source.

You can use the Copy data activity for transferring data from one source to another (destination).

You can use an Azure Databricks notebook for processing data using spark clusters with given instructions.

URL: https://learn.microsoft.com/training/modules/explore-data-ingestion-azure/2-describe-common-practices-for-data-loading?ns-enrollment-type=LearningPath&ns-enrollment-id=learn.wwl.azure-data-fundamentals-explore-data-warehouse-analytics

Question # 19 (Multiple Choice)

Which Azure Data Factory component should you use to represent data that you want to ingest for processing?

A. Linked services
B. Datasets
C. Pipelines
D. Notebooks

ItemDescription
Answer: B. Datasets
Objective: 4.3 Describe data ingestion and processing on Azure
Rationale: A dataset represents the data that you want to ingest for processing. A dataset can also represent output data from a process.

You use a pipeline to perform tasks and processes.

You use a linked service to connect to a source or destination.

A notebook can contain cells that read data, process data, and write the results out to a data store.

URL: https://learn.microsoft.com/training/modules/explore-data-ingestion-azure/2-describe-common-practices-for-data-loading

Question # 20 (Multiple Choice)

You are designing reports by using Microsoft Power BI.

For which three scenarios can you create Power BI reports as paginated reports? Each correct answer presents a complete solution.

A. a report that has a table visual with an ability to print all the data in the table
B. a report that has a table visual with an ability to see all the data in the table
C. a report with a repeatable header and footer
D. a report that is formatted to fit well on a printed page
E. a report that uses only Power BI visuals

ItemDescription
Answer: A. a report that has a table visual with an ability to print all the data in the table AND
C. a report with a repeatable header and footer AND
D. a report that is formatted to fit well on a printed page
Objective: 4.4 Describe data visualization in Power BI
Rationale: When a Power BI report that has a table visual contains multiple rows, printed, only records that can are displayed will be printed.

All records print if you design the report by using Report Builder as a paginated report, all records print.

Only paginated report supports repeatable headers and footers.

You cannot create paginated reports by using Power BI visuals. You must use Report Builder instead.

URL: https://learn.microsoft.com/training/modules/get-started-with-power-bi/1-introduction?ns-enrollment-type=LearningPath&ns-enrollment-id=learn.wwl.azure-data-fundamentals-explore-data-warehouse-analytics

What do data analysts use to label the type of data contained in each column in a spreadsheet?

These column labels are usually called attributes. An attribute is a characteristic or quality of data used to label a column in a table.

Which of the following tasks would a data analyst be responsible for select all that apply?

Data analysts are responsible for analyzing data using statistical techniques, implementing and maintaining databases, gathering data from primary and secondary sources, identifying, analyzing and interpreting trends from the data.
What process do data analysts use to keep project-related files together and organize them into subfolders? Data analysts use archiving to separate current from past work.

During which phase of data analysis would a data analyst use spreadsheet?

During which phase would a data analyst use spreadsheets or query languages to transform data in order to draw conclusions? The analyze step involves using data analytics tools such as spreadsheets and query languages to transform data in order to draw conclusions and make informed decisions.