DP-100 Practice Exam — 493 Free Microsoft Questions

Join Us Among the Stars

Sign Up & unlock 100% of Exam Questions

Log in / Sign up

No Strings Attached!

493Practice Questions

3Study Modes

Free

TopicSort

Question 1

Explore data, and run experiments

Note: This question is part of a series of questions that present the same scenario. Each question in the series contains a unique solution that might meet the stated goals. Some question sets might have more than one correct solution, while others might not have a correct solution.
After you answer a question in this section, you will NOT be able to return to it. As a result, these questions will not appear in the review screen.
You are analyzing a numerical dataset which contains missing values in several columns.
You must clean the missing values using an appropriate operation without affecting the dimensionality of the feature set.
You need to analyze a full dataset to include all values.
Solution: Calculate the column median value and use the median value as the replacement for any missing value in the column.
Does the solution meet the goal?

A Yes
B No

Question 2

Design and prepare a machine learning solution

Question 3

Explore data, and run experiments

Question 4

Design and prepare a machine learning solution

Question 5

Train and deploy models

Question 6

Design and prepare a machine learning solution

Question 7

Explore data, and run experiments

Question 8

Design and prepare a machine learning solution

Question 9

Design and prepare a machine learning solution

Question 10

Design and prepare a machine learning solution

Question 11

Explore data, and run experiments

Question 12

Design and prepare a machine learning solution

Question 13

Design and prepare a machine learning solution

Question 14

Design and prepare a machine learning solution

Question 16

Explore data, and run experiments

Question 17

Design and prepare a machine learning solution

Question 18

Design and prepare a machine learning solution

Question 19

Design and prepare a machine learning solution

Question 20

Train and deploy models

Question 21

Train and deploy models

Question 22

Explore data, and run experiments

Question 23

Explore data, and run experiments

Question 24

Design and prepare a machine learning solution

Question 25

Train and deploy models

Question 26

Explore data, and run experiments

Page 1 of 20 • Questions 1-25 of 493

1 2 3 4 5

→

Know a question that should be here? Contribute to this exam

You plan to build a team data science environment. Data for training models in machine learning pipelines will be over 20 GB in size.
You have the following requirements:
✑ Models must be built using Caffe2 or Chainer frameworks.
✑ Data scientists must be able to use a data science environment to build the machine learning pipelines and train models on their personal devices in both connected and disconnected network environments.
Personal devices must support updating machine learning pipelines when connected to a network.
You need to select a data science environment.
Which environment should you use?

A Azure Machine Learning Service
B Azure Machine Learning Studio
C Azure Databricks
D Azure Kubernetes Service (AKS)

You are solving a classification task.
You must evaluate your model on a limited data sample by using k-fold cross-validation. You start by configuring a k parameter as the number of splits.
You need to configure the k parameter for the cross-validation.
Which value should you use?

A k=0.5
B k=0.01
C k=5
D k=1

You are performing feature engineering on a dataset.
You must add a feature named CityName and populate the column value with the text London.
You need to add the new feature to the dataset.
Which Azure Machine Learning Studio module should you use?

A Edit Metadata
B Filter Based Feature Selection
C Execute Python Script
D Latent Dirichlet Allocation

You are building a machine learning model for translating English language textual content into French language textual content.
You need to build and train the machine learning model to learn the sequence of the textual content.
Which type of neural network should you use?

A Multilayer Perceptions (MLPs)
B Convolutional Neural Networks (CNNs)
C Recurrent Neural Networks (RNNs)
D Generative Adversarial Networks (GANs)

Question 6

Design and prepare a machine learning solution

Question 7

Explore data, and run experiments

Question 8

Design and prepare a machine learning solution

Question 9

Design and prepare a machine learning solution

Question 10

Design and prepare a machine learning solution

Question 11

Explore data, and run experiments

Question 12

Design and prepare a machine learning solution

Question 13

Design and prepare a machine learning solution

Question 14

Design and prepare a machine learning solution

Question 16

Explore data, and run experiments

Question 17

Design and prepare a machine learning solution

Question 18

Design and prepare a machine learning solution

Question 19

Design and prepare a machine learning solution

Question 20

Train and deploy models

Question 21

Train and deploy models

Question 22

Explore data, and run experiments

Question 23

Explore data, and run experiments

Question 24

Design and prepare a machine learning solution

Question 25

Train and deploy models

Question 26

Explore data, and run experiments

You are developing deep learning models to analyze semi-structured, unstructured, and structured data types.
You have the following data available for model building:
✑ Video recordings of sporting events
✑ Transcripts of radio commentary about events
✑ Logs from related social media feeds captured during sporting events
You need to select an environment for creating the model.
Which environment should you use?

A Azure Cognitive Services
B Azure Data Lake Analytics
C Azure HDInsight with Spark MLib
D Azure Machine Learning Studio

Note: This question is part of a series of questions that present the same scenario. Each question in the series contains a unique solution that might meet the stated goals. Some question sets might have more than one correct solution, while others might not have a correct solution.
After you answer a question in this section, you will NOT be able to return to it. As a result, these questions will not appear in the review screen.
You are analyzing a numerical dataset which contains missing values in several columns.
You must clean the missing values using an appropriate operation without affecting the dimensionality of the feature set.
You need to analyze a full dataset to include all values.
Solution: Replace each missing value using the Multiple Imputation by Chained Equations (MICE) method.
Does the solution meet the goal?

A Yes
B No

Note: This question is part of a series of questions that present the same scenario. Each question in the series contains a unique solution that might meet the stated goals. Some question sets might have more than one correct solution, while others might not have a correct solution.
After you answer a question in this section, you will NOT be able to return to it. As a result, these questions will not appear in the review screen.
You are a data scientist using Azure Machine Learning Studio.
You need to normalize values to produce an output column into bins to predict a target column.
Solution: Apply a Quantiles binning mode with a PQuantile normalization.
Does the solution meet the goal?

A Yes
B No

You need to implement a scaling strategy for the local penalty detection data.
Which normalization type should you use?

A Streaming
B Weight
C Batch
D Cosine

Note: This question is part of a series of questions that present the same scenario. Each question in the series contains a unique solution that might meet the stated goals. Some question sets might have more than one correct solution, while others might not have a correct solution.
After you answer a question in this section, you will NOT be able to return to it. As a result, these questions will not appear in the review screen.
You are a data scientist using Azure Machine Learning Studio.
You need to normalize values to produce an output column into bins to predict a target column.
Solution: Apply an Equal Width with Custom Start and Stop binning mode.
Does the solution meet the goal?

A Yes
B No

You use Azure Machine Learning Studio to build a machine learning experiment.
You need to divide data into two distinct datasets.
Which module should you use?

A Assign Data to Clusters
B Load Trained Model
C Partition and Sample
D Tune Model-Hyperparameters

You need to implement a model development strategy to determine a user's tendency to respond to an ad.
Which technique should you use?

A Use a Relative Expression Split module to partition the data based on centroid distance.
B Use a Relative Expression Split module to partition the data based on distance travelled to the event.
C Use a Split Rows module to partition the data based on distance travelled to the event.
D Use a Split Rows module to partition the data based on centroid distance.

Note: This question is part of a series of questions that present the same scenario. Each question in the series contains a unique solution that might meet the stated goals. Some question sets might have more than one correct solution, while others might not have a correct solution.
After you answer a question in this section, you will NOT be able to return to it. As a result, these questions will not appear in the review screen.
You are using Azure Machine Learning Studio to perform feature engineering on a dataset.
You need to normalize values to produce a feature column grouped into bins.
Solution: Apply an Entropy Minimum Description Length (MDL) binning mode.
Does the solution meet the goal?

A Yes
B No

You are building a regression model for estimating the number of calls during an event.
You need to determine whether the feature values achieve the conditions to build a Poisson regression model.
Which two conditions must the feature set contain? Each correct answer presents part of the solution.
NOTE: Each correct selection is worth one point.

A The label data must be a negative value.
B The label data must be whole numbers.
C The label data must be non-discrete.
D The label data must be a positive value.
E The label data can be positive or negative.

HOTSPOT -
You have a Python data frame named salesData in the following format:

Question Image

The data frame must be unpivoted to a long data format as follows:

Question Image

You need to use the pandas.melt() function in Python to perform the transformation.
How should you complete the code segment? To answer, select the appropriate options in the answer area.
NOTE: Each correct selection is worth one point.
Hot Area:

Question Image

HOTSPOT -
You are creating a machine learning model in Python. The provided dataset contains several numerical columns and one text column. The text column represents a product's category. The product category will always be one of the following:
✑ Bikes
✑ Cars
✑ Vans
✑ Boats
You are building a regression model using the scikit-learn Python package.
You need to transform the text data to be compatible with the scikit-learn Python package.
How should you complete the code segment? To answer, select the appropriate options in the answer area.
NOTE: Each correct selection is worth one point.
Hot Area:

Question Image

You are performing feature engineering on a dataset.
You must add a feature named CityName and populate the column value with the text London.
You need to add the new feature to the dataset.
Which Azure Machine Learning Studio module should you use?

A Extract N-Gram Features from Text
B Edit Metadata
C Preprocess Text
D Apply SQL Transformation

You are performing a filter-based feature selection for a dataset to build a multi-class classifier by using Azure Machine Learning Studio.
The dataset contains categorical features that are highly correlated to the output label column.
You need to select the appropriate feature scoring statistical method to identify the key predictors.
Which method should you use?

A Kendall correlation
B Spearman correlation
C Chi-squared
D Pearson correlation

You are a data scientist building a deep convolutional neural network (CNN) for image classification.
The CNN model you build shows signs of overfitting.
You need to reduce overfitting and converge the model to an optimal fit.
Which two actions should you perform? Each correct answer presents a complete solution.
NOTE: Each correct selection is worth one point.

A Add an additional dense layer with 512 input units.
B Add L1/L2 regularization.
C Use training data augmentation.
D Reduce the amount of training data.
E Add an additional dense layer with 64 input units.

DRAG DROP -
You have a model with a large difference between the training and validation error values.
You must create a new model and perform cross-validation.
You need to identify a parameter set for the new model using Azure Machine Learning Studio.
Which module you should use for each step? To answer, drag the appropriate modules to the correct steps. Each module may be used once or more than once, or not at all. You may need to drag the split bar between panes or scroll to view content.
NOTE: Each correct selection is worth one point.
Select and Place:

Question Image

HOTSPOT -
You are analyzing the asymmetry in a statistical distribution.
The following image contains two density curves that show the probability distribution of two datasets.

Question Image

Use the drop-down menus to select the answer choice that answers each question based on the information presented in the graphic.
NOTE: Each correct selection is worth one point.
Hot Area:

Question Image

You are with a time series dataset in Azure Machine Learning Studio.
You need to split your dataset into training and testing subsets by using the Split Data module.
Which splitting mode should you use?

A Recommender Split
B Regular Expression Split
C Relative Expression Split
D Split Rows with the Randomized split parameter set to true

DRAG DROP -
You configure a Deep Learning Virtual Machine for Windows.
You need to recommend tools and frameworks to perform the following:
✑ Build deep neural network (DNN) models
✑ Perform interactive data exploration and visualization
Which tools and frameworks should you recommend? To answer, drag the appropriate tools to the correct tasks. Each tool may be used once, more than once, or not at all. You may need to drag the split bar between panes or scroll to view content.
NOTE: Each correct selection is worth one point.
Select and Place:

Question Image

You plan to use a Data Science Virtual Machine (DSVM) with the open source deep learning frameworks Caffe2 and PyTorch.
You need to select a pre-configured DSVM to support the frameworks.
What should you create?

A Data Science Virtual Machine for Windows 2012
B Data Science Virtual Machine for Linux (CentOS)
C Geo AI Data Science Virtual Machine with ArcGIS
D Data Science Virtual Machine for Windows 2016
E Data Science Virtual Machine for Linux (Ubuntu)

HOTSPOT -
You are tuning a hyperparameter for an algorithm. The following table shows a data set with different hyperparameter, training error, and validation errors.

Question Image

Use the drop-down menus to select the answer choice that answers each question based on the information presented in the graphic.
Hot Area:

Question Image

Join Us Among the Stars

DP-100Preview

About the DP-100 Exam

Mode Selection

Question 1

Question 2

Question 3

Question 4

Question 5

Question 6

Question 7

Question 8

Question 9

Question 10

Question 11

Question 12

Question 13

Question 14

Question 16

Question 17

Question 18

Question 19

Question 20

Question 21

Question 22

Question 23

Question 24

Question 25

Question 26

Question 6

Question 7

Question 8

Question 9

Question 10

Question 11

Question 12

Question 13

Question 14

Question 16

Question 17

Question 18

Question 19

Question 20

Question 21

Question 22

Question 23

Question 24

Question 25

Question 26