Month End Sale Limited Time 70% Discount Offer - Ends in 0d 00h 00m 00s - Coupon code: 70special

EMC D-DS-FN-23 Dell Data Science Foundations Exam Practice Test

Page: 1 / 6
Total 59 questions

Dell Data Science Foundations Questions and Answers

Testing Engine

  • Product Type: Testing Engine
$37.5  $124.99

PDF Study Guide

  • Product Type: PDF Study Guide
$33  $109.99
Question 1

What type of variable is the dependent variable from a logistic regression?

Options:

A.

Categorical

B.

Continuous

C.

Ratio

D.

Interval

Question 2

In a user-defined aggregate function, what is FFUNC?

Options:

A.

Optional final calculation function

B.

Window function

C.

State transition function

D.

Segment-level calculation function

Question 3

What is a key consideration when preparing a presentation intended for analysts?

Options:

A.

Describe how to implement the model

B.

Provide talking points to promote or evangelize the project

C.

Emphasize the business benefits of implementing the model

D.

Focus on clean simple-to-understand visuals

Question 4

After running a density plot you realize that the data has a long tail to the right. What can you do to make the dataset more normally distributed?

Options:

A.

Use a scatter plot to obtain a better picture

B.

Use a histogram to obtain a better picture

C.

Apply a square transformation

D.

Apply a logarithmic transformation

Question 5

In time series analysis, what statement describes a MA(q) process?

Options:

A.

Current deviation from the time series mean depends on the q previous deviations

B.

Current deviation from the time series mean depends on the quotient q

C.

Current time series value depends on the q previous values

D.

Current time series value depends on the fitted polynomial of order q

Question 6

In hypothesis testing, when does a Type I error occur?

Options:

A.

Null hypothesis is rejected when it is actually false

B.

Null hypothesis is rejected when it is actually true

C.

Null hypothesis is accepted when it is actually false

D.

Null hypothesis is accepted when it is actually true

Question 7

What does a branch represent in a decision tree?

Options:

A.

Outcome of a test on a variable

B.

Outcome of all prior decisions

C.

Root of the tree

D.

Class label for the decision tree

Question 8

You build a decision tree to classify five different types of customers based on their browsing history from a sample of 500. The resulting decision tree has 17 layers. One of the leaf nodes has only three customers.

What do you conclude?

Options:

A.

The decision tree needs to be rebuilt without the three customers

B.

The decision tree needs to be rebuilt to see if the results change

C.

The sample size is too small, so the classes may not be accurate

D.

Due to large number of layers, there may be an overfitting problem

Question 9

Which component of a final presentation provides a succinct overview of the business situation that was the impetus to initiate the project?

Options:

A.

Model description

B.

Approach

C.

Project goals

D.

Recommendations

Question 10

When should you consider using multinomial logistic regression over binary logistic regression?

Options:

A.

Dependent variable is continuous or dichotomous

B.

Dependent variable is continuous or categorical

C.

Dependent variable has more than two categories

D.

Dependent variable is continuous only

Question 11

What is part of the model output for a linear regression?

Options:

A.

The assignment of each input datum to a cluster

B.

Coefficients indicating relative impact of the input variables on the outcome

C.

The set of all rules X -> Y with minimum support and confidence

D.

Probability score for each possible class label

Question 12

What does “MAD” in MADlib stand for?

Options:

A.

Magnetic Association Design

B.

Magnetic Agile Deep

C.

Multiple Agile Development

D.

Multiple Access Design

Question 13

After which phase of the data analytics lifecycle should you determine if the model needs any recalibration?

Options:

A.

Model planning

B.

Data preparation

C.

Discovery

D.

Operationalize

Question 14

You have been given a task to improve sales force compensation of your organization. As a result of a study, your team decides to classify personnel as follows:

● Did not meet quota

● Met quota

● Exceeded 150% of quota

In which data analytics lifecycle phase should you define these categories for analysis purposes?

Options:

A.

Model building

B.

Communicate results

C.

Operationalize

D.

Model planning

Question 15

Refer to the exhibit.

What is the approximate R-squared value for a linear regression model fitted to the data associated with this scatterplot?

Options:

A.

4

B.

0.96

C.

0.25

D.

16

Question 16

Which visualization technique should be avoided?

Options:

A.

Using a small number of contrasting colors to draw distinctions

B.

Using tables of numbers to present all of the data visually

C.

Achieving a high data-ink ratio

D.

Using visuals to illustrate key points

Question 17

On which type of data should you run K-means clustering?

Options:

A.

Ordinal

B.

Numeric

C.

Text

D.

Nominal

Page: 1 / 6
Total 59 questions