Month End Sale - Limited Time 70% Discount Offer - Ends in 0d 00h 00m 00s - Coupon code: mxmas70

Home > EMC > Data Science > D-DS-FN-23

D-DS-FN-23 Dell Data Science Foundations Question and Answers

Question # 4

What is part of the model output for a linear regression?

A.

The assignment of each input datum to a cluster

B.

Coefficients indicating relative impact of the input variables on the outcome

C.

The set of all rules X -> Y with minimum support and confidence

D.

Probability score for each possible class label

Full Access
Question # 5

When building a K-means clustering model, you notice that the clusters did not segment on variables that you expected. What should you do?

A.

Decrease the value of K

B.

Multiply each variable by its standard deviation

C.

Add the WSS to each variable

D.

Check that the data was properly scaled

Full Access
Question # 6

What is a recommended use case for regular expressions?

A.

Linear regression

B.

Decision trees

C.

Logistic regression

D.

In-database text analysis

Full Access
Question # 7

In a user-defined aggregate function, what is FFUNC?

A.

Optional final calculation function

B.

Window function

C.

State transition function

D.

Segment-level calculation function

Full Access
Question # 8

Match each task to its description.

Full Access
Question # 9

You have the data from a popular e-commerce website. You are exploring the time spent (in seconds) on the website by 100,000 customers across 14 different product categories.

What visualization can be used to represent the relationship between time spent and product category?

A.

Rug plot

B.

Scatter plot

C.

Box and whisker plot

D.

Hexbin plot

Full Access
Question # 10

A logistic regression model is built to determine the probability of a credit card borrower defaulting on a credit loan. A threshold value of 0.3 is selected. Which statement can be used to predict a borrower will default?

A.

If probability > 0.1, then predict the borrower will default

B.

If probability < 0.1, then predict the borrower will default

C.

If probability > 0.3, then predict the borrower will default

D.

If probability < 0.3, then predict the borrower will default

Full Access
Question # 11

Refer to the exhibit.

What is the approximate R-squared value for a linear regression model fitted to the data associated with this scatterplot?

A.

4

B.

0.96

C.

0.25

D.

16

Full Access
Question # 12

After which phase of the data analytics lifecycle should you determine if the model needs any recalibration?

A.

Model planning

B.

Data preparation

C.

Discovery

D.

Operationalize

Full Access
Question # 13

What is the similarity between the matrix and array data structures in R?

A.

Both structures can contain only integers

B.

Both structures can only contain one data type

C.

Both structures can store multiple data types

D.

Both structures must be 2-dimensional

Full Access
Question # 14

When should you consider using multinomial logistic regression over binary logistic regression?

A.

Dependent variable is continuous or dichotomous

B.

Dependent variable is continuous or categorical

C.

Dependent variable has more than two categories

D.

Dependent variable is continuous only

Full Access
Question # 15

What is the purpose of applying the naïve Bayes conditional independence assumption?

A.

To simplify the probability calculations

B.

To calculate the probability of rare events

C.

To minimize rounding errors in probability calculations

D.

To accurately calculate each probability

Full Access
Question # 16

After running a density plot you realize that the data has a long tail to the right. What can you do to make the dataset more normally distributed?

A.

Use a scatter plot to obtain a better picture

B.

Use a histogram to obtain a better picture

C.

Apply a square transformation

D.

Apply a logarithmic transformation

Full Access
Question # 17

What is a business driver for Big Data analytics adoption?

A.

Implement the latest technology and tools

B.

Maintain existing data silos

C.

Identify new business opportunities

D.

Ensure the analysts work in isolation

Full Access