Labour Day Sale - Limited Time 70% Discount Offer - Ends in 0d 00h 00m 00s - Coupon code: mxmas70

Home > Databricks > Databricks Certification > Databricks-Certified-Professional-Data-Scientist

Databricks-Certified-Professional-Data-Scientist Databricks Certified Professional Data Scientist Exam Question and Answers

Question # 4

Suppose that we are interested in the factors that influence whether a political candidate wins an election. The outcome (response) variable is binary (0/1); win or lose. The predictor variables of interest are the amount of money spent on the campaign, the amount of time spent campaigning negatively and whether or not the candidate is an incumbent.

Above is an example of

A.

Linear Regression

B.

Logistic Regression

C.

Recommendation system

D.

Maximum likelihood estimation

E.

Hierarchical linear models

Full Access
Question # 5

Digit recognition, is an example of.....

A.

Classification

B.

Clustering

C.

Unsupervised learning

D.

None of the above

Full Access
Question # 6

Suppose a man told you he had a nice conversation with someone on the train. Not knowing anything about this conversation, the probability that he was speaking to a woman is 50% (assuming the train had an equal number of men and women and the speaker was as likely to strike up a conversation with a man as with a woman). Now suppose he also told you that his conversational partner had long hair. It is now more

likely he was speaking to a woman, since women are more likely to have long hair than men.____________

can be used to calculate the probability that the person was a woman.

A.

SVM

B.

MLE

C.

Bayes' theorem

D.

Logistic Regression

Full Access
Question # 7

You are designing a recommendation engine for a website where the ability to generate more personalized recommendations by analyzing information from the past activity of a specific user, or the history of other users deemed to be of similar taste to a given user. These resources are used as user profiling and helps the site recommend content on a user-by-user basis. The more a given user makes use of the system, the better the recommendations become, as the system gains data to improve its model of that user. What kind of this recommendation engine is ?

A.

Naive Bayes classifier

B.

Collaborative filtering

C.

Logistic Regression

D.

Content-based filtering

Full Access
Question # 8

Select the correct statement which applies to Supervised learning

A.

We asks the machine to learn from our data when we specify a target variable.

B.

Lesser machine's task to only divining some pattern from the input data to get the target variable

C.

Instead of telling the machine Predict Y for our data X, we're asking What can you tell me about X?

Full Access
Question # 9

In statistics, maximum-likelihood estimation (MLE) is a method of estimating the parameters of a statistical model. When applied to a data set and given a statistical model, maximum-likelihood estimation provides estimates for the model's parameters and the normalizing constant usually ignored in MLEs because

A.

The normalizing constant is always very close to 1

B.

The normalizing constant only has a small impact on the maximum likelihood

C.

The normalizing constant is often zero and can cause division by zero

D.

The normalizing constant doesn't impact the maximizing value

Full Access
Question # 10

You are working on a Data Science project and during the project you have been gibe a responsibility to interview all the stakeholders in the project. In which phase of the project you are?

A.

Discovery

B.

Data Preparations

C.

Creating Models

D.

Executing Models

E.

Creating visuals from the outcome

F.

Operationnalise the models

Full Access
Question # 11

Which of the following metrics are useful in measuring the accuracy and quality of a recommender system?

A.

Cluster Density

B.

Support Vector Count

C.

Mean Absolute Error

D.

Sum of Absolute Errors

Full Access
Question # 12

What is the probability that the total of two dice will be greater than 8, given that the first die is a 6?

A.

1/3

B.

2/3

C.

1/6

D.

2/6

Full Access
Question # 13

Under which circumstance do you need to implement N-fold cross-validation after creating a regression model?

A.

The data is unformatted.

B.

There is not enough data to create a test set.

C.

There are missing values in the data.

D.

There are categorical variables in the model.

Full Access
Question # 14

What are the advantages of the Hashing Features?

A.

Requires the less memory

B.

Less pass through the training data

C.

Easily reverse engineer vectors to determine which original feature mapped to a vector location

Full Access
Question # 15

Which of the following is a Continuous Probability Distributions?

A.

Binomial probability distribution

B.

Negative binomial distribution

C.

Poisson probability distribution

D.

Normal probability distribution

Full Access
Question # 16

Assume some output variable "y" is a linear combination of some independent input variables "A" plus some independent noise "e". The way the independent variables are combined is defined by a parameter vector B y=AB+e where X is an m x n matrix. B is a vector of n unknowns, and b is a vector of m values. Assuming that m is not equal to n and the columns of X are linearly independent, which expression correctly solves for B?

A.

Option A

B.

Option B

C.

Option C

D.

Option D

Full Access
Question # 17

Which is an example of supervised learning?

A.

PCA

B.

k-means clustering

C.

SVD

D.

EM

E.

SVM

Full Access
Question # 18

You are working as a data science consultant for a gaming company. You have three member team and all other stake holders are from the company itself like project managers and project sponsored, data team etc. During the discussion project managed asked you that when can you tell me that the model you are using is robust enough, after which step you can consider answer for this question?

A.

Data Preparation

B.

Discovery

C.

Operationalize

D.

Model planning

E.

Model building

Full Access
Question # 19

What describes a true property of Logistic Regression method?

A.

It handles missing values well.

B.

It works well with discrete variables that have many distinct values.

C.

It is robust with redundant variables and correlated variables.

D.

It works well with variables that affect the outcome in a discontinuous way.

Full Access
Question # 20

Question-3: In machine learning, feature hashing, also known as the hashing trick (by analogy to the kernel trick), is a fast and space-efficient way of vectorizing features (such as the words in a language), i.e., turning arbitrary features into indices in a vector or matrix. It works by applying a hash function to the features and using their hash values modulo the number of features as indices directly, rather than looking the indices up in an associative array. So what is the primary reason of the hashing trick for building classifiers?

A.

It creates the smaller models

B.

It requires the lesser memory to store the coefficients for the model

C.

It reduces the non-significant features e.g. punctuations

D.

Noisy features are removed

Full Access