Databricks Databricks-Certified-Professional-Data-Scientist - Databricks Certified Professional Data Scientist Exam

Databricks Databricks-Certified-Professional-Data-Scientist Premium Access Download Demo

Page: 1 / 5
Total 138 questions

You have data of 10.000 people who make the purchasing from a specific grocery store. You also have their income detail in the data. You have created 5 clusters using this data. But in one of the cluster you see that only 30 people are falling as below 30, 2400, 2600, 2700, 2270 etc."

What would you do in this case?

You will be increasing number of clusters.

You will be decreasing the number of clusters.

You will remove that 30 people from dataset

You will be multiplying standard deviation with the 100

Question # 2

Refer to Exhibit

In the exhibit, the x-axis represents the derived probability of a borrower defaulting on a loan. Also in the exhibit, the pink represents borrowers that are known to have not defaulted on their loan, and the blue represents borrowers that are known to have defaulted on their loan. Which analytical method could produce the probabilities needed to build this exhibit?

Linear Regression

Logistic Regression

Discriminant Analysis

Association Rules

Question # 3

In which of the scenario you can use the regression to predict the values

Samsung can use it for mobile sales forecast

Mobile companies can use it to forecast manufacturing defects

Probability of the celebrity divorce

Only 1 and 2

All 1 ,2 and 3

Question # 4

In which phase of the data analytics lifecycle do Data Scientists spend the most time in a project?

Discovery

Data Preparation

Model Building

Communicate Results

Question # 5

Which of the following is a correct example of the target variable in regression (supervised learning)?

Nominal values like true, false

Reptile, fish, mammal, amphibian, plant, fungi

Infinite number of numeric values, such as 0.100, 42.001, 1000.743..

All of the above

Question # 6

In statistics, maximum-likelihood estimation (MLE) is a method of estimating the parameters of a statistical model. When applied to a data set and given a statistical model, maximum-likelihood estimation provides estimates for the model's parameters and the normalizing constant usually ignored in MLEs because

The normalizing constant is always very close to 1

The normalizing constant only has a small impact on the maximum likelihood

The normalizing constant is often zero and can cause division by zero

The normalizing constant doesn't impact the maximizing value

Question # 7

Suppose that we are interested in the factors that influence whether a political candidate wins an election. The outcome (response) variable is binary (0/1); win or lose. The predictor variables of interest are the amount of money spent on the campaign, the amount of time spent campaigning negatively and whether or not the candidate is an incumbent.

Above is an example of

Linear Regression

Logistic Regression

Recommendation system

Maximum likelihood estimation

Hierarchical linear models

Question # 8

A data scientist is asked to implement an article recommendation feature for an on-line magazine.

The magazine does not want to use client tracking technologies such as cookies or reading history. Therefore, only the style and subject matter of the current article is available for making recommendations. All of the magazine's articles are stored in a database in a format suitable for analytics.

Which method should the data scientist try first?

K Means Clustering

Naive Bayesian

Logistic Regression

Association Rules

Question # 9

RMSE is a useful metric for evaluating which types of models?

Logistic regression

Naive Bayes classifier

Linear regression

All of the above

Question # 10

What type of output generated in case of linear regression?

Continuous variable

Discrete Variable

Any of the Continuous and Discrete variable

Values between 0 and 1

Summer Sale Special Limited Time 70% Discount Offer - Ends in 0d 00h 00m 00s - Coupon code: xmas50

Databricks Databricks-Certified-Professional-Data-Scientist - Databricks Certified Professional Data Scientist Exam

The Answer Is:

Explanation:

The Answer Is:

The Answer Is:

Explanation:

The Answer Is:

The Answer Is:

Explanation:

The Answer Is:

Explanation:

The Answer Is:

Explanation:

The Answer Is:

Explanation:

The Answer Is:

Explanation:

The Answer Is:

Explanation: