Databricks Databricks-Certified-Professional-Data-Scientist - Databricks Certified Professional Data Scientist Exam
Total 138 questions
Which of the following steps you will be using in the discovery phase?
In which of the following scenario we can use naTve Bayes theorem for classification
You are working as a data science consultant for a gaming company. You have three member team and all other stake holders are from the company itself like project managers and project sponsored, data team etc. During the discussion project managed asked you that when can you tell me that the model you are using is robust enough, after which step you can consider answer for this question?
Refer to the exhibit.
You are building a decision tree. In this exhibit, four variables are listed with their respective values of info-gain.
Based on this information, on which attribute would you expect the next split to be in the decision tree?
What is the probability that the total of two dice will be greater than 8, given that the first die is a 6?
What is one modeling or descriptive statistical function in MADlib that is typically not provided in a standard relational database?
A website is opened 3 times by a user. What is the probability of he clicks 2 times the advertisement, is best calculated by
The method based on principal component analysis (PCA) evaluates the features according to
Projecting a multi-dimensional dataset onto which vector has the greatest variance?
Refer to the Exhibit.
In the Exhibit, the table shows the values for the input Boolean attributes "A", "B", and "C". It also shows the values for the output attribute "class". Which decision tree is valid for the data?