New Year Sale Special Limited Time 70% Discount Offer - Ends in 0d 00h 00m 00s - Coupon code: xmas50

CertNexus AIP-210 - CertNexus Certified Artificial Intelligence Practitioner (CAIP)

Page: 1 / 3
Total 92 questions

Which of the following equations best represent an LI norm?

A.

|x| + |y|

B.

|x|+|y|^2

C.

|x|-|y|

D.

|x|^2+|y|^2

Which of the following is a common negative side effect of not using regularization?

A.

Overfitting

B.

Slow convergence time

C.

Higher compute resources

D.

Low test accuracy

Which two encoders can be used to transform categorical data into numerical features? (Select two.)

A.

Count Encoder

B.

Log Encoder

C.

Mean Encoder

D.

Median Encoder

E.

One-Hot Encoder

We are using the k-nearest neighbors algorithm to classify the new data points. The features are on different scales.

Which method can help us to solve this problem?

A.

Log transformation

B.

Normalization

C.

Square-root transformation

D.

Standardization

When should you use semi-supervised learning? (Select two.)

A.

A small set of labeled data is available but not representative of the entire distribution.

B.

A small set of labeled data is biased toward one class.

C.

Labeling data is challenging and expensive.

D.

There is a large amount of labeled data to be used for predictions.

E.

There is a large amount of unlabeled data to be used for predictions.

Normalization is the transformation of features:

A.

By subtracting from the mean and dividing by the standard deviation.

B.

Into the normal distribution.

C.

So that they are on a similar scale.

D.

To different scales from each other.

Which of the following statements are true regarding highly interpretable models? (Select two.)

A.

They are usually binary classifiers.

B.

They are usually easier to explain to business stakeholders.

C.

They are usually referred to as "black box" models.

D.

They are usually very good at solving non-linear problems.

E.

They usually compromise on model accuracy for the sake of interpretability.

Which of the following metrics is being captured when performing principal component analysis?

A.

Kurtosis

B.

Missingness

C.

Skewness

D.

Variance

Below are three tables: Employees, Departments, and Directors.

Employee_Table

Department_Table

Director_Table

ID

Firstname

Lastname

Age

Salary

DeptJD

4566

Joey

Morin

62

$ 122,000

1

1230

Sam

Clarck

43

$ 95,670

2

9077

Lola

Russell

54

$ 165,700

3

1346

Lily

Cotton

46

$ 156,000

4

2088

Beckett

Good

52

$ 165,000

5

Which SQL query provides the Directors' Firstname, Lastname, the name of their departments, and the average employee's salary?

A.

SELECT m.Firstname, m.Lastname, d.Name, AVG(e.Saiary) as Dept_avg_SaiaryFROM Employee_Table as eLEFT JOIN Department_Table as d on e.Dept = d.NameLEFT JOIN Directorjable as m on d.ID = m.DeptJDGROUP BY m.Firstname, m.Lastname, d.Name

B.

SELECT m.Firstname, m.Lastname, d.Name, AVG(e.Salary) as Dept_avg_SalaryFROM Employee_Table as eRIGHT JOIN Departmentjable as d on e.Dept = d.NameINNER JOIN Directorjable as m on d.ID = m.DeptJDGROUP BY d.Name

C.

SELECT m.Firstname, m.Lastname, d.Name, AVG(e.Salary) as Dept_avg_SalaryFROM Employee_Table as eRIGHT JOIN Department_Table as d on e.Dept = d.NameINNER JOIN Directorjable as m on d.ID = m.DeptJDGROUP BY e.Salary

D.

SELECT m.Firstname, m.Lastname, d.Name, AVG(e.Salary) as Dept_avg_SalaryFROM Employee_Table as eRIGHT JOIN Department_Table as d on e.Dept = d.NameINNER JOIN Directorjable as m on d.ID = m.DeptIDGROUP BY m.Firstname, m.Lastname, d.Name

Your dependent variable data is a proportion. The observed range of your data is 0.01 to 0.99. The instrument used to generate the dependent variable data is known to generate low quality data for values close to 0 and close to 1. A colleague suggests performing a logit-transformation on the data prior to performing a linear regression. Which of the following is a concern with this approach?

Definition of logit-transformation

If p is the proportion: logit(p)=log(p/(l-p))

A.

After logit-transformation, the data may violate the assumption of independence.

B.

Noisy data could become more influential in your model.

C.

The model will be more likely to violate the assumption of normality.

D.

Values near 0.5 before logit-transformation will be near 0 after.