Summer Sale Limited Time 65% Discount Offer - Ends in 0d 00h 00m 00s - Coupon code: ecus65

SAS Institute A00-240 - SAS Statistical Business Analysis SAS9: Regression and Model

Page: 1 / 3
Total 99 questions

Refer to the exhibit.

Output from a multiple linear regression analysis is shown.

What is the most appropriate statement concerning collinearity between the input variables?

A.

Collinearity is a problem since all variance inflation values are less than 10.

B.

Collinearity is not a problem since all variance inflation values are less than 10.

C.

Collinearity is not a problem since all Pr>|t| values are less than 0.05.

D.

Collinearity is a problem since all Pr>|t| values are less than 0.05.

An analyst has a sufficient volume of data to perform a 3-way partition of the data into training, validation, and test sets to perform honest assessment during the model building process.

What is the purpose of the training data set?

A.

To provide an unbiased measure of assessment for the final model.

B.

To compare models and select and fine-tune the final model.

C.

To reduce total sample size to make computations more efficient.

D.

To build the predictive models.

Refer to the exhibit:

Based upon the comparative ROC plot for two competing models, which is the champion model and why?

A.

Candidate 1, because the area outside the curve is greater

B.

Candidate 2, because the area under the curve is greater

C.

Candidate 1, because it is closer to the diagonal reference curve

D.

Candidate 2, because it shows less over fit than Candidate 1

Which SAS program will detect collinearity in a multiple regression application?

A.

Option A

B.

Option B

C.

Option C

D.

Option D

Customers were surveyed to assess their intent to purchase a product. An analyst divided the customers into groups defined by the company's pre-assigned market segments and tested for difference in the customers' average intent to purchase. The following is the output from the GLM procedure:

What percentage of customers' intent to purchase is explained by market segment?

Click the calculator button to display a calculator if needed.

A.

<0.01%

B.

35%

C.

65%

D.

76%

The selection criterion used in the forward selection method in the GLMSELECT procedure is:

A.

RSQ

B.

MSE

C.

R-squared

D.

AIC

A non-contributing predictor variable (Pr > |t| =0.658) is added to an existing multiple linear regression model.

What will be the result?

A.

An increase in R-Square

B.

A decrease in R-Square

C.

A decrease in Mean Square Error

D.

No change in R-Square

While building a predictive model, median imputations are performed while preparing the training data.

How should the imputations be addressed in the validation data?

A.

The imputed values are irrelevant to the validation data, and are not used.

B.

The imputed values must be applied directly to the validation data without recalculation.

C.

The imputed values must be recalculated using the validation data.

D.

The imputed values must be recalculated using both the training and the validation data.

In order to perform honest assessment on a predictive model, what is an acceptable division between training, validation, and testing data?

A.

Training: 50% Validation: 0% Testing: 50%

B.

Training: 100% Validation: 0% Testing: 0%

C.

Training: 0% Validation: 100% Testing: 0%

D.

Training: 50% Validation: 50% Testing: 0%

A marketing manager attempts to determine those customers most likely to purchase additional products as the result of a nation-wide marketing campaign.

The manager possesses a historical dataset (CAMPAIGN) of a similar campaign from last year.

It has the following characteristics:

    Target variable Respond (0, 1)

    Continuous predictor Income

    Categorical predictor Homeowner(Y, N)

Which SAS program performs this analysis?

A.

Option A

B.

Option B

C.

Option C

D.

Option D