Which SAS program will divide the original data set into 60% training and 40% validation data sets, stratified by county?

A. Option A
B. Option B
C. Option C
D. Option D
In order to perform honest assessment on a predictive model, what is an acceptable division between training, validation, and testing data?
A. Training: 50% Validation: 0% Testing: 50%
B. Training: 100% Validation: 0% Testing: 0%
C. Training: 0% Validation: 100% Testing: 0%
D. Training: 50% Validation: 50% Testing: 0%
An analyst has a sufficient volume of data to perform a 3-way partition of the data into training, validation, and test sets to perform honest assessment during the model building process. What is the purpose of the training data set?
A. To provide an unbiased measure of assessment for the final model.
B. To compare models and select and fine-tune the final model.
C. To reduce total sample size to make computations more efficient.
D. To build the predictive models.
Customers were surveyed to assess their intent to purchase a product. An analyst divided the customers into groups defined by the company's pre-assigned market segments and tested for difference in the customers' average intent to purchase. The following is the output from the GLM procedure:

What percentage of customers' intent to purchase is explained by market segment? Click the calculator button to display a calculator if needed.
A. <0.01%
B. 35%
C. 65%
D. 76%
Including redundant input variables in a regression model can:
A. Stabilize parameter estimates and increase the risk of overfitting.
B. Destabilize parameter estimates and increase the risk of overfitting.
C. Stabilize parameter estimates and decrease the risk of overfitting.
D. Destabilize parameter estimates and decrease the risk of overfitting.
Identify the correct SAS program for fitting a multiple linear regression model with dependent variable (y) and four predictor variables (x1-x4).

A. Option A
B. Option B
C. Option C
D. Option D
Which of the following describes a concordant pair of observations in the LOGISTIC procedure?
A. An observation with the event has an equal probability as another observation with the event.
B. An observation with the event has a lower predicted probability than the observation without the event.
C. An observation with the event has an equal predicted probability as the observation without the event.
D. An observation with the event has a higher predicted probability than the observation without the event
Refer to the confusion matrix:

Calculate the accuracy and error rate (0 - negative outcome, 1 - positive outcome)
A. Accuracy = 58/102, Error Rate = 23/48
B. Accuracy = 83/102, Error Rate = 67/102
C. Accuracy = 25/150, Error Rate = 44/150
D. Accuracy = 83/150, Error Rate = 67/150
This question will ask you to provide a segment of missing code.
The following code is used to create missing value indicator variables for input variables, fred1 to fred7.

Which segment of code would complete the task?
A. Option A
B. Option B
C. Option C
D. Option D