A new test to diagnose a disease is evaluated on 1152 people, and 106 people have the disease, and 1046 people do not have the disease. The test results are summarized below:

In this sample, how many cases are false positives and false negatives?
A. 33 false positives and 81 false negatives
B. 81 false positives and 73 false negatives
C. 73 false positives and 81 false negatives
D. 81 false positives and 33 false negatives
What is the goal of the backpropagation algorithm?
A. to randomize the trajectory of the neural network parameters during training
B. to smooth the gradient of the loss function in order to avoid getting trapped in small local minimas
C. to scale the gradient descent step in proportion to the gradient magnitude
D. to compute the gradient of the loss function with respect to the neural network parameters
A neural network is composed of a first affine transformation (affine1) followed by a ReLU non-linearity, followed by a second affine transformation (affine2). Which two explicit functions are implemented by this neural network? (Choose two.)
A. y = affine1(ReLU(affine2(x)))
B. y = max(affine1(x), affine2(x))
C. y = affine2(ReLU(affine1(x)))
D. y = affine2(max(affine1(x), 0))
E. y = ReLU(affine1(x), affine2(x))
Determine the number of bigrams and trigrams in the sentence. "Data is the new oil".
A. 3 bigrams, 3 trigrams
B. 4 bigrams, 4 trigrams
C. 3 bigrams, 4 trigrams
D. 4 bigrams, 3 trigrams
What is used to scale large positive values during data cleaning?
A. division by random numbers
B. square
C. logarithm
D. subtract median
What is an example of a supervised machine learning algorithm that can be applied to a continuous numeric response variable?
A. linear regression
B. k-means
C. local outlier factor (LOF)
D. naive Bayes
A neural network is trained for a classification task. During training, you monitor the loss function for the train dataset and the validation dataset, along with the accuracy for the validation dataset. The goal is to get an accuracy of 95%.

From the graph, what modification would be appropriate to improve the performance of the model?
A. increase the depth of the neural network
B. insert a dropout layer in the neural network architecture
C. increase the proportion of the train dataset by moving examples from the validation dataset to the train dataset
D. restart the training with a higher learning rate
What are the various components that make up a time series data?
A. trend, noise, covariance
B. trend, noise, kurtosis
C. trend, seasonality, causation
D. trend, seasonality, noise
Which two properties hold true for standardized variables (also known as z-score normalization)? (Choose two.)
A. standard deviation = 0.5
B. expected value = 0
C. expected value = 0.5
D. expected value = 1
E. standard deviation = 1
A data analyst creates a term-document matrix for the following sentence:
I saw a cat, a dog and another cat.
Assuming they used a binary vectorizer, what is the resulting weight for the word cat?
A. 0
B. 1
C. 3
D. 2