What is an appropriate data visualization to use in a presentation for an analyst audience?
A. Pie chart
B. Area chart
C. Stacked bar chart
D. ROC curve
When creating a project sponsor presentation, what is the main objective?
A. Show that you met the project goals
B. Show how you met the project goals
C. Show how well the model will meet the SLA (service level agreement)
D. Clearly describe the methods and techniques used
Which word or phrase completes the statement? A Data Scientist would consider that a RDBMS is to a Table as R is to a ______________ .
A. Data frame
B. List
C. Matrix
D. Array
What are the characteristics of Big Data?
A. Data volume,processing complexity,and data structure variety.
B. Data volume,business importance,and data structure variety.
C. Data type,processing complexity,and data structure variety.
D. Data volume,processing complexity,and business importance.
Which word or phrase completes the statement?
Theater actor is to "Artistic and Expressive" as Data Scientist is to ________________
A. "Communicative and Collaborative"
B. "Introverted and Technical"
C. "Logical and Steadfast"
D. "Independent and Intelligent"
Your customer provided you with 2, 000 unlabeled records and asked you to separate them into three groups. What is the correct analytical method to use?
A. K-means clustering
B. Linear regression
C. Naive Bayesian classification
D. Logistic regression
You are using k-means clustering to classify heart patients for a hospital. You have chosen Patient Sex, Height, Weight, Age and Income as measures and have used 3 clusters. When you create a pair-wise plot of the clusters, you notice that there is significant overlap between the clusters. What should you do?
A. Identify additional measures to add to the analysis
B. Remove one of the measures
C. Decrease the number of clusters
D. Increase the number of clusters
Refer to the exhibit.
You are assigned to do an end of the year sales analysis of 1, 000 different products, based on the
transaction table. Which column in the end of year report requires the use of a window function?
A. Total Sales to Date
B. Daily Sales
C. Average Daily Price
D. Maximum Price
Refer to the exhibit.
You have run a linear regression model against your data, and have plotted true outcome versus predicted
outcome. The R-squared of your model is 0.75. What is your assessment of the model?
A. The R-squared may be biased upwards by the extreme-valued outcomes. Remove them and refit to get a better idea of the model's quality over typical data.
B. The R-squared is good. The model should perform well.
C. The extreme-valued outliers may negatively affect the model's performance. Remove them to see if the R-squared improves over typical data.
D. The observations seem to come from two different populations,but this model fits them both equally well.
Refer to the exhibit Consider the training data set shown in the exhibit. What are the classification (Y = 0 or
1) and the probability of the classification for the tupleX(0, 0, 1) using Naive Bayesian classifier?
A. Classification Y = 1,Probability = 4/54
B. Classification Y = 0,Probability = 1/54
C. Classification Y = 1,Probability = 1/54
D. Classification Y = 0,Probability = 4/54