E20-007 Exam Questions
162 real E20-007 exam questions with expert-verified answers and explanations. Page 2 of 4.
- Question #55
How does Pig's use of a schema differ from that of a traditional RDBMS?
- Question #56
You are provided four different datasets. Initial analysis on these datasets show that they have identical mean, variance and correlation values. What should your next step in the...
- Question #57
You are asked to create a model to predict the total number of monthly subscribers for a specific magazine. You are provided with 1 year's worth of subscription and payment data, u...
- Question #58
Which word or phrase completes the statement? Structured data is to OLAP data as quasi- structured data is to____
- Question #59
What describes a true property of Logistic Regression method?
- Question #60
You have been assigned to do a study of the daily revenue effect of a pricing model of online transactions. You have tested all the theoretical models in the previous model plannin...
- Question #61
What is a core deliverable at the end of the analytic project?
- Question #62
You have been assigned to run a logistic regression model for each of 100 countries, and all the data is currently stored in a PostgreSQL database. Which tool/library would you use...
- Question #63
Your organization has a website where visitors randomly receive one of two coupons. It is also possible that visitors to the website will not receive a coupon. You have been asked...
- Question #64
Imagine you are trying to hire a Data Scientist for your team. In addition to technical ability and quantitative background, which additional essential trait would you look for in...
- Question #65
What describes the use of UNION clause in a SQL statement?
- Question #66
You have run the association rules algorithm on your data set, and the two rules {banana, apple} => {grape} and {apple, orange}=> {grape} have been found to be relevant. What else...
- Question #67
When would you use a Wilcoxson Rank Sum test?
- Question #68
In the MapReduce framework, what is the purpose of the Reduce function?
- Question #69
Which of the following is an example of quasi-structured data?
- Question #70
A Data Scientist is assigned to build a model from a reporting data warehouse. The warehouse contains data collected from many sources and transformed through a complex, multi-stag...
- Question #71
Which word or phrase completes the statement? Emphasis color is to standard color as _______ .
- Question #72
Which activity might be performed in the Operationalize phase of the Data Analytics Lifecycle?
- Question #73
Refer to the exhibit. You are asked to write a report on how specific variables impact your client's sales using a data set provided to you by the client. The data includes 15 vari...
- Question #74
Refer to the Exhibit. In the Exhibit, the table shows the values for the input Boolean attributes "A", "B", and "C". It also shows the values for the output attribute "class". Whic...
- Question #75
Refer to the Exhibit. In the Exhibit, the table shows the values for the input Boolean attributes "A", "B", and "C". It also shows the values for the output attribute "class". Whic...
- Question #76
Refer to the exhibit. You are building a decision tree. In this exhibit, four variables are listed with their respective values of info-gain. Based on this information, on which at...
- Question #77
Refer to the exhibit. You are assigned to do an end of the year sales analysis of 1, 000 different products, based on the transaction table. Which column in the end of year report...
- Question #78
Refer to the exhibit. After analyzing a dataset, you report findings to your team: 1. Variables A and C are significantly and positively impacting the dependent variable. 2. Variab...
- Question #79
Refer to the Exhibit. You are working on creating an OLAP query that outputs several rows of with summary rows of subtotals and grand totals in addition to regular rows that may co...
- Question #80
Refer to the exhibit. Click on the calculator icon in the upper left corner. You are given a list of pre-defined association rules:
- Question #81
Refer to the exhibit. You have run a linear regression model against your data, and have plotted true outcome versus predicted outcome. The R-squared of your model is 0.75. What is...
- Question #82
Refer to the exhibit. You are using K-means clustering to classify customer behavior for a large retailer. You need to determine the optimum number of customer groups. You plot the...
- Question #83
Refer to the exhibit. You are using k-means clustering to discover groupings within a data set. You plot within-sum-of- squares (wss) of multiple cluster sizes. Based on the exhibi...
- Question #84
Refer to the exhibit Consider the training data set shown in the exhibit. What are the classification (Y = 0 or 1) and the probability of the classification for the tupleX(0, 0, 1)...
- Question #85
Refer to the exhibit. In the exhibit, a correlogram is provided based on an autocorrelation analysis of a sample dataset. What can you conclude from only this exhibit?
- Question #86
Refer to the exhibit. Which type of data issue would you suspect based on the exhibit?
- Question #88
Refer to the exhibit. Click on the calculator icon in the upper left corner. An analyst is searching a corpus of documents for the topic "solid state disk". In the Exhibit, Table A...
- Question #90
Refer to the exhibit. What provides the decision tree for predicting whether or not someone is a good or bad credit risk. What would be the assigned probability, p(good), of a sing...
- Question #91
Refer to the exhibit. You ran a linear regression, and the final output is seen in the exhibit. Based only on the information in the exhibit and an acceptable confidence level of 9...
- Question #92
Refer to the exhibit. The exhibit shows four graphs labeled as Fig A thorough Fig D. Which figure represents the entropy function relative to a Boolean classification and is repres...
- Question #94
Refer to the exhibit. The graph represents an ROC space with four classifiers labelled A through
- Question #95
Refer to the exhibit. Consider the training data set shown in the exhibit. What are the classification (Y = 0 or 1) and the probability of the classification for the tuple X(1, 0,...
- Question #96
Refer to the exhibit. You have scored your Naive bayesian classifier model on a hold out test data for cross validation and determined the way the samples scored and tabluated them...
- Question #99
You are using MADlib for Linear Regression analysis. Which value does the statement return? SELECT (linregr(depvar, indepvar)).r2 FROM zeta1;
- Question #100
Refer to the exhibit. You have scored your Naive bayesian classifier model on a hold out test data for cross validation and determined the way the samples scored and tabulated them...
- Question #101
A data scientist plans to classify the sentiment polarity of 10, 000 product reviews collected from the Internet. What is the most appropriate model to use? Suppose labeled trainin...
- Question #102
In which lifecycle stage are test and training data sets created?
- Question #103
When creating a presentation for a technical audience, what is the main objective?
- Question #104
Your company has 3 different sales teams. Each team's sales manager has developed incentive offers to increase the size of each sales transaction. Any sales manager whose incentive...
- Question #105
In data visualization, what is used to focus the audience on a key part of a chart?
- Question #106
When would you use GROUP BY ROLLUP clause in your OLAP query?
- Question #107
Which type of numeric value does a logistic regression model estimate?
- Question #108
Your colleague, who is new to Hadoop, approaches you with a question. They want to know how best to access their data. This colleague has a strong background in data flow languages...
- Question #109
The web analytics team uses Hadoop to process access logs. They now want to correlate this data with structured user data residing in a production single-instance JDBC database. Th...