E20-007 Exam Questions
162 real E20-007 exam questions with expert-verified answers and explanations. Page 1 of 4.
- Question #1
What is an appropriate data visualization to use in a presentation for an analyst audience?
- Question #2
What is an example of a null hypothesis?
- Question #3
You have fit a decision tree classifier using 12 input variables. The resulting tree used 7 of the 12 variables, and is 5 levels deep. Some of the nodes contain only 3 data points....
- Question #4
If your intention is to show trends over time, which chart type is the most appropriate way to depict the data?
- Question #5
You are analyzing a time series and want to determine its stationarity. You also want to determine the order of autoregressive models. How are the autocorrelation functions used?
- Question #7
What is one modeling or descriptive statistical function in MADlib that is typically not provided in a standard relational database?
- Question #8
In which phase of the data analytics lifecycle do Data Scientists spend the most time in a project?
- Question #9
You are testing two new weight-gain formulas for puppies. The test gives the results: - Control group: 1% weight gain - Formula A. 3% weight gain - Formula B. 4% weight gain - A on...
- Question #10
Data visualization is used in the final presentation of an analytics project. For what else is this technique commonly used?
- Question #11
Which functionality do regular expressions provide?
- Question #12
When creating a project sponsor presentation, what is the main objective?
- Question #13
The average purchase size from your online sales site is $17, 200. The customer experience team believes a certain adjustment of the website will increase sales. A pilot study on a...
- Question #14
Which word or phrase completes the statement? Business Intelligence is to monitoring trends as Data Science is to ________ trends.
- Question #15
Consider a scale that has five (5) values that range from "not important" to "very important". Which data classification best describes this data?
- Question #16
Which key role for a successful analytic project can provide business domain expertise with a deep understanding of the data and key performance indicators?
- Question #17
On analyzing your time series data you suspect that the data represented as y1, y2, y3, ... , yn-1, yn may have a trend component that is quadratic in nature. Which pattern of data...
- Question #19
You have used k-means clustering to classify behavior of 100, 000 customers for a retail store. You decide to use household income, age, gender and yearly purchase amount as measur...
- Question #20
What does R code nv <- v[v < 1000] do?
- Question #21
For which class of problem is MapReduce most suitable?
- Question #22
Which activity is performed in the Operationalize phase of the Data Analytics Lifecycle?
- Question #23
Since R factors are categorical variables, they are most closely related to which data classification level?
- Question #24
In which phase of the analytic lifecycle would you expect to spend most of the project time?
- Question #25
You are building a logistic regression model to predict whether a tax filer will be audited within the next two years. Your training set population is 1000 filers. The audit rate i...
- Question #26
Refer to exhibit.You are asked to write a report on how specific variables impact your client's sales using a data set provided to you by the client. The data includes 15 variables...
- Question #27
You have two tables of customers in your database. Customers in cust_table_1 were sent an e- mail promotion last year, and customers in cust_table_2 received a newsletter last year...
- Question #28
In which lifecycle stage are initial hypotheses formed?
- Question #29
You are given 10, 000, 000 user profile pages of an online dating site in XML files, and they are stored in HDFS. You are assigned to divide the users into groups based on the cont...
- Question #30
The Marketing department of your company wishes to track opinion on a new product that was recently introduced. Marketing would like to know how many positive and negative reviews...
- Question #31
Which word or phrase completes the statement? A Data Scientist would consider that a RDBMS is to a Table as R is to a ______________ .
- Question #32
Which word or phrase completes the statement? Unix is to bash as Hadoop is to:
- Question #33
A call center for a large electronics company handles an average of 35, 000 support calls a day. The head of the call center would like to optimize the staffing of the call center...
- Question #34
Consider the example of an analysis for fraud detection on credit card usage. You will need to ensure higher-risk transactions that may indicate fraudulent credit card activity are...
- Question #36
You are studying the behavior of a population, and you are provided with multidimensional data at the individual level. You have identified four specific individuals who are valuab...
- Question #37
Which R data structure allows elements to have different data types?
- Question #38
Which key role for a successful analytic project can consult and advise the project team on the value of end results and how these will be used on a day-to-day basis?
- Question #39
A disk drive manufacturer has a defect rate of less than 1.0% with 98% confidence. A quality assurance team samples 1000 disk drives and finds 14 defective units. Which action shou...
- Question #40
What is required in a presentation for project sponsors?
- Question #41
A data scientist wants to predict the probability of death from heart disease based on three risk factors: age, gender, and blood cholesterol level. What is the most appropriate me...
- Question #42
What are the characteristics of Big Data?
- Question #44
What is an appropriate data visualization to use in a presentation for a project sponsor?
- Question #45
You have been assigned to do a study of the daily revenue effect of a pricing model of online transactions. All the data currently available to you has been loaded into your analyt...
- Question #46
Which SQL OLAP extension provides all possible grouping combinations?
- Question #47
What is the primary bottleneck in text classification?
- Question #48
Which characteristic applies only to Business Intelligence as opposed to Data Science?
- Question #49
You have been assigned to run a linear regression model for each of 5, 000 distinct districts, and all the data is currently stored in a PostgreSQL database. Which tool/library wou...
- Question #50
Your customer provided you with 2, 000 unlabeled records and asked you to separate them into three groups. What is the correct analytical method to use?
- Question #51
You are performing a market basket analysis using the Apriori algorithm. Which measure is a ratio describing the how many more times two items are present together than would be ex...
- Question #52
In which lifecycle stage are appropriate analytical techniques determined?
- Question #53
What is Hadoop?
- Question #54
You are using k-means clustering to classify heart patients for a hospital. You have chosen Patient Sex, Height, Weight, Age and Income as measures and have used 3 clusters. When y...