DATABRICKS-CERTIFIED-PROFESSIONAL-DATA-SCIENTIST Exam Questions
138 real DATABRICKS-CERTIFIED-PROFESSIONAL-DATA-SCIENTIST exam questions with expert-verified answers and explanations. Page 1 of 3.
- Question #1
Suppose you have been given two Random Variables X and Y, whose joint distribution is already known, the marginal distribution of X is simply the probability distribution of X aver...
- Question #2
Suppose that the probability that a pedestrian will be tul by a car while crossing the toad at a pedestrian crossing without paying attention to the traffic light is lo be computed...
- Question #3
You have modeled the datasets with 5 independent variables called A,B,C,D and E having relationships which is not dependent each other, and also the variable A,B and C are continuo...
- Question #4
RMSE measures error of a predicted
- Question #5
Suppose you have made a model for the rating system, which rates between 1 to 5 stars. And you calculated that RMSE value is 1.0 then which of the following is correct
- Question #6
You are creating a regression model with the input income, education and current debt of a customer, what could be the possible output from this model.
- Question #7
In which of the scenario you can use the regression to predict the values
- Question #8
RMSE is a good measure of accuracy, but only to compare forecasting errors of different models for a______, as it is scale-dependent.
- Question #9
You are creating a Classification process where input is the income, education and current debt of a customer, what could be the possible output of this process.
- Question #10
Let's say you have two cases as below for the movie ratings 1. You recommend to a user a movie with four stars and he really doesn't like it and he'd rate it two stars 2. You recom...
- Question #11
RMSE is a useful metric for evaluating which types of models?
- Question #12
Select the correct statement which applies to logistic regression
- Question #13
Suppose that we are interested in the factors that influence whether a political candidate wins an election. The outcome (response) variable is binary (0/1); win or lose. The predi...
- Question #14
A researcher is interested in how variables, such as GRE (Graduate Record Exam scores), GPA (grade point average) and prestige of the undergraduate institution, effect admission in...
- Question #15
In unsupervised learning which statements correctly applies
- Question #16
Select the correct statement which applies to Supervised learning
- Question #17
Which of the following is a correct example of the target variable in regression (supervised learning)?
- Question #18
Select the correct algorithm of unsupervised algorithm
- Question #19
Classification and regression are examples of___________.
- Question #20
Reducing the data from many features to a small number so that we can properly visualize it in two or three dimensions. It is done in_______
- Question #21
If you are trying to predict or forecast a discrete target value, then which is the correct options
- Question #22
Select the correct option from the below
- Question #23
Select the correct statement which applies to K-Nearest Neighbors
- Question #24
Select the statement which applies correctly to the Naive Bayes
- Question #25
Which of the following technique can be used to the design of recommender systems?
- Question #26
You are working on a problem where you have to predict whether the claim is done valid or not. And you find that most of the claims which are having spelling errors as well as corr...
- Question #27
Scenario: Suppose that Bob can decide to go to work by one of three modes of transportation, car, bus, or commuter train. Because of high traffic, if he decides to go by car. there...
- Question #28
In which of the following scenario you should apply the Bay's Theorem
- Question #29
Marie is getting married tomorrow, at an outdoor ceremony in the desert. In recent years, it has rained only 5 days each year. Unfortunately, the weatherman has predicted rain for...
- Question #30
Your company has organized an online campaign for feedback on product quality and you have all the responses for the product reviews, in the response form people have check box as...
- Question #31
Suppose you have been given a relatively high-dimension set of independent variables and you are asked to come up with a model that predicts one of Two possible outcomes like "YES"...
- Question #32
A bio-scientist is working on the analysis of the cancer cells. To identify whether the cell is cancerous or not, there has been hundreds of tests are done with small variations to...
- Question #33
Find out the classifier which assumes independence among all its features?
- Question #34
Digit recognition, is an example of.....
- Question #35
You are using one approach for the classification where to teach the agent not by giving explicit categorizations, but by using some sort of reward system to indicate success, wher...
- Question #36
Select the correct statement which applies to Principal component analysis (PCA)
- Question #37
Select the correct objectives of principal component analysis
- Question #38
Select the sequence of the developing machine learning applications
- Question #39
What type of output generated in case of linear regression?
- Question #40
In which of the scenario you can use the linear regression model?
- Question #41
Which of the following statement true with regards to Linear Regression Model?
- Question #42
What describes a true property of Logistic Regression method?
- Question #43
A data scientist is asked to implement an article recommendation feature for an on-line magazine. The magazine does not want to use client tracking technologies such as cookies or...
- Question #44
Which of the following statement is true for the R square value in the regression model?
- Question #45
Which technique you would be using to solve the below problem statement? "What is the probability that individual customer will not repay the loan amount?"
- Question #46
What is one modeling or descriptive statistical function in MADlib that is typically not provided in a standard relational database?
- Question #47
You are creating a model for the recommending the book at Amazon.com, so which of the following recommender system you will use you don't have cold start problem?
- Question #48
Clustering is a type of unsupervised learning with the following goals
- Question #49
Assume some output variable "y" is a linear combination of some independent input variables "A" plus some independent noise "e". The way the independent variables are combined is d...
- Question #50
Refer to Exhibit In the exhibit, the x-axis represents the derived probability of a borrower defaulting on a loan. Also in the exhibit, the pink represents borrowers that are known...