DATABRICKS-CERTIFIED-PROFESSIONAL-DATA-SCIENTIST Exam Questions

138 real DATABRICKS-CERTIFIED-PROFESSIONAL-DATA-SCIENTIST exam questions with expert-verified answers and explanations. Page 3 of 3.

Question #101
Refer to the exhibit. You are building a decision tree. In this exhibit, four variables are listed with their respective values of info-gain. Based on this information, on which at...
Question #102
You have collected the 100's of parameters about the 1000's of websites e.g. daily hits, average time on the websites, number of unique visitors, number of returning visitors etc....
Question #103
Which of the below best describe the Principal component analysis
Question #104
You are working in a data analytics company as a data scientist, you have been given a set of various types of Pizzas available across various premium food centers in a country. Th...
Question #105
Feature Hashing approach is "SGD-based classifiers avoid the need to predetermine vector size by simply picking a reasonable size and shoehorning the training data into vectors of...
Question #106
What are the advantages of the Hashing Features?
Question #107
In machine learning, feature hashing, also known as the hashing trick (by analogy to the kernel trick), is a fast and space-efficient way of vectorizing features (such as the words...
Question #108
Suppose A, B , and C are events. The probability of A given B , relative to P(|C), is the same as the probability of A given B and C (relative to P ). That is,
Question #109
What is the considerable difference between L1 and L2 regularization?
Question #110
Regularization is a very important technique in machine learning to prevent overfitting. Mathematically speaking, it adds a regularization term in order to prevent the coefficients...
Question #111
Select the correct option which applies to L2 regularization
Question #112
Regularization is a very important technique in machine learning to prevent over fitting. And Optimizing with a L1 regularization term is harder than with an L2 regularization term...
Question #113
Logistic regression is a model used for prediction of the probability of occurrence of an event. It makes use of several variables that may be......
Question #114
Spam filtering of the emails is an example of
Question #115
Select the choice where Regression algorithms are not best fit
Question #116
Question-13. Which of the following is not the Classification algorithm?
Question #117
You are working in an ecommerce organization, where you are designing and evaluating a recommender system, you need to select which of the following metric wilt always have the lar...
Question #118
Under which circumstance do you need to implement N-fold cross-validation after creating a regression model?
Question #119
A data scientist wants to predict the probability of death from heart disease based on three risk factors: age, gender, and blood cholesterol level. What is the most appropriate me...
Question #120
You are analyzing data in order to build a classifier model. You discover non-linear data and discontinuities that will affect the model. Which analytical method would you recommen...
Question #121
Your customer provided you with 2. 000 unlabeled records three groups. What is the correct analytical method to use?
Question #122
What describes a true limitation of Logistic Regression method?
Question #123
What are the advantages of the mutual information over the Pearson correlation for text classification problems?
Question #124
The figure below shows a plot of the data of a data matrix M that is 1000 x 2. Which line represents the first principal component?
Question #125
Question-18. What is the best way to ensure that the k-means algorithm will find a good clustering of a collection of vectors?
Question #126
A website is opened 3 times by a user. What is the probability of he clicks 2 times the advertisement, is best calculated by
Question #127
Suppose a man told you he had a nice conversation with someone on the train. Not knowing anything about this conversation, the probability that he was speaking to a woman is 50% (a...
Question #128
Which of the following could be features?
Question #129
Refer to image below
Question #130
A fruit may be considered to be an apple if it is red, round, and about 3" in diameter. A naive Bayes classifier considers each of these features to contribute independently to the...
Question #131
Select the correct statement regarding the naive Bayes classification
Question #132
In which of the following scenario we can use Naive Bayes theorem for classification
Question #133
Which of the following are advantages of the Support Vector machines?
Question #134
Support vector machines (SVMs) are a set of supervised learning methods used for
Question #135
Select the correct problems which can be solved using SVMs
Question #136
Which is an example of supervised learning?
Question #137
Which of the following are point estimation methods?
Question #138
In statistics, maximum-likelihood estimation (MLE) is a method of estimating the parameters of a statistical model. When applied to a data set and given a statistical model, maximu...

PreviousPage 3 of 3