DY0-001 Exam Questions
92 real DY0-001 exam questions with expert-verified answers and explanations. Page 1 of 2.
- Question #1
A statistician notices gaps in data associated with age-related illnesses and wants to further aggregate these observations. Which of the following is the best technique to achieve...
- Question #2
A data scientist needs to analyze a company's chemical businesses and is using the master database of the conglomerate company. Nothing in the data differentiates the data observat...
- Question #3
A data scientist built several models that perform about the same but vary in the number of features. Which of the following models should the data scientist recommend for producti...
- Question #4
A data analyst wants to use compression on an analyzed data set and send it to a new destination for further processing. Which of the following issues will most likely occur?
- Question #5
The most likely concern with a one-feature, machine-learning model is high error due to:
- Question #7
A data scientist is clustering a data set but does not want to specify the number of clusters present. Which of the following algorithms should the data scientist use?
- Question #8
A data analyst wants to find the latitude and longitude of a mailing address. Which of the following is the best method to use?
- Question #9
Which of the following describes the appropriate use case for PCA?
- Question #10
A data scientist observes findings that indicate that as electrical grids in a country become more and more connected over time, the frequency of brownouts and blackouts in total d...
- Question #11
A data scientist is merging two tables. Table 1 contains employee IDs and roles. Table 2 contains employee IDs and team assignments. Which of the following is the best technique to...
- Question #12
Which of the following is a classic example of a constrained optimization problem?
- Question #13
A data scientist wants to digitize historical hard copies of documents. Which of the following is the best method for this task?
- Question #14
A data scientist trained a model for departments to share. The departments must access the model using HTTP requests. Which of the following approaches is appropriate?
- Question #15
Given the following: Which of the following time series models best represents this process?
- Question #16
Which of the following methods should a data scientist use just before switching to a potential replacement model?
- Question #17
A data scientist is presenting the recommendations from a monthslong modeling and experiment process to the company's Chief Executive Officer. Which of the following is the best se...
- Question #18
A data scientist is developing a model to predict the outcome of a vote for a national mascot. The choice is between tigers and lions. The full data set represents feedback from in...
- Question #19
A data scientist is working with a data set that covers a two-year period for a large number of machines. The data set contains: - Machine system ID numbers - Sensor measurement va...
- Question #20
A data scientist is standardizing a large data set that contains website addresses. A specific string inside some of the web addresses needs to be extracted. Which of the following...
- Question #21
A model's results show increasing explanatory value as additional independent variables are added to the model. Which of the following is the most appropriate statistic?
- Question #22
A team is building a spam detection system. The team wants a probability-based identification method without complex, in-depth training from the historical data set. Which of the f...
- Question #23
A data scientist is using the following confusion matrix to assess model performance: The model is predicting whether a delivery truck will be able to make 200 scheduled delivery s...
- Question #24
The following graphic shows the results of an unsupervised, machine-learning clustering model: k is the number of clusters, and n is the processing time required to run the model....
- Question #25
Under perfect conditions, E. coli bacteria would cover the entire earth in a matter of days. Which of the following types of models is the best for explaining this type of growth?
- Question #26
Which of the following problem-solving approaches is a set of guidelines to handle highly variable and not fully apparent situations?
- Question #27
A data analyst is examining the correlation matrix of a new data set to identify issues that could adversely impact model performance. Which of the following is the analyst most li...
- Question #28
A data scientist is designing a real-time machine-learning model that classifies a user based on initial behavior. The run times of these models are provided in the following table...
- Question #29
A movie production company would like to find the actors appearing in its top movies using data from the tables below. The resulting data must show all movies in Table 1, enriched...
- Question #30
A data scientist is preparing to brief a non-technical audience that is focused on analysis and results. During the modeling process, the data scientist produced the following arti...
- Question #31
A data scientist has built a model that provides the likelihood of an error occurring in a factory. The historical accuracy of the model is 90%. At a specific factory, the model is...
- Question #32
An analyst is examining data from an array of temperature sensors and sees that one sensor consistently returns values that are much higher than the values from the other sensors....
- Question #33
A data scientist would like to model a complex phenomenon using a large data set composed of categorical, discrete, and continuous variables. After completing exploratory data anal...
- Question #34
Which of the following is best solved with graph theory?
- Question #35
Given these business requirements: - Needs to most efficiently move 3,000 boxes across a river - Has one boat that holds eight boxes, travels at ten nautical miles per hour, and ha...
- Question #36
A data scientist is analyzing a data set with categorical features and would like to make those features more useful when building a model. Which of the following data transformati...
- Question #37
Which of the following measures would a data scientist most likely use to calculate the similarity of two text strings?
- Question #38
Which of the following issues should a data scientist be most concerned about when generating a synthetic data set?
- Question #39
A data scientist is performing a linear regression and wants to construct a model that explains the most variation in the data. Which of the following should the data scientist max...
- Question #42
In a research project, Professor Smith is analyzing a large corpus of scientific articles. He wants to remove common words like "the," "is," and "a," which do not contribute much t...
- Question #43
One of the main differences between administrative and transactional data is ______.
- Question #44
For an imbalanced dataset, why can accuracy be considered a misleading metric?
- Question #45
Xiaojing frequently watches romantic comedies. A movie recommender system uses this information to suggest other romantic comedies to her. Which of these approaches is the system u...
- Question #46
A data scientist is building an inferential model with a single predictor variable. A scatter plot of the independent variable against the real-number dependent variable shows a st...
- Question #47
A data scientist wants to evaluate the performance of various nonlinear models. Which of the following is best suited for this task?
- Question #48
Which of the following is the layer that is responsible for the depth in deep learning?
- Question #49
Which of the following modeling tools is appropriate for solving a scheduling problem?
- Question #50
Which of the following environmental changes is most likely to resolve a memory constraint error when running a complex model using distributed computing?
- Question #51
A data analyst wants to save a newly analyzed data set to a local storage option. The data set must meet the following requirements: - Be minimal in size - Have the ability to be i...
- Question #52
Which of the following is a key difference between KNN and k-means machine-learning techniques?
- Question #53
A data scientist needs to: - Build a predictive model that gives the likelihood that a car will get a flat tire. - Provide a data set of cars that had flat tires and cars that did...