MLS-C01 Exam Questions
388 real MLS-C01 exam questions with expert-verified answers and explanations. Page 2 of 8.
- Question #51Modeling
A company wants to classify user behavior as either fraudulent or normal. Based on internal research, a Machine Learning Specialist would like to build a binary classifier based on...
Machine Learning ModelsClassificationRecallModel Selection - Question #52Machine Learning Implementation and Operations
A Machine Learning Specialist kicks off a hyperparameter tuning job for a tree-based ensemble model using Amazon SageMaker with Area Under the ROC Curve (AUC) as the objective metr...
Hyperparameter TuningAmazon SageMakerMLOpsCost Optimization - Question #53Data Engineering
A Machine Learning Specialist is creating a new natural language processing application that processes a dataset comprised of 1 million sentences. The aim is to then run Word2Vec t...
NLPText PreprocessingWord EmbeddingsData Preparation - Question #54Modeling
A Data Scientist is evaluating different binary classification models. A false positive result is 5 times more expensive (from a business perspective) than a false negative result....
Confusion MatrixClassification MetricsCost-sensitive LearningModel Evaluation - Question #55Modeling
A Data Scientist uses logistic regression to build a fraud detection model. While the model accuracy is 99%, 90% of the fraud cases are not detected by the model. What action will...
Classification ThresholdRecallLogistic RegressionImbalanced Data - Question #56Modeling
Machine Learning Specialist is building a model to predict future employment rates based on a wide range of economic factors. While exploring the data, the Specialist notices that...
Feature ScalingData PreprocessingNormalizationStandardization - Question #57Data Engineering
A Machine Learning Specialist must build out a process to query a dataset on Amazon S3 using Amazon Athena. The dataset contains more than 800,000 records stored as plaintext CSV f...
Amazon AthenaApache ParquetQuery OptimizationData Storage Formats - Question #58Data Engineering
A Machine Learning Specialist is developing a daily ETL workflow containing multiple ETL jobs. The workflow consists of the following processes: - Start the workflow as soon as dat...
ETL WorkflowAWS Step FunctionsAWS GlueData Orchestration - Question #59Modeling
An agency collects census information within a country to determine healthcare and social program needs by province and city. The census form collects responses for approximately 5...
Dimensionality ReductionClusteringUnsupervised LearningAlgorithm Selection - Question #60Modeling
A large consumer goods manufacturer has the following products on sale: - 34 different toothpaste variants - 48 different toothbrush variants - 43 different mouthwash variants The...
Time Series ForecastingDeepARCold Start ProblemAWS SageMaker - Question #61ML Implementation and Operations
A Machine Learning Specialist uploads a dataset to an Amazon S3 bucket protected with server- side encryption using AWS KMS. How should the ML Specialist define the Amazon SageMake...
IAM RolesS3 PermissionsKMS EncryptionSageMaker Access - Question #62Modeling
A company is interested in building a fraud detection model. Currently, the Data Scientist does not have a sufficient amount of information due to the low number of fraud cases. Wh...
Imbalanced DatasetsOversamplingSMOTEData Preprocessing - Question #63Modeling
A Machine Learning Engineer is preparing a data frame for a supervised learning task with the Amazon SageMaker Linear Learner algorithm. The ML Engineer notices the target label cl...
Missing Value ImputationData PreprocessingBias ReductionFeature Engineering - Question #64Data Engineering
A Machine Learning Specialist has completed a proof of concept for a company using a small data sample, and now the Specialist is ready to implement an end-to-end solution in AWS u...
Data IngestionETLAmazon S3Amazon SageMaker - Question #65Modeling
A Machine Learning Specialist receives customer data for an online shopping website. The data includes demographics, past visits, and locality information. The Specialist must deve...
Recommendation SystemsCollaborative FilteringMachine Learning AlgorithmsCustomer Behavior Analysis - Question #66Modeling
A Machine Learning Specialist is working with a large company to leverage machine learning within its products. The company wants to group its customers into categories based on wh...
ClassificationSupervised LearningChurn PredictionML Problem Types - Question #67Modeling
The displayed graph is from a forecasting model for testing a time series. Considering the graph only, which conclusion should a Machine Learning Specialist make about the behavior...
Time Series ForecastingModel EvaluationTrend AnalysisSeasonality Analysis - Question #68Modeling
A company wants to classify user behavior as either fraudulent or normal. Based on internal research, a Machine Learning Specialist would like to build a binary classifier based on...
Model SelectionBinary ClassificationSupport Vector Machine (SVM)Non-linear Kernels - Question #69Machine Learning Implementation and Operations
A company has collected customer comments on its products, rating them as safe or unsafe, using decision trees. The training dataset has the following features: id, date, full revi...
Missing Data HandlingText PreprocessingData ImputationTest Data Preparation - Question #70Modeling
An insurance company needs to automate claim compliance reviews because human reviews are expensive and error-prone. The company has a large set of claims and a compliance label fo...
Natural Language ProcessingFeature ExtractionText EmbeddingsSageMaker Built-in Algorithms - Question #71Machine Learning Implementation and Operations
A company is running a machine learning prediction service that generates 100 TB of predictions every day. A Machine Learning Specialist must generate a visualization of the daily...
MLOpsMachine Learning MonitoringBig Data ProcessingData Visualization - Question #72Data Engineering
A Machine Learning Specialist is preparing data for training on Amazon SageMaker. The Specialist is using one of the SageMaker built-in algorithms for the training. The dataset is...
SageMaker data formatsData optimizationTraining performanceRecordIO Protobuf - Question #73Modeling
A Machine Learning Specialist is required to build a supervised image-recognition model to identify a cat. The ML Specialist performs some tests and records the following results f...
Data AugmentationImage ClassificationModel RobustnessTest Error Analysis - Question #74Data Engineering
A Machine Learning Specialist needs to be able to ingest streaming data and store it in Apache Parquet files for exploration and analysis. Which of the following services would bot...
Streaming DataKinesis FirehoseApache ParquetData Ingestion - Question #75Modeling
A Data Scientist is developing a machine learning model to classify whether a financial transaction is fraudulent. The labeled data available for training consists of 100,000 non-...
XGBoostClass ImbalanceHyperparameter TuningEvaluation Metrics - Question #76Machine Learning Implementation and Operations
A Machine Learning Specialist is assigned a TensorFlow project using Amazon SageMaker for training, and needs to continue working for an extended period with no Wi-Fi access. Which...
Offline ML DevelopmentSageMaker EnvironmentsDockerTensorFlow - Question #77Data Engineering
A Machine Learning Specialist at a company sensitive to security is preparing a dataset for model training. The dataset is stored in Amazon S3 and contains Personally Identifiable...
S3VPC EndpointSecurityPrivate Networking - Question #78Modeling
During mini-batch training of a neural network for a classification problem, a Data Scientist notices that training accuracy oscillates. What is the MOST likely cause of this issue...
Neural NetworksHyperparameter TuningLearning RateTraining Stability - Question #79ML Implementation and Operations
An employee found a video clip with audio on a company's social media feed. The language used in the video is Spanish. English is the employee's first language, and they do not und...
NLPAWS AI ServicesSentiment AnalysisSpeech-to-Text - Question #80Machine Learning Implementation and Operations
A Machine Learning Specialist is packaging a custom ResNet model into a Docker container so the company can leverage Amazon SageMaker for training. The Specialist is using Amazon E...
Docker containersGPU accelerationNVIDIA-DockerSageMaker - Question #81Modeling
A Machine Learning Specialist is building a logistic regression model that will predict whether or not a person will order a pizza. The Specialist is trying to build the optimal mo...
ROC CurveModel EvaluationBinary ClassificationClassification Thresholds - Question #82Modeling
A Data Scientist is building a model to predict customer churn using a dataset of 100 continuous numerical features. The Marketing team has not provided any insight about which fea...
Feature SelectionRegularizationOverfittingModel Interpretability - Question #83Data Engineering
An aircraft engine manufacturing company is measuring 200 performance metrics in a time- series. Engineers want to detect critical manufacturing defects in near-real time during te...
Streaming DataReal-time AnalyticsAWS KinesisData Ingestion - Question #84Machine Learning Implementation and Operations
A Machine Learning team runs its own training algorithm on Amazon SageMaker. The training algorithm requires external assets. The team needs to submit both its own algorithm code a...
Amazon SageMakerCustom AlgorithmsContainerizationData Storage - Question #85Machine Learning Implementation and Operations
A Machine Learning Specialist wants to determine the appropriate setting for an endpoint automatic scaling SageMakerVariantInvocationsPerInstance configuration. The Specialist has...
SageMaker Endpoint ScalingAutomatic ScalingInvocation MetricsSafety Factor - Question #86Modeling
A company uses a long short-term memory (LSTM) model to evaluate the risk factors of a particular energy sector. The model reviews multi-page text documents to analyze each sentenc...
Natural Language ProcessingWord EmbeddingsDeep LearningModel Optimization - Question #87Data Engineering
A Machine Learning Specialist needs to move and transform data in preparation for training. Some of the data needs to be processed in near-real time, and other data can be moved ho...
Data IngestionReal-time Data ProcessingBatch Data ProcessingData Orchestration - Question #88ML Implementation and Operations
A Machine Learning Specialist previously trained a logistic regression model using scikit-learn on a local machine, and the Specialist now wants to deploy it to production for infe...
SageMaker Model DeploymentCustom Docker ContainersModel InferenceScikit-learn - Question #89Data Engineering
A trucking company is collecting live image data from its fleet of trucks across the globe. The data is growing rapidly and approximately 100 GB of new data is generated every day....
Data LakeAmazon S3Data Storage for MLIAM Access Control - Question #90Modeling
A credit card company wants to build a credit scoring model to help predict whether a new credit card applicant will default on a credit card payment. The company has collected dat...
Feature EngineeringDimensionality ReductionPrincipal Component AnalysisAutoencoders - Question #91Modeling
A Data Scientist is training a multilayer perception (MLP) on a dataset with multiple classes. The target class of interest is unique compared to the other classes within the datas...
Class ImbalanceRecall OptimizationNeural NetworksLoss Functions - Question #92Modeling
A Machine Learning Specialist works for a credit card processing company and needs to predict which transactions may be fraudulent in near-real time. Specifically, the Specialist m...
Machine Learning Problem TypesBinary ClassificationFraud DetectionProblem Framing - Question #93Modeling
A real estate company wants to create a machine learning model for predicting housing prices based on a historical dataset. The dataset contains 32 features. Which model will meet...
Regression ModelsSupervised LearningModel SelectionPredictive Modeling - Question #94Modeling
A Machine Learning Specialist is applying a linear least squares regression model to a dataset with 1,000 records and 50 features. Prior to training, the ML Specialist notices that...
Linear RegressionMulticollinearitySingular MatrixModel Training Issues - Question #95Modeling
Given the following confusion matrix for a movie classification model, what is the true class frequency for Romance and the predicted class frequency for Adventure?
Confusion MatrixClassification MetricsModel EvaluationFrequency Calculation - Question #96ML Implementation and Operations
A Machine Learning Specialist wants to bring a custom algorithm to Amazon SageMaker. The Specialist implements the algorithm in a Docker container supported by Amazon SageMaker. Ho...
SageMaker Custom ContainersDocker ENTRYPOINTML Training JobsContainer Packaging - Question #97Exploratory Data Analysis
A Data Scientist needs to analyze employment data. The dataset contains approximately 10 million observations on people across 10 different features. During the preliminary analysi...
Feature TransformationData PreprocessingSkewed DataLogarithmic Transformation - Question #98Modeling
A web-based company wants to improve its conversion rate on its landing page. Using a large historical dataset of customer visits, the company has repeatedly trained a multi-class...
OverfittingRegularizationDeep LearningModel Generalization - Question #99Exploratory Data Analysis
A Machine Learning Specialist is given a structured dataset on the shopping habits of a company's customer base. The dataset contains thousands of columns of data and hundreds of n...
t-SNEDimensionality ReductionData VisualizationExploratory Data Analysis - Question #100Machine Learning Implementation and Operations
A Machine Learning Specialist is planning to create a long-running Amazon EMR cluster. The EMR cluster will have 1 master node, 10 core nodes, and 20 task nodes. To save on costs,...
Amazon EMRSpot InstancesCost OptimizationCluster Architecture