Weekend Sale Limited Time 70% Discount Offer - Ends in 0d 00h 00m 00s - Coupon code: xmas50

Databricks Databricks-Machine-Learning-Professional - Databricks Certified Machine Learning Professional

In a continuous integration, continuous deployment (CI/CD) process for machine learning pipelines, which of the following events commonly triggers the execution of automated testing?

A.

The launch of a new cost-efficient SQL endpoint

B.

CI/CD pipelines are not needed for machine learning pipelines

C.

The arrival of a new feature table in the Feature Store

D.

The launch of a new cost-efficient job cluster

E.

The arrival of a new model version in the MLflow Model Registry

Which of the following is a reason for using Jensen-Shannon (JS) distance over a Kolmogorov-Smirnov (KS) test for numeric feature drift detection?

A.

All of these reasons

B.

JS is not normalized or smoothed

C.

None of these reasons

D.

JS is more robust when working with large datasets

E.

JS does not require any manual threshold or cutoff determinations

Which of the following machine learning model deployment paradigms is the most common for machine learning projects?

A.

On-device

B.

Streaming

C.

Real-time

D.

Batch

E.

None of these deployments

A machine learning engineer is in the process of implementing a concept drift monitoring solution. They are planning to use the following steps:

1. Deploy a model to production and compute predicted values

2. Obtain the observed (actual) label values

3. _____

4. Run a statistical test to determine if there are changes over time

Which of the following should be completed as Step #3?

A.

Obtain the observed values (actual) feature values

B.

Measure the latency of the prediction time

C.

Retrain the model

D.

None of these should be completed as Step #3

E.

Compute the evaluation metric using the observed and predicted values

A machine learning engineer has developed a model and registered it using the FeatureStoreClient fs. The model has model URI model_uri. The engineer now needs to perform batch inference on customer-level Spark DataFrame spark_df, but it is missing a few of the static features that were used when training the model. The customer_id column is the primary key of spark_df and the training set used when training and logging the model.

Which of the following code blocks can be used to compute predictions for spark_df when the missing feature values can be found in the Feature Store by searching for features by customer_id?

A.

df = fs.get_missing_features(spark_df, model_uri)

fs.score_model(model_uri, df)

B.

fs.score_model(model_uri, spark_df)

C.

df = fs.get_missing_features(spark_df, model_uri)

fs.score_batch(model_uri, df)

df = fs.get_missing_features(spark_df)

D.

fs.score_batch(model_uri, df)

E.

fs.score_batch(model_uri, spark_df)

Which of the following MLflow operations can be used to automatically calculate and log a Shapley feature importance plot?

A.

mlflow.shap.log_explanation

B.

None of these operations can accomplish the task.

C.

mlflow.shap

D.

mlflow.log_figure

E.

client.log_artifact

A data scientist set up a machine learning pipeline to automatically log a data visualization with each run. They now want to view the visualizations in Databricks.

Which of the following locations in Databricks will show these data visualizations?

A.

The MLflow Model RegistryModel paqe

B.

The Artifacts section of the MLflow Experiment page

C.

Logged data visualizations cannot be viewed in Databricks

D.

The Artifacts section of the MLflow Run page

E.

The Figures section of the MLflow Run page

A machine learning engineer is manually refreshing a model in an existing machine learning pipeline. The pipeline uses the MLflow Model Registry model "project". The machine learning engineer would like to add a new version of the model to "project".

Which of the following MLflow operations can the machine learning engineer use to accomplish this task?

A.

mlflow.register_model

B.

MlflowClient.update_registered_model

C.

mlflow.add_model_version

D.

MlflowClient.get_model_version

E.

The machine learning engineer needs to create an entirely new MLflow Model Registry model

A machine learning engineer wants to move their model versionmodel_versionfor the MLflow Model Registry modelmodelfrom the Staging stage to the Production stage using MLflow Clientclient.

Which of the following code blocks can they use to accomplish the task?

A)

B)

C)

D)

E)

A.

Option A

B.

Option B

C.

Option C

D.

Option D

E.

option E

Which of the following is a simple statistic to monitor for categorical feature drift?

A.

Mode

B.

None of these

C.

Mode, number of unique values, and percentage of missing values

D.

Percentage of missing values

E.

Number of unique values