Weekend Sale Limited Time 70% Discount Offer - Ends in 0d 00h 00m 00s - Coupon code: xmas50

Databricks Databricks-Machine-Learning-Professional - Databricks Certified Machine Learning Professional

A machine learning engineer has developed a random forest model using scikit-learn, logged the model using MLflow as random_forest_model, and stored its run ID in the run_id Python variable. They now want to deploy that model by performing batch inference on a Spark DataFrame spark_df.

Which of the following code blocks can they use to create a function called predict that they can use to complete the task?

A)

B)

It is not possible to deploy a scikit-learn model on a Spark DataFrame.

C)

D)

E)

A.

Option A

B.

Option B

C.

Option C

D.

Option D

E.

Option E

A data scientist has created a Python functioncompute_featuresthat returns a Spark DataFrame with the following schema:

The resulting DataFrame is assigned to thefeatures_dfvariable. The data scientist wants to create a Feature Store table usingfeatures_df.

Which of the following code blocks can they use to create and populate the Feature Store table using the Feature Store Clientfs?

A.

B.

C.

features_df.write.mode("fs").path("new_table")

D.

E.

features_df.write.mode("feature").path("new_table")

A machine learning engineer wants to programmatically create a new Databricks Job whose schedule depends on the result of some automated tests in a machine learning pipeline.

Which of the following Databricks tools can be used to programmatically create the Job?

A.

MLflow APIs

B.

AutoML APIs

C.

MLflow Client

D.

Jobs cannot be created programmatically

E.

Databricks REST APIs

A data scientist has written a function to track the runs of their random forest model. The data scientist is changing the number of trees in the forest across each run.

Which of the following MLflow operations is designed to log single values like the number of trees in a random forest?

A.

mlflow.log_artifact

B.

mlflow.log_model

C.

mlflow.log_metric

D.

mlflow.log_param

E.

There is no way to store values like this.

A machine learning engineer is attempting to create a webhook that will trigger a Databricks Jobjob_idwhen a model version for modelmodeltransitions into any MLflow Model Registry stage.

They have the following incomplete code block:

Which of the following lines of code can be used to fill in the blank so that the code block accomplishes the task?

A.

"MODEL_VERSION_CREATED"

B.

"MODEL_VERSION_TRANSITIONED_TO_PRODUCTION"

C.

"MODEL_VERSION_TRANSITIONED_TO_STAGING"

D.

"MODEL_VERSION_TRANSITIONED_STAGE"

E.

"MODEL_VERSION_TRANSITIONED_TO_STAGING", "MODEL_VERSION_TRANSITIONED_TO_PRODUCTION"

A data scientist has developed a scikit-learn random forest model model, but they have not yet logged model with MLflow. They want to obtain the input schema and the output schema of the model so they can document what type of data is expected as input.

Which of the following MLflow operations can be used to perform this task?

A.

mlflow.models.schema.infer_schema

B.

mlflow.models.signature.infer_signature

C.

mlflow.models.Model.get_input_schema

D.

mlflow.models.Model.signature

E.

There is no way to obtain the input schema and the output schema of an unlogged model.

A machine learning engineer has registered a sklearn model in the MLflow Model Registry using the sklearn model flavor with UI model_uri.

Which of the following operations can be used to load the model as an sklearn object for batch deployment?

A.

mlflow.spark.load_model(model_uri)

B.

mlflow.pyfunc.read_model(model_uri)

C.

mlflow.sklearn.read_model(model_uri)

D.

mlflow.pyfunc.load_model(model_uri)

E.

mlflow.sklearn.load_model(model_uri)

Which of the following tools can assist in real-time deployments by packaging software with its own application, tools, and libraries?

A.

Cloud-based compute

B.

None of these tools

C.

REST APIs

D.

Containers

E.

Autoscaling clusters