Snowflake DSA-C02 - SnowPro Advanced: Data Scientist Certification Exam

Snowflake DSA-C02 Premium Access Download Demo

Page: 1 / 2
Total 65 questions

Question # 1

Select the correct mappings:

I. W Weights or Coefficients of independent variables in the Linear regression model --> Model Pa-rameter

II. K in the K-Nearest Neighbour algorithm --> Model Hyperparameter

III. Learning rate for training a neural network --> Model Hyperparameter

IV. Batch Size --> Model Parameter

I,II

I,II,III

III,IV

II,III,IV

Explanation:

Explanation

Hyperparameters in Machine learning are those parameters that are explicitly defined by the user to control the learning process. These hyperparameters are used to improve the learning of the model, and their values are set before starting the learning process of the model.

What are hyperparameters?

In Machine Learning/Deep Learning, a model is represented by its parameters. In contrast, a training process involves selecting the best/optimal hyperparameters that are used by learning algorithms to provide the best result. So, what are these hyperparameters? The answer is, "Hyperparameters are defined as the parameters that are explicitly defined by the user to control the learning process."

Here the prefix "hyper" suggests that the parameters are top-level parameters that are used in con-trolling the learning process. The value of the Hyperparameter is selected and set by the machine learning engineer before the learning algorithm begins training the model. Hence, these are external to the model, and their values cannot be changed during the training process.

Some examples of Hyperparameters in Machine Learning

Â· The k in kNN or K-Nearest Neighbour algorithm

Â· Learning rate for training a neural network

Â· Train-test split ratio

Â· Batch Size

Â· Number of Epochs

Â· Branches in Decision Tree

Â· Number of clusters in Clustering Algorithm

Model Parameters:

Model parameters are configuration variables that are internal to the model, and a model learns them on its own. For example, W Weights or Coefficients of independentvariables in the Linear regression model. or Weights or Coefficients of independent variables in SVM, weight, and biases of a neural network, cluster centroid in clustering. Some key points for model parameters are as follows:

They are used by the model for making predictions.

Â· They are learned by the model from the data itself

Â· These are usually not set manually.

Â· These are the part of the model and key to a machine learning Algorithm.

Model Hyperparameters:

Hyperparameters are those parameters that are explicitly defined by the user to control the learning process. Some key points for model parameters are as follows:

These are usually defined manually by the machine learning engineer.

One cannot know the exact best value for hyperparameters for the given problem. The best value can be determined either by the rule of thumb or by trial and error.

Some examples of Hyperparameters are the learning rate for training a neural network, K in the KNN algorithm.

Question # 2

Which of the following process best covers all of the following characteristics?

Â· Collecting descriptive statistics like min, max, count and sum.

Â· Collecting data types, length and recurring patterns.

Â· Tagging data with keywords, descriptions or categories.

Â· Performing data quality assessment, risk of performing joins on the data.

Â· Discovering metadata and assessing its accuracy.

Identifying distributions, key candidates, foreign-key candidates,functional dependencies, embedded value dependencies, and performing inter-table analysis.

Data Visualization

Data Virtualization

Data Profiling

Data Collection

Question # 3

Which of the Following is not type of Windows function in Snowflake?

Rank-related functions.

Window frame functions.

Aggregation window functions.

Association functions.

Question # 4

Which of the learning methodology applies conditional probability of all the variables with respec-tive the dependent variable?

Reinforcement learning

Unsupervised learning

Artificial learning

Supervised learning

Question # 5

How do you handle missing or corrupted data in a dataset?

Drop missing rows or columns

Replace missing values with mean/median/mode

Assign a unique category to missing values

All of the above

Question # 6

To return the contents of a DataFrame as a Pandas DataFrame, Which of the following method can be used in SnowPark API?

REPLACE_TO_PANDAS

SNOWPARK_TO_PANDAS

CONVERT_TO_PANDAS

TO_PANDAS

Question # 7

What Can Snowflake Data Scientist do in the Snowflake Marketplace as Consumer?

Discover and test third-party data sources.

Receive frictionless access to raw data products from vendors.

Combine new datasets with your existing data in Snowflake to derive new business in-sights.

Use the business intelligence (BI)/ML/Deep learning tools of her choice.

Question # 8

The most widely used metrics and tools to assess a classification model are:

Confusion matrix

Cost-sensitive accuracy

Area under the ROC curve

All of the above

Question # 9

Mark the correct steps for saving the contents of a DataFrame to aSnowflake table as part of Moving Data from Spark to Snowflake?

Step 1.Use the PUT() method of the DataFrame to construct a DataFrameWriter.

Step 2.Specify SNOWFLAKE_SOURCE_NAME using the NAME() method.

Step 3.Use the dbtable option to specify the table to which data is written.

Step 4.Specify the connector options using either the option() or options() method.

Step 5.Use the save() method to specify the save mode for the content.

Step 1.Use the PUT() method of the DataFrame to construct a DataFrameWriter.

Step 2.Specify SNOWFLAKE_SOURCE_NAME using the format() method.

Step 3.Specify the connector options using either the option() or options() method.

Step 4.Use the dbtable option to specify the table to which data is written.

Step 5.Use the save() method to specify the save mode for the content.

Step 1.Use the write() method of the DataFrame to construct a DataFrameWriter.

Step 2.Specify SNOWFLAKE_SOURCE_NAME using the format() method.

Step 3.Specify the connector options using either the option() or options() method.

Step 4.Use the dbtable option to specify the table to which data is written.

Step 5.Use the mode() method to specify the save mode for the content.

(Correct)

Step 1.Use the writer() method of the DataFrame to construct a DataFrameWriter.

Step 2.Specify SNOWFLAKE_SOURCE_NAME using the format() method.

Step 3.Use the dbtable option to specify the table to which data is written.

Step 4.Specify the connector options using either the option() or options() method.

Step 5.Use the save() method to specify the save mode for the content.

Question # 10

You previously trained a model using a training dataset. You want to detect any data drift in the new data collected since the model was trained.

What should you do?

Create a new dataset using the new data and a timestamp column and create a data drift monitor that uses the training dataset as a baseline and the new dataset as a target.

Create a new version of the dataset using only the new data and retrain the model.

Add the new data to the existing dataset and enable Application Insights for the service where the model is deployed.

Retrained your training dataset after correcting data outliers & no need to introduce new data.

Spring Sale Special Limited Time 70% Discount Offer - Ends in 0d 00h 00m 00s - Coupon code: xmas50

Snowflake DSA-C02 - SnowPro Advanced: Data Scientist Certification Exam

The Answer Is:

Explanation:

The Answer Is:

Explanation:

The Answer Is:

Explanation:

The Answer Is:

Explanation:

The Answer Is:

The Answer Is:

Explanation:

The Answer Is:

Explanation:

The Answer Is:

The Answer Is:

Explanation:

The Answer Is:

Explanation: