Summer Sale Limited Time 65% Discount Offer - Ends in 0d 00h 00m 00s - Coupon code: ecus65

Snowflake DSA-C02 - SnowPro Advanced: Data Scientist Certification Exam

Page: 1 / 2
Total 65 questions

Select the correct mappings:

I. W Weights or Coefficients of independent variables in the Linear regression model --> Model Pa-rameter

II. K in the K-Nearest Neighbour algorithm --> Model Hyperparameter

III. Learning rate for training a neural network --> Model Hyperparameter

IV. Batch Size --> Model Parameter

A.

I,II

B.

I,II,III

C.

III,IV

D.

II,III,IV

Which of the following process best covers all of the following characteristics?

· Collecting descriptive statistics like min, max, count and sum.

· Collecting data types, length and recurring patterns.

· Tagging data with keywords, descriptions or categories.

· Performing data quality assessment, risk of performing joins on the data.

· Discovering metadata and assessing its accuracy.

Identifying distributions, key candidates, foreign-key candidates,functional dependencies, embedded value dependencies, and performing inter-table analysis.

A.

Data Visualization

B.

Data Virtualization

C.

Data Profiling

D.

Data Collection

Which of the Following is not type of Windows function in Snowflake?

A.

Rank-related functions.

B.

Window frame functions.

C.

Aggregation window functions.

D.

Association functions.

Which of the learning methodology applies conditional probability of all the variables with respec-tive the dependent variable?

A.

Reinforcement learning

B.

Unsupervised learning

C.

Artificial learning

D.

Supervised learning

How do you handle missing or corrupted data in a dataset?

A.

Drop missing rows or columns

B.

Replace missing values with mean/median/mode

C.

Assign a unique category to missing values

D.

All of the above

To return the contents of a DataFrame as a Pandas DataFrame, Which of the following method can be used in SnowPark API?

A.

REPLACE_TO_PANDAS

B.

SNOWPARK_TO_PANDAS

C.

CONVERT_TO_PANDAS

D.

TO_PANDAS

What Can Snowflake Data Scientist do in the Snowflake Marketplace as Consumer?

A.

Discover and test third-party data sources.

B.

Receive frictionless access to raw data products from vendors.

C.

Combine new datasets with your existing data in Snowflake to derive new business in-sights.

D.

Use the business intelligence (BI)/ML/Deep learning tools of her choice.

The most widely used metrics and tools to assess a classification model are:

A.

Confusion matrix

B.

Cost-sensitive accuracy

C.

Area under the ROC curve

D.

All of the above

Mark the correct steps for saving the contents of a DataFrame to aSnowflake table as part of Moving Data from Spark to Snowflake?

A.

Step 1.Use the PUT() method of the DataFrame to construct a DataFrameWriter.

Step 2.Specify SNOWFLAKE_SOURCE_NAME using the NAME() method.

Step 3.Use the dbtable option to specify the table to which data is written.

Step 4.Specify the connector options using either the option() or options() method.

Step 5.Use the save() method to specify the save mode for the content.

B.

Step 1.Use the PUT() method of the DataFrame to construct a DataFrameWriter.

Step 2.Specify SNOWFLAKE_SOURCE_NAME using the format() method.

Step 3.Specify the connector options using either the option() or options() method.

Step 4.Use the dbtable option to specify the table to which data is written.

Step 5.Use the save() method to specify the save mode for the content.

C.

Step 1.Use the write() method of the DataFrame to construct a DataFrameWriter.

Step 2.Specify SNOWFLAKE_SOURCE_NAME using the format() method.

Step 3.Specify the connector options using either the option() or options() method.

Step 4.Use the dbtable option to specify the table to which data is written.

Step 5.Use the mode() method to specify the save mode for the content.

(Correct)

D.

Step 1.Use the writer() method of the DataFrame to construct a DataFrameWriter.

Step 2.Specify SNOWFLAKE_SOURCE_NAME using the format() method.

Step 3.Use the dbtable option to specify the table to which data is written.

Step 4.Specify the connector options using either the option() or options() method.

Step 5.Use the save() method to specify the save mode for the content.

You previously trained a model using a training dataset. You want to detect any data drift in the new data collected since the model was trained.

What should you do?

A.

Create a new dataset using the new data and a timestamp column and create a data drift monitor that uses the training dataset as a baseline and the new dataset as a target.

B.

Create a new version of the dataset using only the new data and retrain the model.

C.

Add the new data to the existing dataset and enable Application Insights for the service where the model is deployed.

D.

Retrained your training dataset after correcting data outliers & no need to introduce new data.