Weekend Sale Limited Time 70% Discount Offer - Ends in 0d 00h 00m 00s - Coupon code: xmas50

EMC E20-065 - Advanced Analytics Specialist Exam for Data Scientists

Page: 2 / 2
Total 66 questions

What do lemmatization and stemming have in common?

A.

Use WordNet

B.

Remove common words in a natural language

C.

Reduce the high dimensionality in text

D.

Use a set of heuristics

Given an input vector of features, a Random Forests model performs a classification task and ends in a tie.

How does the model handle this outcome?

A.

The model will be rebuilt

B.

A winner is chosen at random

C.

The tree that caused the tie is discarded

D.

One more tree is added to the forest

What is an effective use of color in visualization?

A.

Use self-explanatory colors so a legend is unnecessary

B.

Maximize use of color to make a more lasting impression

C.

Use high contrast colors such as red and blue

D.

Minimize use of color except for emphasis

What is the maximum number of edges in an undirected graph of 10 nodes?

A.

45

B.

90

C.

100

D.

9

What is a property of a good color model for ordinal data?

A.

Uses a rainbow-like color map for distinction of categories

B.

Uses a rainbow-like color map for ease of display and printing

C.

Uses perceptually ordinal colors with just-noticeable increments

D.

Uses perceptually ordinal colors with linear, perceptual increments

What is a characteristic of lemmatization?

A.

Can be performed by calling the synset () function on a lemma in LNTK

B.

Can be performed by calling the lemma() function on a synset in LNTK

C.

Reduces words of variant forms to their base forms based on a set of heuristics

D.

Reduces words of variant forms to their base forms based on a dictionary

After a client submits a job request to the YARN ResourceManager, what happens next?

A.

The scheduler allocates a container to run an ApplicationMaster

B.

The ResourceManager allocates containers to run map and reduce tasks

C.

The Resource Manager requests load data from the NodeManagers

D.

The ApplicationManager starts an ApplicationMaster

What is a random subspace of features, as used by Random Forests?

A.

A random subset of features that are chosen at each split in the decision tree

B.

Filtration of data that does not meet a pre-defined weighting thrsehold

C.

The creation of out-of-bag (OOB) data that is used to select features

D.

Removal of highly correlated variables to randomize the features

How does Latent Dinchlet Allocation (LDA) interpret a document?

A.

As a single-predefined topic

B.

As a mixture of pre-defined topics

C.

As having a mixture of sentiments

D.

As having a single pre-defined sentiment