New Year Sale Special Limited Time 70% Discount Offer - Ends in 0d 00h 00m 00s - Coupon code: xmas50

CompTIA DA0-001 - CompTIA Data+ Certification Exam

Page: 7 / 12
Total 396 questions

Given the following data:

CustomerID

ItemBought

Date

Tre_234

Sofa

2022-09-08

216_Tre

Shoes

08/02/2021

215/Tre

Blanket

2021/06/20

045/Tre

Mug

12-26-2021

Tre-345

Lamp

31/08/2022

TREJD19

Bucket

2022'08/01

Which of the following best describes the main issue in the data set?

A.

Inconsistent data

B.

Data mismatch

C.

Invalid data

D.

Redundant data

Which of the following is used for calculations and pivot tables?

A.

IBM SPSS

B.

SAS

C.

Microsoft Excel

D.

Domo

A data analyst has been asked to derive a new variable labeled “Promotion_flag” based on the total quantity sold by each salesperson. Given the table below:

Which of the following functions would the analyst consider appropriate to flag “Yes” for every salesperson who has a number above 1,000,000 in the Quantity_sold column?

A.

Date

B.

Mathematical

C.

Logical

D.

Aggregate

An e-commerce company recently tested a new website layout. The website was tested by a test group of customers, and an old website was presented to a control group. The table below shows the percentage of users in each group who made purchases on the websites:

Which of the following conclusions is accurate at a 95% confidence interval?

A.

In Germany, the increase in conversion from the new layout was not significant.

B.

In France, the increase in conversion from the new layout was not significant.

C.

In general, users who visit the new website are more likely to make a purchase.

D.

The new layout has the lowest conversion rates in the United Kingdom.

Which of the following types of analysis would be best for an analyst to use to examine the relationships between authors who cited other authors in a library of research papers?

A.

Linguistic analysis

B.

Trend analysis

C.

Link analysis

D.

Performance analysis

During data cleansing, an analyst conducts measures of central tendency on a data set. Which of the following data is the analyst attempting to identify?

A.

Duplicate

B.

Missing

C.

Outlying

D.

Invalid

The total values in this month's revenue report are twice as much as last month's. Which of the following most likely occurred during the ETL process?

A.

The data cleansing processes failed to execute.

B.

The database connectivity failed.

C.

The report included the previous month's data.

D.

The data normalization processes failed.

Which of the following are the first steps a company should take after discovering a data breach? (Select two).

A.

Delete data.

B.

Notify affected users.

C.

Assess the breach.

D.

Back up the system.

E.

Issue a press release.

F.

Delay reporting.

Which of the following is the best variable formal to store a customer's age using the least possible amount of storage data?

A.

Int

B.

Float

C.

Char

D.

Double

Which of the following is the most appropriate to consider when creating a schema of a central group broken into detailed subcategories?

A.

Relational

B.

Hierarchical

C.

Snowflake

D.

Star