CompTIA DA0-002 - CompTIA Data+ Exam (2025)
An analyst needs to produce a final dataset using the following tables:
CourseID
SectionNumber
StudentID
MATH1000
1
10009
MATH1000
2
10007
PSYC1500
1
10009
PSYC1500
1
10015
StudentID
FirstName
LastName
10009
Jane
Smith
10007
John
Doe
10015
Robert
Roe
The expected output should be formatted as follows:
| CourseID | SectionNumber | StudentID | FirstName | LastName |
Which of the following actions is the best way to produce the requested output?
The human resources department wants to know the number of employees who earn $125,000 or more. However, the department is concerned about duplicates in the dataset. Given the following table:
Employee_ID
Level
Salary
001
1
10000
002
2
20000
003
2
256000
004
2
125000
001
1
10000
002
2
20000
Which of the following SQL statements resolves this issue?
A data professional wants to identify all customers who made a purchase in January. Given the following table:
CustomerID
Month
Sales
0001
January
13000
0002
March
10000
0003
April
23000
0004
May
10000
Which of the following types of functions should the professional use to flag the customers?
A data analyst needs to modify a dashboard that was created by another employee. Upon opening the dashboard, the analyst notices that the information is not loading properly. Which of the following should the analyst do to troubleshoot this error?
A data analyst is designing a report for the business review team. The team lists the following requirements for the report:
• Specific data points
• Color branding
• Labels and terminology
• Suggested charts and tables
Which of the following components is missing from the requirements?
A grocery store wants to view the revenue from the previous year, highlighting individual departments. Which of the following is the most appropriate chart to communicate this information?
A data analyst is analyzing the following dataset:
Transaction Date
Quantity
Item
Item Price
12/12/12
11
USB Cords
9.99
11/11/11
3
Charging Block
8.89
10/10/10
5
Headphones
50.15
Which of the following methods should the analyst use to determine the total cost for each transaction?
Which of the following is the most efficient to use when programming repeatable tasks?
A product goes viral on social media, creating high demand. Distribution channels are facing supply chain issues because the testing and training models that are used for sales forecasting have not encountered similar demand. Which of the following best describes this situation?
A data analyst needs to identify outliers from a given dataset. Which of the following visualizations is the best way to identify outliers?
