Black Friday Sale Special - Limited Time 70% Discount Offer - Ends in 0d 00h 00m 00s - Coupon code: mxmas70

Home > CompTIA > CompTIA Data+ > DA0-002

DA0-002 CompTIA Data+ Exam (2025) Question and Answers

Question # 4

Which of the following best explains the purpose of data lineage?

A.

To see the steps and path of data flow through different systems

B.

To better understand the granularity of data variable relationships

C.

To track data transformations from acquisition through reporting

D.

To look up data definitions, ensuring consistent use across business units

Full Access
Question # 5

Which of the following is a NoSQL database?

A.

PostgreSQL

B.

MySQL

C.

Oracle

D.

MongoDB

Full Access
Question # 6

Which of the following best describes the semi-structured data that is gathered when web scraping?

A.

JSON

B.

CSV

C.

CSS

D.

HTML

Full Access
Question # 7

A data analyst creates a report, and some of the fields are empty. Which of the following conditions should the analyst add to a query to provide a list of all the records with empty fields?

A.

WHERE [ColumnName] = NULL

B.

WHERE [ColumnName] IS NULL

C.

WHERE [ColumnName] IS NOT NULL

D.

WHERE [ColumnName] = 'NULL'

Full Access
Question # 8

A company gives users adequate data access permissions to allow them to fulfill their duties but nothing more. Which of the following concepts best describes this practice?

A.

Active Directory

B.

Hierarchical access

C.

Zero Trust

D.

Least privilege

Full Access
Question # 9

A report triggers an error that prevents information from being displayed. However, the report was functional before a database upgrade. Which of the following should a data analyst do first to troubleshoot the problem?

A.

Ensure the system has permissions for the report service.

B.

Change the report's refresh rate.

C.

Verify the connection to the database.

D.

Check whether the data structures were modified.

Full Access
Question # 10

A data analyst learns that a report detailing employee sales is reflecting sales only for the current month. Which of the following is the most likely cause?

A.

Lack of permissions

B.

An error in SQL code

C.

Report refresh failure

D.

Connectivity issues

Full Access
Question # 11

A senior manager needs a report that can be generated and accessed at any time. Which of the following delivery methods should a data analyst use?

A.

Ad hoc

B.

Dynamic

C.

Self-service

D.

Static

Full Access
Question # 12

Which of the following elements is the most important to include in a dashboard for internal technical audiences?

A.

Methodology section

B.

Dynamic features

C.

Key performance indicators

D.

Company branding

Full Access
Question # 13

A developer builds an online survey that requires all questions to have an answer. Which of the following inconsistencies does this setting prevent?

A.

Missing values

B.

Duplication

C.

Data corruption

D.

Completeness

Full Access
Question # 14

A table contains several rows of cellular numbers with call timestamps, call durations, called numbers, and carriers of the called number. Which of the following allows a data analyst to sort the cellular numbers based on the carriers of the called numbers and include the total call durations?

A.

SELECT cellular_number, called_number_carrier, SUM(call_duration) FROM calls GROUP BY cellular_number ORDER BY called_number_carrier;

B.

SELECT cellular_number, SUM(call_duration) FROM calls GROUP BY call_duration ORDER BY called_number_carrier;

C.

SELECT cellular_number, called_number_carrier, SUM(call_duration) FROM calls GROUP BY cellular_number, called_number_carrier ORDER BY called_number_carrier;

D.

SELECT cellular_number, called_number_carrier, SUM(call_duration) FROM calls GROUP BY call_duration ORDER BY called_number_carrier;

Full Access
Question # 15

While interacting with a data set, a data analyst learns that copies of the data are distributed across several data repositories around the globe. Which of the following concepts best describes this practice?

A.

Redundancy

B.

High availability

C.

Replication

D.

Duplication

Full Access
Question # 16

Which of the following best represents a type of infrastructure that requires a company to purchase and maintain all of its own servers?

A.

Private

B.

Cloud

C.

Hybrid

D.

Public

Full Access
Question # 17

A data analyst receives four files that need to be unified into a single spreadsheet for further analysis. All of the files have the same structure, number of columns, and field names, but each file contains different values. Which of the following methods will help the analyst convert the files into a single spreadsheet?

A.

Merging

B.

Appending

C.

Parsing

D.

Clustering

Full Access
Question # 18

A data analyst needs to provide a weekly sales report for the Chief Financial Officer. Which of the following delivery methods is the most appropriate?

A.

A granular daily report in a dashboard

B.

A detailed text document

C.

A spreadsheet with raw data

D.

A high-level email

Full Access
Question # 19

Which of the following explains the purpose of UAT?

A.

To begin the software application development process to enhance user experience

B.

To ensure all parts of the software application work together after each sprint

C.

To review software application crashes, create patches, and deploy to users

D.

To validate and verify that a software application meets the needs and requirements of users

Full Access
Question # 20

An analyst is building a reporting deck. The deck must include tracking and visualizing metrics and row-level security. Which of the following actions should the analyst take after meeting all of the requirements?

A.

Show a mock-up to the team.

B.

Explain the desired level of reporting detail.

C.

Present an analysis of the data.

D.

Find out the project due date.

Full Access
Question # 21

A data analyst wants to understand several datasets at the variable level. Which of the following should the analyst consult to find this information?

A.

Data lineage

B.

Data dictionary

C.

Data flow diagram

D.

Data versioning

Full Access
Question # 22

A business intelligence analyst is creating an employee retention dashboard that looks at data from the last five years. The analyst is interested in identifying patterns that can be studied further. Which of the following is the best method to apply to the dashboard?

A.

Predictive

B.

Prescriptive

C.

Diagnostic

D.

Descriptive

Full Access
Question # 23

The following SQL code returns an error in the program console:

SELECT firstName, lastName, SUM(income)

FROM companyRoster

SORT BY lastName, income

Which of the following changes allows this SQL code to run?

A.

SELECT firstName, lastName, SUM(income) FROM companyRoster HAVING SUM(income) > 10000000

B.

SELECT firstName, lastName, SUM(income) FROM companyRoster GROUP BY firstName, lastName

C.

SELECT firstName, lastName, SUM(income) FROM companyRoster ORDER BY firstName, income

D.

SELECT firstName, lastName, SUM(income) FROM companyRoster

Full Access
Question # 24

Which of the following best describes an assessment a data analyst would use to validate that the number of records in a dataset matches the expected results?

A.

Source control

B.

Unit test

C.

Stress test

D.

Health check

Full Access
Question # 25

A sales manager wants a dashboard that shows sales aggregated by region and identifies high-volume sales by salesperson per region. Which of the following communication techniques best displays this information?

A.

Defined parameters

B.

Filter options

C.

Level of detail

D.

User persona

Full Access
Question # 26

Which of the following is the most efficient to use when programming repeatable tasks?

A.

LLM

B.

Deep learning

C.

NLP

D.

RPA

Full Access
Question # 27

A data analyst is creating a pivot table for a large dataset for an upcoming board meeting. Which of the following is the purpose of the pivot table?

A.

To visualize the data in a dashboard

B.

To retrieve and clean data from several sources

C.

To summarize and analyze the data

D.

To organize the data for reporting

Full Access
Question # 28

A data professional wants to identify all customers who made a purchase in January. Given the following table:

CustomerID

Month

Sales

0001

January

13000

0002

March

10000

0003

April

23000

0004

May

10000

Which of the following types of functions should the professional use to flag the customers?

A.

Statistical

B.

Logical

C.

Mathematical

D.

Date

Full Access
Question # 29

Given the following tables:

Individual table

ID

FirstName

LastName

1

John

Doe

Output

ID

FullName

1

JohnDoe

Which of the following is the best option to display output from FirstName and LastName as FullName?

A.

Concatenate

B.

Filter

C.

Join

D.

Group

Full Access
Question # 30

A data analyst is following up on a recent, company-wide data audit of customer invoice data. Which of the following is the best option for the analyst to use?

A.

PCI DSS

B.

GDPR

C.

ISO

D.

PII

Full Access
Question # 31

A product goes viral on social media, creating high demand. Distribution channels are facing supply chain issues because the testing and training models that are used for sales forecasting have not encountered similar demand. Which of the following best describes this situation?

A.

Model bias

B.

Data drift

C.

Incorrect sizing

D.

Skewing

Full Access
Question # 32

A data analyst needs to remove all duplicate values between two tables, "Employees" and "Managers," using SQL SELECT statements. Which of the following should the analyst use for this task?

A.

SELECT * FROM Employees UNION ALL SELECT * FROM Managers

B.

SELECT * FROM Employees UNION SELECT * FROM Managers

C.

SELECT * FROM Employees JOIN SELECT * FROM Managers

D.

SELECT * FROM Employees CROSS JOIN SELECT * FROM Managers

Full Access
Question # 33

A data analyst is preparing a survey for Paralympic Games athletes. Which of the following should the analyst consider when creating this survey?

A.

Idioms

B.

Color contrast

C.

Refresh speed

D.

Granularity

Full Access
Question # 34

A data analyst is analyzing the following dataset:

Transaction Date

Quantity

Item

Item Price

12/12/12

11

USB Cords

9.99

11/11/11

3

Charging Block

8.89

10/10/10

5

Headphones

50.15

Which of the following methods should the analyst use to determine the total cost for each transaction?

A.

Parsing

B.

Scaling

C.

Compressing

D.

Deriving

Full Access
Question # 35

Which of the following is business intelligence software?

A.

SAS

B.

Python

C.

Notepad++

D.

Tableau

Full Access
Question # 36

A data analyst needs to join together a table data source and a web API data source using Python. Which of the following is the best way to accomplish this task?

A.

Convert the data from the API and database to a varchar format and convert them to pandas DataFrames that are then merged together.

B.

Convert the data from the API and database to a JSON format and convert them to pandas DataFrames that are then merged together.

C.

Convert the data from the API and database to a TXT format and convert them to pandas DataFrames that are then merged together.

D.

Convert the data from the API and database to a string format and convert them to pandas DataFrames that are then merged together.

Full Access