# Data+ v1.0

Page:    1 / 14
Exam contains 197 questions

An analyst has generated a report that includes the number of months in the first two quarters of 2019 when sales exceeded \$50,000:

Which of the following functions did the analyst use to generate the data in the Sales_indicator column?

• A. Aggregate
• B. Logical
• C. Date
• D. Sort

While reviewing survey data, an analyst notices respondents entered “Jan,” “January,” and “01” as responses for the month of January. Which of the following steps should be taken to ensure data consistency?

• A. Delete any of the responses that do not have “January” written out.
• B. Replace any of the responses that have “01”.
• C. Filter on any of the responses that do not say “January” and update them to “January”.
• D. Sort any of the responses that say “Jan” and update them to “01”.

Which of the following data cleansing issues will be fixed when a DISTINCT function is applied?

• A. Missing data
• B. Duplicate data
• C. Redundant data
• D. Invalid data

A county in Illinois is conducting a survey to determine the mean annual income per household. The county is 427sq mi (2.65q km). Which of the following sampling methods would MOST likely result in a representative sample?

• A. A stratified phone survey of 100 people that is conducted between 2:00 p.m. and 3:00 p.m.
• B. A systematic survey that is sent to 100 single-family homes in the county
• C. Surveys sent to ten randomly selected homes within 5mi (8km) of the county’s office
• D. Surveys sent to 100 randomly selected homes that are reflective of the population

Which of the following statistical methods requires two or more categorical variables?

• A. Simple linear regression
• B. Chi-squared test
• C. Z-test
• D. Two-sample t-test

Which of the following data manipulation techniques is an example of a logical function?

• A. WHERE
• B. AGGREGATE
• C. BOOLEAN
• D. IF

A sales team wants visibility of current sales numbers, pipeline, and team performance. The team would also like to see calculations of individuals’ earned commissions and projected commissions based on sales, but they want that information to be kept confidential. Which of the following would be the BEST way to provide this visibility?

• A. Create a dashboard displaying a data refresh date so users know the current sales numbers and configure permissions to control access.
• B. Create a dashboard for sales numbers, pipeline, and team and individual performance for the management team.
• C. Create a dashboard with filters for the overall team, individuals, and management. Users can filter to see the data they want.
• D. Create a dashboard with views for team, individuals, and management. Configure permissions to control access.

Which of the following is a characteristic of a relational database?

• A. It utilizes key-value pairs.
• B. It has undefined fields.
• C. It is structured in nature.
• D. It uses minimal memory.

A data analyst is asked on the morning of April 9, 2020, to create a sales report that identifies sales year to date. The daily sales data is current through the end of the day. Which of the following date ranges should be on the report?

• A. January 1, 2020 to April 1, 2020
• B. January 1, 2020 to April 7, 2020
• C. January 1, 2020 to April 8, 2020
• D. January 1, 2020 to April 9, 2020

Given the following data tables:

Which of the following MDM processes needs to take place FIRST?

• A. Creation of a data dictionary
• B. Compliance with regulations
• C. Standardization of data field names
• D. Consolidation of multiple data fields

Which of the following is used for calculations and pivot tables?

• A. IBM SPSS
• B. SAS
• C. Microsoft Excel
• D. Domo

Given the following report:

Which of the following components need to be added to ensure the report is point-in-time and static? (Choose two.)

• A. A control group for the phrases
• B. A summary of the KPIs
• C. Filter buttons for the status
• D. The date when the report was last accessed
• E. The time period the report covers
• F. The date on which the report was run

An analyst has been asked to validate data quality. Which of the following are the BEST reasons to validate data for quality control purposes? (Choose two.)

• A. Retention
• B. Integrity
• C. Transmission
• D. Consistency
• E. Encryption
• F. Deletion

A research analyst wants to determine whether the data being analyzed is connected to other datapoints. Which of the following is the BEST type of analysis to conduct?

• A. Trend analysis
• B. Performance analysis
• D. Exploratory analysis

Which of the following variable name formats would be problematic if used in the majority of data software programs?

• A. First_Name_
• B. FirstName
• C. First_Name
• D. First Name