What role in a data governance is typically responsible for day-to-day oversight of data use?
A. Data processors.
B. Data custodians
C. Data owners.
D. Data stewards.
Which of the following would be considered non-personally identifiable information?
A. Cell phone device name
B. Customer’s name
C. Government ID number
D. Telephone number
A financial institution is reporting on sales performance to a company at the account level. Due to the sensitive nature of the government the does il with, some account information is not shown. Which of the following fields should be masked?
A. Sales volume
B. Start date
C. Product name
D. Customer name
Which of the following programming languages are best suited for analysis and machine learning applications? (Select two).
A. Ruby
B. Rust
C. PHP
D. Python
E. Kotlin
F. R
F. R
A data analyst was asked to create a chart that shows the relationship between study
hours and exam scores for each student using the data sets in the table below:

Which of the following charts would BEST represent the relationship between the
variables?
A. A histogram
B. A scatter plot
C. A heat map
D. A bar chart
A data set for sales per month includes the following data:

Which of the following cleaning and profiling methods should be applied to the data set?
A. Data outliers
B. Invalid data
C. Duplicate data
D. Data type validation
You have two databases tables that you would like to join together using a foreign key
relationship.
What term best describes this action?
A. Blending.
B. Appending.
C. Mixing.
D. Merging.
Which of the following are reasons to conduct data cleansing? (Select two).
A. To perform web scraping
B. To track KPls
C. To improve accuracy
D. To review data sets
E. To increase the sample size
F. To calculate trends
F. To calculate trends
An analyst wants to test the association between the number of doors in a car and the number of gears in the car. Which of the following is the best test to use?
A. F-test
B. Acceptance test
C. Chi-squared test
D. Z-test
A marketing analytics team received customer transaction data from two different sources.
The data is complete and accurate; however, the field names appear to be inconsistent.
Given the following tables:

Which of the following is considered best practice if the team wants to consolidate the files
and conduct further analysis?
A. Standardize the field names.
B. Recode the data values.
C. Overwrite the field names in one of the tables.
D. Edit the field names in the data dictionary.
| Page 3 out of 13 Pages |