An analyst is designing a dashboard to determine which site has the highest percentage of new customers. The analyst must choose an appropriate chart to include in the dashboard. The following data is available:
Which of the following types of charts should be considered to best display the data?
An analyst runs a report on a daily basis, and the number of datapoints must be validated before the data can be analyzed. The number of datapoints increases each day by approximately 20% of the total number from the day before. On a given day, the number of datapoints was 8,798. Which of the following should be the total number of datapoints on the next day?
A company wants to know how its customers interact with an e-commerce website based on clicks over items. Which of the following is the primary requirement for this report?
A data analyst has been asked to organize the table below in the following ways:
By sales from high to low -
By state in alphabetic order -
Which of the following functions will allow the data analyst to organize the table in this manner?
A research analyst collects ten data points from 1.000 specimens. The analyst will not need any additional data to complete the analysis and will not need to retrieve information by specifier. Which of the following is the best data structure for the analyst to use?
‘Which of the following is the BEST reason to use database views instead of tables?
A data analyst needs to create a data visualization that aids in un the cumulative impact of sequentially introduced values that are positive or negative. Which of the following
data visualization methods should the analyst use?
Which of the following best describes the process of examining data for statistics and information about the data?
Cleansing
Given the following report:
Which of the following components need to be added to ensure the report is point-in-time and static? (Choose two.)
A data architect is designing a data solution for a retail clothing store chain. Each store has a database that tracks sales transactions. The data architect needs to create a summary table that will be used for a senior executive dashboard. The summary table should not contain duplicate store information. Which of the following should the data architect create?
An analyst conducted a preliminary analysis for a data set and identified several patterns and anomalies. Which of the following analysis techniques did the analyst use?
A data analyst needs to apply quality control concepts to a data set for accuracy. Which of the following is the best way to do this?
The current date is July 14, 2020. A data analyst has been asked to create a report that shows the company’s year-over-year Q2 2020 sales. Which of the following reports should the analyst compare?
Which of the following best describes a difference between JSON and XML?
A client wants a new report that will be automatically emailed to all global sales teams on a weekly basis. Each sales team must be able to view the sales for its region and the combined sales for all regions. Which of the following would be the most efficient method for meeting the requirements?
A data analyst has been asked to create a sales report that calculates the rolling 12-month average for sales. If the report will be published on November 1, 2020, which of the following months shouts the report cover?
A data analyst received the information in the table below from a recently completed marketing campaign:
Which of the following is the total order conversion rate?
Which of the following techniques is used to quantify data?
You should always choose the analytics tool that is most appropriate for any given situation, even if that means acquiring a new tool.
A data analyst has been asked to derive a new variable labeled “Promotion_flag” based on the total quantity sold by each salesperson. Given the table below:
Which of the following functions would the analyst consider appropriate to flag “Yes” for every salesperson who has a number above 1,000,000 in the Quantity_sold column?
A data analyst is reviewing SQL code and sees a query that uses terms such as MIN, SUM, and COUNT. Which of the following types of functions best describes these terms?
Which of the following is the first step an analyst should perform upon receiving a business request for analysis?
Which of the following best describes a business analytics tool with interactive visualization and business capabilities and an interface that is simple enough for end users to create their own reports and dashboards?
Python
An analyst is building a new dashboard for a user. After an initial conversation with the user. the analyst created a mock-up of the dashboard. Which of the following best explains why the analyst created the mock-up?
A data analyst is designing a dashboard that will provide a story of sales and determine which site is providing the highest sales volume per customer The analyst must choose an appropriate chart to include in the dashboard. The following data is available:
Which of the following types of charts should be considered?
Which of the following variable name formats would be problematic if used in the majority of data software programs?
Joseph is interpreting a left skewed distribution of test scores. Joe scored at the mean, Alfonso scored at the median, and gaby scored and the end of the tail.
Who had the highest score?
A data set was recorded using multimedia technology. Which of the following is a necessary step on the way to interpretation?
A data scientist wants to see which products make the most money and which products attract the most customer purchasing interest in their company.
Which of the following data manipulation techniques would he use to obtain this information?
A salesperson who is prospecting potential clients collected the following data:
Which of the following is an issue with this data?
Which of the following types of dashboards should a business intelligence engineer develop in order to provide information about failed data pipelines?
What subset of Structured Query Language (SQL) is used to add, remove, modify, or retrieve the information stored within a relational database?
An analyst needs to provide a chart to identify the composition between the categories of the survey response data set:
Which of the following charts would be BEST to use?
A sales manager requested a report that contains the first name, last name, and phone number of all the company’s customers and employees. The data engineer needs to return all the records from several tables, even duplicates. Which of the following is the best way to join the two tables?
Which of the following contains alphanumeric values?
Which of the following statements would be used to append two tables that have the same number of columns?
Which of the following components need to be added to ensure the report is point-in-time and static? (Select two).
A report is scheduled to run and be distributed at the end of business each day. On Mondays, one of the recipients opens the previous week's reports and combines them to calculate the weekly totals and projections for the coming week. This is a tedious process, and the recipient asks an analyst for help. Which of the following should the analyst recommend?
Which of the following is the most likely reason for a data analyst to optimize a query using parameterization?
Which of the following is an example of a strategy to reduce statistical errors?
A data analyst is working with a team to create a dashboard for a client who requires on-demand access. Which of the following is the best delivery method to support the clients’ requirement?
A data analyst needs to write a SOL query measuring last month's website visits and distribute a summary report to the marketing team. Which of the following is the analyst creating?
An analyst has received the requirements for an internal user dashboard. The analyst confirms the data sources and then creates a wireframe. Which of the following is the NEXT step the analyst should take in the dashboard creation process?
A client has requested an analysis of all pet care items purchased by current customers and their social media connections in the past 12 months. Which of the following data analysis techniques would be the best choice given these requirements?
A data analyst needs to create a dashboard using the company's yearly revenue data sets. Which of the following would be the best way to plot the information to show the top-performing region?
Which of the following is an example of a data-mining ETL tool?
Which of the following types of analysis is used when comparing last week's sales to the previous week's sales?
An analyst is currently working on a ticket to revamp a company-wide dashboard that has been in use for five years. Which of the following should be the first step in the development process?
A survey asks participants to rate a company on a scale of one to ten. Which of the following best describes the rating variable?
Given the following tables:
Which of the following will be the dimensions from a FULL JOIN of the tables above?
An analyst is working with a data set that lists individuals' first and last names in separate columns. Which of the following processes should the analyst use to combine the first and last names into a single spreadsheet cell?
A reporting analyst needs to create a report that refreshes automatically and is accessible to the entire sales organization. Which of the following tools is the most appropriate to use for this task?
Which of the following is the best approach to use to gain a general understanding of a data set?
Which of the following is an example of a discrete variable?
Which of the following data analysis tools increases the efficiency of data visualizations?
A publishing group has requested a dashboard to track submissions before publication. A key requirement is that all changes are tracked, as multiple users will be checking out documents and editing them before submissions are considered final. Which of the following is the BEST way to meet this stakeholder requirement?
Which of the following is the best technique for transferring data from one database to another with some data manipulation?
A database administrator needs to ensure only approved users can access specific database tables to perform financial functions. Which of the following is the best access control method for the administrator to use?
You are working with a dataset and need to swap the values in rows with those in columns.
What action do you need to perform?
A user receives a large custom report to track company sales across various date ranges. The user then completes a series of manual calculations for each date range. Which of the following should an analyst suggest so the user has a dynamic, seamless experience?
Daniel is using the structured Query language to work with data stored in relational database.
He would like to add several new rows to a database table.
What command should he use?
An analyst compiled a high-level report that includes the following data points:
Total dollars closed for the year
Annual quota/goal
Top 10 customers
Average deal size
Largest deals lost
Which of the following groups is the most likely audience for this report?
A data analyst for a media company needs to determine the most popular movie genre. Given the table below:
Which of the following must be done to the Genre column before this task can be completed?
A marketing analytics team received customer transaction data from two different sources. The data is complete and accurate; however, the field names appear to be inconsistent. Given the following tables:
Which of the following is considered best practice if the team wants to consolidate the files and conduct further analysis?
Given the following table:
Which of the following methods is the best way to describe the changes in the values in the table?
Given the following athlete workout data (with inconsistent units or formats for time/distance), which of the following best describes the data quality issue?
An analyst has written the following code:
SELECT *
FROM Cust_table
WHERE age > 60 AND City = "New York"
Which of the following criteria is the analyst retrieving?
An analyst is working for an organization that has a vast amount of data stored across various systems and databases. Employees have difficulty locating and understanding the currently available data assets. Which of the following tools should the analyst implement to best solve this issue?
A database consists of one fact table that is composed of multiple dimensions. Each dimension is represented by a denormalized table. This structure is an example of a:
Given the customer table below:
Which of the following chart types is the most appropriate to represent the average spending of active customers vs. inactive customers?
The number of phone calls that the call center receives in a day is an example of:
An analyst wants to include a graph in a quarterly sales report that shows the comparison between two quantitative variables. Which of the following visual diagrams can the analyst use to most effectively represent this relationship?
A database administrator needs to increase performance on a large dimension table. Which of the following is the best way to accomplish this task?
A table in a hospital database has a column for patient height in inches and a column for patient height in centimeters. This is an example of:
Which of the following data types best describe 4Ac1? (Select two).
A data analyst has been asked to create a daily manufacturing report for the floor manager Which of the following metrics should be included in the report?
Which of the following tools would be best to use to calculate the interquartile range, median, mean, and standard deviation of a column in a table that has 5.000.000 rows?
Randy scored 76 on a math test, Katie scored 86 on a science test, Ralph scored 80 on a history test, and Jean scored 80 on an English test. The table below contains the mean and standard deviation of the scores for each of the courses:
Using this information, which of the following students had the BEST score?
Which of the following is the best variable formal to store a customer's age using the least possible amount of storage data?
What category of data stewardship work is focused on ensuring that the organization respects the wishes of data subjects?
Which of the following describes the method of sampling in which elements of data are selected randomly from each of the small subgroups within a population?
Which of the following summary statements upholds integrity in data reporting?
Which one of the following is a common data warehouse schema?
Which of the following is an object associated with a table that sorts and stores table row data in a key-value pair?
Given the following grocery store orders:
If a query is made to the table with the following logic:
Order_Total > 132 OR (Order Total >= 25 AND Order_Total < 74)
Which of the following is the number of orders that will be returned by the query?
What analytics suite is offered by Microsoft and directly integrates with SQL Server Databases?
Which of the following value is the measure of dispersion "range" between the scores of ten students in a test.
The scores of ten students in a test are 17, 23, 30, 36, 45, 51, 58, 66, 72, 77.
Different people manually type a series of handwritten surveys into an online database. Which of the following issues will MOST likely arise with this data? (Choose two.)
The current date is July 14, 2020. A data analyst has been asked to create a report that shows the company's year-over-year Q2 2020 sales. Which of the following reports should the analyst compare?
A data governance analyst who is reviewing a retailer's data set notices that sales data is captured at the regional level but not at the individual store level. Which of the following best describes the issue with this data set?
Which of the following is a control measure for preventing a data breach?
After a merger, an analyst needs to enhance a very complicated quarterly report so that it is more user friendly for new team members. Which of the following elements would help reduce questions?
What would be an example of an acceptable form of primary identification for the Data+ exam?
Which of the following is concatenate typically used to combine?
Jhon is working on an ELT process that sources data from six different source systems.
Looking at the source data, he finds that data about the sample people exists in two of six systems.
What does he have to make sure he checks for in his ELT process?
Choose the best answer.
An analyst modified a data set that had a number of issues. Given the original and modified versions:
Which of the following data manipulation techniques did the analyst use?
A data analyst needs to create a dashboard to help identify trends in the data sets. Which of the following is an appropriate consideration for dashboard development?
An analyst is working on a project for a director. During this process. the analyst pulled the data. created summarized tables and graphs with descriptions, created a report summary, and inserted all items into a report. After writing the report, which of the following would be the most appropriate next step?
An analyst is reviewing the following data:
Car IDSpeed
123155
566436
564418
650567
546436
645638
Which of the following should the analyst include in the measures of central tendency for speed?
A company's human resources department has asked a data analyst to categorize the income of all employees into five salary bands:
Which of the following types of functions would be the most appropriate to use?
Given the diagram below:
Which of the following types of sampling is depicted in the image?
Which of the following is the median of the number set:3, 7, 5, 6, 9?
Exhibit.
Which of the following logical statements results in Table B?
A)
B)
C)
D)
While reviewing survey data, a research analyst notices data is missing from all the responses to a single question. Which of the following methods would BEST address this issue?
Which of the following would be the best way to identify multicollinear attributes in a data set?
A site reliability team wants to monitor the stability of their website. so they can proactively diagnose issues when they occur Which of the following deliverables would best suit their needs?
Which of the following types of analysis would be best for an analyst to use to examine the relationships between authors who cited other authors in a library of research papers?
Which of the following is used for calculations and pivot tables?
An analyst wants to create a historical data set for the past five years with each year in its own data set. Which of the following methods is the best way to create this historical data set?
A collections manager has a team calling customers who are past due on their accounts in an attempt to collect payments. The manager receives the call list in the form of a printed report that is generated by the accounting department at the beginning of each week. Consequently, the collections team calls some customers who have made payments in the time since the report was last printed. Which of the following reporting enhancements could the accounting department implement to best reduce the number of calls on current accounts?
Which of the following query statements would be used when filtering data in a relational database management system? (Select two).
A user imports a data file into the accounts payable system each day. On a regular basis. the field input is not what the system is expecting. so it results in an error for the row and a broken import process. To resolve the issue, the user opens the file, finds the error in the row, and manually corrects it before attempting the import again. The import sometimes breaks on subsequent attempts. though. Which of the following changes should be made to this process to reduce the number of errors?
A data profiling rule checks the quality of all email addresses in a database. The rule returns a value with the number of email addresses that conformed to the rule. Which of the following options describes this value?
Which of the following is a characteristic of a relational database?
An analyst has been asked to validate data quality. Which of the following are the BEST reasons to validate data for quality control purposes? (Choose two.)
Which of the following are reasons to create and maintain a data dictionary? (Choose two.)
An analyst for a concert venue is analyzing the number of tickets sold for a recent event. Which of the following types of data is the number of sold tickets an example of?
A column is being used to store strings of variable lengths. Performance is a concern, so the column needs to use as little space as possible. Which of the following data types best meets these requirements?