What type of variable is the dependent variable from a logistic regression?
In a user-defined aggregate function, what is FFUNC?
What is a key consideration when preparing a presentation intended for analysts?
After running a density plot you realize that the data has a long tail to the right. What can you do to make the dataset more normally distributed?
In time series analysis, what statement describes a MA(q) process?
In hypothesis testing, when does a Type I error occur?
What does a branch represent in a decision tree?
You build a decision tree to classify five different types of customers based on their browsing history from a sample of 500. The resulting decision tree has 17 layers. One of the leaf nodes has only three customers.
What do you conclude?
Which component of a final presentation provides a succinct overview of the business situation that was the impetus to initiate the project?
When should you consider using multinomial logistic regression over binary logistic regression?
What is part of the model output for a linear regression?
What does “MAD” in MADlib stand for?
After which phase of the data analytics lifecycle should you determine if the model needs any recalibration?
You have been given a task to improve sales force compensation of your organization. As a result of a study, your team decides to classify personnel as follows:
● Did not meet quota
● Met quota
● Exceeded 150% of quota
In which data analytics lifecycle phase should you define these categories for analysis purposes?
Refer to the exhibit.
What is the approximate R-squared value for a linear regression model fitted to the data associated with this scatterplot?
Which visualization technique should be avoided?
On which type of data should you run K-means clustering?