An e-commerce developer built an application for automatic classification of online products in order to allow customers to select products faster. The goal is to provide more relevant products to the user based on prior purchases.
Which of the following factors is necessary for a supervised machine learning algorithm to be successful?
A company producing consumable goods wants to identify groups of people with similar tastes for the purpose of targeting different products for each group. You have to choose and apply an appropriate ML type for this problem.
Which ONE of the following options represents the BEST possible solution for this above-mentioned task?
SELECT ONE OPTION
Arihant Meditation is a startup using Al to aid people in deeper and better meditation based on analysis of various factors such as time and duration of the meditation, pulse and blood pressure, EEG patters etc. among others. Their model accuracy and other functional performance parameters have not yet reached their desired level.
Which ONE of the following factors is NOT a factor affecting the ML functional performance?
SELECT ONE OPTION
Which ONE of the following options describes a scenario of A/B testing the LEAST?
SELECT ONE OPTION
A team of software testers is attempting to create an AI algorithm to assist in software testing. This particular team has gone through over 40 iterations of testing and cannot afford to spend as much time as it takes to run the full regression test suite. They are hoping to have the algorithm reduce the amount of testing required, thus reducing the time needed for each testing cycle.
How can an AI-based tool be expected to assist in this reduction?
Which ONE of the following statements correctly describes the importance of flexibility for Al systems?
SELECT ONE OPTION
Pairwise testing can be used in the context of self-driving cars for controlling an explosion in the number of combinations of parameters.
Which ONE of the following options is LEAST likely to be a reason for this incredible growth of parameters?
SELECT ONE OPTION
In a certain coffee producing region of Colombia, there have been some severe weather storms, resulting in massive losses in production. This caused a massive drop in stock price of coffee.
Which ONE of the following types of testing SHOULD be performed for a machine learning model for stock-price prediction to detect influence of such phenomenon as above on price of coffee stock.
SELECT ONE OPTION
Data used for an object detection ML system was found to have been labelled incorrectly in many cases.
Which ONE of the following options is most likely the reason for this problem?
SELECT ONE OPTION
You are testing an autonomous vehicle which uses AI to determine proper driving actions and responses. You have evaluated the parameters and combinations to be tested and have determined that there are too many to test in the time allowed. It has been suggested that you use pairwise testing to limit the parameters. Given the complexity of the software under test, what is likely the outcome from using pairwise testing?
Which of the following are the three activities in the data acquisition activities for data preparation?
Which of the following is a dataset issue that can be resolved using pre-processing?
Which ONE of the following activities is MOST relevant when addressing the scenario where you have more than the required amount of data available for the training?
SELECT ONE OPTION
A bank wants to use an algorithm to determine which applicants should be given a loan. The bank hires a data scientist to construct a logistic regression model to predict whether the applicant will repay the loan or not. The bank has enough data on past customers to randomly split the data into a training dataset and a test/validation dataset. A logistic regression model is constructed on the training dataset using the following independent variables:
Gender
Marital status
Number of dependents
Education
Income
Loan amount
Loan term
Credit score
The model reveals that those with higher credit scores and larger total incomes are more likely to repay their loans. The data scientist has suggested that there might be bias present in the model based on previous models created for other banks.
Given this information, what is the best test approach to check for potential bias in the model?
Consider a machine learning model where the model is attempting to predict if a patient is at risk for stroke. The model collects information on each patient regarding their blood pressure, red blood cell count, smoking status, history of heart disease, cholesterol level, and demographics. Then, using a decision tree the model predicts whether or not the associated patient is likely to have a stroke in the near future. Once the model is created using a training dataset, it is used to predict a stroke in 80 additional patients. The table below shows a confusion matrix on whether or not the model made a correct or incorrect prediction.
The testers have calculated what they believe to be an appropriate functional performance metric for the model. They calculated a value of 0.6667.
Which metric did the testers calculate?
The stakeholders of a machine learning model have confirmed that they understand the objective and purpose of the model, and ensured that the proposed model aligns with their business priorities. They have also selected a framework and a machine learning model that they will be using. What should be the next step to progress along the machine learning workflow?
Which ONE of the following combinations of Training, Validation, Testing data is used during the process of learning/creating the model?
SELECT ONE OPTION
Before deployment of an AI-based system, a developer is expected to demonstrate in a test environment how decisions are made. Which of the following characteristics does decision making fall under?
Which of the following is a problem with AI-generated test cases that are generated from the requirements?
A tourist calls an airline to book a ticket and is connected with an automated system which is able to recognize speech, understand requests related to purchasing a ticket, and provide relevant travel options. When the tourist asks about the expected weather at the destination or potential impacts on operations because of the tight labor market, the only response from the automated system is, "I don’t understand your question."
This AI system should be categorized as?
Which ONE of the following options BEST DESCRIBES clustering?
SELECT ONE OPTION
"Splendid Healthcare" has started developing a cancer detection system based on ML. The type of cancer they plan on detecting has 2% prevalence rate in the population of a particular geography. It is required that the model performs well for both normal and cancer patients.
Which ONE of the following combinations requires MAXIMIZATION?
SELECT ONE OPTION
Upon testing a model used to detect rotten tomatoes, the following data was observed by the test engineer, based on certain number of tomato images.
For this confusion matrix which combinations of values of accuracy, recall, and specificity respectively is CORRECT?
SELECT ONE OPTION
You have access to the training data that was used to train an AI-based system. You can review this information and use it as a guideline when creating your tests. What type of characteristic is this?