Which is NOT a valid OCI Data Science notebook session approach?
You are a computer vision engineer building an image recognition model. You decide to use Oracle Data Labeling to annotate your image data. Which of the following THREE are possible ways to annotate an image in Data Labeling?
Which THREE types of data are used for Data Labeling?
You are attempting to save a model from a notebook session to the model catalog by using the Accelerated Data Science (ADS) SDK, with resource principal as the authentication signer, and you get a 404 authentication error. Which two should you look for to ensure permissions are set up correctly?
As a data scientist, you are tasked with creating a model training job that is expected to take different hyperparameter values on every run. What is the most efficient way to set those parameters with Oracle Data Science Jobs?
You are using Oracle Cloud Infrastructure (OCI) Anomaly Detection to train a model to detect anomalies in pump sensor data. What are you trying to determine? How does the required False Alarm Probability setting affect an anomaly detection model?
You are working in your notebook session and find that your notebook session does not have enough compute CPU and memory for your workload. How would you scale up your notebook session without losing your work?
Which Oracle Accelerated Data Science (ADS) classes can be used for easy access to datasets fromreference libraries and index websites, such as scikit-learn?
How can you convert a fixed load balancer to a flexible load balancer?
Which statement best describes Oracle Cloud Infrastructure Data Science Jobs?
When preparing your model artifact to save it to the Oracle Cloud Infrastructure (OCI) DataScience model catalog, you create a score.py file. What is the purpose of the score.py file?
Which Security Zone policy is NOT valid?
Which of the following best describes the principal goal of data science?
Which function's objective is to represent the difference between the predictive value and the target value?
You want to make your model more frugal to reduce the cost of collecting and processing data. You plan to do this by removing features that are highly correlated. You would like to create a heatmap that displays the correlation so that you can identify candidate features to remove. Which Accelerated Data Science (ADS) SDK method is appropriate to display the comparability between Continuous and Categorical features?
In which two ways can you improve data durability in Oracle Cloud Infrastructure Object Storage?
You’re going to create an Oracle Cloud Infrastructure Anomaly Detection model for multivariate data. Where do you need to store the training data?
You have an embarrassingly parallel or distributed batch job with a large amount of data running using Data Science Jobs. What would be the best approach to run the workload?
What is the name of the machine learning library used in Apache Spark?
You are working as a data scientist for a healthcare company. They decided to analyze the data to find patterns in a large volume of electronic medical records. You are asked to build a PySpark solution to analyze these records in a JupyterLab notebook. What is the order of recommended steps to develop a PySpark application in OCI Data Science?
You are a data scientist using Oracle AutoML to produce a model and you are evaluating the score metric for the model. Which TWO of the following prevailing metrics would you use for evaluating a multiclass classification model?
What happens when a notebook session is deactivated?
Why is data sampling useful for data scientists?
As a data scientist, you are tasked with creating a model training job that is expected to take different hyperparameter values on every run. What is the most efficient way to set those parameters with Oracle Data Science Jobs?
You have an embarrassingly parallel or distributed batch job on a large amount of data that you consider running using Data Science Jobs. What would be the best approach to run the workload?
You are a data scientist working inside a notebook session and you attempt to pip install a package from a public repository that is not included in your conda environment. After running this command, you get a network timeout error. What might be missing from your network configuration?
Which Oracle Accelerated Data Science (ADS) classes can be used for easy access to datasets from reference libraries and index websites such as scikit-learn?
You have an image classification model in the model catalog which is deployed as an HTTP endpoint using model deployments. Your tenancy administrator is seeing increased demands and has asked you to increase the load balancing bandwidth from the default of 10Mbps. You are provided with the following information:
Payload size in KB = 1024
Estimated requests per second = 120 requests/second (Monday through Friday, in every month, in every year)
Buffer percentage = 20%What is the optimal load balancing bandwidth to redeploy your model?
What is a common maxim about data scientists?
Which Oracle Data Safe feature minimizes the amount of personal data and allows internal test, development, and analytics teams to operate with reduced risk?
You want to use ADSTuner to tune the hyperparameters of a supported model you recently trained. You have just started your search and want to reduce the computational cost as well as assess the quality of the model class that you are using. What is the most appropriate search space strategy to choose?
Where are OCI secrets stored?
You are a data scientist building a pipeline in the Oracle Cloud Infrastructure (OCI) Data Science service for your machine learning project. You want to optimize the pipeline completion time by running some steps in parallel. Which statement is true about running pipeline steps in parallel?
Which statement is true about standards?
As a data scientist, you are working on a global health dataset that has data from more than 50 countries. You want to encode three features, such as 'countries', 'race', and 'body organ' as categories. Which option would you use to encode the categorical feature?
You have just received a new dataset from a colleague. You want to quickly find out summary information about the dataset, such as the types of features, the total number of observations, and distributions of the data. Which Accelerated Data Science (ADS) SDK method from the ADSDataset class would you use?
You realize that your model deployment is about to reach its utilization limit. What would you do to avoid the issue before requests start to fail? Pick THREE.
Which technique can be used for feature engineering in the machine learning lifecycle?
Which statement about Oracle Cloud Infrastructure Multi-Factor Authentication (MFA) is NOT valid?
You have received machine learning model training code, without clear information about the optimal shape to run the training. How would you proceed to identify the optimal compute shape for your model training that provides a balanced cost and processing time?
Six months ago, you created and deployed a model that predicts customer churn for a call centre. Initially, it was yielding quality predictions. However, over the last two months, users are questioning the credibility of the predictions. Which TWO methods would you employ to verify the accuracy of the model?
True or false? Data scientists typically need a combination of technical skills, nontechnical ones, and suitable personality traits to be successful.
Which statement about resource principals is true?
Which OCI Data Science interaction method can function without the need of scripting?
Where do calls to stdout and stderr from score.py go in a model deployment?
Which feature of Oracle Cloud Infrastructure Data Science provides an interactive coding environment for building and training machine learning models?
You have a dataset with fewer than 1000 observations, and you are using Oracle AutoML to build a classifier. While visualizing the results of each stage of the Oracle AutoML pipeline, you notice that no visualization has been generated for one of the stages. Which stage is not visualized?