Month End Sale Limited Time 70% Discount Offer - Ends in 0d 00h 00m 00s - Coupon code: 70special

Oracle 1z0-1110-25 Oracle Cloud Infrastructure 2025 Data Science Professional Exam Practice Test

Page: 1 / 16
Total 158 questions

Oracle Cloud Infrastructure 2025 Data Science Professional Questions and Answers

Testing Engine

  • Product Type: Testing Engine
$37.5  $124.99

PDF Study Guide

  • Product Type: PDF Study Guide
$33  $109.99
Question 1

Which is NOT a valid OCI Data Science notebook session approach?

Options:

A.

Ensure you don’t execute long-running Python processes in a notebook cell

B.

Run the process directly in the terminal and use Python logging to get updates on the progress of your job

C.

Avoid having multiple users in the same notebook session due to the possibility of resource contention and write conflicts

D.

While connecting to data in OCI Object Storage from your notebook session, the best practice is to make a local copy on the device and then upload it to your notebook session block volume

E.

Authenticate using your notebook session’s resource principal to access other OCI resources. Resource principals provide a more secure way to authenticate to resources compared to the OCI configuration and API approach

Question 2

You are a computer vision engineer building an image recognition model. You decide to use Oracle Data Labeling to annotate your image data. Which of the following THREE are possible ways to annotate an image in Data Labeling?

Options:

A.

Adding labels to an image using semantic segmentation, by drawing multiple bounding boxes to an image

B.

Adding a single label to an image

C.

Adding labels to an image by drawing a bounding box to an image is not supported by Data Labeling

D.

Adding labels to an image using object detection, by drawing bounding boxes to an image

E.

Adding multiple labels to an image

Question 3

Which THREE types of data are used for Data Labeling?

Options:

A.

Audio

B.

Text Document

C.

Images

D.

Graphs

Question 4

You are attempting to save a model from a notebook session to the model catalog by using the Accelerated Data Science (ADS) SDK, with resource principal as the authentication signer, and you get a 404 authentication error. Which two should you look for to ensure permissions are set up correctly?

Options:

A.

The model artifact is saved to the block volume of the notebook session

B.

A dynamic group has rules that match the notebook sessions in its compartment

C.

The policy for your user group grants manage permissions for the model catalog in this compartment

D.

The policy for a dynamic group grants manage permissions for the model catalog in this compartment

E.

The networking configuration allows access to Oracle Cloud Infrastructure services through a Service Gateway

Question 5

As a data scientist, you are tasked with creating a model training job that is expected to take different hyperparameter values on every run. What is the most efficient way to set those parameters with Oracle Data Science Jobs?

Options:

A.

Create a new job every time you need to run your code and pass the parameters as environment variables

B.

Create your code to expect different parameters as command-line arguments, and create a new job every time you run the code

C.

Create a new job by setting the required parameters in your code, and create a new job for every code change

D.

Create your code to expect different parameters either as environment variables or as command-line arguments, which are set on every job run with different values

Question 6

You are using Oracle Cloud Infrastructure (OCI) Anomaly Detection to train a model to detect anomalies in pump sensor data. What are you trying to determine? How does the required False Alarm Probability setting affect an anomaly detection model?

Options:

A.

It is used to disable the reporting of false alarms

B.

It changes the sensitivity of the model to detecting anomalies

C.

It determines how many false alarms occur before an error message is generated

D.

It adds a score to each signal indicating the probability that it’s a false alarm

Question 7

You are working in your notebook session and find that your notebook session does not have enough compute CPU and memory for your workload. How would you scale up your notebook session without losing your work?

Options:

A.

Deactivate your notebook session, provision a new notebook session on a larger compute shape, and recreate all your file changes

B.

Download your files and data to your local machine, delete your notebook session, provision a new notebook session on a larger compute shape, and upload your files from your local machine to the new notebook session

C.

Ensure your files and environments are written to the block volume storage under the /home/datascience directory, deactivate the notebook session, and activate the notebook with a larger compute shape selected

D.

Create a temporary bucket in Object Storage, write all your files and data to Object Storage, delete the notebook session, provision a new notebook session on a larger compute shape, and copy your files and data from your temporary bucket to your new notebook session

Question 8

Which Oracle Accelerated Data Science (ADS) classes can be used for easy access to datasets fromreference libraries and index websites, such as scikit-learn?

Options:

A.

DatasetBrowser

B.

DatasetFactory

C.

ADSTuner

D.

SecretKeeper

Question 9

How can you convert a fixed load balancer to a flexible load balancer?

Options:

A.

There is no way to convert the load balancer

B.

Use Update Shape workflows

C.

Delete the fixed load balancer and create a new one

D.

Using the Edit Listener option

Question 10

Which statement best describes Oracle Cloud Infrastructure Data Science Jobs?

Options:

A.

Jobs let you define and run repeatable tasks on fully managed infrastructure.

B.

Jobs let you define and run repeatable tasks on customer-managed infrastructure.

C.

Jobs let you define and run repeatable tasks on fully managed third-party cloud infrastructures.

D.

Jobs let you define and run all Oracle Cloud DevOps workloads.

Question 11

When preparing your model artifact to save it to the Oracle Cloud Infrastructure (OCI) DataScience model catalog, you create a score.py file. What is the purpose of the score.py file?

Options:

A.

Configure the deployment infrastructure

B.

Execute the inference logic code

C.

Define the compute scaling strategy

D.

Define the inference server dependencies

Question 12

Which Security Zone policy is NOT valid?

Options:

A.

A boot volume can be moved from a security zone to a standard compartment

B.

A compute instance cannot be moved from a security zone to a standard compartment

C.

Resources in a security zone should not be accessible from the public internet

D.

Resources in a security zone must be automatically backed up regularly

Question 13

Which of the following best describes the principal goal of data science?

Options:

A.

To collect and archive exhaustive datasets from various source systems for corporate record-keeping uses.

B.

To mine and analyze large amounts of data in order to uncover information that can be used for operational improvements and business gains.

C.

To collect and prepare data for use as part of analytics applications.

D.

Data science is focused on output of the analysis.

Question 14

Which function's objective is to represent the difference between the predictive value and the target value?

Options:

A.

Optimizer function

B.

Fit function

C.

Update function

D.

Cost function

Question 15

You want to make your model more frugal to reduce the cost of collecting and processing data. You plan to do this by removing features that are highly correlated. You would like to create a heatmap that displays the correlation so that you can identify candidate features to remove. Which Accelerated Data Science (ADS) SDK method is appropriate to display the comparability between Continuous and Categorical features?

Options:

A.

pearson_plot()

B.

cramersv_plot()

C.

correlation_ratio_plot()

D.

corr()

Question 16

In which two ways can you improve data durability in Oracle Cloud Infrastructure Object Storage?

Options:

A.

Setup volumes in a RAID1 configuration

B.

Enable server-side encryption

C.

Enable Versioning

D.

Limit delete permissions

E.

Enable client-side encryption

Question 17

You’re going to create an Oracle Cloud Infrastructure Anomaly Detection model for multivariate data. Where do you need to store the training data?

Options:

A.

Your local machine

B.

MySQL database

C.

Autonomous Data Warehouse

D.

Object Storage Bucket

Question 18

You have an embarrassingly parallel or distributed batch job with a large amount of data running using Data Science Jobs. What would be the best approach to run the workload?

Options:

A.

Create a job in Data Science Jobs and then start the number of simultaneous job runs required for your workload

B.

Create a new job for every job run that you have to run in parallel, because the Data Science Job service can have only one job per job

C.

Create the job in Data Science Jobs and start a job run. When it is done, start a new job run until you achieve the number of runs required

D.

Reconfigure the job run because Data Science Jobs does not support embarrassingly parallel

Question 19

What is the name of the machine learning library used in Apache Spark?

Options:

A.

MLib

B.

GraphX

C.

Structured Streaming

D.

HadoopML

Question 20

You are working as a data scientist for a healthcare company. They decided to analyze the data to find patterns in a large volume of electronic medical records. You are asked to build a PySpark solution to analyze these records in a JupyterLab notebook. What is the order of recommended steps to develop a PySpark application in OCI Data Science?

Options:

A.

Launch a notebook session, configure core-site.xml, install a PySpark conda environment, develop your PySpark application, create a Data Flow application with the Accelerated Data Science (ADS) SDK

B.

Configure core-site.xml, install a PySpark conda environment, create a Data Flow application with the Accelerated Data Science (ADS) SDK, develop your PySpark application, launch a notebook session

C.

Install a Spark conda environment, configure core-site.xml, launch a notebook session, create a Data Flow application with the Accelerated Data Science (ADS) SDK, develop your PySpark application

D.

Launch a notebook session, install a PySpark conda environment, configure core-site.xml, develop your PySpark application, create a Data Flow application with the Accelerated Data Science (ADS) SDK

Question 21

You are a data scientist using Oracle AutoML to produce a model and you are evaluating the score metric for the model. Which TWO of the following prevailing metrics would you use for evaluating a multiclass classification model?

Options:

A.

Mean squared error

B.

Explained variance score

C.

Recall

D.

F1-score

E.

R-squared

Question 22

What happens when a notebook session is deactivated?

Options:

A.

Compute cost increases due to frequent deactivation

B.

The data on boot volume is preserved

C.

The underlying compute instance stops

D.

The block volume attached to the notebook is permanently deleted

Question 23

Why is data sampling useful for data scientists?

Options:

A.

It lets them analyze datasets in small batches to reduce their use of system resources.

B.

It reduces the amount of data storage space that’s required for data science applications.

C.

It enables them to use a representative subset of data to build accurate analytical models more quickly.

Question 24

As a data scientist, you are tasked with creating a model training job that is expected to take different hyperparameter values on every run. What is the most efficient way to set those parameters with Oracle Data Science Jobs?

Options:

A.

Create a new job every time you need to run your code and pass the parameters as environment variables

B.

Create a new job by setting the required parameters in your code and create a new job for every code change

C.

Create your code to expect different parameters either as environment variables or as command-line arguments, which are set on every job run with different values

D.

Create your code to expect different parameters as command-line arguments and create a new job every time you run the code

Question 25

You have an embarrassingly parallel or distributed batch job on a large amount of data that you consider running using Data Science Jobs. What would be the best approach to run the workload?

Options:

A.

Create the job in Data Science Jobs and start a job run. When it is done, start a new job run until you achieve the number of runs required

B.

Create the job in Data Science Jobs and then start the number of simultaneous job runs required for your workload

C.

Reconfigure the job run because Data Science Jobs does not support embarrassingly parallel workloads

D.

Create a new job for every job run that you have to run in parallel, because the Data Science Jobs service can have only one job run per job

Question 26

You are a data scientist working inside a notebook session and you attempt to pip install a package from a public repository that is not included in your conda environment. After running this command, you get a network timeout error. What might be missing from your network configuration?

Options:

A.

The NAT Gateway with public internet access

B.

Service Gateway with private subnet access

C.

FastConnect to an on-premises network

D.

Primary Virtual Network Interface Card (VNIC)

Question 27

Which Oracle Accelerated Data Science (ADS) classes can be used for easy access to datasets from reference libraries and index websites such as scikit-learn?

Options:

A.

DataLabeling

B.

DatasetBrowser

C.

SecretKeeper

D.

DatasetFactory

Question 28

You have an image classification model in the model catalog which is deployed as an HTTP endpoint using model deployments. Your tenancy administrator is seeing increased demands and has asked you to increase the load balancing bandwidth from the default of 10Mbps. You are provided with the following information:

    Payload size in KB = 1024

    Estimated requests per second = 120 requests/second (Monday through Friday, in every month, in every year)

    Buffer percentage = 20%What is the optimal load balancing bandwidth to redeploy your model?

Options:

A.

452 Mbps

B.

52 Mbps

C.

7052 Mbps

D.

1152 Mbps

Question 29

What is a common maxim about data scientists?

Options:

A.

They spend 80% of their time finding and preparing data and 20% analyzing it.

B.

They spend 80% of their time analyzing data and 20% finding and preparing it.

C.

They spend 80% of their time on failed analytics projects and 20% doing useful work.

Question 30

Which Oracle Data Safe feature minimizes the amount of personal data and allows internal test, development, and analytics teams to operate with reduced risk?

Options:

A.

Data encryption

B.

Security assessment

C.

Data masking

D.

Data discovery

E.

Data auditing

Question 31

You want to use ADSTuner to tune the hyperparameters of a supported model you recently trained. You have just started your search and want to reduce the computational cost as well as assess the quality of the model class that you are using. What is the most appropriate search space strategy to choose?

Options:

A.

Detailed

B.

ADSTuner doesn’t need a search space to tune the hyperparameters

C.

Perfunctory

D.

Pass a dictionary that defines a search space

Question 32

Where are OCI secrets stored?

Options:

A.

OCI Object Storage

B.

OCI Vault

C.

Autonomous Data Warehouse

D.

Oracle Databases

Question 33

You are a data scientist building a pipeline in the Oracle Cloud Infrastructure (OCI) Data Science service for your machine learning project. You want to optimize the pipeline completion time by running some steps in parallel. Which statement is true about running pipeline steps in parallel?

Options:

A.

Steps in a pipeline can be run only sequentially

B.

Pipeline steps can be run in sequence or in parallel, as long as they create a directed acyclic graph (DAG)

C.

All pipeline steps are always run in parallel

D.

Parallel steps cannot be run if they are completely independent of each other

Question 34

Which statement is true about standards?

Options:

A.

They may be audited

B.

They are the result of a regulation or contractual requirement or an industry requirement

C.

They are methods and instructions on how to maintain or accomplish the directives of the policy

D.

They are the foundation of corporate governance

Question 35

As a data scientist, you are working on a global health dataset that has data from more than 50 countries. You want to encode three features, such as 'countries', 'race', and 'body organ' as categories. Which option would you use to encode the categorical feature?

Options:

A.

DataFrameLabelEncode()

B.

auto_transform()

C.

OneHotEncoder()

D.

show_in_notebook()

Question 36

You have just received a new dataset from a colleague. You want to quickly find out summary information about the dataset, such as the types of features, the total number of observations, and distributions of the data. Which Accelerated Data Science (ADS) SDK method from the ADSDataset class would you use?

Options:

A.

show_corr()

B.

to_xgb()

C.

compute()

D.

show_in_notebook()

Question 37

You realize that your model deployment is about to reach its utilization limit. What would you do to avoid the issue before requests start to fail? Pick THREE.

Options:

A.

Update the deployment to add more instances

B.

Delete the deployment

C.

Update the deployment to use fewer instances

D.

Update the deployment to use a larger virtual machine (more CPUs/memory)

E.

Reduce the load balancer bandwidth limit so that fewer requests come in

Question 38

Which technique can be used for feature engineering in the machine learning lifecycle?

Options:

A.

Principal Component Analysis (PCA)

B.

K-means clustering

C.

Support Vector Machines (SVM)

D.

Gradient boosting

Question 39

Which statement about Oracle Cloud Infrastructure Multi-Factor Authentication (MFA) is NOT valid?

Options:

A.

Users cannot disable MFA for themselves

B.

A user can register only one device to use for MFA

C.

Users must install a supported authenticator app on the mobile device they intend to register for MFA

D.

An administrator can disable MFA for another user

Question 40

You have received machine learning model training code, without clear information about the optimal shape to run the training. How would you proceed to identify the optimal compute shape for your model training that provides a balanced cost and processing time?

Options:

A.

Start with a smaller shape and monitor the Job Run metrics and time required to complete the model training. If the compute shape is not fully utilized, tune the model parameters, and rerun the job. Repeat the process until the shape resources are fully utilized

B.

Start with the strongest compute shape Jobs support and monitor the Job Run metrics and time required to complete the model training. Tune the model so that it utilizes as much compute resources as possible, even at an increased cost

C.

Start with a smaller shape and monitor the utilization metrics and time required to complete the model training. If the compute shape is fully utilized, change to compute that has more resources and rerun the job. Repeat the process until the processing time does not improve

D.

Start with a random compute shape and monitor the utilization metrics and time required to finish the model training. Perform model training optimizations and performance tests in advance to identify the right compute shape before running the model training as a job

Question 41

Six months ago, you created and deployed a model that predicts customer churn for a call centre. Initially, it was yielding quality predictions. However, over the last two months, users are questioning the credibility of the predictions. Which TWO methods would you employ to verify the accuracy of the model?

Options:

A.

Retrain the model

B.

Validate the model using recent data

C.

Drift monitoring

D.

Redeploy the model

E.

Operational monitoring

Question 42

True or false? Data scientists typically need a combination of technical skills, nontechnical ones, and suitable personality traits to be successful.

Options:

A.

True

B.

False

Question 43

Which statement about resource principals is true?

Options:

A.

When you authenticate using a resource principal, you need to create and manage credentials to access OCI resources.

B.

A resource principal is not a secure way to authenticate to resources, compared to the OCI configuration and API key approach.

C.

The Data Science service does not provide authentication via a notebook session’s or job run’s resource principal to access other OCI resources.

D.

A resource principal is a feature of IAM that enables resources to be authorized principal actors.

Question 44

Which OCI Data Science interaction method can function without the need of scripting?

Options:

A.

OCI Console

B.

CLI

C.

Language SDKs

D.

REST APIs

Question 45

Where do calls to stdout and stderr from score.py go in a model deployment?

Options:

A.

The file that was defined for them on the Virtual Machine (VM)

B.

The predict log in the Oracle Cloud Infrastructure (OCI) Logging service as defined in the deployment

C.

The OCI Cloud Shell, which can be accessed from the console

D.

The OCI console

Question 46

Which feature of Oracle Cloud Infrastructure Data Science provides an interactive coding environment for building and training machine learning models?

Options:

A.

Model Catalog

B.

Jobs

C.

Notebook Sessions

D.

Projects

Question 47

You have a dataset with fewer than 1000 observations, and you are using Oracle AutoML to build a classifier. While visualizing the results of each stage of the Oracle AutoML pipeline, you notice that no visualization has been generated for one of the stages. Which stage is not visualized?

Options:

A.

Feature selection

B.

Algorithm selection

C.

Adaptive sampling

D.

Hyperparameter tuning

Page: 1 / 16
Total 158 questions