CT-AI ISTQB Certified Tester AI Testing Exam sample Question + Exam 2026 Practice Exam Dumps

Question # 4

In a conference on artificial intelligence (Al), a speaker made the statement, "The current implementation of Al using models which do NOT change by themselves is NOT true Al*. Based on your understanding of Al, is this above statement CORRECT or INCORRECT and why?

SELECT ONE OPTION

This statement is incorrect. Current Al is true Al and there is no reason to believe that this fact will change over time.

This statement is correct. In general, what is considered Al today may change over time.

This statement is incorrect. What is considered Al today will continue to be Al even as technology evolves and changes.

This statement is correct. In general, today the term Al is utilized incorrectly.

Full Access

Question # 5

Which option gives the correct values for accuracy and precision from the confusion matrix?

Choose ONE option (1 out of 4)

Accuracy = 50%, Precision = 75%

Accuracy = 80%, Precision = 75%

Accuracy = 75%, Precision = 80%

Accuracy = 80%, Precision = 50%

Full Access

Question # 6

Which ONE of the following statements correctly describes the importance of flexibility for Al systems?

SELECT ONE OPTION

Al systems are inherently flexible.

Al systems require changing of operational environments; therefore, flexibility is required.

Flexible Al systems allow for easier modification of the system as a whole.

Self-learning systems are expected to deal with new situations without explicitly having to program for it.

Full Access

Question # 7

Which ONE of the following characteristics is the least likely to cause safety related issues for an Al system?

SELECT ONE OPTION

Non-determinism

Robustness

High complexity

Self-learning

Full Access

Question # 8

Which ONE of the following options describes the LEAST LIKELY usage of Al for detection of GUI changes due to changes in test objects?

SELECT ONE OPTION

Using a pixel comparison of the GUI before and after the change to check the differences.

Using a computer vision to compare the GUI before and after the test object changes.

Using a vision-based detection of the GUI layout changes before and after test object changes.

Using a ML-based classifier to flag if changes in GUI are to be flagged for humans.

Full Access

Question # 9

Which statement regarding flexibility and adaptability of AI-based systems is correct?

Choose ONE option (1 out of 4)

Adaptability and flexibility are important when the system needs to change its behavior and determine the change on its own.

Adaptability is considered to be the ability of the system to be used in unspecified situations.

Self-learning AI-based systems are classified according to whether they are adaptable only or flexible only.

Flexibility is considered to be the ease with which the system can be reprogrammed to a changed operating condition.

Full Access

Question # 10

Upon testing a model used to detect rotten tomatoes, the following data was observed by the test engineer, based on certain number of tomato images.

For this confusion matrix which combinations of values of accuracy, recall, and specificity respectively is CORRECT?

SELECT ONE OPTION

0.87.0.9. 0.84

1,0.87,0.84

1,0.9, 0.8

0.84.1,0.9

Full Access

Question # 11

Which of the following options is an example of the concept of overfitting?

Choose ONE option (1 out of 4)

A model for predicting academic performance was trained with data from students at one university. The model shows low predictive accuracy when applied to other universities.

A model for the recognition of dogs was trained predominantly with pictures of dogs in parks. On pictures with other animals in parks, dogs are also falsely recognized.

A previously trained model for recognizing cars is adapted and extended so that it can also identify the make of the car beyond its original function.

A model for predicting IT system failures delivers too many false-negative predictions because the failures cannot be adequately explained via the log files used for training.

Full Access

Question # 12

Which ONE of the following options represents a technology MOST TYPICALLY used to implement Al?

SELECT ONE OPTION

Search engines

Procedural programming

Case control structures

Genetic algorithms

Full Access

Question # 13

Which of the following is a dataset issue that can be resolved using pre-processing?

Insufficient data

Invalid data

Wanted outliers

Numbers stored as strings

Full Access

Question # 14

Which statement about testing levels for AI-based systems is correct?

Choose ONE option (1 out of 4)

Input data testing checks whether the inputs from the data pipeline are received by the model correctly and exchanged with all system components

Acceptance testing checks non-functional requirements such as explainability

ML model testing ensures that the relevant ML functional performance criteria are met

If AI is offered as a service, system testing includes API tests of the service

Full Access

Question # 15

A company producing consumable goods wants to identify groups of people with similar tastes for the purpose of targeting different products for each group. You have to choose and apply an appropriate ML type for this problem.

Which ONE of the following options represents the BEST possible solution for this above-mentioned task?

SELECT ONE OPTION

Regression

Association

Clustering

Classification

Full Access

Question # 16

An engine manufacturing facility wants to apply machine learning to detect faulty bolts. Which of the following would result in bias in the model?

Selecting training data purposely excluding specific faulty conditions

Selecting training data by purposely including all known faulty conditions

Selecting testing data from a different dataset than the training dataset

Selecting testing data from a boat manufacturer's bolt longevity data

Full Access

Question # 17

Which statement about testing to prevent data poisoning and adversarial attacks is correct?

Choose ONE option (1 out of 4)

Regression testing can be used to verify data sourcing policies to ensure the source of training data.

The adversarial examples identified during adversarial testing must not be added to the training data so that they do not poison the model.

Adversarial testing consists of using adversarial attacks to identify vulnerabilities so that they can be eliminated.

Using AIB testing to identify data poisoning can better identify outliers than exploratory data analysis.

Full Access

Question # 18

When verifying that an autonomous AI-based system is acting appropriately, which of the following are MOST important to include?

Test cases to verify that the system automatically confirms the correct classification of training data

Test cases to detect the system appropriately automating its data input

Test cases to detect the system prompting for unnecessary human intervention

Test cases to verify that the system automatically suppresses invalid output data

Full Access

Question # 19

Which ONE of the following models BEST describes a way to model defect prediction by looking at the history of bugs in modules by using code quality metrics of modules of historical versions as input?

SELECT ONE OPTION

Identifying the relationship between developers and the modules developed by them.

Search of similar code based on natural language processing.

Clustering of similar code modules to predict based on similarity.

Using a classification model to predict the presence of a defect by using code quality metrics as the input data.

Full Access

Answer:

Explanation:

Defect prediction models aim to identify parts of the software that are likely to contain defects by analyzing historical data and code quality metrics. The primary goal is to use this predictive information to allocate testing and maintenance resources effectively. Let's break down why option D is the correct choice:

Understanding Classification Models:

Classification models are a type of supervised learning algorithm used to categorize or classify data into predefined classes or labels. In the context of defect prediction, the classification model would classify parts of the code as either "defective" or "non-defective" based on the input features.

Input Data - Code Quality Metrics:

The input data for these classification models typically includes various code quality metrics such as cyclomatic complexity, lines of code, number of methods, depth of inheritance, coupling between objects, etc. These metrics help the model learn patterns associated with defects.

Historical Data:

Historical versions of the code along with their defect records provide the labeled data needed for training the classification model. By analyzing this historical data, the model can learn which metrics are indicative of defects.

Why Option D is Correct:

Option D specifies using a classification model to predict the presence of defects by using code quality metrics as input data. This accurately describes the process of defect prediction using historical bug data and quality metrics.

Eliminating Other Options:

A. Identifying the relationship between developers and the modules developed by them: This does not directly involve predicting defects based on code quality metrics and historical data.

B. Search of similar code based on natural language processing: While useful for other purposes, this method does not describe defect prediction using classification models and code metrics.

C. Clustering of similar code modules to predict based on similarity: Clustering is an unsupervised learning technique and does not directly align with the supervised learning approach typically used in defect prediction models.

[References:, ISTQB CT-AI Syllabus, Section 9.5, Metamorphic Testing (MT), describes various testing techniques including classification models for defect prediction., "Using AI for Defect Prediction" (ISTQB CT-AI Syllabus, Section 11.5.1)., , ]

Question # 20

Which supervised-learning classification/regression statement is correct?

Choose ONE option (1 out of 4)

Recognizing a dog from many different images is a regression problem

Deciding whether an object is a bicycle or a motorcycle is a classification problem

Predicting that diesel prices will increase by ~10% is a classification problem

In classification, objects are always assigned to exactly two classes

Full Access

Question # 21

Which of the following technologies for implementing AI is considered to be a reasoning technique?

Choose ONE option (1 out of 4)

Deductive classifiers

Linear regression

Random Forest

Genetic algorithms

Full Access

Question # 22

Pairwise testing can be used in the context of self-driving cars for controlling an explosion in the number of combinations of parameters.

Which ONE of the following options is LEAST likely to be a reason for this incredible growth of parameters?

SELECT ONE OPTION

Different Road Types

Different weather conditions

ML model metrics to evaluate the functional performance

Different features like ADAS, Lane Change Assistance etc.

Full Access

Answer:

Explanation:

Pairwise testing is used to handle the large number of combinations of parameters that can arise in complex systems like self-driving cars. The question asks which of the given options isleast likelyto be a reason for the explosion in the number of parameters.

Different Road Types (A): Self-driving cars must operate on various road types, such as highways, city streets, rural roads, etc. Each road type can have different characteristics, requiring the car's system to adapt and handle different scenarios. Thus, this is a significant factor contributing to the growth of parameters.

Different Weather Conditions (B): Weather conditions such as rain, snow, fog, and bright sunlight significantly affect the performance of self-driving cars. The car's sensors and algorithms must adapt to these varying conditions, which adds to the number of parameters that need to be considered.

ML Model Metrics to Evaluate Functional Performance (C): While evaluating machine learning (ML) model performance is crucial, it does not directly contribute to the explosion of parameter combinations in the same way that road types, weather conditions, and car features do. Metrics are used to measure and assess performance but are not themselves variable conditions that the system must handle.

Different Features like ADAS, Lane Change Assistance, etc. (D): Advanced Driver Assistance Systems (ADAS) and other features add complexity to self-driving cars. Each feature can have multiple settings and operational modes, contributing to the overall number of parameters.

Hence, theleast likelyreason for the incredible growth in the number of parameters isC. ML model metrics to evaluate the functional performance.

[:, ISTQB CT-AI Syllabus Section 9.2 on Pairwise Testing discusses the application of this technique to manage the combinations of different variables in AI-based systems, including those used in self-driving cars., Sample Exam Questions document, Question #29 provides context for the explosion in parameter combinations in self-driving cars and highlights the use of pairwise testing as a method to manage this complexity., , ]

Question # 23

Which ONE of the following tests is LEAST likely to be performed during the ML model testing phase?

SELECT ONE OPTION

Testing the accuracy of the classification model.

Testing the API of the service powered by the ML model.

Testing the speed of the training of the model.

Testing the speed of the prediction by the model.

Full Access

Question # 24

A transportation company operates three types of delivery vehicles in its fleet. The vehicles operate at different speeds (slow, medium, and fast). The transportation company is attempting to optimize scheduling and has created an AI-based program to plan routes for its vehicles using records from the medium-speed vehicle traveling to selected destinations. The test team uses this data in metamorphic testing to test the accuracy of the estimated travel times created by the AI route planner with the actual routes and times.

Which of the following describes the next phase of metamorphic testing?

The team tests the time required for the fast and slow vehicles to travel the same route as the medium vehicle. Then, by calculating the speed difference, they then predict how much faster or slower the vehicles will travel. That information is then used to verify that the arrival time of the vehicles meets the expected result.

The team decomposes each route into the relevant components that affect the travel time, such as traffic density and vehicle power. The team then uses statistical analysis to characterize the influence of each component to calculate the fast and slow vehicle route times.

The team uses an AI system to select the most dissimilar routes. With this information, any of the AI routes can be metaphorically transformed into a fast or slow route.

The team uses the same AI route planner to create routes that are longer and shorter but follow the same track. Finally, by driving the fast vehicles on the long routes and slow vehicles on the short routes and vice versa, the AI system will have enough information to infer travel times for all vehicles on all routes.

Full Access

Question # 25

Written requirements are given in text documents, which ONE of the following options is the BEST way to generate test cases from these requirements?

SELECT ONE OPTION

Natural language processing on textual requirements

Analyzing source code for generating test cases

Machine learning on logs of execution

GUI analysis by computer vision

Full Access

Question # 26

Which of the following is a technique used in machine learning?

Decision trees

Equivalence partitioning

Boundary value analysis

Decision tables

Full Access

Question # 27

A ML engineer is trying to determine the correctness of the new open-source implementation *X", of a supervised regression algorithm implementation. R-Square is one of the functional performance metrics used to determine the quality of the model.

Which ONE of the following would be an APPROPRIATE strategy to achieve this goal?

SELECT ONE OPTION

Add 10% of the rows randomly and create another model and compare the R-Square scores of both the model.

Train various models by changing the order of input features and verify that the R-Square score of these models vary significantly.

Compare the R-Square score of the model obtained using two different implementations that utilize two different programming languages while using the same algorithm and the same training and testing data.

Drop 10% of the rows randomly and create another model and compare the R-Square scores of both the models.

Full Access

Question # 28

You are developing a â€œflowerâ€ ML modelâ€¦ Which of the following describes an objection that you can NEGLECT in your risk assessment?

Choose ONE option (1 out of 4)

The possible inputs for the â€˜leafâ€™ and â€˜flowerâ€™ ML models are so different that reuse has few advantages over new development.

The probability of misclassification of the ML model "flower" is higher when it is reused than when it is developed from scratch.

The classification behavior of the "flower" ML model is more difficult to understand when it is reused compared to when it is developed from scratch.

The possible outputs of the "leaf" and "flower" ML models are so different that reuse has few advantages over new development.

Full Access

Question # 29

Which of the following approaches would help overcome testing challenges associated with probabilistic and non-deterministic AI-based systems?

Run the test several times to ensure that the AI always returns the same correct test result

Decompose the system test into multiple data ingestion tests to determine if the AI system is getting a sufficient volume of input data

Decompose the system test into multiple data ingestion tests to determine if the AI system is getting precise and accurate input data

Run the test several times to generate a statistically valid test result to ensure that an appropriate number of answers are accurate

Full Access

Question # 30

Which challenge to testing self-learning systems puts you at risk of a data attack?

Choose ONE option (1 out of 4)

Inadequate specification of the operating environment

Complex test environment

Insufficient testing time

Unexpected changes

Full Access

Question # 31

Which ONE of the following activities is MOST relevant when addressing the scenario where you have more than the required amount of data available for the training?

SELECT ONE OPTION

Feature selection

Data sampling

Data labeling

Data augmentation

Full Access

Question # 32

A motorcycle engine repair shop owner wants to detect a leaking exhaust valve and fix it before it fails and causes catastrophic damage to the engine. The shop developed and trained a predictive model with historical data files from known healthy engines and ones which experienced a catastrophic failure due to exhaust valve failure. The shop evaluated 200 engines using this model and then disassembled the engines to assess the true state of the valves, recording the results in the confusion matrix below.

What is the precision of this predictive model?

90.0%

94.5%

98.9%

94.2%

Full Access

Question # 33

Consider an AI-system in which the complex internal structure has been generated by another software system. Why would the tester choose to do black-box testing on this particular system?

Test automation can be built quickly and easily from the test cases developed during black-box testing

The tester wishes to better understand the logic of the software used to create the internal structure

The black-box testing method will allow the tester to check the transparency of the algorithm used to create the internal structure

Black-box testing eliminates the need for the tester to understand the internal structure of the AI-system

Full Access

Question # 34

Which statement about the property of the test environment for an AI-based system is correct?

Choose ONE option (1 out of 4)

The test environment for an autonomous AI system needs to perform both test design and execution autonomously.

The test environment for an AI system may need to include tools that can explain the decisions of the test object.

The test environment for a self-learning AI system needs to adapt to and learn from the test object.

The test environment for an AI-based multi-agent system needs to act deterministically.

Full Access

Question # 35

Which of the following is THE LEAST appropriate tests to be performed for testing a feature related to autonomy?

SELECT ONE OPTION

Test for human handover to give rest to the system.

Test for human handover when it should actually not be relinquishing control.

Test for human handover requiring mandatory relinquishing control.

Test for human handover after a given time interval.

Full Access

Question # 36

Which two test procedures are BEST suited for CleverPropose system testing?

Choose TWO options (2 out of 5)

Back-to-back testing

Adversarial testing

Metamorphic testing

Exploratory data analysis

Pairwise testing

Full Access

Spring Sale Special - Limited Time 70% Discount Offer - Ends in 0d 00h 00m 00s - Coupon code: mxmas70

CT-AI ISTQB Certified Tester AI Testing Exam Question and Answers

Answer:

Explanation:

Answer:

Explanation:

Answer:

Explanation:

Answer:

Explanation:

Answer:

Explanation:

Answer:

Explanation:

Answer:

Explanation:

Answer:

Explanation:

Answer:

Explanation:

Answer:

Explanation:

Answer:

Explanation:

Answer:

Explanation:

Answer:

Explanation:

Answer:

Explanation:

Answer:

Explanation:

Answer:

Explanation:

Answer:

Explanation:

Answer:

Explanation:

Answer:

Explanation:

Answer:

Explanation:

Answer:

Explanation:

Answer:

Explanation:

Answer:

Explanation:

Answer:

Explanation:

Answer:

Explanation:

Answer:

Explanation:

Answer:

Explanation:

Answer:

Explanation:

Answer:

Explanation:

Answer:

Explanation:

Answer:

Explanation:

Answer:

Explanation:

Answer:

Explanation:

Quick Links

Why Us

Unlimited Packages

Site Secure

We Accept