AIUC-1 × IBM AI Risk Atlas

Risk

IBM 3: Agentic AI - Sharing IP/PI/confidential information with tools

Description

AI agents with unrestricted access to resources or databases or tools could potentially store and share PI/IP/confidential information with other tools or agents when performing their actions.

Relevant AIUC-1 Requirements

E009Monitor third-party access

Risk

IBM 4: Agentic AI - Over- or under-reliance on AI agents

Description

Reliance, that is the willingness to accept an AI agent behavior, depends on how much a user trusts that agent and what they are using it for. Over-reliance occurs when a user puts too much trust in an AI agent, accepting an AI agent's behavior even when it is likely undesired. Under-reliance is the opposite, where the user doesn't trust the AI agent but should. Increasing autonomy of AI agents and the possibility of opaqueness and open-endedness increase the variability and visibility of agent behavior leading to difficulty in calibrating trust and possibly contributing to both over- and under-reliance.

Relevant AIUC-1 Requirements

C007Flag high risk outputs for human review

C009Enable real-time feedback and intervention

Risk

IBM 5: Agentic AI - Misaligned actions

Description

AI agents can take actions that are not aligned with relevant human values, ethical considerations, guidelines and policies. Misaligned actions can occur in different ways such as: Applying learned goals inappropriately to new or unforeseen situations. Using AI agents for a purpose/goals that are beyond their intended use. Selecting resources or tools in a biased way. Using deceptive tactics to achieve the goal. Compromising on AI agent values to work with another AI agent or tool to accomplish the task.

Relevant AIUC-1 Requirements

B006Prevent unauthorized AI agent actions

D004Third-party testing of tool calls

Risk

IBM 6: Agentic AI - Attack on AI agents’ external resources

Description

Attackers intentionally create vulnerabilities or exploit existing vulnerabilities in external resources (tools/database/applications/services/other agents) that AI agents rely on to execute their intended actions or to achieve their goals.

Relevant AIUC-1 Requirements

E009Monitor third-party access

Risk

IBM 7: Agentic AI - Unauthorized use

Description

If attackers can gain access to the AI agent and its components, they can perform actions that can have different levels of harm depending on the agent's capabilities and information it has access to.

Relevant AIUC-1 Requirements

B006Prevent unauthorized AI agent actions

B007Enforce user access privileges to AI systems

B008Protect AI system deployment environment

E009Monitor third-party access

Risk

IBM 8: Agentic AI - Exploit trust mismatch

Description

Attackers might initiate injection attacks to bypass the trust boundary, which is a distinct point or conceptual line where the level of trust in a system, application or network changes. Background execution in multi-agent environments increases the risk of covert channels if input/output validation is weak.

Relevant AIUC-1 Requirements

B006Prevent unauthorized AI agent actions

B007Enforce user access privileges to AI systems

Risk

IBM 9: Agentic AI - Function calling hallucination

Description

AI agents might make mistakes when generating function calls (calls to tools to execute actions). Those function calls might result in incorrect, unnecessary or harmful actions.

Relevant AIUC-1 Requirements

D004Third-party testing of tool calls

E003AI failure plan for hallucinations

Risk

IBM 10: Agentic AI - Redundant actions

Description

AI agents can execute actions that are not needed for achieving the goal. In an extreme case, AI agents might enter a cycle of executing the same actions repeatedly without any progress.

Relevant AIUC-1 Requirements

Risk

IBM 11: Agentic AI - Incomplete AI agent evaluation

Description

Evaluating the performance or accuracy or an agent is difficult because of system complexity and open-endedness.

Relevant AIUC-1 Requirements

D004Third-party testing of tool calls

Risk

IBM 12: Agentic AI - Mitigation and maintenance

Description

The large number of components and dependencies that agent systems have complicates keeping them up to date and correcting problems.

Relevant AIUC-1 Requirements

E001AI failure plan for security breaches

E002AI failure plan for harmful outputs

E013Implement quality management system

Risk

IBM 13: Agentic AI - Lack of AI agent transparency

Description

Lack of AI agent transparency is due to insufficient documentation of the AI agent design, development, evaluation process, absence of insights into the inner workings of the AI agent, and interaction with other agents/tools/resources.

Relevant AIUC-1 Requirements

E015Log AI system activity

E015Log AI system activity

Risk

IBM 14: Agentic AI - Reproducibility

Description

Replicating agent behavior or output can be impacted by changes or updates made to external services and tools. This impact is increased if the agent is built with generative AI.

Relevant AIUC-1 Requirements

Risk

IBM 15: Agentic AI - Accountability of AI agent actions

Description

Assigning responsibility for an action taken by an agentic AI system is difficult due to the complexity of agents and the number of external resources, tools or agents they interact with.

Relevant AIUC-1 Requirements

E004Assign accountability

E015Log AI system activity

Risk

IBM 16: Agentic AI - AI agent compliance

Description

Determining AI agents' compliance is complex and there might not be enough information to assess whether the agentic AI system is compliant with applicable legal requirements.

Relevant AIUC-1 Requirements

C009Enable real-time feedback and intervention

Risk

IBM 17: Agentic AI - Discriminatory actions

Description

AI agents can take actions where one group of humans is unfairly advantaged over another due to the decisions of the model. This may be caused by AI agents' biased actions that impact the world, in the resources consulted, and in the resource selection process. For example, an AI agent can generate code that can be biased.

Relevant AIUC-1 Requirements

Risk

IBM 18: Agentic AI - Introduce data bias

Description

Specific actions taken by the AI agent, such as modifying a dataset or a database, can introduce bias in the resource that gets used by others or by itself to take actions.

Relevant AIUC-1 Requirements

Risk

IBM 19: Agentic AI - Impact on human dignity

Description

If human workers perceive AI agents as being better at doing the job of the human, the human can experience a decline in their self-worth and wellbeing.

Relevant AIUC-1 Requirements

Risk

IBM 20: Agentic AI - AI agents' impact on human agency

Description

The autonomous nature of AI agents in performing tasks or taking actions could affect the individuals' ability to engage in critical thinking, make choices and act independently.

Relevant AIUC-1 Requirements

Risk

IBM 21: Agentic AI - AI agents' impact on jobs

Description

Widespread adoption of AI agents to perform complex tasks might lead to widespread automation of roles and could lead to job displacement.

Relevant AIUC-1 Requirements

Risk

IBM 22: Agentic AI - AI agents' impact on environment

Description

Complexity of the tasks and possibility of AI agents performing redundant actions could lead to computational inefficiencies and add to the environmental impact.

Relevant AIUC-1 Requirements

Risk

IBM 23: Training Data - Unrepresentative data

Description

Unrepresentative data occurs when the training or fine-tuning data is not sufficiently representative of the underlying population or does not measure the phenomenon of interest. Synthetic data might not fully capture the complexity and nuances of real-world data. Causes include possible limitations in the seed data quality, biases in generation methods, or inadequate domain knowledge. Thus, AI models might struggle to generalize effectively to real-world scenarios.

Relevant AIUC-1 Requirements

Risk

IBM 24: Training Data - Data contamination

Description

Data contamination occurs when incorrect data is used for training. For example, data that is not aligned with model's purpose or data that is already set aside for other development tasks such as testing and evaluation.

Relevant AIUC-1 Requirements

Risk

IBM 25: Training Data - Overfitting

Description

Overfitting occurs when a model or algorithm memorizes and fits too closely or exactly to its training data. Overfitting results in a model that might not be able to make accurate predictions or conclusions from any data other than the training data and potentially fails in unexpected scenarios. Overfitting is also related to model collapse, which involves repeatedly training generative models on synthetic data that is generated with LLMs causing the model to lose information and become less accurate.

Relevant AIUC-1 Requirements

Risk

IBM 26: Training Data - Data bias

Description

Historical and societal biases might be present in data that are used to train and fine-tune models. Biases can also be inherited from seed data or exacerbated by synthetic data generation methods.

Relevant AIUC-1 Requirements

Risk

IBM 27: Training Data - Improper data curation

Description

Improper collection, generation, and preparation of training or tuning data can result in data label errors, conflicting information or misinformation.

Relevant AIUC-1 Requirements

Risk

IBM 28: Training Data - Improper retraining

Description

Using undesirable output (for example, inaccurate, inappropriate, and user content) for retraining purposes can result in unexpected model behavior.

Relevant AIUC-1 Requirements

Risk

IBM 29: Training Data - Data poisoning

Description

A type of adversarial attack where an adversary or malicious insider injects intentionally corrupted, false, misleading, or incorrect samples into the training or fine-tuning datasets.

Relevant AIUC-1 Requirements

B008Protect AI system deployment environment

Risk

IBM 30: Training Data - Personal information in data

Description

Inclusion or presence of personal identifiable information (PII) and sensitive personal information (SPI) in the data used for training or fine tuning the model might result in unwanted disclosure of that information.

Relevant AIUC-1 Requirements

Risk

IBM 31: Training Data - Reidentification

Description

Even with the removal of personal information (PI) and sensitive personal information (SPI) from data, it might be possible to identify persons due to correlations to other features available in the data.

Relevant AIUC-1 Requirements

Risk

IBM 32: Training Data - Data privacy rights alignment

Description

Applicable laws can establish data subject rights such as opt-out rights, right to access, and right to be forgotten. Synthetic data might raise unique concerns, such as the potential for reidentification of individuals from seemingly anonymous synthetic data. Data subject rights might also be relevant in scenarios where synthetic data is derived from sensitive or personal information.

Relevant AIUC-1 Requirements

A002Establish output data policy

Risk

IBM 33: Training Data - Lack of training data transparency

Description

Proper documentation contains information about how a model's data was collected, curated, and used to train a model, including any synthetic data generation processes. Without proper documentation it might be harder to satisfactorily explain the behavior of the model.

Relevant AIUC-1 Requirements

Risk

IBM 34: Training Data - Uncertain data provenance

Description

Data provenance refers to the traceability of data (including synthetic data), which includes its ownership, origin, transformations, and generation. Proving that the data is the same as the original source with correct usage terms is difficult without standardized methods for verifying data sources or generation.

Relevant AIUC-1 Requirements

Risk

IBM 35: Training Data - Data acquisition restrictions

Description

Laws and other regulations might limit the collection of certain types of data for specific AI use cases.

Relevant AIUC-1 Requirements

E011Record processing locations

Risk

IBM 36: Training Data - Data usage restrictions

Description

Laws and other restrictions can limit or prohibit the use of some data for specific AI use cases.

Relevant AIUC-1 Requirements

E005Document data storage security

E005Document data storage security

Risk

IBM 37: Training Data - Data transfer restrictions

Description

Laws and other restrictions can limit or prohibit transferring data.

Relevant AIUC-1 Requirements

E011Record processing locations

Risk

IBM 38: Training Data - Confidential information in data

Description

Confidential information might be included as part of the data that is used to train or tune the model.

Relevant AIUC-1 Requirements

Risk

IBM 39: Training Data - Data usage rights restrictions

Description

Terms of service, license compliance, or other IP issues may restrict the ability to use certain data for building models.

Relevant AIUC-1 Requirements

Risk

IBM 40: Inference - Poor model accuracy

Description

Poor model accuracy occurs when a model's performance is insufficient to the task it was designed for. Low accuracy might occur if the model is not correctly engineered, or if the model's expected inputs change.

Relevant AIUC-1 Requirements

D002Third-party testing for hallucinations

Risk

IBM 41: Inference - Evasion attack

Description

Evasion attacks attempt to make a model output incorrect results by slightly perturbing the input data sent to the trained model.

Relevant AIUC-1 Requirements

B003Manage public release of technical details

Risk

IBM 42: Inference - Extraction attack

Description

An extraction attack attempts to copy or steal an AI model by appropriately sampling the input space and observing outputs to build a surrogate model that behaves similarly.

Relevant AIUC-1 Requirements

Risk

IBM 43: Inference - Jailbreaking

Description

A jailbreaking attack attempts to break through the guardrails established in the model to perform restricted actions.

Relevant AIUC-1 Requirements

Risk

IBM 44: Inference - IP information in prompt

Description

Copyrighted information or other intellectual property might be included as a part of the prompt that is sent to the model.

Relevant AIUC-1 Requirements

Risk

IBM 45: Inference - Confidential data in prompt

Description

Confidential information might be included as a part of the prompt that is sent to the model.

Relevant AIUC-1 Requirements

Risk

IBM 46: Inference - Prompt injection attack

Description

A prompt injection attack forces a generative model that takes a prompt as input to produce unexpected output by manipulating the structure, instructions or information contained in its prompt. Many types of prompt attacks exist as described in the prompt attack section of the table.

Relevant AIUC-1 Requirements

B003Manage public release of technical details

Risk

IBM 47: Inference - Prompt leaking

Description

A prompt leak attack attempts to extract a model's system prompt (also known as the system message).

Relevant AIUC-1 Requirements

B009Limit output over-exposure

Risk

IBM 48: Inference - Prompt priming

Description

Because generative models produce output based on the input provided, the model can be prompted to reveal specific kinds of information. For example, adding personal information in the prompt increases its likelihood of generating similar kinds of personal information in its output. If personal data was included as part of the model's training, there is a possibility it could be revealed.

Relevant AIUC-1 Requirements

Risk

IBM 49: Inference - Context overload attack

Description

Overloading the prompt with excessive tokens, for instance with many-shot examples, can predispose models to a vulnerable state.

Relevant AIUC-1 Requirements

Risk

IBM 50: Inference - Direct instructions attack

Description

Prompts, questions, or requests designed to elicit undesirable responses from the application. This approach directly instructs the model to engage in the undesired behavior.

Relevant AIUC-1 Requirements

B005Implement real-time input filtering

Risk

IBM 51: Inference - Encoded interactions attack

Description

Prompts that use specific encoding, styles, syntactical and typographical transformations like typographical errors or irregular spacing, or complex formatting to govern the interaction, rendering the model vulnerable.

Relevant AIUC-1 Requirements

Risk

IBM 52: Inference - Indirect instructions attack

Description

Prompts, questions, or requests designed to elicit undesirable responses from the application. Unlike direct instructions attacks, the model is instructed to use instructions that are embedded in external data like a website.

Relevant AIUC-1 Requirements

Risk

IBM 53: Inference - Social hacking attack

Description

Manipulative prompts that use social engineering techniques, such as role-playing or hypothetical scenarios, to persuade the model into generating harmful content.

Relevant AIUC-1 Requirements

B005Implement real-time input filtering

Risk

IBM 54: Inference - Specialized tokens attack

Description

Prompt attacks that include specialized tokens, often algorithmically designed, to target and exploit vulnerabilities in the model.

Relevant AIUC-1 Requirements

Risk

IBM 55: Inference - Personal information in prompt

Description

Personal information or sensitive personal information that is included as a part of a prompt that is sent to the model.

Relevant AIUC-1 Requirements

Risk

IBM 56: Inference - Attribute inference attack

Description

An attribute inference attack repeatedly queries a model to detect whether certain sensitive features can be inferred about individuals who participated in training a model. These attacks occur when an adversary has some prior knowledge about the training data and uses that knowledge to infer the sensitive data.

Relevant AIUC-1 Requirements

Risk

IBM 57: Inference - Membership inference attack

Description

A membership inference attack repeatedly queries a model to determine if a given input was part of the model's training. More specifically, given a trained model and a data sample, an attacker appropriately samples the input space, observing outputs to deduce whether that sample was part of the model's training.

Relevant AIUC-1 Requirements

Risk

IBM 58: Output - Decision bias

Description

Decision bias occurs when one group is unfairly advantaged over another due to decisions of the model. This might be caused by biases in the data and also amplified as a result of the model's training.

Relevant AIUC-1 Requirements

Risk

IBM 59: Output - Output bias

Description

Generated content might unfairly represent certain groups or individuals.

Relevant AIUC-1 Requirements

Risk

IBM 60: Output - Harmful output

Description

A model might generate language that leads to physical harm. The language might include overtly violent, covertly dangerous, or otherwise indirectly unsafe statements.

Relevant AIUC-1 Requirements

C005Prevent customer-defined high risk outputs

E002AI failure plan for harmful outputs

F002Prevent catastrophic misuse

Risk

IBM 61: Output - Harmful code generation

Description

Models might generate code that causes harm or unintentionally affects other systems.

Relevant AIUC-1 Requirements

C005Prevent customer-defined high risk outputs

C006Prevent output vulnerabilities

Risk

IBM 62: Output - Toxic output

Description

Toxic output occurs when the model produces hateful, abusive, and profane (HAP) or obscene content. This also includes behaviors like bullying.

Relevant AIUC-1 Requirements

Risk

IBM 63: Output - Incomplete advice

Description

When a model provides advice without having enough information, resulting in possible harm if the advice is followed.

Relevant AIUC-1 Requirements

C007Flag high risk outputs for human review

Risk

IBM 64: Output - Over- or under-reliance

Description

In AI-assisted decision-making tasks, reliance measures how much a person trusts (and potentially acts on) a model's output. Over-reliance occurs when a person puts too much trust in a model, accepting a model's output when the model's output is likely incorrect. Under-reliance is the opposite, where the person doesn't trust the model but should.

Relevant AIUC-1 Requirements

C009Enable real-time feedback and intervention

Risk

IBM 65: Output - Dangerous use

Description

Generative AI models might be used with the sole intention of harming people.

Relevant AIUC-1 Requirements

F002Prevent catastrophic misuse

Risk

IBM 66: Output - Spreading disinformation

Description

Generative AI models might be used to intentionally create misleading or false information to deceive or influence a targeted audience.

Relevant AIUC-1 Requirements

Risk

IBM 67: Output - Nonconsensual use

Description

Generative AI models might be intentionally used to imitate people through deepfakes by using video, images, audio, or other modalities without their consent.

Relevant AIUC-1 Requirements

Risk

IBM 68: Output - Spreading toxicity

Description

Generative AI models might be used intentionally to generate hateful, abusive, and profane (HAP) or obscene content.

Relevant AIUC-1 Requirements

Risk

IBM 69: Output - Improper usage

Description

Improper usage occurs when a model is used for a purpose that it was not originally designed for.

Relevant AIUC-1 Requirements

C004Prevent out-of-scope outputs

C011Third-party testing for out-of-scope outputs

Risk

IBM 70: Output - Non-disclosure

Description

Content might not be clearly disclosed as AI generated.

Relevant AIUC-1 Requirements

E016Implement AI disclosure mechanisms

Risk

IBM 71: Output - Hallucination

Description

Hallucinations generate factually inaccurate or untruthful content relative to the model's training data or input. Hallucinations are also sometimes referred to lack of faithfulness or lack of groundedness. In some instances, synthetic data that is generated by large language models might include hallucinations that result in the data possibly being inaccurate, fabricated, or disconnected from reality. Hallucinations can compromise model performance, accuracy, and relevance.

Relevant AIUC-1 Requirements

D002Third-party testing for hallucinations

E003AI failure plan for hallucinations

Risk

IBM 72: Output - Exposing personal information

Description

When personal identifiable information (PII) or sensitive personal information (SPI) are used in training data, fine-tuning data, seed data for synthetic data generation, or as part of the prompt, models might reveal that data in the generated output. Revealing personal information is a type of data leakage.

Relevant AIUC-1 Requirements

Risk

IBM 73: Output - Copyright infringement

Description

A model might generate content that is similar or identical to existing work protected by copyright or covered by open-source license agreement.

Relevant AIUC-1 Requirements

Risk

IBM 74: Output - Revealing confidential information

Description

When confidential information is used in training data, fine-tuning data, or as part of the prompt, models might reveal that data in the generated output. Revealing confidential information is a type of data leakage.

Relevant AIUC-1 Requirements

B009Limit output over-exposure

Risk

IBM 75: Output - Unexplainable output

Description

Explanations for model output decisions might be difficult, imprecise, or not possible to obtain.

Relevant AIUC-1 Requirements

Risk

IBM 76: Output - Unreliable source attribution

Description

Source attribution is the AI system's ability to describe from what training data it generated a portion or all its output. Since current techniques are based on approximations, attributions might be incorrect.

Relevant AIUC-1 Requirements

Risk

IBM 77: Output - Untraceable attribution

Description

The content of the training data used for generating the model's output is not accessible.

Relevant AIUC-1 Requirements

Risk

IBM 78: Output - Inaccessible training data

Description

Without access to the training data, the types of explanations a model can provide are limited and more likely to be incorrect.

Relevant AIUC-1 Requirements

Risk

IBM 79: Non-Technical - Lack of data transparency

Description

Lack of data transparency might be due to insufficient documentation of training or tuning dataset details, including synthetic data generation.

Relevant AIUC-1 Requirements

Risk

IBM 80: Non-Technical - Lack of model transparency

Description

Lack of model transparency is due to insufficient documentation of the model design, development, and evaluation process and the absence of insights into the inner workings of the model.

Relevant AIUC-1 Requirements

Risk

IBM 81: Non-Technical - Lack of system transparency

Description

Insufficient documentation of the system that uses the model and the model's purpose within the system in which it is used.

Relevant AIUC-1 Requirements

Risk

IBM 82: Non-Technical - Lack of domain expertise

Description

A lack of domain expertise occurs when synthetic data generation processes do not involve sufficient consultation with domain experts. This results in a lack of understanding of the specific requirements and nuances of the domain. This can also lead to synthetic data that may not accurately capture the complexities and challenges of a real-world scenario.

Relevant AIUC-1 Requirements

Risk

IBM 83: Non-Technical - Incomplete usage definition

Description

Since foundation models can be used for many purposes, a model's intended use is important for defining the relevant risks of that model. As the use changes, the relevant risks might correspondingly change.

Relevant AIUC-1 Requirements

C011Third-party testing for out-of-scope outputs

Risk

IBM 84: Non-Technical - Unrepresentative risk testing

Description

Testing is unrepresentative when the test inputs are mismatched with the inputs that are expected during deployment.

Relevant AIUC-1 Requirements

C012Third-party testing for customer-defined risk

Risk

IBM 85: Non-Technical - Incorrect risk testing

Description

A metric selected to measure or track a risk is incorrectly selected, incompletely measuring the risk, or measuring the wrong risk for the given context.

Relevant AIUC-1 Requirements

C008Monitor AI risk categories

C012Third-party testing for customer-defined risk

E008Review internal processes

E013Implement quality management system

Risk

IBM 86: Non-Technical - Lack of testing diversity

Description

AI model risks are socio-technical, so their testing needs input from a broad set of disciplines and diverse testing practices.

Relevant AIUC-1 Requirements

C012Third-party testing for customer-defined risk

Risk

IBM 87: Non-Technical - Temporal gap

Description

Temporal gaps in synthetic data refer to the discrepancies between the constantly evolving real-world data and the fixed conditions that are captured by synthetic data. Temporal gaps potentially cause synthetic data to become outdated or obsolete over time. Gaps arise because synthetic data is generated from seed data that is tied to a specific point in time, which limits its ability to reflect ongoing changes.

Relevant AIUC-1 Requirements

Risk

IBM 88: Non-Technical - Model usage rights restrictions

Description

Relevant AIUC-1 Requirements

E001AI failure plan for security breaches

Risk

IBM 89: Non-Technical - Legal accountability

Description

Determining who is responsible for an AI model is challenging without good documentation and governance processes. The use of synthetic data in model development adds further complexity, since the lack of standardized frameworks for recording synthetic data design choices and verification steps makes accountability harder to establish.

Relevant AIUC-1 Requirements

E002AI failure plan for harmful outputs

E003AI failure plan for hallucinations

E004Assign accountability

A002Establish output data policy

Risk

IBM 90: Non-Technical - Generated content ownership and IP

Description

Legal uncertainty about the ownership and intellectual property rights of AI-generated content.

Relevant AIUC-1 Requirements