PLOT4AI - Library

Library

PLOT4AI is a library (currently) containing 138 threats related to AI/ML. The threats have been classified in 8 different categories.

In case you are new to PLOT4AI, read the HOW TO first.

Plot4ai - Phases and Categories

Threat Modeling Categories

Click on a category to filter the cards below.

All
Data & Data Governance
Transparency & Accessibility
Privacy & Data Protection
Cybersecurity
Safety & Environmental Impact
Bias, Fairness & Discrimination
Ethics & Human Rights
Accountability & Human Oversight

Click on a card to view the contents; Then click on card flip icon to flip it and see the backside of the card
(Please note: these cards can also be downloaded as CSV or PDF for offline usage)

Is our data complete, up-to-date, and trustworthy?

Can we prevent target leakage?

Can we prevent concept and data drift?

Can the AI model maintain continuous access to data sources after deployment?

Can we process new or updated data from external sources without delay?

Are all required data sources legitimate, authorized, and verified?

Can we obtain the data needed to develop or fine-tune the AI model?

Can we trace the provenance and lineage of the data used to train or fine-tune the AI model?

Could our dataset have copyright or other legal restrictions?

Can we detect and prevent data tampering across the AI lifecycle?

Does the AI system need to be explainable for users or affected persons?

Is our AI system inclusive and accessible?

If users’ consent is required, is the necessary information provided in a clear and accessible way?

Could the user perceive the message from the AI system in a different way than intended?

Is the AI system easy for users to learn and operate?

Are users clearly made aware that they are interacting with an AI system or consuming AI-generated content?

Are users informed about the AI system's reliability, limitations, and risks in a way that enables safe and effective use?

Can the training data be linked to individuals?

Could the AI system infer and reveal information that a person has not explicitly shared?

Could geolocation restrictions or regional regulations impact the implementation of our AI system in other countries?

Can we minimize the amount of personal data used while preserving model performance?

Are we processing special categories of personal data or sensitive data?

Could the AI system make decisions with legal or similarly significant effects without human intervention?

Do we have a valid legal basis for processing personal data?

Could we be using personal data for purposes different from those for which it was originally collected?

Are we able to comply with all the applicable GDPR data subjects’ rights?

Could we be deploying the AI system without conducting a required Data Protection Impact Assessment (DPIA)?

Are we using third-party providers while processing data from children or other vulnerable individuals?

Are we using metadata that could reveal personal data or behavior patterns?

Could we compromise users’ rights to privacy and to a private and family life?

Are we providing sufficient transparency about how the AI model collects, processes, and uses personal data?

Are we logging or storing user input data in ways that may violate privacy?

Could the AI system produce inaccurate or misleading outputs that result in privacy violations or harm?

Are we transferring personal data to countries that lack adequate privacy protections?

Can we comply with the storage limitation principle and international data retention regulations?

Could we be deploying the AI system without testing for adversarial robustness and systemic vulnerabilities?

Are our AI inference APIs and function-calling interfaces securely implemented?

Are training data, model output, and other sensitive AI assets securely stored?

If the AI system uses randomness, is the source of randomness properly protected?

Is the AI model suited for processing confidential information?

Have we implemented safeguards to detect and prevent insider threats to our AI systems?

Have we protected our AI system against model sabotage?

Is our AI model resilient to evasion attacks?

Are we protected from poisoning attacks?

Are we protected from model inversion attacks?

Are we protected from membership inference attacks?

Are we protected from model stealing attacks?

Are we protected from reprogramming deep neural nets attacks?

Are we protected from adversarial examples?

Could third-party AI/ML providers compromise our training data or insert backdoors?

Could the AI system be vulnerable to jailbreak techniques, allowing attackers to bypass safety restrictions?

Could the AI system be vulnerable to prompt injection attacks, leading to unauthorized access or manipulation?

Is the AI training environment secured against unauthorized access and manipulation?

Is the deployed AI system protected from unauthorized access and misuse?

Could third-party tools, plugins, or dependencies introduce vulnerabilities in our AI system?

Could the AI system generate or execute unsafe SQL queries from user input?

Could the AI system generate or execute unsafe code based on user input?

Could autonomous AI agents access or interact with malicious web content?

Could agent memory be poisoned with malicious or misleading information?

Could agents misuse tools or APIs they are authorized to access?

Could hallucinated output from one agent propagate and mislead others in multi-agent systems?

Can we trace and audit the actions and decisions of autonomous agents in our system?

Could a compromised or malicious agent sabotage a multi-agent system?

Could an agent gain access to functions or data beyond its intended permissions?

Could an attacker or user intentionally overload the AI system’s resources to degrade performance or cause failures?

Could an attacker or agent impersonate a user or AI identity to gain unauthorized influence?

Could an agent be misused to manipulate or deceive users?

Could an attacker intercept or manipulate communications between agents to alter system behavior?

Could unsafe file uploads introduce security risks?

Could unsafe deserialization of model artifacts lead to code execution or system compromise?

Could malicious fine-tuning compromise the safety or alignment of our GenAI model?

Are we protected from vulnerabilities in vector databases and RAG pipelines?

Could failures in real-time data collection channels disrupt model performance?

Could AI-generated hallucinations lead to misinformation or decision-making risks?

Could the lack of interpretability in our AI models compromise safety?

Can human over-reliance on automated systems lead to failures during emergencies?

Could performance or reliability issues emerge when scaling the AI system across environments?

In case of system failure, could users be adversely impacted?

Is our AI model robust and suitable for its intended use across different deployment contexts?

Could the AI system's performance on benchmarks be misleading or fail to reflect real-world risks?

Could the AI system become persuasive causing harm to users?

Could our AI agents hack their reward functions to exploit the system?

Could the AI system expose children to harmful, inappropriate, or unsafe content or interactions?

Could the AI system be misused for malicious purposes such as disinformation, cyberattacks or warfare?

Could the AI system accelerate the development of bioweapons or other CBRNE threats?

Could the AI system generate or disseminate deepfakes or synthetic media that mislead users, impersonate individuals, or cause harm?

Could the AI system generate toxic or harmful content?

Could the AI system deliberately mislead users or hide its capabilities during deployment or evaluation?

Could AI decisions result in physical damage, infrastructure failure, or major financial losses?

Do we monitor how version updates from third-party GenAI models can affect our system's behaviour?

Could the development of autonomous AI agents lead to loss of control, concentration of power or rogue behavior?

Could environmental phenomena or natural disasters compromise our AI system?

Could AI agents take actions that unintentionally harm users, the environment or themselves during learning or deployment?

Does training and deploying our AI system generate high CO2 emissions?

Could unsustainable data center cooling practices increase the environmental impact of our AI system?

Is the production of our AI hardware exploiting limited material resources?

Are we assessing our AI system’s environmental impact across its entire life cycle?

Is the dataset representative of the different real-world groups, populations and environments?

Could the AI system incorrectly attribute actions to individuals or groups?

Could certain groups be disproportionately affected by the outcomes of the AI system?

Could our AI system reinforce systemic inequalities?

Could our AI system oversimplify real-world problems?

Could our AI system accurately capture the factors it's designed to measure?

Could the AI system reinforce historical inequalities embedded in the data?

Can data be labeled consistently?

Could the system be using proxy variables that reflect sensitive attributes or lead to indirect discrimination?

Could the AI system’s design choices lead to unfair outcomes?

Could we over-rely on early evaluation results or AI-generated outputs?

Could popularity bias reduce diversity in system's recommendations?

Is the AI system designed to support multiple viewpoints and narratives?

Could our AI system contribute to social division or rivalry?

Could our AI system automatically label or categorize people?

Could the AI system affect employment conditions, labor rights, or job opportunities?

Could our AI system fail to uphold and respect human dignity?

Could the AI system affect democracy or have an adverse impact on society at large?

Do we offer users and accessible way to contest AI decisions or seek redress?

Could the system have an impact on decisions that affect life, health, or personal safety?

Could the AI system limit, suppress or distort users’ freedom of expression?

Could our AI system affect access to services such as healthcare, housing, insurance, benefits or education?

Could the AI system interfere with users’ autonomy influencing their decision-making process?

Could the AI system promote certain values or beliefs on users?

Could the AI system negatively impact vulnerable groups or fail to protect their rights?

Could the AI system fail to uphold the rights and best interests of children?

Is the development and use of the AI system proportionate to its intended purpose and impact on rights?

Does the AI system use behavioral data in ways that may raise ethical, privacy, or human rights concerns?

Is the AI system's task clearly defined, with well-scoped objectives and boundaries?

Have we identified and involved all key stakeholders relevant to this phase of the AI lifecycle?

Have all relevant staff and users received adequate training to understand, oversee, and responsibly interact with the AI system?

Do we have qualified people available to supervise the behavior of AI agents and provide feedback during learning?

Do we have the resources and processes to effectively oversee AI decision-making?

Is there a well-defined process to escalate AI-related failures or unexpected outcomes?

Have we defined who is accountable for the AI system’s decisions and outcomes?

Do we regularly review whether the AI system’s goals, assumptions, and impacts are still appropriate?

Can human operators safely interrupt or override the AI system at any time?

Could users contest or challenge the decisions made by the AI system?

Have we assessed our legal liability for damages caused by our AI system?

Do we have adequate resources and MLOps practices in place to manage, monitor, and maintain our AI system?

If we plan to deploy a third-party AI tool, have we assessed our shared responsibility for its potential impact on users?

License

PLOT4AI by Isabel Barberá is licensed under a Creative Commons Attribution-ShareAlike 4.0 International License.

References

I have created an overview of all the sources I have used for the creation of this library, which can be found under References