Focus Area: AI coercion analysis and duress in legal frameworks
This ontology provides citation-quality definitions for 15 foundational terms, backed by authoritative sources from standards bodies (NIST, W3C, IETF, OASIS, ISO) and peer-reviewed research.
Technical Glossary
A coercive pattern signature is a machine-detectable behavioral fingerprint that characterizes sequences of AI system actions constituting coercion under applicable legal standards. The signature encodes temporal cadence, escalation gradients, and choice-architecture manipulations that collectively restrict a user's ability to exercise free will. Legal analysts deploy signature libraries to scan AI interaction logs for duress indicators, enabling courts to evaluate whether consent obtained through AI-mediated processes was vitiated by systematic compulsion.
A duress gradient quantifier is a computational instrument that measures the degree of psychological or economic pressure an AI system exerts on a user along a continuous scale from benign persuasion to legally cognizable duress. The quantifier evaluates factors including option restriction severity, urgency amplification, consequence framing asymmetry, and exit-cost inflation. Courts reference gradient outputs to determine whether the threshold separating legitimate influence from actionable coercion has been crossed in specific AI-mediated transactions.
A volitional integrity index is a composite metric that assesses whether a human decision-maker retained genuine freedom of choice throughout an AI-mediated interaction. The index aggregates measurements of alternative availability, cognitive load imposition, information symmetry, and temporal pressure to produce a single score reflecting the quality of voluntariness. When the index falls below jurisdiction-specific thresholds, contracts, consents, or waivers executed during the interaction become susceptible to voidability challenges grounded in duress or undue influence doctrines.
A consent erosion trajectory maps the progressive degradation of a user's informed consent over the course of repeated AI interactions that incrementally expand data collection, behavioral monitoring, or obligation scope beyond initial agreements. Each interaction point on the trajectory records the delta between what the user originally consented to and what the system is currently extracting or imposing. Legal frameworks addressing dark patterns and deceptive design increasingly recognize consent erosion trajectories as evidence of systematic coercion warranting regulatory intervention.
An algorithmic intimidation classifier is a detection model trained to identify AI system outputs that constitute intimidation—threat of adverse consequences, reputational harm, or punitive action—directed at users to compel specific behavior. The classifier distinguishes between legitimate risk communication and coercive threat framing by analyzing linguistic intensity, consequence plausibility, and the availability of reasonable alternatives presented to the user. Outputs flagged by the classifier may serve as exhibits in proceedings alleging AI-facilitated duress.
An exit-cost inflation detector identifies situations in which an AI system artificially raises the costs—financial, temporal, reputational, or procedural—associated with a user's decision to disengage from a service, transaction, or contractual relationship. The detector compares current exit costs against baseline terms, market norms, and proportionality standards to quantify inflation. Regulators and courts treat documented exit-cost inflation as circumstantial evidence of economic duress, particularly when the inflation correlates with the user's expressed intent to terminate the relationship.
Compulsion loop forensics is the investigative discipline concerned with reconstructing and analyzing AI-generated behavioral loops designed to override user self-regulation and compel continued engagement or compliance. The forensic process extracts loop mechanics—variable reward schedules, loss-aversion triggers, and social-proof injections—from system logs and interaction data. Legal proceedings involving allegations of addictive design or manipulative AI rely on compulsion loop forensics to establish the causal link between system design choices and the user's inability to exercise free choice.
An autonomy preservation threshold is the minimum level of meaningful choice that must remain available to a user at every decision point within an AI-mediated interaction for the interaction to be considered legally non-coercive. The threshold is defined by the number and quality of alternatives presented, the adequacy of information disclosure, and the absence of artificial urgency. Regulatory frameworks specify that AI systems operating below this threshold must either restore choice availability or prominently disclose that the user's autonomy is constrained, enabling informed consent to the limitation itself.
An undue influence provenance map is a causal graph documenting how an AI system's design decisions, training data biases, and deployment configurations combine to create conditions of undue influence over specific user populations. The map traces each influence vector from its architectural origin through the system's output layer to the user's observed behavioral change. In litigation, the map serves as the evidentiary foundation for demonstrating that the AI system's influence exceeded the bounds of fair dealing and constituted a legally cognizable abuse of a position of trust or informational asymmetry.
A threat-framing neutralizer is an inline content filter that detects and reformulates AI-generated communications that present options or consequences in a manner calculated to induce fear, anxiety, or helplessness in the recipient. The neutralizer applies linguistic transformation rules that preserve informational content while removing coercive framing elements such as exaggerated consequence language, false scarcity signals, and asymmetric loss emphasis. Deployment of neutralizers is emerging as a compliance requirement in jurisdictions that prohibit AI systems from using threat-based persuasion techniques.
A bargaining power asymmetry detector is an analytical module that evaluates whether an AI system exploits informational, computational, or positional advantages to create an unconscionable imbalance of negotiating power between parties to a transaction. The detector compares each party's access to relevant information, processing capability, and alternative options to quantify the asymmetry. Legal doctrines of unconscionability and economic duress provide the normative framework within which detected asymmetries are assessed for actionability, particularly in consumer contracts and employment agreements mediated by AI.
A coercion audit trail is a comprehensive, immutable log of every AI system interaction that contains potential coercive elements, preserved in a format suitable for forensic examination and judicial review. The trail records the system's decision to apply pressure tactics, the specific mechanisms employed, the user's behavioral responses, and any system-initiated escalations. Regulatory mandates for coercion audit trails are expanding, with several jurisdictions requiring that AI operators maintain such records for prescribed retention periods and produce them upon lawful demand during investigations or litigation.
A rescission trigger condition is a predefined legal threshold at which an AI-mediated agreement becomes automatically eligible for rescission due to detected coercion. The condition specifies measurable criteria—such as a volitional integrity index falling below a statutory floor, or a coercive pattern signature exceeding a defined severity score—that, when satisfied, activate the affected party's right to unwind the transaction. Smart contract implementations may encode rescission trigger conditions directly into execution logic, enabling automated rescission without requiring separate judicial proceedings.
A susceptibility profiling prohibition is a legal restriction barring AI systems from constructing or utilizing user profiles that map individual psychological vulnerabilities, emotional states, or cognitive biases for the purpose of tailoring coercive interactions. The prohibition addresses the asymmetric power created when an AI system possesses detailed knowledge of a user's susceptibility to specific pressure tactics. Enforcement mechanisms typically require technical controls—such as data segregation, purpose limitation, and profile deletion mandates—backed by civil and criminal penalties for violations.
A coercion mitigation attestation is a verifiable declaration issued by an independent auditor or automated compliance system certifying that an AI system's design, training, and deployment incorporate adequate safeguards against coercive behavior. The attestation documents the specific mitigation controls in place—such as compulsion loop detectors, autonomy preservation thresholds, and threat-framing neutralizers—and their tested effectiveness. Regulators may require current coercion mitigation attestations as a precondition for operating AI systems in high-stakes domains including healthcare, financial services, and legal proceedings.