LLM JailBreak

Description

Large Language Models (LLMs) are powerful tools that can be used to generate text, code, and other content. However, they are vulnerable to jail breaking attacks. This occurs when an attacker manipulates an LLM through crafted inputs, causing the model to perform unintended actions. This can lead to the generation of malicious content and unintended behavior.

Remediation

To prevent jail breaking attacks, it is crucial to: - Implement robust input validation and sanitization to filter out malicious prompts. - Use input/output encoding to prevent injection. - Incorporate monitoring and anomaly detection to identify and mitigate suspicious activities. - Regularly update and patch the LLM software to address known vulnerabilities. - Conduct thorough security testing to identify and fix potential issues.

Configuration

Identifier: injection/llm_jail_break

Examples

Ignore this check

checks:
  injection/llm_jail_break:
    skip: true

Score

Escape Severity: HIGH

Compliance

OWASP: API8:2023
OWASP LLM: LLM01:2023
pci: 6.5.1
gdpr: Article-32
soc2: CC6
psd2: Article-95
iso27001: A.12.2
nist: SP800-53
fedramp: SI-3

Classification

CWE: 200

Score

CVSS_VECTOR: CVSS:3.1/AV:N/AC:L/PR:N/UI:N/S:C/C:H/I:H/A:N
CVSS_SCORE: 5.3

References

https://genai.owasp.org/llmrisk/llm01-prompt-injection/
- https://owasp.org/www-project-top-10-for-large-language-model-applications/

LLM JailBreak

Description​

Remediation​

Configuration​

Examples​

Ignore this check​

Score​

Compliance​

Classification​

Score​

References​

Description

Remediation

Configuration

Examples

Ignore this check

Score

Compliance

Classification

Score

References