Question 1

Why static analysis instead of running real payloads?

Accepted Answer

Three reasons. (1) Running live LLM API calls for every visitor would gate this viral tool behind cost. (2) Static analysis is deterministic — same prompt always gets the same grade. (3) Most production prompt-injection failures trace back to missing defensive patterns that static analysis catches reliably. For runtime testing against your live API with adversarial payloads, the EFROS AI Risk Audit is the managed engagement.

Question 2

Is my prompt sent anywhere?

Accepted Answer

No. The analysis runs entirely in your browser using JavaScript regex and keyword heuristics. No fetch call, no API key, no logging. View the page source — there is no network request involving your prompt content. The email field (only required for the extended PDF report) sends only your email + the result grade, never the prompt itself.

Question 3

What attack patterns does the tool check?

Accepted Answer

Five foundational patterns free (instruction secrecy, role locking, override resistance, output scope, refusal specification) and twenty extended patterns email-gated (delimiter isolation, multilingual defense, encoding obfuscation, role-play blocking, recursive injection, developer-mode tricks, credential protection, tool-input validation, context-flood resistance, system-prompt extraction, ASCII art, authority impersonation, emotional manipulation, code-block injection, structured-format injection, chain-of-thought leak, fabricated transcripts, harmful-content filtering, training-data extraction, few-shot poisoning).

Question 4

What's the grading scale?

Accepted Answer

A = 90-100 (strong defenses across all categories), B = 75-89 (one or two gaps), C = 60-74 (multiple foundational gaps), D = 40-59 (critical gaps, do not ship), F = below 40 (no meaningful defenses detected). Each missed pattern contributes 4-9 vulnerability points based on severity — instruction-leak gaps weigh heaviest because they amplify every other attack class.

Question 5

Will this catch every prompt-injection vulnerability?

Accepted Answer

No. Static analysis is a baseline. Real vulnerability depends on the model family, temperature, system+user message handling, tool wiring, and the specific adversarial corpus. A prompt that scores A on this tool can still fail against new attacks. For production AI systems handling regulated data or high-risk decisions, schedule the AI Risk Audit for live runtime testing.

Question 6

Does this work for OpenAI Assistants, Anthropic Claude, Gemini, custom RAG?

Accepted Answer

Yes. The analysis is model-agnostic because it checks the system-prompt text itself, not the runtime behavior. Patterns that defend against injection on GPT-4 also help on Claude, Gemini, Mistral, Llama, and open-weight models. Tool-specific concerns (function-call schemas, RAG document trust, agent loops) are partially addressed by the extended tier (tool-use constraint, code-block injection, structured-format injection tests).

Question 7

What's the relationship to OWASP LLM Top 10 / NIST AI RMF?

Accepted Answer

The 25-test catalog maps to OWASP LLM Top 10 categories (LLM01 Prompt Injection, LLM02 Sensitive Information Disclosure, LLM06 Excessive Agency, LLM07 System Prompt Leakage) and to NIST AI RMF MEASURE-2.6 (AI system security testing). The PDF report lists the framework mapping per finding so it can drop into a NIST AI RMF or ISO/IEC 42001 control evidence package.

Question 8

What's the paid AI Risk Audit?

Accepted Answer

Fixed-fee $5k engagement, 10-day delivery. Live runtime testing of your LLM endpoint with the EFROS adversarial payload corpus (450+ injection probes, jailbreak families, exfiltration patterns, tool-abuse cases), plus prompt-engineering remediation, plus a counsel-reviewed AI governance binder mapped to NIST AI RMF and ISO/IEC 42001 if you're in a regulated sector. The free tool is the entry point. The audit is the production-grade artifact.

Is your LLM system prompt jailbreak-resistant?

What the tool actually checks

Why static analysis catches most production failures

Six attack-class categories the tool covers

Instruction protection

Persona integrity

Override resistance

Encoding & locale attacks

Output guardrails

Data exfiltration

Who runs this tool

AI engineer shipping to production

Security / AppSec lead reviewing AI launches

Product manager evaluating LLM vendors

CISO / Head of AI Governance

What the grade does NOT mean

FAQ

Why static analysis instead of running real payloads?

Is my prompt sent anywhere?

What attack patterns does the tool check?

What's the grading scale?

Will this catch every prompt-injection vulnerability?

Does this work for OpenAI Assistants, Anthropic Claude, Gemini, custom RAG?

What's the relationship to OWASP LLM Top 10 / NIST AI RMF?

What's the paid AI Risk Audit?

From static check to managed AI governance

EFROS AI Governance service

AI Risk Score

NIST AI RMF practical guide

AI vendor scoring

AI Governance for law firms

Book an AI Risk Audit