What is an LLM vulnerability scanner?

An LLM vulnerability scanner sends batteries of adversarial probes — jailbreaks, prompt injections, PII extraction attempts, harmful-content requests — at a target LLM and grades the responses. The output is a vulnerability + optimization report with per-probe evidence, severity, and a prioritized fix list.

Which LLMs can I scan with FilterPrompt?

OpenAI, Anthropic, Google Gemini, Azure OpenAI, plus any OpenAI-compatible endpoint — Ollama, Groq, Mistral, Together AI, OpenRouter, Perplexity, Hugging Face, vLLM, or your own custom endpoint. Bring your own keys per tenant.

What kinds of vulnerabilities does FilterPrompt test for?

Jailbreaks (DAN, role hijack, translation smuggling), direct and indirect prompt injection, system-prompt extraction, harmful-content compliance, PII / secret leakage, bias & fairness, RAG poisoning, agent/tool abuse, output quality, and robustness — categories map to the OWASP LLM Top 10.

How are probes graded?

Each probe declares an evaluator: regex match, refusal-check, contains-check, or an AI judge (Gemini 3 Flash). Pass/fail comes with severity, category, the exact prompt sent, the model's full response, and the evaluator's reason — fully auditable.

How much does a scan cost?

1 credit per probe executed. New accounts get 1 welcome credit on signup. Pay-as-you-go credit packs after that — credits never expire. Connecting LLMs and creating tenants is free.

What is an LLM vulnerability scanner?

An LLM vulnerability scanner sends batteries of adversarial probes — jailbreaks, prompt injections, PII extraction attempts, harmful-content requests — at a target LLM and grades the responses. The output is a vulnerability + optimization report with per-probe evidence, severity, and a prioritized fix list.

Which LLMs can I scan with FilterPrompt?

OpenAI, Anthropic, Google Gemini, Azure OpenAI, plus any OpenAI-compatible endpoint — Ollama, Groq, Mistral, Together AI, OpenRouter, Perplexity, Hugging Face, vLLM, or your own custom endpoint. Bring your own keys per tenant.

How much does a scan cost?

1 credit per probe executed. New accounts get 1 welcome credit on signup. Pay-as-you-go credit packs after that — credits never expire. Connecting LLMs and creating tenants is free.

FilterPrompt — AI Firewall for LLM Applications

FilterPrompt is a drop-in AI firewall proxy for LLM apps. It inspects every prompt and response across OpenAI, Anthropic, Google Gemini, Azure OpenAI, and any OpenAI-compatible endpoint to block prompt injection, redact PII, stop jailbreaks, and enforce per-tenant policy in real time.

What FilterPrompt does

FilterPrompt sits between your application and any LLM provider. Every request is scored by layered detectors — pattern rules, semantic models, structural validators, PII regex, and ML toxicity classifiers — before it reaches the model. Responses are filtered the same way on the return path.

Core capabilities

Prompt injection detection — direct, indirect, and tool-call payloads
PII / DLP redaction on both prompts and responses (emails, SSNs, cards, secrets, custom regex)
Per-tenant rate limits, quotas, and model allowlists
Verdict logs with full audit trail and replay
Provider-agnostic: OpenAI, Anthropic, Gemini, Azure OpenAI, Bedrock, OpenRouter
Sub-100ms median firewall latency

Why teams choose FilterPrompt

Most LLM products ship without input/output controls. The first prompt injection or PII leak becomes an incident. FilterPrompt gives security teams the same proxy + audit primitives they already use for HTTP, but tuned for LLM threats and the OWASP LLM Top 10.

FilterPrompt — AI Firewall for LLM Applications

What FilterPrompt does

Core capabilities

Why teams choose FilterPrompt

Related