// first 10 minutes
LLM prompt injection — quick-start
LLM prompt injection — first 10 minutes. capture the adversarial input before it is overwritten. print this, check boxes, then run the primary tools.
checklist
- preserve the exact input that triggered the injection: user message, RAG chunk, uploaded document, or tool result — with full context, not just the suspect string.
- preserve the model's output alongside the input — the deviation from expected behavior is the evidence.
- note the injection source category: direct user input, indirect via RAG retrieval, injected via tool result, or embedded in an uploaded document.
- pull the full prompt context window for the affected session if your logging captures it — system prompt, conversation history, and the injected payload.
- check whether the injection caused the model to ignore the system prompt, exfiltrate data via a crafted tool call, or output instructions for the user to follow.
- identify what the model was instructed to do by the injected content — not just what it did.
- check whether the same injection vector has been triggered in other sessions — query your logging for similar patterns.
- preserve any tool calls the model made during or after the injection — the call graph is the blast-radius map.
- note the model version, temperature, and any safety configuration at the time — context for the defense.
- begin the primary tool path below — prompt injection pattern analyzer.
primary tools
- 01llm prompt injection attempt log forensic analyzerdrop llm api/chat injection log export · parse user turn + matched pattern + model response · runs locally
- 02prompt injection attempt detector in uploaded docdrop pdf / docx / image · detect known prompt-injection payload patterns · runs locally
- 03indirect prompt injection document artifact detectordrop uploaded doc + chat export · detect hidden instruction payloads in attachments · runs locally
- 04mcp prompt injection via tool result detectordrop mcp server tool result log · detect injection payloads in tool responses · runs locally
- 05rag prompt injection via retrieved doc detectordrop retrieved docs · detect injection payloads in retrievals · runs locally
- 06llm jailbreak conversation artifact detectorscan conversation exports for dan · roleplay bypass · injection patterns · severity · export csv · runs locally
- 07llm guardrail bypass score anomaly detectordrop safety classifier log export · detect score manipulation + threshold edge cases · runs locally