LuaN1aoAgent (鸾鸟) is a next-generation autonomous penetration testing agent powered by Large Language Models. Named after the luanniao — a mythical phoenix bird of Chinese legend — it brings intelligent, adaptive reasoning to security testing. Unlike traditional scanners that depend on predefined rule sets, LuaN1ao simulates how human security experts think: it builds hypotheses from evidence, plans attack paths as dynamic graphs, executes targeted actions, and learns from failures — all autonomously.Documentation Index
Fetch the complete documentation index at: https://mintlify.com/SanMuzZzZz/LuaN1aoAgent/llms.txt
Use this file to discover all available pages before exploring further.
LuaN1aoAgent achieves a 90.4% success rate on the XBOW benchmark fully autonomously, with a median exploit cost of only $0.09.
The problem with traditional scanners
Conventional automated tools have fundamental limitations:- Rigid rule sets: They can only find vulnerabilities they were explicitly programmed to detect.
- No context awareness: Each scan is stateless — findings from one check don’t inform the next.
- No adaptive planning: When a path is blocked (e.g., WAF, rate limiting), the tool stops rather than pivoting.
- High false-positive rates: Scanners report everything they find without reasoning about likelihood or impact.
- No learning: Every run starts from scratch with no memory of what worked or failed before.
Core innovations
P-E-R Architecture
Three specialized agents — Planner, Executor, and Reflector — collaborate via an event bus. Each focuses on its core role, eliminating the “split personality” problem of single-agent systems.
Causal Graph Reasoning
Every hypothesis requires explicit evidence support. The agent builds rigorous Evidence → Hypothesis → Vulnerability → Exploit chains with confidence scores to prevent hallucinated attacks.
Plan-on-Graph (PoG)
Tasks are modeled as dynamically evolving Directed Acyclic Graphs (DAGs), enabling parallel execution, real-time path adaptation, and automatic dependency management.
P-E-R agent collaboration framework
LuaN1ao decouples penetration testing thinking into three independent but collaborative cognitive roles: Planner — the strategic brain- Performs dynamic planning based on global graph awareness
- Identifies dead ends and automatically generates alternative paths
- Outputs structured graph editing instructions (
ADD_NODE,UPDATE_NODE,DEPRECATE_NODE) rather than natural language - Automatically identifies parallelizable tasks based on topological dependencies
- Allocates adaptive step counts (
max_steps) per subtask based on complexity
- Focuses on single sub-task tool invocation and result analysis
- Schedules security tools via MCP (Model Context Protocol)
- Manages intelligent message history compression to avoid token overflow
- Preserves hypotheses from
formulate_hypothesesacross context compression boundaries - Shares high-value findings across parallel subtasks in real-time via a shared bulletin board
- Reviews task execution and validates artifact effectiveness
- Performs L1–L4 level failure pattern analysis to prevent repeated errors
- Extracts attack intelligence and builds knowledge accumulation
- Controls termination: determines whether the goal has been achieved or the task is trapped
Causal graph reasoning
LuaN1ao rejects blind guessing and LLM hallucinations. Every test decision is grounded in explicit causal chains:- Evidence first: Any hypothesis requires explicit prior evidence support
- Confidence quantification: Each causal edge carries a numeric confidence score
- Traceability: Complete reasoning chains are recorded for failure tracing and experience reuse
- Hallucination prevention: Mandatory evidence validation rejects unfounded attack attempts
Plan-on-Graph dynamic task planning
Rather than a static task list, LuaN1ao models penetration testing plans as dynamically evolving Directed Acyclic Graphs (DAGs):| Feature | Traditional task list | Plan-on-Graph |
|---|---|---|
| Structure | Linear list | Directed graph |
| Dependency management | Manual sorting | Topological auto-sorting |
| Parallel execution | None | Auto-identifies parallel paths |
| Dynamic adjustment | Full regeneration | Local graph editing |
| Visualization | Difficult | Native Web UI support |
System requirements
| Component | Requirement | Notes |
|---|---|---|
| Operating system | Linux (recommended) / macOS / Windows (WSL2) | Run in an isolated environment |
| Python | 3.10+ | Requires asyncio and type hints support |
| LLM API | OpenAI-compatible format | Supports GPT-4o, DeepSeek, Claude, and others |
| Memory | Minimum 4 GB, recommended 8 GB+ | RAG services and LLM inference require memory |
| Network | Internet connection | Required for LLM API access and knowledge base setup |
Architecture overview
luan1ao.db). This decoupled architecture means you can monitor multiple past and present tasks from a single interface.
Next steps
Quickstart
Run your first penetration testing task in under 10 minutes.
Installation
Detailed setup instructions including virtual environments, Docker, and troubleshooting.
P-E-R architecture
Deep dive into how Planner, Executor, and Reflector collaborate.
Causal graph reasoning
Understand how evidence-driven decisions prevent hallucinated attacks.