🏰 Watchtower (AI-Powered Penetration Testing Framework)

Watchtower is a simple AI-powered penetration testing automation CLI tool that leverages LLMs and LangGraph to orchestrate agentic workflows that you can use to test your websites locally. Generate useful pentest reports for your websites.

Penetration testing is red team activity and should be done with permission.

⚠️ Legal Disclaimer

Watchtower is designed exclusively for authorized security testing and educational purposes.

Legal Use: Authorized penetration testing, security research, educational environments.
Illegal Use: Unauthorized access, malicious activities, any form of cyber attack.

You are fully responsible for ensuring you have explicit written permission before testing any system. Unauthorized access to computer systems is illegal under laws including the Computer Fraud and Abuse Act (CFAA), GDPR, and equivalent international legislation.

By using Watchtower, you agree to use it only on systems you own or have explicit authorization to test.

✨ Core Features

Multi-Agent Architecture:
- Planner: Analyzes the target and current findings to strategize the next sequence of actions.
- Worker: Dynamically executes tools requested by the Planner.
- Analyst: Parses tool stdout/stderr, filters false positives, and converts raw findings into structured schema data.
Dynamic Tool Arsenal: Integrated with 23 security tools using Python subprocess wrappers. The interactive CLI auto-checks your PATH and lets you exclude tools dynamically.
- Network: nmap, masscan
- Web Recon: httpx, whatweb, wafw00f
- Subdomain: subfinder, amass, dnsrecon
- Vulnerability: nuclei, nikto, sqlmap, wpscan, retire.js
- SSL/TLS: testssl.sh, sslyze
- Content/Params: gobuster, ffuf, arjun, kiterunner
- Security Analysis: xsstrike, gitleaks, cmseek, dalfox
State Management: Uses SQLite locally to store a historical record of observations and findings.
Parallel Reconnaissance: Orchestrates multiple tools concurrently (e.g., httpx + whatweb) to accelerate assessment cycles.
Smart Truncation: Output-aware clipping that prioritizes security vulnerabilities and critical findings over generic logs.
LLM Agnostic: Seamlessly swap between OpenAI, Google Gemini, and OpenRouter APIs via .env files.

🚀 Quick Start

1. 📋 Requirements & Prerequisites

Prerequisites:

OS: Linux or macOS recommended (Windows supported via WSL2 for some networking tools).
Python: 3.11+ installed.
API Keys: An active API key for OpenRouter, OpenAI, or Gemini.

Tool Requirements: To utilize the AI's full capabilities, you must have the actual CLI binaries installed and accessible in your system PATH (e.g., nmap, nuclei, httpx). The framework will automatically detect which tools are missing and skip them in the UI.

Install all tools:

./install_tools.sh # or with sudo

2. 💻 Installation

git clone https://github.com/fzn0x/watchtower.git
cd watchtower

python -m venv venv
source venv/bin/activate  # On Windows: venv\Scripts\activate
pip install -r requirements.txt

3. ⚙️ Configuration

cp .env.example .env

Populate .env with your API keys. You only need to fulfill one of the API setups:

OPENAI_API_KEY=""
GEMINI_API_KEY=""
OPENROUTER_API_KEY=""

(Optional) Explicitly define which model strings the framework should use. Defaults are already configured:

OPENAI_MODEL_NAME="gpt-4-turbo"
GEMINI_MODEL_NAME="gemini-1.5-pro"
OPENROUTER_MODEL_NAME="anthropic/claude-3-opus"

4. ▶️ Running the Framework

You must specify a target URL or IP using the -t or --target flag.

python -m watchtower.main -t https://www.example.com

Upon execution, Watchtower will display an interactive CLI checkbox prompt. It automatically highlights which of the 23 tools are successfully installed on your machine. You can use <Space> to enable/disable specific tools allowing you to narrowly focus the LLM's payload, or hit <Enter> to confirm the selection.

Headless Mode: If you want to bypass the interactive menu or are integrating Watchtower into an automated CI/CD pipeline, use the --skip-ask-tools flag to auto-run with everything available on your PATH:

python -m watchtower.main -t https://www.example.com --skip-ask-tools

Authenticated Pentesting: Watchtower supports authenticated workflows via session cookies or custom headers.

python -m watchtower.main -t https://api.example.com --cookie "session=xyz123" --header "X-API-Key: secret-key"

📊 Generating Reports

Watchtower automatically stores all executed commands, terminal outputs, and confirmed vulnerabilities in a local SQLite memory file (pentest_memory.db).

You can extract all findings into a cleanly formatted PDF document without re-running the pentest:

python -m watchtower.main --report "pentest_report.pdf"

5. 🔌 Custom Providers

Watchtower dynamically supports almost any LLM provider on the market via LangChain and LiteLLM integrations. You can override the default models from the CLI using the --provider, --model, and --apikey flags.

 python -m watchtower.main -t https://www.example.com --provider=https://api.dgrid.ai/api/v1 --model=anthropic/claude-opus-4.5 --apikey "API_KEY"

Example response (using httpx tool):

INFO: ==> Node Executed: [WORKER]
INFO: HTTP Request: POST https://api.dgrid.ai/api/v1/chat/completions "HTTP/1.1 200 OK"
INFO: ==> Node Executed: [ANALYST]
INFO:     - Updated state 'findings': [{'title': 'Overly Permissive CORS Configuration', 'severity': 'Medium', 'description': 'The server is configured with Access-Control-Allow-Origin: * which allows any website to make cross-origin requests to this domain. This could potentially allow malicious websites to interact with the API on behalf of authenticated users, leading to data theft or unauthorized actions if sensitive endpoints exist.', 'evidence': 'Access-Control-Allow-Origin: *\naccess-control-allow-headers: *\naccess-control-allow-methods: GET, HEAD, OPTIONS'}, {'title': 'Missing Security Headers', 'severity': 'Low', 'description': 'The response is missing several recommended security headers including X-Frame-Options (clickjacking protection), Content-Security-Policy (XSS and injection protection), and Strict-Transport-Security (HSTS for enforcing HTTPS). While some headers like X-Content-Type-Options and Referrer-Policy are present, the absence of these headers reduces the overall security posture.', 'evidence': 'HTTP/1.1 200 OK\nDate: Fri, 27 Feb 2026 13:28:07 GMT\nContent-Type: text/html; charset=utf-8\n[Headers present: x-content-type-options: nosniff, referrer-policy: strict-origin-when-cross-origin]\n[Missing: X-Frame-Options, Content-Security-Policy, Strict-Transport-Security]'}]

⚠️ Disclaimer: The --apikey argument requires the exact property name of the variable stored inside your .env file (e.g., MY_GROQ_KEY), not the raw API key string itself. This prevents your secrets from leaking into bash history.

Supported Providers Include:

Anthropic
OpenAI
OpenRouter
Litellm
Amazon Bedrock
Vercel AI Gateway
Moonshot AI
Mistral
MiniMax
OpenCode Zen
GLM Models
Z.AI
Synthetic
Qianfan
Any custom HTTP URLs (Acts as a drop-in OpenAI-compatible endpoint, automatically routing requests via LangChain's ChatOpenAI client)
Others: https://docs.litellm.ai/docs/providers

Example CLI Execution (Custom Endpoint):

python -m watchtower.main -t https://example.com --provider=https://api.dgrid.ai/api/v1 --model=anthropic/claude-opus-4.5 --apikey "API_KEY"

🗺️ Roadmap

Initial LangGraph Planner/Worker architecture.
Integrate core web and network reconnaissance tools.
Add Pydantic structured output fallback for open-source OpenRouter models.
Parallel tool execution and Smart Truncation.
Support for authenticated pentesting (cookies/headers).
Advanced business logic analysis enhancements.

📌 Important Notes

API Costs: The multi-agent workflow consumes tokens rapidly during active scanning as the Planner loops through observations. Be mindful of your API budgets.
Hallucinations: While the Analyst agent filters false positives, LLMs can still hallucinate conclusions based on ambiguous tool outputs. Always manually verify findings.
Network Stability: Some tools (like masscan or ffuf) are extremely noisy. You may trigger upstream edge-protections (Cloudflare) on your targets, which will pollute the observation logs with 403s.

❓ FAQ & Troubleshooting

Q: `model: [model_name] does not support feature: structured-outputs`

This error occurs when using experimental, free, or unsupported models via OpenRouter (e.g. some Qwen or DeepSeek variants). Watchtower relies on LangChain's Structured Outputs mechanism to force the AI to return perfectly formatted JSON objects. If you see this error, switch your OPENROUTER_MODEL_NAME in the .env file to a fully-supported model like:

anthropic/claude-3.5-sonnet
openai/gpt-4o
google/gemini-1.5-pro
meta-llama/llama-3.1-70b-instruct

Note: The framework includes a custom string-fallback parser for models that don't natively support API structured outputs, but using supported commercial models yields significantly better pentesting logic.

Q: `429 Too Many Requests: [model]:free is temporarily rate-limited upstream`

This means you are using an explicitly :free model tier on OpenRouter, and the upstream providers (like Venice or Novita) are currently rate-limiting free-tier requests due to high global traffic. You simply need to wait out the timeout or switch to a slightly different (or paid) model endpoint.

📄 License

This project is licensed under the MIT License. See the LICENSE file for details.

By using this software, you agree to the conditions outlined in the Legal Disclaimer. Watchtower's developers assume no liability for the misuse of this tool.

🙏 Author & Acknowledgements

Created and maintained by fzn0x.

A deep and sincere thank you to the open-source security community. Watchtower stands on the shoulders of giants—this framework would not exist without the incredible developers who built and maintain the underlying penetration testing and reconnaissance tools that power the core worker engine. Thank you for making security accessible.

Name		Name	Last commit message	Last commit date
Latest commit History 11 Commits
assets/imgs		assets/imgs
watchtower		watchtower
.env.example		.env.example
.gitignore		.gitignore
CONTRIBUTING.md		CONTRIBUTING.md
Dockerfile		Dockerfile
LICENSE		LICENSE
Makefile		Makefile
README.md		README.md
SECURITY.md		SECURITY.md
docker-compose.yml		docker-compose.yml
install_tools.sh		install_tools.sh
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

🏰 Watchtower (AI-Powered Penetration Testing Framework)

⚠️ Legal Disclaimer

✨ Core Features

🚀 Quick Start

1. 📋 Requirements & Prerequisites

2. 💻 Installation

3. ⚙️ Configuration

4. ▶️ Running the Framework

📊 Generating Reports

5. 🔌 Custom Providers

🗺️ Roadmap

📌 Important Notes

❓ FAQ & Troubleshooting

Q: `model: [model_name] does not support feature: structured-outputs`

Q: `429 Too Many Requests: [model]:free is temporarily rate-limited upstream`

📄 License

🙏 Author & Acknowledgements

About

Uh oh!

Releases 3

Contributors 1

Languages

Folders and files

Latest commit

History

Repository files navigation

🏰 Watchtower (AI-Powered Penetration Testing Framework)

⚠️ Legal Disclaimer

✨ Core Features

🚀 Quick Start

1. 📋 Requirements & Prerequisites

2. 💻 Installation

3. ⚙️ Configuration

4. ▶️ Running the Framework

📊 Generating Reports

5. 🔌 Custom Providers

🗺️ Roadmap

📌 Important Notes

❓ FAQ & Troubleshooting

Q: model: [model_name] does not support feature: structured-outputs

Q: 429 Too Many Requests: [model]:free is temporarily rate-limited upstream

📄 License

🙏 Author & Acknowledgements

About

Topics

Resources

License

Contributing

Security policy

Uh oh!

Stars

Watchers

Forks

Releases 3

Contributors 1

Languages

Q: `model: [model_name] does not support feature: structured-outputs`

Q: `429 Too Many Requests: [model]:free is temporarily rate-limited upstream`