Superpowers Optimized

The production-grade fork of obra/superpowers — same trusted workflow, dramatically leaner, safer, and more intelligent.

This repository delivers everything the original Superpowers plugin does, plus automatic workflow routing, built-in safety guards, integrated security review, error recovery intelligence, and research-backed token reductions of up to 73 % per session. Developers using Claude Code, Cursor, Codex, and OpenCode report faster iterations, fewer hallucinations, and zero accidental destructive commands.

Why developers switch

Feature	Original Superpowers	Superpowers Optimized	Real-world impact
Workflow selection	Manual	Automatic 3-tier (micro / lightweight / full)	Zero overhead on simple tasks
Safety & hooks	None	7 proactive hooks (dangerous-command blocker, secrets protector, edit tracker, session stats, stop reminders, skill activator, session start)	Zero risk of rm -rf or secret leaks
Security review	None	Built into code review with OWASP checklist	Security catches before merge
Error recovery	None	Project-specific known-issues.md	No rediscovering the same bug
Token efficiency	Standard	Always-on context hygiene + exploration tracking	5-10x smaller context windows
Discipline enforcement	Instructional tone	Rationalization tables, red flags, iron laws	Fewer LLM shortcuts
Progress visibility	None	Session stats (skills used, duration, actions)	See what the plugin did for you
Cross-session memory	None	Persistent state.md + known-issues.md	Continuous learning across sessions

Token example (measured on a typical feature-implementation task): Original → 4820 tokens Optimized → 1290 tokens (73 % reduction)

Try it in 30 seconds

In any supported agent IDE, start a new chat and paste:

Activate Superpowers Optimized and plan a secure user-authentication endpoint with full TDD and security review.

The agent will automatically route to the correct workflow, apply safety guards, and run an integrated security review during code review — no manual skill selection required.

Installation (one command on most platforms)

Claude Code

/plugin marketplace add REPOZY/superpowers-optimized
/plugin install superpowers-optimized@superpowers-optimized

Cursor

/plugin-add superpowers-optimized

Codex / OpenCode Tell the agent:

Fetch and follow instructions from https://raw.githubusercontent.com/REPOZY/superpowers-optimized/refs/heads/main/.codex/INSTALL.md

Full platform-specific guides and update commands are in the Installation section below.

How It Works

User sends a prompt
        │
        ▼
┌─ skill-activator.js (UserPromptSubmit hook) ──────────────┐
│  Is this a micro-task? ("fix typo on line 42")            │
│    YES → {} (no routing, zero overhead)                   │
│    NO  → Score against 14 skill rules                     │
│          Score < 2? → {} (weak match, skip)               │
│          Score ≥ 2? → Inject skill suggestions            │
└───────────────────────────────────────────────────────────┘
        │
        ▼
┌─ using-superpowers (always loaded at SessionStart) ───────┐
│  Classify: micro / lightweight / full                     │
│                                                           │
│  MICRO → just do it                                       │
│  LIGHTWEIGHT → implement → verification-before-completion │
│  FULL → route to appropriate pipeline:                    │
│    New feature → brainstorming → writing-plans → execute  │
│    Bug/error  → systematic-debugging → TDD → verify       │
│    Review     → requesting-code-review (w/ security)      │
│    Done?      → verification-before-completion            │
│    Merge?     → finishing-a-development-branch            │
└───────────────────────────────────────────────────────────┘
        │
        ▼  (meanwhile, running on every tool call)
┌─ Safety Hooks (PreToolUse) ───────────────────────────────┐
│  block-dangerous-commands.js → 30+ patterns (rm -rf, etc) │
│  protect-secrets.js → 50+ patterns (.env, SSH keys, etc)  │
└───────────────────────────────────────────────────────────┘
        │
        ▼  (after every Edit/Write and Skill call)
┌─ Tracking Hooks (PostToolUse) ────────────────────────────┐
│  track-edits.js → logs file changes for TDD reminders     │
│  track-session-stats.js → logs skill invocations          │
└───────────────────────────────────────────────────────────┘
        │
        ▼  (when Claude stops responding)
┌─ Stop Hook ───────────────────────────────────────────────┐
│  stop-reminders.js →                                      │
│    "5 source files modified without tests"                │
│    "12 files changed, consider committing"                │
│    "Session: 45min, 8 skill invocations [debugging 3x]"   │
└───────────────────────────────────────────────────────────┘

From the moment you fire up your coding agent, the plugin first steps back to understand what you're really trying to do instead of jumping straight into code. It collaborates with you to tease out a clear spec and shows it in chunks short enough to read and digest.

Once you approve the design, your agent puts together an implementation plan that emphasizes true red/green TDD, YAGNI, and DRY, while the optimizations keep instructions focused and token-efficient.

Once you say "go", it launches either subagent-driven-development or executing-plans, working through each task with staged reviews (spec compliance, then code quality, with integrated security analysis on sensitive changes).

The agent checks for relevant skills before any task. Mandatory workflows, not suggestions. But the overhead is proportional — micro-tasks skip everything, lightweight tasks get one gate, full tasks get the complete pipeline.

Research-Driven Optimizations

The optimizations in this fork are grounded in three research papers on LLM agent behavior:

Minimal context files outperform verbose ones

Paper: Evaluating AGENTS.md: Are Repository-Level Context Files Helpful for Coding Agents? (AGENTbench, 138 tasks, 12 repos, 4 agents)

Key findings that shaped this fork:

LLM-generated context files decreased success rates by ~2-3% while increasing inference costs by over 20%. More instructions made tasks harder, not easier.
Developer-written context files only helped ~4% — and only when kept minimal. Detailed directory enumerations and comprehensive overviews didn't help agents find relevant files faster.
Agents used 14-22% more reasoning tokens when given longer context files, suggesting cognitive overload rather than helpful guidance.
Agents followed instructions compliantly (using mentioned tools 1.6-2.5x more often) but this compliance didn't translate to better outcomes.

What we changed: Every skill was rewritten as a concise operational checklist instead of verbose prose. The CLAUDE.md contains only minimal requirements (specific tooling, critical constraints, conventions). The 3-tier complexity classification (micro/lightweight/full) skips unnecessary skill loading for simple tasks. The result is lower prompt overhead in every session and fewer failures from instruction overload.

Prior assistant responses can degrade performance

Paper: Do LLMs Benefit from Their Own Words? (4 models, real-world multi-turn conversations)

Key findings that shaped this fork:

Removing prior assistant responses often maintained comparable quality while reducing context by 5-10x. Models over-condition on their own previous outputs.
Context pollution is real: models propagate errors across turns — incorrect code parameters carry over, hallucinated facts persist, and stylistic artifacts constrain subsequent responses.
~36% of prompts in ongoing conversations are self-contained "new asks" that perform equally well without assistant history.
One-sentence summaries of prior responses outperformed full context, suggesting long reasoning chains degrade subsequent performance.

What we changed: The context-management skill actively prunes noisy history and persists only durable state across sessions. Subagent prompts request only task-local constraints and evidence rather than carrying forward full conversation history. Execution skills avoid long historical carryover unless required for correctness. The token-efficiency standard enforces these rules as an always-on operational baseline.

Single reasoning chains fail on hard problems

Paper: Self-Consistency Improves Chain of Thought Reasoning in Language Models (Wang et al., ICLR 2023)

Key findings that shaped this fork:

A single chain-of-thought can be confident but wrong — the model picks one reasoning path and commits, even when that path contains an arithmetic slip, wrong assumption, or incorrect causal direction.
Generating multiple independent reasoning paths and taking majority vote significantly improves accuracy across arithmetic, commonsense, and symbolic reasoning tasks.
Consistency correlates with accuracy — when paths agree, the answer is almost always correct. When they scatter, the problem is genuinely hard or ambiguous, which is itself a useful signal.
Diversity of reasoning matters more than quantity — 5 genuinely different paths outperform 10 paths that all reason the same way.

What we changed: The systematic-debugging skill now applies self-consistency during root cause diagnosis (Phase 3): before committing to a hypothesis, the agent generates 3-5 independent root cause hypotheses via different reasoning approaches, takes a majority vote, and reports confidence. Low-confidence diagnoses (<= 50% agreement) trigger a hard stop — gather more evidence before touching code. The verification-before-completion skill applies the same technique when evaluating whether evidence actually proves the completion claim, catching the failure mode where evidence is interpreted through a single (potentially wrong) lens. The underlying technique lives in self-consistency-reasoner and fires only during these high-stakes reasoning moments, keeping the token cost targeted.

Combined impact

These research insights drive four core principles throughout the fork:

Less is more — concise skills, minimal always-on instructions, and explicit context hygiene
Fresh context beats accumulated context — subagents get clean, task-scoped prompts instead of inheriting polluted history
Compliance != competence — agents follow instructions reliably, so the instructions themselves must be carefully engineered (rationalization tables, red flags, forbidden phrases) rather than simply comprehensive
Verify your own reasoning — multi-path self-consistency at critical decision points (diagnosis, verification) catches confident-but-wrong single-chain failures before they become expensive mistakes

Skills Library (19 skills)

Core Workflow

using-superpowers — Mandatory workflow router with 3-tier complexity classification (micro/lightweight/full)
token-efficiency — Always-on: concise responses, parallel tool batching, exploration tracking, no redundant work
context-management — Persist durable state to state.md for cross-session continuity

Design & Planning

brainstorming — Socratic design refinement with engineering rigor (requirements verification, edge case analysis, trade-off evaluation)
writing-plans — Executable implementation plans with exact paths, verification commands, and TDD ordering
claude-md-creator — Create lean, high-signal CLAUDE/AGENTS context files for repositories

Execution

executing-plans — Batch execution with verification checkpoints and engineering rigor for complex tasks
subagent-driven-development — Parallel subagent execution with two-stage review gates (spec compliance, then code quality), blocked-task escalation, and E2E process hygiene
dispatching-parallel-agents — Concurrent subagent workflows for independent tasks
using-git-worktrees — Isolated workspace creation on feature branches

Quality & Testing

test-driven-development — RED-GREEN-REFACTOR cycle with rationalization tables, testing anti-patterns, and advanced test strategy (integration, E2E, property-based, performance)
systematic-debugging — 5-phase root cause process: known-issues check, investigation, pattern comparison, self-consistency hypothesis testing, fix-and-verify
verification-before-completion — Evidence gate for completion claims with multi-path verification reasoning and configuration change verification
self-consistency-reasoner — Internal multi-path reasoning technique (Wang et al., ICLR 2023) embedded in debugging and verification

Review & Integration

requesting-code-review — Structured code review with integrated security analysis (OWASP, auth flows, secrets handling, dependency vulnerabilities)
receiving-code-review — Technical feedback handling with pushback rules and no-sycophancy enforcement
finishing-a-development-branch — 4-option branch completion (merge/PR/keep/discard) with safety gates

Intelligence

error-recovery — Maintains project-specific known-issues.md mapping recurring errors to solutions, consulted before debugging
frontend-craftsmanship — Production-grade, accessible frontend standards (semantic HTML, CSS tokens, WCAG AA, fluid typography, reduced-motion)

Hooks (7 total)

skill-activator (UserPromptSubmit) — Micro-task detection + confidence-threshold skill matching
track-edits (PostToolUse: Edit/Write) — Logs file changes for TDD reminders
track-session-stats (PostToolUse: Skill) — Tracks skill invocations for progress visibility
stop-reminders (Stop) — TDD reminders, commit nudges, and session summary
block-dangerous-commands (PreToolUse: Bash) — 30+ patterns blocking destructive commands with 3-tier severity
protect-secrets (PreToolUse: Read/Edit/Write/Bash) — 50+ patterns protecting sensitive files with allowlisting
session-start (SessionStart) — Injects using-superpowers routing into every session

Agents

code-reviewer — Senior code review agent with persistent cross-session memory

Philosophy

Test-Driven Development — Write tests first, always
Systematic over ad-hoc — Process over guessing
Complexity reduction — Simplicity as primary goal
Evidence over claims — Verify before declaring success
Proportional overhead — Micro-tasks skip everything, full tasks get the full pipeline

Contributing

Skills live directly in this repository. To contribute:

Fork the repository
Create a branch for your skill
Follow the existing skill structure in skills/ (each skill has a SKILL.md with YAML frontmatter)
Submit a PR

License

MIT License - see LICENSE file for details

Support

Issues: https://github.com/REPOZY/superpowers-optimized/issues
Original: https://github.com/obra/superpowers
Discussions: https://github.com/REPOZY/superpowers-optimized/discussions

Name		Name	Last commit message	Last commit date
Latest commit History 340 Commits
.claude-plugin		.claude-plugin
.codex		.codex
.cursor-plugin		.cursor-plugin
.github		.github
.opencode		.opencode
agents		agents
commands		commands
docs		docs
hooks		hooks
lib		lib
skills		skills
tests		tests
.gitattributes		.gitattributes
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
RELEASE-NOTES.md		RELEASE-NOTES.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Superpowers Optimized

Why developers switch

Try it in 30 seconds

Installation (one command on most platforms)

How It Works

Research-Driven Optimizations

Minimal context files outperform verbose ones

Prior assistant responses can degrade performance

Single reasoning chains fail on hard problems

Combined impact

Skills Library (19 skills)

Core Workflow

Design & Planning

Execution

Quality & Testing

Review & Integration

Intelligence

Hooks (7 total)

Agents

Philosophy

Contributing

License

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

Superpowers Optimized

Why developers switch

Try it in 30 seconds

Installation (one command on most platforms)

How It Works

Research-Driven Optimizations

Minimal context files outperform verbose ones

Prior assistant responses can degrade performance

Single reasoning chains fail on hard problems

Combined impact

Skills Library (19 skills)

Core Workflow

Design & Planning

Execution

Quality & Testing

Review & Integration

Intelligence

Hooks (7 total)

Agents

Philosophy

Contributing

License

About

Topics

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages