b84230e389a862764f4ad8f212638a5ed55e7351
---ci---
phase: 1
milestone: v0.2.0
status: execute
decisions:
- id: D-001
decision: Git log as primary project memory, .ci/ for long-lived references only
rationale: Eliminates state drift, enables reconstruction from commit messages alone
confidence: 0.95
alternatives: [hybrid file+git, pure git with no .ci/]
- id: D-002
decision: ---ci--- YAML blocks in commit bodies for machine-parseable metadata
rationale: Structured and human-readable; grep-friendly; round-trips through parser
confidence: 0.92
alternatives: [JSON payload, conventional-commit-only]
- id: D-003
decision: Phase+milestone branch naming (phase/NN-slug, milestone/vX.X-slug)
rationale: Branch list immediately shows project state; merged equals complete
confidence: 0.88
alternatives: [trunk+tags, milestone-only branches]
requirements:
covered: [ARCH-01, ARCH-02, ARCH-03, ARCH-04, ARCH-05, ARCH-06]
lessons:
- Commit body YAML must round-trip through parser — tested before shipping
- .ci/audit/ removal required updating 4 test suites that depended on audit files
---/ci---
New modules: commit-parser, commit-builder, git-context, git-branch, ci-files
Core rewrites: DecisionEngine, EscalationProtocol, OrchestratorAgent
Removed: .ci/audit/, .planning/ directory support
Tests: 25 suites, 218 passing (up from 20/158)
CI — Continuous Intelligence
Fully autonomous AI-driven software engineering harness.
Overview
CI (Continuous Intelligence) is an autonomous-first software engineering harness that eliminates human-in-the-loop overhead while preserving the rigor of guided development. It receives a specification, resolves ambiguities through a single Clarify phase, then executes the full pipeline — research, plan, execute, verify — autonomously.
Installation
npm install -g @continuous-intelligence/ci
Or from source:
git clone <repo-url>
cd ci
npm install
npm run build
npm link
Quick Start
# Initialize from inline specification
ci init "Build a REST API for task management"
# Initialize from a specification file
ci init --spec ./specs/my-project.md
# Initialize with interactive clarify phase
ci init --clarify "Build a REST API for task management"
# Run the full autonomous pipeline
ci run --all
# Run a specific phase
ci run research
ci run plan
ci run execute
ci run verify
# Execute an ad-hoc task
ci quick "Add authentication middleware"
# Verify a phase
ci verify 1
# Check project status
ci status
# Review autonomous decisions
ci audit
ci audit --verbose
# Debug an issue
ci debug "Tests failing on CI"
# Rollback a phase
ci rollback 1
# Ship a phase (verify, security, commit, tag)
ci ship 1
Autonomy Levels
| Level | Behavior |
|---|---|
full |
No human interaction after Clarify. Escalate only irreversible decisions. |
supervised |
Escalate on every Escalation Gate plus verification failures. |
guided |
Escalate on every Decision Gate. Closest to Learnship behavior. |
Configuration
CI uses .ci/config.json for project configuration:
{
"autonomy": {
"level": "full",
"escalation_hooks": ["deploy", "delete_data", "merge_to_main"],
"clarify_budget": 10,
"decision_confidence_threshold": 0.6,
"max_revision_iterations": 3,
"max_verification_retries": 2,
"escalation_timeout_ms": 300000
},
"model_profile": "quality",
"parallelization": {
"enabled": true,
"max_concurrent_agents": 5,
"min_plans_for_parallel": 2
},
"verification": {
"automated_only": true,
"escalate_visual": true,
"escalate_external_integration": true,
"test_first": false
},
"security": {
"auto_accept_low_severity": true,
"auto_mitigate_medium_severity": true,
"escalate_high_severity": true
},
"git": {
"branching_strategy": "phase",
"auto_commit": true,
"auto_push": false
}
}
Architecture
Pipeline
SPECIFY → CLARIFY → RESEARCH → PLAN → EXECUTE → VERIFY → COMPLETE
↕ ↕ ↕ ↕
(questions) (auto-decide) (auto-run) (auto-verify)
Decision Engine
Every autonomous decision is classified by confidence:
- High (>0.85): Auto-decide, log to audit trail
- Medium (0.60-0.85): Auto-decide with assumption logging, flag for review
- Low (<0.60): Escalate to human
18 Agents
All 17 Learnship agents retained, plus the CI Orchestrator:
| Agent | Role | Modification |
|---|---|---|
| orchestrator | Pipeline controller | New — replaces interactive workflows |
| planner | Plan creation | Never sets autonomous: false |
| executor | Task execution | Never pauses for checkpoints |
| verifier | Output verification | Generates automated tests, not human UAT |
| researcher | Domain research | Logs assumptions, never flags for human |
| challenger | Plan stress-testing | Binding verdicts, only escalates <0.60 |
| security-auditor | Security audit | Auto-dispositions threats |
| debugger | Bug fixing | Auto-fixes when confidence > threshold |
| Others | Various | Unchanged from Learnship |
Verification Layers
- Structural: File existence, import/export wiring, no stubs
- Behavioral: Generated automated tests for must-haves
- Security: STRIDE analysis with auto-disposition
- Code Quality: Multi-persona review with P0 auto-fix
Specification Format
# Project: My Project
## Objective
Build a REST API for task management.
## Requirements
- User authentication (JWT-based)
- CRUD operations for tasks
- Real-time notifications
## Constraints
- Must use Node.js
- Must be production-ready
## Out of Scope
- Admin dashboard
- Mobile apps
Escalation Protocol
When CI cannot proceed autonomously:
- Irreversible Action: Deploy, delete, merge to protected branch
- Verification Failure: Tests pass but functional verification fails
- Low Confidence Decision: Critical decision below threshold
- Security Escalation: High-severity threat detected
- Specification Ambiguity: Multiple valid interpretations
Each escalation includes a recommended default with auto-proceed timeout.
Differences from Learnship
| Dimension | Learnship | CI |
|---|---|---|
| Human Interactions | 19+/lifecycle | 1-2/lifecycle |
| Decision Making | Human decides, agent implements | Agent decides, human reviews post-hoc |
| Verification | Human UAT | Automated tests + escalation |
| Specification | Multi-round conversation | Single spec file |
| Learning Curve | Moderate | Low (5 core commands) |
License
MIT
Releases
29
Languages
TypeScript
97.8%
JavaScript
1.2%
Shell
1%