T

Jon Chery f7fff95cbe feat(P03): full STRIDE + CWE security verification with reduced false positives

---ci---
project: ci
phase: 3
milestone: v0.8
status: complete
decisions:
  - id: D-029
    decision: Full STRIDE 7-category coverage with CWE mapping
    rationale: Industry standard threat classification with actionable CWE remediation
    confidence: 0.88
  - id: D-030
    decision: Reduce exec/eval false positives via string interpolation detection
    rationale: execSync("ls") is safe; execSync(`rm ${x}`) is not
    confidence: 0.85
requirements:
  covered: [SEC-01, SEC-02, SEC-03, SEC-04, SEC-05, SEC-06]
---/ci---

SEC-01: Fixed STRIDE category misassignments. Hardcoded password is
information_disclosure (CWE-259), not spoofing. exec with interpolation
is elevation_of_privilege (CWE-78), not tampering. All 17 patterns
correctly categorized.

SEC-02: Added missing STRIDE categories: repudiation (empty catch blocks,
CWE-778) and spoofing (jwt.decode without verify, CWE-287). Also added
denial_of_service (JSON body parser without size limit, CWE-400) and
prototype pollution (CWE-1321), weak crypto (CWE-328), unsafe
deserialization (CWE-502), path traversal (CWE-22).

SEC-03: Reduced false positives: exec/eval patterns now require string
interpolation (template literal or dynamic concat), not all exec/calls.

SEC-04: Every SECURITY_PATTERNS entry has a cwe field with valid CWE ID.

SEC-05: Confidence-based auto-disposition: each pattern has a confidence
score. High confidence findings are flagged, medium require verification,
low are suppressed. Threshold configurable via constructor.

SEC-06: Security passed=false when any high-severity finding exists
(already enforced by hasHighFail check, now more explicit).

2026-05-29 20:23:09 +00:00

.githooks

feat(P06): 3-tier versioning, branch hierarchy enforcement, ARCHITECTURE-PLAN synthesis

2026-05-29 17:18:10 +00:00

opencode

feat(P06): docs & hardening — AGENTS.md/README fixes, agent tests, Gitea tests, multi-project tests, version 0.7.0

2026-05-29 18:20:46 +00:00

scripts

docs(rebrand): rename & rebrand CI → CIAgent across all documentation, templates, and scripts

2026-05-29 17:58:48 +00:00

src

feat(P03): full STRIDE + CWE security verification with reduced false positives

2026-05-29 20:23:09 +00:00

templates

feat(P06): docs & hardening — AGENTS.md/README fixes, agent tests, Gitea tests, multi-project tests, version 0.7.0

2026-05-29 18:20:46 +00:00

.gitignore

v0.2.0: Git-native architecture (#1 )

2026-05-29 12:59:45 +00:00

AGENTS.md

feat(P06): docs & hardening — AGENTS.md/README fixes, agent tests, Gitea tests, multi-project tests, version 0.7.0

2026-05-29 18:20:46 +00:00

jest.config.js

v0.2.0: Git-native architecture (#1 )

2026-05-29 12:59:45 +00:00

LICENSE

feat: implement CI (Continuous Intelligence) autonomous engineering harness

2026-05-28 23:24:42 +00:00

package-lock.json

fix(P01): replace hardcoded phase=1 in orchestrator and fix getDecisions double-fetch

2026-05-29 19:46:46 +00:00

package.json

feat(P06): docs & hardening — AGENTS.md/README fixes, agent tests, Gitea tests, multi-project tests, version 0.7.0

2026-05-29 18:20:46 +00:00

README.md

feat(P06): docs & hardening — AGENTS.md/README fixes, agent tests, Gitea tests, multi-project tests, version 0.7.0

2026-05-29 18:20:46 +00:00

tsconfig.json

feat: implement CI (Continuous Intelligence) autonomous engineering harness

2026-05-28 23:24:42 +00:00

README.md

CIAgent — Continuous Intelligence

Fully autonomous, git-native AI-driven software engineering harness.

Overview

CIAgent (Continuous Intelligence) is an autonomous-first software engineering harness that eliminates human-in-the-loop overhead while preserving the rigor of guided development. It receives a specification, resolves ambiguities through a single Clarify phase, then executes the full pipeline — research, plan, execute, verify — autonomously.

The git log IS the project memory. Every decision, escalation, lesson learned, and verification result is encoded in commit messages using structured ---ci--- YAML blocks. An agent's first impulse to gather context is git log, not file reads. Another agent with access to only commit messages (no code, no diffs) can reconstruct the project state completely.

Installation

From source (package not yet published to npm):

git clone https://git.cloudinit.dev/continuous-intelligence/ci.git
cd ciagent
npm install
npm run build
npm link

Quick Start

# Initialize from inline specification
ciagent init "Build a REST API for task management"

# Initialize from a specification file
ciagent init --spec ./specs/my-project.md

# Run the full autonomous pipeline
ciagent run --all

# Run a specific phase
ciagent run research
ciagent run plan
ciagent run execute
ciagent run verify

# Execute an ad-hoc task
ciagent quick "Add authentication middleware"

# Check project status (reads from git log + branches)
ciagent status

# Review autonomous decisions (extracted from git log ---ci--- blocks)
ciagent audit
ciagent audit --verbose

# Debug an issue
ciagent debug "Tests failing on CI"

# Rollback a phase
ciagent rollback 1

# Ship a phase (verify, security, commit, tag)
ciagent ship 1

Git-Native Architecture (v0.2.0)

The Commit Schema

Every CIAgent-generated commit contains a ---ci--- YAML block with structured metadata:

feat(P01-01-02): create user registration endpoint

---ci---
phase: 1
milestone: v1.0
plan: 01-01
task: 01-01-02
status: execute
decisions:
  - id: D-003
    decision: Use bcrypt with 12 rounds for password hashing
    rationale: Industry standard; argon2 not available in target env
    confidence: 0.88
    alternatives: [argon2, scrypt]
requirements:
  covered: [AUTH-01]
---/ci---

- POST /auth/register validates email and password
- Checks for duplicate users
- Returns JWT token on success

What Lives Where

Where	What	Why
`.ciagent/config.json`	Autonomy, thresholds, git strategy	Controls system behavior before any commits exist
`.ciagent/PROJECT.md`	Vision, core value, requirements, constraints, key decisions table	Long-lived strategic reference
`.ciagent/ARCHITECTURE.md`	System architecture, component boundaries, data flow	Long-lived technical reference
`.ciagent/ROADMAP.md`	Phase breakdown, milestone mapping, success criteria	Long-lived planning reference
`.ciagent/REQUIREMENTS.md`	v1/v2 requirements with REQ-IDs and traceability	Long-lived requirements reference
Git commit bodies	Decisions, escalations, lessons, compounds, verification results	Dynamic event stream — the audit trail
Git branches	Phase/milestone status	`phase/NN-slug` and `milestone/vX.X-slug` encode project structure

Branch Strategy

main
 └── milestone/v1.0-mvp
      ├── phase/01-authentication       # in progress if not merged
      ├── phase/02-task-management
      └── phase/03-realtime-notifications

Branch exists + not merged = phase in progress
Branch merged to milestone = phase complete
Milestone branch merged to main = milestone complete

Context Reconstruction Protocol

An agent starting a session gathers context in this order:

git log --oneline -20 — recent activity
git branch -a — phase/milestone structure
git log -1 --format="%b" — latest ---ci--- block
.ciagent/config.json — autonomy + thresholds
.ciagent/PROJECT.md — vision + constraints (when needed)
.ciagent/ROADMAP.md — phase plan + success criteria (when needed)
.ciagent/REQUIREMENTS.md — REQ-IDs + traceability (when planning)
.ciagent/ARCHITECTURE.md — system structure (when researching)

Steps 1-3 take <1 second and provide 80% of the context needed.

The Reconstruction Test

An agent with access to only commit messages (no code, no diffs, no .ciagent/ files) can reconstruct:

Reconstructable	How
Project specification	Init commit body
Current phase	`---ci---.phase` field + branch status
Current milestone	Branch names + `---ci---.milestone` field
All decisions with rationale	`git log --grep="decisions:" --format="%b"`
Decision confidence	Each decision has `confidence: 0.XX`
Alternatives considered	Each decision has `alternatives: [...]`
Requirements coverage	`git log --grep="requirements:" --format="%b"`
Lessons learned	`git log --grep="lessons:" --format="%b"`
Compounded solutions	`git log --grep="compound:" --format="%b"`
Escalation history	`git log --grep="escalation:" --format="%b"`

Commit Types

In addition to conventional commit types, CIAgent uses:

Type	When Used
`decision`	Autonomous decision logged (no code change)
`compound`	Compounded solution captured
`escalation`	Escalation raised or resolved
`verify`	Verification pass/fail
`wip`	Work-in-progress checkpoint

Autonomy Levels

Level	Behavior
`full`	No human interaction after Clarify. Escalate only irreversible decisions.
`supervised`	Escalate on every Escalation Gate plus verification failures.
`guided`	Escalate on every Decision Gate.

Configuration

CIAgent uses .ciagent/config.json for project configuration:

{
  "autonomy": {
    "level": "full",
    "escalation_hooks": ["deploy", "delete_data", "merge_to_main"],
    "clarify_budget": 10,
    "decision_confidence_threshold": 0.6,
    "max_revision_iterations": 3,
    "max_verification_retries": 2,
    "escalation_timeout_ms": 300000
  },
  "model_profile": "quality",
  "parallelization": {
    "enabled": true,
    "max_concurrent_agents": 5,
    "min_plans_for_parallel": 2
  },
  "verification": {
    "automated_only": true,
    "escalate_visual": true,
    "escalate_external_integration": true,
    "test_first": false
  },
  "security": {
    "auto_accept_low_severity": true,
    "auto_mitigate_medium_severity": true,
    "escalate_high_severity": true
  },
  "git": {
    "branching_strategy": "phase",
    "auto_commit": true,
    "auto_push": false
  }
}

Architecture

Pipeline

SPECIFY → CLARIFY → RESEARCH → PLAN → EXECUTE → TEST → VERIFY → COMPLETE
                 ↕               ↕         ↕     ↕      ↕
            (questions)    (auto-decide) (auto-run) (auto-test) (auto-verify)

Git-Native Core Modules

Module	Purpose
`commit-parser`	`---ci---` YAML block extraction and parsing from commit messages
`commit-builder`	Structured commit message generation for all commit types
`git-context`	Project state reconstruction from `git log` + `git branch`
`git-branch`	Phase/milestone branch lifecycle management
`ciagent-files`	`.ciagent/` long-lived reference file management with update discipline

Decision Engine

Every autonomous decision is classified by confidence:

High (>0.85): Auto-decide, commit as ---ci--- block
Medium (0.60-0.85): Auto-decide with assumption logging, flag for review
Low (<0.60): Escalate to human

Decisions are committed to git as decision type commits. The audit trail is git log --grep="decisions:".

19 Agents

Agent	Role	CIAgent Modification
orchestrator	Pipeline controller	Git-first context loading, `---ci---` commit generation
planner	Plan creation	Never sets `autonomous: false`
executor	Task execution	Never pauses for checkpoints
verifier	Output verification	Generates automated tests, not human UAT
researcher	Domain research	Logs assumptions, never flags for human
tester	Integration/e2e tests	Detects and runs existing test files, never writes tests
challenger	Plan stress-testing	Binding verdicts, only escalates <0.60
security-auditor	Security audit	Auto-dispositions threats
debugger	Bug fixing	Auto-fixes when confidence > threshold
Others	Various	Delegates to active intelligence backend

Verification Layers

Structural: File existence, import/export wiring, no stubs
Behavioral: Test infrastructure and requirement traceability (partially implemented — static analysis, no test generation yet)
Security: Regex-based threat pattern scanning with auto-disposition (partially implemented — no STRIDE analysis yet)
Code Quality: Regex-based code quality checks (partially implemented — no multi-persona review yet)

Specification Format

# Project: My Project

## Objective
Build a REST API for task management.

## Requirements
- User authentication (JWT-based)
- CRUD operations for tasks
- Real-time notifications

## Constraints
- Must use Node.js
- Must be production-ready

## Out of Scope
- Admin dashboard
- Mobile apps

Escalation Protocol

When CIAgent cannot proceed autonomously:

Irreversible Action: Deploy, delete, merge to protected branch
Verification Failure: Tests pass but functional verification fails
Low Confidence Decision: Critical decision below threshold
Security Escalation: High-severity threat detected
Specification Ambiguity: Multiple valid interpretations

Each escalation is committed as an escalation type commit. Resolved escalations produce a follow-up commit with the resolution. The full escalation history is available via git log --grep="escalation:".

Current Limitations

Agent implementations: 5 core agents have intrinsic logic (planner, executor, verifier, researcher, tester); 13 agents delegate to backends. Full LLM-powered agent behavior requires an intelligence backend.
Package not published to npm: Install from source only until a publishing pipeline is configured.
Behavioral/Security/Quality verification layers: Partially implemented — structural verification is complete; behavioral does static analysis; security does regex-based threat scanning; quality does regex-based code quality checks.

Differences from Learnship

Dimension	Learnship	CIAgent
Project memory	`.planning/` directory files (legacy)	Git log + `---ci---` commit blocks
Audit trail	`.ciagent/audit/*.json` files (legacy)	`git log --grep="decisions:"`
State management	`STATE.md` + `STATE.md.json` (legacy)	Reconstructed from git on demand
Phase discovery	Read `.planning/phases/` directory (legacy)	`git branch -a \| grep phase/`
Human Interactions	19+/lifecycle	1-2/lifecycle
Decision Making	Human decides, agent implements	Agent decides, human reviews post-hoc
Verification	Human UAT	Automated tests + escalation
Specification	Multi-round conversation	Single spec file

Repository

git.cloudinit.dev/continuous-intelligence/ci

License

MIT

Releases 29

v0.11.0: Multi-Session & Persona Specialization Latest

2026-06-01 20:23:07 +00:00

Languages

TypeScript 97.8%

JavaScript 1.2%

Shell 1%