continuous-intelligence/ci

Author	SHA1	Message	Date
Jon Chery	8e50049ba5	feat(P01): add ideation engine + ciagent ideate command — IDEATE-01,02,03,17 + MULTI-01 ---ci--- phase: 1 milestone: v0.10 status: execute decisions: - id: D-080 decision: Three-tier ideation (mechanical, backend-enriched, cross-project) rationale: Mechanical tier always produces output without backend confidence: 0.92 - id: D-089 decision: No separate codebase map command rationale: Git-native + .ciagent/ covers mapping; avoids tree-sitter dep confidence: 0.88 requirements: covered: - IDEATE-01 - IDEATE-02 - IDEATE-03 - IDEATE-17 - MULTI-01 ---/ci--- Add IdeationEngine core module with 15 signal collectors: - Uncovered/partial requirements from REQUIREMENTS.md - Coverage gaps (documented but unimplemented agents) - Repeated lessons from git history - Low-confidence decisions from ---ci--- blocks - Escalation patterns from git history - Compound solution patterns - Architecture drift (ARCHITECTURE.md vs src/) - Verification inversion (missing test files) - Improvement patterns (cross-referencing lessons + requirements) - Spec ambiguity (should/could/might patterns) - Spec missing (common requirement categories) - Cascade impact (--affected from git diff) - External signals (npm audit, dependency staleness) - Cross-project lesson mining Add ciagent ideate CLI command with flags: --category, --affected, --spec, --external, --cross-project, --output Add active_projects to CIAgentConfig (backwards compatible with active_project). Add IDEATE pipeline stage between RESEARCH and PLAN. Update IdeationAgent to delegate to IdeationEngine. 533 tests passing.	2026-05-30 20:13:43 +00:00
Jon Chery	a8b50f5109	feat(ci): v0.9.0 — Distribution & Expansion milestone complete CI / build-and-test (push) Has been cancelled Details Publish to npm / publish (push) Has been cancelled Details ---ci--- project: ci phase: 6 milestone: v0.9 status: complete artifacts: tags: [v0.9.0] decisions: - id: D-047 decision: v0.9 theme = Distribution & Expansion rationale: npm publish + OpenAI/Anthropic backends + agent flesh + parallel execution confidence: 0.92 - id: D-049 decision: Feature milestone — patch tags v0.8.1-v0.8.6 then v0.9.0 rationale: OpenAI backend, agent flesh, npm publish all feat confidence: 0.95 - id: D-059 decision: Rename OllamaBaseBackend to LLMBaseBackend + thin OllamaBaseBackend subclass rationale: 15 of 17 methods backend-agnostic confidence: 0.92 - id: D-060 decision: OpenAI/Anthropic backends use native fetch() not SDK packages rationale: No dependency bloat; fetch native in Node 18+ confidence: 0.85 - id: D-066 decision: Concurrency limiter internal (no p-limit dependency) rationale: 15 lines; avoids dependency for trivial feature confidence: 0.90 - id: D-067 decision: Promise.allSettled for review agents at orchestrator lines 373-400 rationale: Current sequential loop replaced with parallel execution confidence: 0.88 requirements: covered: [PUBLISH-01, PUBLISH-02, PUBLISH-03, PUBLISH-04, OPENAI-01, OPENAI-02, OPENAI-03, OPENAI-04, OPENAI-05, FLESH-01, FLESH-02, FLESH-03, FLESH-04, FLESH-05, ANTHROPIC-01, ANTHROPIC-02, FLESH-06, FLESH-07, NPM-01, NPM-02, PARALLEL-01, PARALLEL-02, PARALLEL-03, INTEG-01, INTEG-02, INTEG-03, INTEG-04, INTEG-05] ---/ci--- 6 phases, 28 tasks, 4077 net lines added, 57 test suites, 527 tests, zero stub agents	2026-05-30 02:19:44 +00:00
Jon Chery	70f9f720e6	feat(P06): integration \u0026 hardening — version 0.8.0, agent tests, E2E, docs, fallbacks ---ci--- project: ci phase: 6 milestone: v0.8 status: complete decisions: - id: D-037 decision: v0.8.0 release with 6 phases complete rationale: All verification layers now deliver what they claim confidence: 0.95 requirements: covered: [INT-01, INT-02, INT-03, INT-04, INT-05, INT-06, INT-07, INT-08] ---/ci--- INT-06: Version bumped to 0.8.0 in package.json and src/version.ts. INT-07: New test suites for SecurityAuditorAgent (5 tests), DocWriterAgent (5 tests), DebuggerAgent (5 tests), ChallengerAgent (4 tests). INT-08: Zod validation test suite with 9 cases: valid input, missing fields, path traversal, absolute paths, contradictory success+error, invalid operation, negative tokens, fail+error, emptyBackendResult. INT-04: ciagent review command now has mechanical fallback — runs CodeReviewerAgent regex review without backend. INT-05: ciagent debug command now has mechanical fallback — runs DebuggerAgent stack trace parsing + git bisect without backend. INT-01: E2E verification test — fixture with defects fails L3/L4; clean project passes all 4 layers. INT-02: AGENTS.md updated — removed 'not yet implemented' caveats for L2/L3/L4; updated test count to 44 suites, 454 tests. INT-03: PROJECT.md updated — removed Out of Scope for STRIDE, multi-persona review, and behavioral test generation.	2026-05-29 20:46:44 +00:00
Jon Chery	93967feb68	feat(P05): flesh 4 agents with intrinsic mechanical logic ---ci--- project: ci phase: 5 milestone: v0.8 status: complete decisions: - id: D-033 decision: Flesh SecurityAuditorAgent with STRIDE-aware mechanical scanning rationale: Runs L3 security patterns intrinsically; no backend required confidence: 0.90 - id: D-034 decision: Flesh DocWriterAgent with template-based doc update rationale: Updates ROADMAP.md phase status, REQUIREMENTS.md req status, reads git log for new decisions confidence: 0.85 - id: D-035 decision: Flesh DebuggerAgent with stack trace parsing + git bisect rationale: Parses stack traces to find file:line, bisects to find introducing commit confidence: 0.80 - id: D-036 decision: Flesh ChallengerAgent with plan DAG/wave/must-have/REQ validation rationale: Validates plan structure mechanically; catches circular deps and gaps confidence: 0.82 requirements: covered: [AGENT-01, AGENT-02, AGENT-03, AGENT-04] ---/ci--- AGENT-01: SecurityAuditorAgent.mechanicalAudit() runs STRIDE+ CWE pattern scan intrinsically. Each finding has stride_category, cwe, severity, and disposition (accept/mitigate/flag based on confidence threshold). AGENT-02: DocWriterAgent.mechanicalDocUpdate() reads plan data, updates .ciagent/ROADMAP.md phase status to complete, .ciagent/REQUIREMENTS.md pending→covered, and reads git log for new decision entries. AGENT-03: DebuggerAgent.mechanicalDebug() parses stack traces (4 regex patterns for different formats), identifies root file:line, runs git bisect to find introducing commit, suggests git revert. AGENT-04: ChallengerAgent.mechanicalChallenge() validates plan structure: circular dependency detection via DFS, wave ordering validation, must-haves presence check, and requirement coverage check.	2026-05-29 20:30:45 +00:00
Jon Chery	07e5e70c9b	feat(P04): 3-persona code review, fix L4 pass/fail, flesh CodeReviewerAgent ---ci--- project: ci phase: 4 milestone: v0.8 status: complete decisions: - id: D-031 decision: 3-persona quality review: security, performance, maintainability rationale: Each persona detects different class of issues; aggregate gives complete picture confidence: 0.82 - id: D-032 decision: L4 P0>0 = fail (not P0>3); P1 = warning (not pass) rationale: Any P0 finding is critical; P1 findings should never pass silently confidence: 0.95 requirements: covered: [QUAL-01, QUAL-02, QUAL-03, QUAL-04, QUAL-05] ---/ci--- QUAL-01: Added 3-persona review with distinct pattern sets: SecurityReviewer (injection, auth, crypto), PerformanceReviewer (sync I/O, timer leaks, DoS), MaintainabilityReviewer (type safety, dead code, tech debt). QUAL-02: CodeReviewerAgent fleshed with mechanical 3-persona review. Works without backend by running regex-based scan across all personas. QUAL-03: L4 passed=false when ANY P0 finding exists (was >3). P1 findings now return status='warning' (was always 'pass'). QUAL-04: TypeScript strict mode check remains in quality layer. QUAL-05: CodeReviewerAgent.mechanicalReview() provides regex-based review as fallback when no backend is available.	2026-05-29 20:26:21 +00:00
Jon Chery	d6ba76e660	fix(P01): add SIGTERM/SIGINT signal handlers for graceful shutdown ---ci--- project: ci phase: 1 milestone: v0.8 status: in_progress decisions: - id: D-026 decision: Graceful drain on SIGTERM/SIGINT: dispose timers then exit rationale: Prevents orphaned setTimeout timers from leaking when process is killed confidence: 0.88 requirements: covered: [FIX-07] ---/ci--- FIX-07: cli/index.ts registers SIGTERM/SIGINT handlers that call escalationProtocol.dispose() before process.exit. OrchestratorAgent registers its EscalationProtocol instance via registerEscalationProtocol(). SIGINT exits with code 130, SIGTERM with 143 (standard signal+128 convention).	2026-05-29 20:05:48 +00:00
Jon Chery	5fb285cf46	fix(P01): add Zod BackendResult validation and fix opencode silent success ---ci--- project: ci phase: 1 milestone: v0.8 status: in_progress decisions: - id: D-022 decision: Validate BackendResult at boundary with Zod schema rationale: External backend output is untrusted; runtime validation prevents corrupt commit streams confidence: 0.92 - id: D-023 decision: opencode parseResult returns success:false on malformed JSON rationale: Silent success:true on parse failure masks backend errors; fail loudly instead confidence: 0.95 requirements: covered: [FIX-02, FIX-03] ---/ci--- FIX-02: Add Zod BackendResultSchema and validateBackendResult() in backends/types.ts. backendResultToAgentResult() in base.ts now validates before passing through. Invalid results produce success:false with error detail. Path traversal protection: artifact paths with '..' or leading '/' are rejected. FIX-03: opencode.ts parseResult() no longer defaults to success:true when JSON parsing fails entirely. Both the inner parse error and the no-JSON match case now return emptyBackendResult() with descriptive error messages.	2026-05-29 19:52:51 +00:00
Jon Chery	2306493a77	fix(P01): replace hardcoded phase=1 in orchestrator and fix getDecisions double-fetch ---ci--- project: ci phase: 1 milestone: v0.8 status: in_progress decisions: - id: D-021 decision: 6-phase wave-ordered vertical slices for v0.8 rationale: Each phase independently demoable; critical fixes first confidence: 0.90 requirements: covered: [FIX-01, FIX-06] ---/ci--- FIX-01: Replace 5 hardcoded phase=1 literals in orchestrator.ts mechanical execution path with this.pipelineState!.current_phase. The orchestrator correctly tracks current_phase but commits always embedded literal 1. FIX-06: Replace getDecisions() redundant double-fetch with single getRecentCommits(50) call, delegating to existing getDecisionsFromCommits(). Old code called getRecentCommits(50) once per grep match entry (O(N*M) when it should be O(1)).	2026-05-29 19:46:46 +00:00
Jon Chery	a416413c7d	feat(P06): docs & hardening — AGENTS.md/README fixes, agent tests, Gitea tests, multi-project tests, version 0.7.0 ---ci--- phase: 6 milestone: v0.7.0 plan: 06 task: P06-all status: execute ---/ci---	2026-05-29 18:20:46 +00:00
Jon Chery	4de1f65c10	feat(P04): pipeline stage delegation — EXECUTE=3 agents, TEST=tester, VERIFY=verifier, COMPLETE=doc-writer+ship ---ci--- phase: 4 milestone: v1.0 plan: 04 task: PIPE-01-04 status: execute ---/ci---	2026-05-29 18:13:39 +00:00
Jon Chery	6902c37ced	fix(P03): improve planner task descriptions — avoid redundant REQ-ID in task lines ---ci--- phase: 3 milestone: v0.6.0 plan: 03 task: 03-03 status: execute ---/ci---	2026-05-29 18:11:49 +00:00
Jon Chery	bbabd2dc0a	feat(P03): core agent flesh — VerifierAgent, ResearcherAgent, TesterAgent intrinsic logic	2026-05-29 18:08:38 +00:00
Jon Chery	99df4fe4e2	feat(P02): orchestrator enrichment — GitAgentContext, multi-phase, error recovery, timer cleanup, TEST stage ---ci--- phase: 2 milestone: v0.6 status: execute decisions: - id: D-001 decision: Pass GitAgentContext to agents instead of bare AgentContext rationale: Agents need git-native context (gitContext, gitBranch, ciFiles, milestone) to operate autonomously confidence: 0.95 - id: D-002 decision: Implement multi-phase iteration with totalPhases derived from ROADMAP.md rationale: Milestones can span multiple phases; orchestrator must advance through all of them confidence: 0.90 - id: D-003 decision: Add executeStageWithRecovery with retry + plan revision + escalation rationale: Robust error recovery requires multiple fallback levels before giving up confidence: 0.85 - id: D-004 decision: Add timer-to-escalation mapping in EscalationProtocol for proper cleanup rationale: resolveEscalation must clearTimeout for the corresponding timer to prevent resource leaks confidence: 0.90 - id: D-005 decision: Add dispose() to EscalationProtocol called in orchestrator finally block rationale: Ensures all timers are cleaned up on orchestrator exit regardless of outcome confidence: 0.95 - id: D-006 decision: Add mechanical TEST stage fallback running npm test via execSync rationale: When no backend is available, tests can still be run mechanically confidence: 0.85 ---/ci---	2026-05-29 18:05:36 +00:00
Jon Chery	4a58aa1657	refactor(rebrand): rename & rebrand CI → CIAgent across all source and test files - Type renames: CIConfig → CIAgentConfig, DEFAULT_CI_CONFIG → DEFAULT_CIAGENT_CONFIG - Type renames: CiMetadata → CIAgentMetadata, ParsedCiCommit → ParsedCIAgentCommit - Function renames: initCI → initCIAgent, isCIInitialized → isCIAgentInitialized - Function renames: extractCiBlock → extractCIAgentBlock, parseCiBlock → parseCIAgentBlock - Class renames: CiFiles → CIAgentFiles - Import paths: ci-files.js → ciagent-files.js - Directory paths: .ci/ → .ciagent/ across all source and test files - Check names: ".ci directory exists" → ".ciagent directory exists" - Check names: "CI config valid" → "CIAgent config valid" - Temp dir names: ci--test- → ciagent--test- - CLI examples: "ci init" → "ciagent init" - Fix deepMerge infinite recursion bug in config.ts - ---ci---/---/ci--- block markers preserved unchanged - All 31 test suites, 370 tests passing ---ci--- phase: 1 milestone: v0.5 plan: 07 task: 07-01-01 status: execute ---/ci---	2026-05-29 18:01:13 +00:00
Jon Chery	5753e2dc96	fix(P03): honest execution — real rollback, honest orchestrator, git-native verification ---ci--- project: ci phase: 3 milestone: v0.5 status: complete decisions: - id: D-026 decision: Phase 3 Honest Execution complete rationale: All HONEST requirements covered; no more fake success returns confidence: 0.95 alternatives: [] requirements: covered: [HONEST-01, HONEST-02, HONEST-03] ---/ci---	2026-05-29 16:44:46 +00:00
CI	fb3f1df13e	release(v0.4.0): purge learnship, migrate .planning→.ci, fix backends, add test coverage - Remove all learnship references: Decision.learnship_equivalent field, agent persona prompts, opencode.json permissions, test fixtures - Migrate verification layers from .planning/ to .ci/: structural checks .ci/ dir + ROADMAP.md, behavioral checks ROADMAP.md - Fix ollama-local: remove sync require+curl blocking, use async fetchAvailableModels() in callModel - Fix opencode.json: use __OPENCODE_DIR__ template tokens, remove legacy learnship permission entries - Remove duplicate install script from package.json (keep postinstall) - Fix quality any-regex false positives (target type annotations only) - Add backends test coverage: backends.test.ts, tool-registry.test.ts - Version bump 0.3.0 → 0.4.0 - Artifacts module: rename .planning→.ci internal paths - Remove dead TODO_PATTERN/FIXME_PATTERN constants ---ci--- phase: 3 milestone: v0.4 status: complete requirements: covered: [REQ-09, REQ-10, REQ-11, REQ-13, REQ-14, REQ-17] partial: [] decisions: - id: D-001 decision: purge all learnship references from codebase rationale: project is CI-only, learnship is no longer a dependency confidence: 0.99 category: scope alternatives: [keep for historical reference] - id: D-002 decision: migrate verification from .planning/ to .ci/ paths rationale: .planning/ is removed schema, all current state lives in .ci/ confidence: 0.95 category: architecture alternatives: [keep dual-path support] - id: D-003 decision: use __OPENCODE_DIR__ template tokens in opencode.json rationale: hardcoded ~ paths fail in containers and non-standard homes confidence: 0.90 category: implementation_approach alternatives: [keep tilde expansion] ---/ci---	2026-05-29 16:18:30 +00:00
CI	940b85bfae	feat(backends): multi-backend intelligence layer — LLM + Agent backends, persona-loading agents, honest CLI commands Add IntelligenceBackend abstraction with two categories: - LLMBackend (OllamaLocal, OllamaCloud): CI runs tool loop, provides tools, constructs prompts - AgentBackend (Opencode): agent runs own tool loop, CI serializes request Refactor all 18 agents from hardcoded stubs to persona loaders that delegate to the active backend or fail honestly when no backend is available. Refactor OrchestratorAgent.executeStage() from monolithic switch to agent delegation via STAGE_AGENT_MAP for intelligent stages (research, plan, execute, verify), with mechanical stages (specify, clarify, complete) staying inline. Wire CLI commands with --backend flag and auto-detection (opencode → ollama-local → ollama-cloud). Harden rollback/ship with real git operations. No command returns fake success.	2026-05-29 15:58:34 +00:00
grimacing	6e637e4af0	v0.2.0: Git-native architecture (#1 )	2026-05-29 12:59:45 +00:00
CI	9cf5c000d9	feat: implement CI (Continuous Intelligence) autonomous engineering harness Implements the full PRD for CI - a fully autonomous AI-driven software engineering harness derived from Learnship's architecture. Core components: - CI Orchestrator agent with autonomous pipeline (SPECIFY → CLARIFY → RESEARCH → PLAN → EXECUTE → VERIFY → COMPLETE) - Decision Engine with confidence thresholds (high/medium/low) - Clarify Phase with question budget and default acceptance - Escalation Protocol with timeout auto-proceed - Audit Trail system (.ci/audit/) for post-hoc review - Error Recovery with retry, plan revision, and rollback 18 agents (all Learnship agents + Orchestrator): - Autonomous behavioral modifications per PRD §7.1 - Agent registry with factory pattern 11 CLI commands: - ci init, ci run, ci quick, ci debug, ci verify - ci review, ci status, ci audit, ci clarify - ci rollback, ci ship 4-layer verification system: - Structural, Behavioral, Security, Code Quality 3 autonomy levels: full, supervised, guided Compatible with Learnship artifact schemas (.planning/)	2026-05-28 23:24:42 +00:00

19 Commits