The Blueprint 365
Overview
Blueprint 365 is an 11-stage AI agentic pipeline that transforms raw business requirements into a perfect, fully-implemented system reflecting requirements with 100% fidelity. Each stage is powered by a specialized large language model (LLM) with strict governance, traceability, and gap detection at every step.
Unlike traditional development workflows that rely on human translation of requirements through specification, design, and implementation — introducing ambiguity, drift, and omissions at every handoff — Blueprint 365 uses a deterministic multi-model pipeline where every artifact is verified against its source before the next stage begins.
Pipeline Flow
[USER] --> Stage 1 --> Stage 2 --> Stage 3 --> Stage 4 --> Stage 5 --> Stage 6 --> Stage 7 --> Stage 8 --> Stage 9 --> Stage 10 --> Stage 11 --> [SYSTEM]
REQGATH DOCGEN GA-A ARCH GA-B DETAIL GA-C PROMPT GA-D CODE GA-E
The pipeline operates in five phases:
- Requirements Phase (Stages 1-3): Raw user input → validated, unambiguous requirements document
- Design Phase (Stages 4-6): Requirements → high-level architecture → detailed design specifications
- Prompt Phase (Stages 7-9): Design → optimized code generation prompts
- Execution Phase (Stage 10): Prompts → production-ready code
- Certification Phase (Stage 11): Final quality attestation and critique
Stage Descriptions
Stage 1 — Requirement Gathering
| Property | Value |
|---|---|
| LLM | Kimi |
| Category | Strong |
| Independence | First stage — no predecessor |
| Output Token | VALIDATED_DETERMINISTIC |
The entry point of the pipeline. Raw user input is ingested and transformed into a structured, unambiguous, deterministic requirements specification. The system:
- Decomposes vague or high-level statements into specific, actionable requirements
- Eliminates ambiguity by requiring quantification of all metrics and constraints
- Identifies implicit assumptions and surfaces them for user confirmation
- Validates that requirements are internally consistent and free of contradictions
- Produces a numbered, traceable requirements list that every subsequent stage references
Key rule: No requirement enters the pipeline unless it is stated in a form that can be objectively verified as "done" or "not done."
Stage 2 — Document Generation
| Property | Value |
|---|---|
| LLM | Sonnet |
| Category | Light-Strong |
| Independence | Stage 1 (Kimi) |
| Output Token | SRS_GENERATED |
Transforms the validated requirements from Stage 1 into a formal Software Requirements Specification (SRS) document. This document serves as the single source of truth for the entire pipeline.
The SRS includes:
- Functional requirements — what the system must do, mapped to Stage 1 line items
- Non-functional requirements — performance, security, scalability, compliance constraints
- User scenarios — happy path, edge cases, and exceptional flows
- Acceptance criteria — measurable conditions for each requirement
- Dependencies and constraints — external systems, libraries, platforms, regulatory considerations
The SRS is written in a standardized format that downstream stages can parse and trace against.
Stage 3 — Gap Analysis A (GA-A)
| Property | Value |
|---|---|
| LLM | DeepSeek |
| Category | Strong |
| Independence | Stage 2 (Sonnet) |
| Output Token | VERIFIED_RECONCILED or REJECTED_GAP_DETECTED |
First verification gate. GA-A compares the Stage 2 SRS against the Stage 1 requirements and identifies:
- Missing requirements — requirements from Stage 1 not addressed in the SRS
- Scope creep — content in the SRS not traceable to a Stage 1 requirement
- Ambiguity — statements in the SRS that lack the precision of Stage 1
- Contradictions — internal inconsistencies within the SRS
Outcomes:
VERIFIED_RECONCILED— the SRS fully and accurately reflects all requirements. Pipeline proceeds to Stage 4.REJECTED_GAP_DETECTED— gaps found. Pipeline loops back to Stage 2 (or Stage 1 if the requirements themselves need refinement).
This stage enforces the fundamental rule: no downstream artifact may contain anything not authorized by Stage 1.
Stage 4 — Technical Architecture
| Property | Value |
|---|---|
| LLM | Opus 🏆 |
| Category | Premium Strong |
| Independence | Stage 3 (DeepSeek) |
| Output Token | DESIGN_STABILIZED |
Produces the high-level technical architecture. This stage:
- Designs the system structure: components, modules, layers, interfaces
- Selects technology stack: languages, frameworks, databases, infrastructure
- Defines data flow: how data moves between components and external systems
- Establishes architectural patterns: microservices, event-driven, layered, etc.
- Identifies integration points with existing systems
- Documents architectural decisions with rationale (Architecture Decision Records)
Every architectural decision must trace to one or more Stage 1 requirements. If a requirement could be satisfied by multiple architectures, the system selects the simplest option and documents the alternatives considered.
Stage 5 — Gap Analysis B (GA-B)
| Property | Value |
|---|---|
| LLM | Nemotron |
| Category | Strong |
| Independence | Stage 4 (Opus) |
| Output Token | ARCHITECTURE_VERIFIED or REJECTED_ARCHITECTURAL_GAP_DETECTED |
Second verification gate. GA-B validates the Stage 4 architecture against the Stage 1 requirements and the Stage 2 SRS:
- Requirements coverage — every functional and non-functional requirement addressed by the architecture
- Architectural gold-plating — detects unauthorized additions (over-engineered components not justified by requirements)
- Feasibility assessment — flags architectural choices that conflict with constraints (budget, timeline, team capability, regulatory)
- Integration validation — confirms all external interfaces are accounted for
Outcomes:
ARCHITECTURE_VERIFIED— architecture is complete and correct. Proceed to Stage 6.REJECTED_ARCHITECTURAL_GAP_DETECTED— issues found. Loop back to Stage 4.
Stage 6 — Detailed Architecture
| Property | Value |
|---|---|
| LLM | MiniMax |
| Category | Execution |
| Independence | Stage 5 (Nemotron) |
| Output Token | DETAILED_DESIGN_COMPLETE |
Expands the high-level architecture into detailed, implementation-ready specifications:
- Component specifications — exact APIs, data structures, algorithms for each component
- Database schemas — tables, indexes, relationships, migration strategy
- API contracts — request/response formats, endpoints, authentication, error codes
- State machines — state transitions, triggers, side effects for stateful components
- Configuration design — environment variables, feature flags, deployment parameters
- Testing strategy — unit, integration, end-to-end test specifications
At this stage, the design is precise enough that a developer could implement each component without ambiguity.
Stage 7 — Gap Analysis C (GA-C)
| Property | Value |
|---|---|
| LLM | DeepSeek |
| Category | Strong |
| Independence | Stage 6 (MiniMax) |
| Output Token | DETAILED_DESIGN_VERIFIED |
Third verification gate. GA-C validates the detailed design against the SRS and high-level architecture:
- Design-requirements traceability — every detailed design element mapped to a requirement
- Architecture compliance — detailed design faithfully implements the architecture without deviation
- Completeness check — no gaps between high-level architecture and detailed design
- Edge case coverage — all boundary conditions, error states, and exceptional flows are designed for
Stage 8 — Prompt Generation
| Property | Value |
|---|---|
| LLM | Opus 🏆 |
| Category | Premium Strong |
| Independence | Stage 7 (DeepSeek) |
| Output Token | PROMPTS_GENERATED |
Transforms the detailed design into optimized code generation prompts. Each prompt is engineered to produce a specific component or module with minimal ambiguity. This stage:
- Breaks the detailed design into logically independent work units
- Generates system prompts that define the role, context, constraints, and output format for each unit
- Includes inline tests, examples, and expected behavior descriptions
- Chains dependent prompts to ensure consistent cross-module interfaces
- Optimizes prompt structure for the target code generation model's strengths
The output is a set of ready-to-execute prompts that, when fed to the code generation model, produce production-quality code.
Stage 9 — Gap Analysis D (GA-D)
| Property | Value |
|---|---|
| LLM | Nemotron |
| Category | Strong |
| Independence | Stage 8 (Opus) |
| Output Token | PROMPTS_VERIFIED |
Fourth verification gate. GA-D validates the prompts against the detailed design:
- Prompt-design alignment — each prompt corresponds to a defined component or module
- Completeness — no design element without a corresponding prompt
- Omission detection — identifies design details missing from prompts
- Prompt quality — checks for ambiguity, missing context, or conflicting instructions within prompts
Stage 10 — Code Generation
| Property | Value |
|---|---|
| LLM | Big Pickle |
| Category | Execution |
| Independence | Stage 9 (Nemotron) |
| Output Token | CODE_GENERATED |
Executes the verified prompts to produce production-ready source code. This stage:
- Generates code for each prompt independently
- Assembles generated code into a coherent project structure
- Applies consistent coding standards, patterns, and conventions across all modules
- Generates accompanying files: build scripts, configuration files, Dockerfiles, CI/CD configurations
- Produces inline documentation, API references, and README files
The output is a complete, buildable codebase — not fragments or snippets.
Stage 11 — Final Quality Attestation & Critique (GA-E)
| Property | Value |
|---|---|
| LLM | Kimi |
| Category | Strong |
| Independence | Stage 10 (Big Pickle) |
| Output Token | ATTESTED, DEFERRED, or REJECTED |
The final verification gate — the most rigorous. GA-E performs a comprehensive audit of the entire pipeline output:
- 100% requirements coverage — every Stage 1 requirement implemented in code
- Scenario coverage — all happy paths, edge cases, and exceptional scenarios addressed
- Compilation verification — code is syntactically correct and buildable
- Full traceability — every line of code traces back through prompts → design → SRS → requirement
- Operational readiness — identifies all operational concerns (connection strings, secrets, infrastructure, integration points)
- User Intervention Manifest — complete list of every human-provided item needed to run the system (API keys, database connection strings, environment variables, third-party service accounts)
Outcomes:
ATTESTED— system passes all checks. Pipeline output is certified as production-ready.DEFERRED— minor issues identified. Documented for human resolution but pipeline output is substantially complete.REJECTED— critical failures detected. Pipeline must be restarted from the affected stage.
LLM Independence Rule
Every stage must use a different LLM than its immediate predecessor. No two consecutive stages may share the same LLM. This ensures independent analysis, prevents echo-chamber effects, and maximizes verification integrity.
| Stage | LLM | Why Different from Previous |
|---|---|---|
| 1 — Requirement Gathering | Kimi | — |
| 2 — Document Generation | Sonnet | Different from Kimi (different provider, different architecture) |
| 3 — GA-A | DeepSeek | Different from Sonnet (different provider, different training objective) |
| 4 — Technical Architecture | Opus 🏆 | Different from DeepSeek (premium reasoning model) |
| 5 — GA-B | Nemotron | Different from Opus (NVIDIA model, different training data) |
| 6 — Detailed Architecture | MiniMax | Different from Nemotron (MiniMax model family) |
| 7 — GA-C | DeepSeek | Different from MiniMax (re-uses DeepSeek for verification specialization) |
| 8 — Prompt Generation | Opus 🏆 | Different from DeepSeek (premium model for prompt engineering) |
| 9 — GA-D | Nemotron | Different from Opus (re-uses Nemotron for verification) |
| 10 — Code Generation | Big Pickle | Different from Nemotron (specialized code generation model) |
| 11 — GA-E | Kimi | Different from Big Pickle (closes the loop with the same model type as Stage 1 but with verification context) |
Each LLM is selected for its demonstrated strengths in the specific task category:
- Kimi — strong at structured analysis and comprehensive verification
- Sonnet — excellent document generation with balanced reasoning
- DeepSeek — reliable gap analysis with strong analytical capability
- Opus 🏆 — premium reasoning for architecture and prompt engineering
- Nemotron — robust verification with NVIDIA's specialized guardrails
- MiniMax — efficient execution for detailed specification
- Big Pickle — specialized code generation model
Governance Philosophy
Blueprint 365 applies four governance principles at every stage:
Input Validation
Every stage verifies the upstream output before processing it. Each artifact carries a signature from its producing stage. The consuming stage validates that:
- The artifact is well-formed and complete
- The producing stage's status token is valid
- The artifact references the correct version of upstream inputs
Scope Invariance
No stage may introduce requirements, features, or constraints not authorized by Stage 1. This is enforced by:
- GA stages (3, 5, 7, 9, 11) explicitly check for scope additions
- Any unauthorized addition triggers a
REJECTEDstatus - The pipeline halts until the scope drift is resolved at its source
Traceability
Every artifact in the pipeline traces back to Stage 1 requirements:
- Stage 2 SRS — each section references specific Stage 1 requirement IDs
- Stage 4 Architecture — each component references SRS sections
- Stage 6 Detailed Design — each specification references architecture components
- Stage 8 Prompts — each prompt references detailed design elements
- Stage 10 Code — each module references its generating prompt
- Stage 11 Attestation — verifies end-to-end traceability
Definition of Done
Every stage has explicit, machine-verifiable completion criteria:
- The stage's output token is produced (or rejection is issued)
- Output passes validation by the downstream GA stage
- All artifacts are versioned and stored with the pipeline execution record
- No pseudo-termination — the stage is not complete until all criteria are met
Status Tokens
Each stage outputs a deterministic status token that the next stage validates before processing. This creates an auditable chain of custody:
| Stage | Output Token | Meaning |
|---|---|---|
| 1 | VALIDATED_DETERMINISTIC |
Requirements are unambiguous and complete |
| 2 | SRS_GENERATED |
SRS document produced (validated by Stage 3) |
| 3 | VERIFIED_RECONCILED |
SRS matches requirements |
| 3 | REJECTED_GAP_DETECTED |
Gaps found in SRS (loop back) |
| 4 | DESIGN_STABILIZED |
Architecture approved |
| 5 | ARCHITECTURE_VERIFIED |
Architecture matches requirements and SRS |
| 5 | REJECTED_ARCHITECTURAL_GAP_DETECTED |
Architecture gaps found (loop back) |
| 6 | DETAILED_DESIGN_COMPLETE |
Detailed design produced |
| 7 | DETAILED_DESIGN_VERIFIED |
Detailed design matches architecture |
| 8 | PROMPTS_GENERATED |
Code generation prompts produced |
| 9 | PROMPTS_VERIFIED |
Prompts match detailed design |
| 10 | CODE_GENERATED |
Production-ready code produced |
| 11 | ATTESTED |
System fully certified |
| 11 | DEFERRED |
Minor issues found, documented for human resolution |
| 11 | REJECTED |
Critical failures detected |
Anti-Patterns Enforced
Hallucination Prevention
No stage may add features, capabilities, or behaviors that are not present in Stage 1 requirements. The GA stages use strict pattern matching to detect:
- Phantom features — capabilities described in design or code without requirement authorization
- Speculative generalization — "the system should also support..." statements
- Assumed integrations — connections to services not mentioned in requirements
Gold-Plating Detection
GA stages (3, 5, 7, 9) specifically flag architectural or design additions that exceed what requirements demand:
- Over-engineered solutions where simpler alternatives would suffice
- Premature optimization (caching layers, redundancy, clustering) without requirement justification
- Technology choices driven by preference rather than constraint
Scope Drift Prevention
If a user attempts to introduce new requirements mid-pipeline, the system:
- Records the new input as a scope change request
- Determines which stages must be re-executed to incorporate the change
- Presents the impact assessment (time, cost, rework scope) before proceeding
- Requires explicit user confirmation before incorporating the change
Ambiguity Rejection
Any requirement, specification, or design element that uses vague terms (fast, scalable, robust, user-friendly, efficient, etc.) without quantification is flagged and must be:
- Replaced with a measurable metric (e.g., "fast" → "responds within 200ms at p95 under 1000 concurrent users")
- Referenced to a specific Stage 1 requirement that provides the quantification
- Rejected if no quantification is available
Traceability Map
Stage 1 (Requirements)
|
v
Stage 2 (SRS Document)
|
v
Stage 3 (Gap Analysis) --> [Feedback Loop] --> Stage 1/2
|
v
Stage 4 (Architecture) --> [Trace to Stage 1]
|
v
Stage 5 (Gap Analysis) --> [Feedback Loop] --> Stage 4
|
v
Stage 6 (Detailed Design) --> [Trace to Stage 1/4]
|
v
Stage 7 (Gap Analysis) --> [Feedback Loop] --> Stage 6
|
v
Stage 8 (Prompts) --> [Trace to Stage 6]
|
v
Stage 9 (Gap Analysis) --> [Feedback Loop] --> Stage 8
|
v
Stage 10 (Code Generation) --> [Trace to Stage 1/6/8]
|
v
Stage 11 (Final Verification) --> [Final Certification]
Output Assurance
The Stage 11 certification guarantees:
| Criterion | Verification Method |
|---|---|
| 100% requirements implemented | Automated traceability scan — every Stage 1 line item mapped to code |
| Happy path coverage | Positive scenario tests generated and passing |
| Edge case coverage | Boundary condition tests generated and passing |
| Exceptional scenario coverage | Error handling tests generated and passing |
| Compilation | Build system invoked, zero errors, zero warnings |
| Traceability | End-to-end chain from code → prompt → design → SRS → requirement |
| Operational readiness | All connection strings, secrets, infrastructure, and integrations identified and documented in the User Intervention Manifest |
User Intervention Manifest
The final output includes a manifest that lists every human-provided item needed to run the system:
- API keys and secrets (with placeholders and source references)
- Database connection strings (with host, port, database name)
- Third-party service accounts and configuration
- Infrastructure provisioning steps (if not automated)
- Any manual configuration steps required
- Deployment environment specifications
This ensures that even though the code is fully generated, no operational detail is hidden or assumed.
Use Cases
Enterprise Application Development
A financial services firm needs a new customer onboarding system with strict regulatory requirements. Using Blueprint 365:
- Stage 1: Requirements are gathered covering KYC, AML, data privacy, and multi-jurisdiction compliance
- Stages 2-6: SRS and architecture are generated and verified against regulations
- Stages 7-10: Prompts generate the onboarding service, document upload, identity verification, and audit logging modules
- Stage 11: Final attestation confirms all regulatory requirements are met and generates the User Intervention Manifest with production secrets and integration details
Result: A production-ready system with full regulatory traceability, delivered in hours instead of weeks.
API Microservice Generation
A SaaS company needs a new billing microservice integrated with their existing platform:
- Requirements define the billing models, pricing tiers, invoice generation, payment gateway integration
- The pipeline generates the complete microservice: API endpoints, database schema, payment gateway client, webhook handlers
- The User Intervention Manifest lists the payment gateway API keys, database connection strings, and environment configuration
Result: A deployable microservice with zero manual coding, fully traced to business requirements.
Getting Started
Prerequisites
- Access to the Blueprint 365 pipeline runner (CLI or API)
- API keys for the required LLM providers (Kimi, Sonnet, DeepSeek, Opus, Nemotron, MiniMax, Big Pickle)
- A clear statement of business requirements in natural language
Running the Pipeline (CLI)
blueprint365 run --input requirements.txt
The pipeline processes each stage sequentially, outputting artifacts to the ./output/ directory. You can monitor progress via the CLI output:
[1/11] REQGATH (Kimi) → VALIDATED_DETERMINISTIC ✓
[2/11] DOCGEN (Sonnet) → SRS_GENERATED ✓
[3/11] GA-A (DeepSeek) → VERIFIED_RECONCILED ✓
[4/11] ARCH (Opus) → DESIGN_STABILIZED ✓
...
Single Step Execution
Run specific stages for iterative refinement:
# Re-run from Stage 4 after a gap detection
blueprint365 run --input requirements.txt --from-stage 4
Output Structure
./output/
├── requirements/
│ └── requirements-validated.md
├── srs/
│ └── srs-v1.md
├── architecture/
│ ├── architecture-v1.md
│ └── adrs/
├── detailed-design/
│ ├── components/
│ ├── schemas/
│ └── contracts/
├── prompts/
│ └── prompt-*.md
├── code/
│ ├── src/
│ ├── tests/
│ ├── docker/
│ └── README.md
├── attestation/
│ └── ga-e-report.md
└── manifest/
└── user-intervention-manifest.md
FAQ
Q: What happens when a GA stage rejects an artifact? A: The pipeline loops back to the stage that produced the artifact. That stage receives the rejection report as context and re-executes with the gap information. The feedback loop continues until the artifact passes. Each iteration is logged for auditability.
Q: Can I customize which LLMs are used for each stage? A: Yes. The LLM assignments are configurable in the pipeline configuration file. The only constraint is the Independence Rule: no two consecutive stages may use the same LLM.
Q: What if my requirements change mid-pipeline? A: Blueprint 365 supports scope change requests. Submit the new or modified requirements, and the pipeline determines which stages need re-execution. The impact assessment (affected stages, additional cost, timeline impact) is presented before proceeding.
Q: Is the generated code production-ready? A: Stage 11 attests that the code compiles, covers all scenarios, and is fully traced to requirements. However, any production deployment should include human review of the attestation report and User Intervention Manifest. The manifest explicitly lists every operational item a human must provide.
Q: How does the pipeline handle very large systems? A: The Stage 6 detailed design breaks the system into independently implementable modules. Stage 8 generates individual prompts per module, and Stage 10 can parallelize code generation across modules. For systems exceeding a single pipeline run, the architecture can define subsystem boundaries, and each subsystem runs through its own pipeline instance.