1 min readfrom Machine Learning

[P] Cold Validation: Open-source system where one AI agent audits another with zero shared context

We released an open-source architecture for independent AI agent verification. The core idea: the agent that built something should never review it. Cold validation uses two agents with strict separation - Builder (Claude Code) produces plans and code - Reviewer (Codex CLI) audits only artifacts — never sees reasoning - An orchestrator enforces phase gates and convergence The reviewer runs filesystem-isolated (temp dir, no repo access). Findings are tracked with durable fingerprints across rounds. The controller independently reconciles verdicts against blocking findings. Apache 2.0. 35 mechanical tests. 

GitHub: https://github.com/raxe-ai/cold-validation-architecture

Deep dive: https://raxe.ai/labs/cold-validation

submitted by /u/cyberamyntas
[link] [comments]

Want to read more?

Check out the full article on the original site

View original article

Tagged with

#rows.com
#no-code spreadsheet solutions
#financial modeling with spreadsheets
#natural language processing for spreadsheets
#generative AI for data analysis
#Excel alternatives for data analysis
#row zero
#Cold Validation
#AI agent
#open-source architecture
#independent verification
#Builder
#Reviewer
#Claude Code
#Codex CLI
#artifacts
#orchestrator
#phase gates
#filesystem-isolated
#durable fingerprints
[P] Cold Validation: Open-source system where one AI agent audits another with zero shared context