Tractatus AI Safety Framework
Find a file
TheFlow f2206aaa44 security: replace production quickstart kit with sanitized deployment guide on implementer.html
Changed implementer.html 'Deployment Quickstart Kit' section to 'Deployment Architecture Guide':

REMOVED:
- Download link for tractatus-quickstart.tar.gz (contained production configs)
- docker-compose.yml, .env.example, sample governance rules
- verify-deployment.sh script
- Installation and troubleshooting guides with production details

ADDED:
- Link to GitHub deployment-guide/ (sanitized conceptual documentation)
- Conceptual architecture and patterns description
- Security best practices overview
- Contact info for production implementation support (john.stroh.nz@pm.me)

Rationale: Production deployment package on public website provided attack surface. Sanitized guide maintains educational value without exposing implementation details.

Deployed to: https://agenticgovernance.digital/implementer.html
2025-10-12 10:14:46 +13:00
.claude feat: comprehensive accessibility improvements (WCAG 2.1 AA) 2025-10-12 07:08:40 +13:00
.github/workflows fix: resolve YAML syntax error in sync-public-docs workflow 2025-10-10 06:29:24 +13:00
.memory feat: enhance framework services and format architectural documentation 2025-10-11 00:50:47 +13:00
.migration-backup feat: enhance framework services and format architectural documentation 2025-10-11 00:50:47 +13:00
audit-reports feat: comprehensive accessibility improvements (WCAG 2.1 AA) 2025-10-12 07:08:40 +13:00
data/mongodb feat: initialize tractatus project with complete directory structure 2025-10-06 23:26:26 +13:00
deployment-quickstart feat: deployment quickstart kit - 30-minute Docker deployment (Task 6) 2025-10-12 07:27:37 +13:00
docs docs: session handoff 2025-10-12 - sidebar redesign and PNG fix complete 2025-10-12 09:16:46 +13:00
governance docs: Phase 2 kickoff materials & domain migration to agenticgovernance.digital 2025-10-07 13:17:42 +13:00
public security: replace production quickstart kit with sanitized deployment guide on implementer.html 2025-10-12 10:14:46 +13:00
scripts feat: redesign docs sidebar with audience-based categories + fix PNG background 2025-10-12 09:10:58 +13:00
src feat: comprehensive accessibility improvements (WCAG 2.1 AA) 2025-10-12 07:08:40 +13:00
systemd feat(infra): semantic versioning and systemd service implementation 2025-10-09 09:16:22 +13:00
tests feat: implement Rule Manager and Project Manager admin systems 2025-10-11 17:16:51 +13:00
.env.example feat: implement Koha donation system backend (Phase 3) 2025-10-08 13:35:40 +13:00
.env.test fix: add Jest test infrastructure and reduce test failures from 29 to 13 2025-10-09 20:37:45 +13:00
.eslintrc.json feat: implement Priority 1 - Public Blog System with governance enhancements 2025-10-11 14:47:01 +13:00
.gitignore fix: include package-lock.json for GitHub Actions 2025-10-09 15:17:35 +13:00
.rsyncignore security: create deployment exclusion list and safe deployment script 2025-10-09 15:47:20 +13:00
CLAUDE_Tractatus_Maintenance_Guide.md feat(infra): semantic versioning and systemd service implementation 2025-10-09 09:16:22 +13:00
ClaudeWeb conversation transcription.md feat: initialize tractatus project with complete directory structure 2025-10-06 23:26:26 +13:00
DEPLOYMENT-2025-10-08.md feat: complete Phase 2 - accessibility, performance, mobile polish 2025-10-08 13:29:26 +13:00
deployment-output.txt feat: Phase 5 Memory Tool PoC - Week 1 Complete 2025-10-10 12:03:39 +13:00
jest.config.js fix: resolve all 29 production test failures 2025-10-09 20:58:37 +13:00
KOHA_PRE_PRODUCTION_SUMMARY.md docs: add Koha pre-production deployment quick reference 2025-10-08 21:02:04 +13:00
LICENSE docs: update LICENSE copyright to John G Stroh 2025-10-07 23:52:00 +13:00
migration-output.txt feat: Phase 5 Memory Tool PoC - Week 1 Complete 2025-10-10 12:03:39 +13:00
NEXT_SESSION.md fix: improve About page presentation and resolve search endpoint tests 2025-10-10 11:39:14 +13:00
NEXT_SESSION_OPENING_PROMPT.md docs: audit session-init.js for API Memory and provide next session prompt 2025-10-10 23:43:51 +13:00
NOTICE legal: add Apache 2.0 copyright headers and NOTICE file 2025-10-08 00:03:12 +13:00
old claude md file feat(infra): semantic versioning and systemd service implementation 2025-10-09 09:16:22 +13:00
package-lock.json feat: enhance framework services and format architectural documentation 2025-10-11 00:50:47 +13:00
package.json feat: implement Rule Manager and Project Manager admin systems 2025-10-11 17:16:51 +13:00
PERPLEXITY_REVIEW_FILES.md feat: complete Phase 2 - accessibility, performance, mobile polish 2025-10-08 13:29:26 +13:00
PHASE-4-PREPARATION-CHECKLIST.md feat: Phase 5 Memory Tool PoC - Week 1 Complete 2025-10-10 12:03:39 +13:00
PITCH-DEVELOPERS.md feat: Phase 5 Memory Tool PoC - Week 1 Complete 2025-10-10 12:03:39 +13:00
PITCH-EXECUTIVE.md feat: Phase 5 Memory Tool PoC - Week 1 Complete 2025-10-10 12:03:39 +13:00
PITCH-GENERAL-PUBLIC.md feat: Phase 5 Memory Tool PoC - Week 1 Complete 2025-10-10 12:03:39 +13:00
PITCH-OPERATIONS.md feat: Phase 5 Memory Tool PoC - Week 1 Complete 2025-10-10 12:03:39 +13:00
PITCH-RESEARCHERS.md feat: Phase 5 Memory Tool PoC - Week 1 Complete 2025-10-10 12:03:39 +13:00
README.md feat: implement Rule Manager and Project Manager admin systems 2025-10-11 17:16:51 +13:00
SESSION_CLOSEDOWN_20251006.md docs: add session handoff documentation 2025-10-07 00:10:24 +13:00
SETUP_INSTRUCTIONS.md feat: add governance document and core utilities 2025-10-06 23:34:40 +13:00
tailwind.config.js feat: fix CSP violations & implement three audience paths 2025-10-07 12:21:00 +13:00
TRACTATUS-ELEVATOR-PITCHES.md feat: Phase 5 Memory Tool PoC - Week 1 Complete 2025-10-10 12:03:39 +13:00
Tractatus-Website-Complete-Specification-v2.0.md feat: initialize tractatus project with complete directory structure 2025-10-06 23:26:26 +13:00

Tractatus Framework

Architectural AI Safety Through Structural Constraints

The world's first production implementation of architectural AI safety guarantees. Tractatus preserves human agency through structural, not aspirational constraints on AI systems.

License Framework Tests


🎯 What is Tractatus?

Tractatus is an architectural AI safety framework that makes certain decisions structurally impossible for AI systems to make without human approval. Unlike traditional AI safety approaches that rely on training and alignment, Tractatus uses runtime enforcement of decision boundaries.

The Core Problem

Traditional AI safety relies on:

  • 🎓 Alignment training - Hoping the AI learns the "right" values
  • 📜 Constitutional AI - Embedding principles in training
  • 🔄 RLHF - Reinforcement learning from human feedback

These approaches share a fundamental flaw: they assume the AI will maintain alignment regardless of capability or context pressure.

The Tractatus Solution

Tractatus implements architectural constraints that:

  • Block values decisions - Privacy vs. performance requires human judgment
  • Prevent instruction override - Explicit instructions can't be autocorrected by training patterns
  • Detect context degradation - Quality metrics trigger session handoffs
  • Require verification - Complex operations need metacognitive checks
  • Persist instructions - Directives survive across sessions

🚀 Quick Start

Installation

# Clone repository
git clone https://github.com/AgenticGovernance/tractatus-framework.git
cd tractatus-framework

# Install dependencies
npm install

# Initialize database
npm run init:db

# Start development server
npm run dev

Basic Usage

const {
  InstructionPersistenceClassifier,
  CrossReferenceValidator,
  BoundaryEnforcer,
  ContextPressureMonitor,
  MetacognitiveVerifier
} = require('./src/services');

// Classify an instruction
const classifier = new InstructionPersistenceClassifier();
const classification = classifier.classify({
  text: "Always use MongoDB on port 27027",
  source: "user"
});

// Store in instruction history
await InstructionDB.store(classification);

// Validate before taking action
const validator = new CrossReferenceValidator();
const validation = await validator.validate({
  type: 'database_config',
  port: 27017  // ⚠️ Conflicts with stored instruction!
});

// validation.status === 'REJECTED'
// validation.reason === 'Pattern recognition bias override detected'

📚 Core Components

1. InstructionPersistenceClassifier

Classifies instructions by quadrant and persistence level:

{
  quadrant: "SYSTEM",           // STRATEGIC | OPERATIONAL | TACTICAL | SYSTEM
  persistence: "HIGH",           // HIGH | MEDIUM | LOW
  temporal_scope: "PROJECT",     // SESSION | PROJECT | PERMANENT
  verification_required: "MANDATORY"
}

2. CrossReferenceValidator

Prevents the "27027 failure mode" where AI training patterns override explicit instructions:

const result = validator.validate(action, { explicit_instructions });
// Blocks: Training pattern overrides, parameter conflicts, scope creep

3. BoundaryEnforcer

Blocks decisions that cross into values territory:

const check = enforcer.checkBoundary({
  decision: "Update privacy policy for more tracking"
});
// Result: BLOCKED - Values decision requires human judgment

4. ContextPressureMonitor

Multi-factor session health tracking:

const pressure = monitor.analyze({
  tokens: 120000/200000,      // 60% token usage
  messages: 45,                // Conversation length
  tasks: 8,                    // Concurrent complexity
  errors: 3                    // Recent error count
});
// Level: ELEVATED | Recommendation: INCREASE_VERIFICATION

5. MetacognitiveVerifier

AI self-checks reasoning before proposing actions:

const verification = verifier.verify({
  action: "Refactor 47 files across 5 system areas",
  context: { requested: "Refactor authentication module" }
});
// Decision: REQUIRE_REVIEW (scope creep detected)

🔧 Rule Manager

The Rule Manager is a web-based administration interface for managing governance rules in the Tractatus framework. It replaces the fragile .claude/instruction-history.json file with a robust, database-backed solution supporting multi-project governance.

Key Features

  • 🌐 Multi-Project Governance - Apply rules universally across projects or scope them to specific projects
  • Real-Time Validation - Ensure rules meet framework quality standards
  • 🔄 Variable Substitution - Use ${VAR_NAME} placeholders for project-specific values
  • 📊 Quality Scoring - AI-calculated clarity, specificity, and actionability metrics
  • 🔍 Advanced Filtering - Search by scope, quadrant, persistence, validation status
  • 📈 Dashboard Analytics - Track rule usage, enforcement counts, and quality trends

Quick Start

  1. Access the Rule Manager:

    http://localhost:9000/admin/rule-manager.html
    
  2. Create a new rule:

    {
      rule_id: "inst_019",
      text: "Database MUST use ${DB_TYPE} on port ${DB_PORT}",
      quadrant: "SYSTEM",
      persistence: "HIGH",
      scope: "UNIVERSAL"  // Applies to all projects
    }
    
  3. Apply across projects:

    • Universal rules are automatically inherited
    • Variables are substituted per-project (e.g., ${DB_TYPE}MongoDB)
    • Project-specific rules override when needed

Rule Properties

Each rule includes:

Property Description Example
Rule ID Unique identifier inst_019
Text Governance instruction "Port MUST be ${PORT}"
Quadrant Framework category SYSTEM
Persistence Enforcement duration HIGH, MEDIUM, LOW
Scope Application range UNIVERSAL, PROJECT_SPECIFIC
Priority Conflict resolution order 0-100
Clarity Score Quality metric 85% (Green)

Documentation

Example: Managing the 27027 Rule

Before (fragile JSON file):

{
  "inst_005": {
    "text": "MongoDB port is 27027",
    "persistence": "HIGH"
  }
}

After (Rule Manager):

// Create universal rule with variables
{
  rule_id: "inst_005",
  text: "MongoDB MUST use port ${MONGODB_PORT} for ${PROJECT_NAME} database",
  scope: "UNIVERSAL",
  variables: ["MONGODB_PORT", "PROJECT_NAME"],
  clarity_score: 92  // Automatically calculated
}

// Apply to tractatus project
variables: { MONGODB_PORT: "27027", PROJECT_NAME: "tractatus" }

// Apply to other project
variables: { MONGODB_PORT: "27017", PROJECT_NAME: "family-history" }

Benefits:

  • No more manual JSON editing
  • Real-time validation prevents syntax errors
  • Variable detection is automatic
  • Clarity scores guide improvement
  • Reusable across projects

💡 Real-World Examples

The 27027 Incident

Problem: User explicitly instructs "Use MongoDB on port 27027". AI immediately uses port 27017 instead.

Why: Training pattern "MongoDB = 27017" overrides explicit instruction, like autocorrect changing a deliberately unusual word.

Solution: CrossReferenceValidator blocks the action and auto-corrects to user's explicit instruction.

Try the Interactive Demo →

Context Degradation

Problem: In 6-hour sessions, error rates increase from 0.5% → 12.1% as context degrades.

Solution: ContextPressureMonitor detects degradation at 60% tokens and triggers session handoff before quality collapses.

Values Creep

Problem: "Improve performance" request leads AI to suggest weakening privacy protections without asking.

Solution: BoundaryEnforcer blocks the privacy/performance trade-off and requires human decision.


🚨 Learning from Failures: Transparency in Action

The framework doesn't prevent all failures—it structures detection, response, and learning.

October 2025: AI Fabrication Incident

During development, Claude (running with Tractatus governance) fabricated financial statistics on the landing page:

  • $3.77M in annual savings (no basis)
  • 1,315% ROI (completely invented)
  • False claims of being "production-ready"

The framework structured the response:

Detected within 48 hours (human review) Complete incident documentation required 3 new permanent rules created Comprehensive audit found related violations All content corrected same day Public case studies published for community learning

Read the full case studies:

Key Lesson: Governance doesn't guarantee perfection—it guarantees transparency, accountability, and systematic improvement.


📖 Documentation


🧪 Testing

# Run all tests
npm test

# Run specific test suites
npm run test:unit
npm run test:integration
npm run test:security

# Watch mode
npm run test:watch

Test Coverage: 192 tests, 100% coverage of core services


🏗️ Architecture

tractatus/
├── src/
│   ├── services/              # Core framework services
│   │   ├── InstructionPersistenceClassifier.js
│   │   ├── CrossReferenceValidator.js
│   │   ├── BoundaryEnforcer.js
│   │   ├── ContextPressureMonitor.js
│   │   └── MetacognitiveVerifier.js
│   ├── models/                # Database models
│   ├── routes/                # API routes
│   └── middleware/            # Framework middleware
├── tests/                     # Test suites
├── scripts/                   # Utility scripts
├── docs/                      # Comprehensive documentation
└── public/                    # Frontend assets

⚠️ Current Research Challenges

Rule Proliferation & Transactional Overhead

Status: Open research question | Priority: High

As the framework learns from failures, instruction count grows:

  • Phase 1: 6 instructions
  • Current: 18 instructions (+200%)
  • Projected (12 months): 40-50 instructions

The concern: At what point does rule proliferation reduce framework effectiveness?

  • Context window pressure increases
  • Validation checks grow linearly
  • Cognitive load escalates

We're being transparent about this limitation. Solutions in development:

  • Instruction consolidation techniques
  • Rule prioritization algorithms
  • Context-aware selective loading
  • ML-based optimization

Full analysis: Rule Proliferation Research


🤝 Contributing

We welcome contributions in several areas:

Research Contributions

  • Formal verification of safety properties
  • Extensions to new domains (robotics, autonomous systems)
  • Theoretical foundations and proofs

Implementation Contributions

  • Ports to other languages (Python, Rust, Go)
  • Integration with other frameworks
  • Performance optimizations

Documentation Contributions

  • Tutorials and guides
  • Case studies from real deployments
  • Translations

See CONTRIBUTING.md for guidelines.


📊 Project Status

Phase 1: Complete (October 2025)

  • All 5 core services implemented
  • 192 unit tests (100% coverage)
  • Production deployment active
  • This website built using Tractatus governance

Phase 2: 🚧 In Planning

  • Multi-language support
  • Cloud deployment guides
  • Enterprise features

📜 License

Apache License 2.0 - See LICENSE for full terms.

The Tractatus Framework is open source and free to use, modify, and distribute with attribution.



📧 Contact


🙏 Acknowledgments

This framework stands on the shoulders of:

  • Ludwig Wittgenstein - Philosophical foundations from Tractatus Logico-Philosophicus
  • March & Simon - Organizational theory and decision-making frameworks
  • Anthropic - Claude AI system for dogfooding and validation
  • Open Source Community - Tools, libraries, and support

📖 Philosophy

"Whereof one cannot speak, thereof one must be silent." — Ludwig Wittgenstein

Applied to AI safety:

"Whereof the AI cannot safely decide, thereof it must request human judgment."

Tractatus recognizes that some decisions cannot be systematized without value judgments. Rather than pretend AI can make these decisions "correctly," we build systems that structurally defer to human judgment in appropriate domains.

This isn't a limitation—it's architectural integrity.


Built with 🧠 by SyDigital Ltd | Documentation