Tractatus AI Safety Framework

Find a file

TheFlow 193a08cb95 feat: initial commit with security hardening and framework documentation Security improvements: - Enhanced .gitignore to protect sensitive files - Removed internal docs from version control (CLAUDE.md, session handoffs, security audits) - Sanitized README.md (removed internal paths and infrastructure details) - Protected session state and token checkpoint files Framework documentation: - Added 4 case studies (framework in action, failures, real-world governance, pre-publication audit) - Added rule proliferation research topic - Sanitized public-facing documentation Content updates: - Updated public/leader.html with honest claims only - Updated public/docs.html with Resources section - All content complies with inst_016, inst_017, inst_018 (no fabrications, no guarantees, accurate status) This commit represents Phase 4 of development with production-ready security hardening.		2025-10-09 12:05:07 +13:00
audit-reports	feat: complete Phase 2 - accessibility, performance, mobile polish	2025-10-08 13:29:26 +13:00
data/mongodb	feat: initialize tractatus project with complete directory structure	2025-10-06 23:26:26 +13:00
docs	feat: initial commit with security hardening and framework documentation	2025-10-09 12:05:07 +13:00
governance	docs: Phase 2 kickoff materials & domain migration to agenticgovernance.digital	2025-10-07 13:17:42 +13:00
public	feat: initial commit with security hardening and framework documentation	2025-10-09 12:05:07 +13:00
scripts	feat(infra): semantic versioning and systemd service implementation	2025-10-09 09:16:22 +13:00
src	feat(infra): semantic versioning and systemd service implementation	2025-10-09 09:16:22 +13:00
systemd	feat(infra): semantic versioning and systemd service implementation	2025-10-09 09:16:22 +13:00
tests	feat(infra): semantic versioning and systemd service implementation	2025-10-09 09:16:22 +13:00
.env.example	feat: implement Koha donation system backend (Phase 3)	2025-10-08 13:35:40 +13:00
.gitignore	feat: initial commit with security hardening and framework documentation	2025-10-09 12:05:07 +13:00
CLAUDE_Tractatus_Maintenance_Guide.md	feat(infra): semantic versioning and systemd service implementation	2025-10-09 09:16:22 +13:00
ClaudeWeb conversation transcription.md	feat: initialize tractatus project with complete directory structure	2025-10-06 23:26:26 +13:00
DEPLOYMENT-2025-10-08.md	feat: complete Phase 2 - accessibility, performance, mobile polish	2025-10-08 13:29:26 +13:00
KOHA_PRE_PRODUCTION_SUMMARY.md	docs: add Koha pre-production deployment quick reference	2025-10-08 21:02:04 +13:00
LICENSE	docs: update LICENSE copyright to John G Stroh	2025-10-07 23:52:00 +13:00
NEXT_SESSION.md	docs: add session handoff documentation	2025-10-07 00:10:24 +13:00
NOTICE	legal: add Apache 2.0 copyright headers and NOTICE file	2025-10-08 00:03:12 +13:00
old claude md file	feat(infra): semantic versioning and systemd service implementation	2025-10-09 09:16:22 +13:00
package.json	feat: implement Koha donation system frontend (Phase 3)	2025-10-08 13:56:56 +13:00
PERPLEXITY_REVIEW_FILES.md	feat: complete Phase 2 - accessibility, performance, mobile polish	2025-10-08 13:29:26 +13:00
README.md	feat: initial commit with security hardening and framework documentation	2025-10-09 12:05:07 +13:00
SESSION_CLOSEDOWN_20251006.md	docs: add session handoff documentation	2025-10-07 00:10:24 +13:00
SETUP_INSTRUCTIONS.md	feat: add governance document and core utilities	2025-10-06 23:34:40 +13:00
tailwind.config.js	feat: fix CSP violations & implement three audience paths	2025-10-07 12:21:00 +13:00
Tractatus-Website-Complete-Specification-v2.0.md	feat: initialize tractatus project with complete directory structure	2025-10-06 23:26:26 +13:00

README.md

Tractatus AI Safety Framework

An open-source governance framework for Large Language Model (LLM) safety through structured decision-making, persistent instruction management, and transparent failure documentation.

Project Start: October 2025 | Current Phase: 4 (Production Hardening)

What is Tractatus?

Tractatus is a rule-based AI governance framework designed to structure how AI assistants make decisions, persist learning across sessions, and maintain transparency through systematic failure documentation.

Core Innovation

The framework governs itself. Every component of Tractatus (including this documentation) was developed using Claude Code with Tractatus governance active. When failures occur—like the October 9th fabrication incident—the framework requires systematic documentation, correction, and permanent learning.

Key Components

InstructionPersistenceClassifier - Categorizes and prioritizes human directives across sessions
ContextPressureMonitor - Tracks cognitive load and manages conversation context
CrossReferenceValidator - Prevents actions conflicting with stored instructions
BoundaryEnforcer - Blocks values-sensitive decisions requiring human approval
MetacognitiveVerifier - Validates complex operations before execution

Website: agenticgovernance.digital (in development)

Project Structure

tractatus/
├── docs/               # Source markdown & governance documents
├── public/             # Frontend assets (CSS, JS, images)
├── src/                # Backend code (Express, MongoDB)
│   ├── routes/        # API route handlers
│   ├── controllers/   # Business logic
│   ├── models/        # MongoDB models
│   ├── middleware/    # Express middleware
│   │   └── tractatus/ # Framework enforcement
│   ├── services/      # Core services (AI, governance)
│   └── utils/         # Utility functions
├── scripts/            # Setup & migration scripts
├── tests/              # Test suites (unit, integration, security)
├── data/               # MongoDB data directory
└── logs/               # Application & MongoDB logs

Quick Start

Prerequisites

Node.js 18+
MongoDB 7+
Git

Installation

# Clone the repository
git clone https://github.com/AgenticGovernance/tractatus-framework.git
cd tractatus-framework

# Install dependencies
npm install

# Copy environment variables
cp .env.example .env
# Edit .env with your configuration

# Initialize database
npm run init:db

# Migrate documents
npm run migrate:docs

# Create admin user
npm run seed:admin

# Start development server
npm run dev

The application will be available at http://localhost:9000

Technical Stack

Backend: Node.js, Express, MongoDB
Frontend: Vanilla JavaScript, Tailwind CSS
Authentication: JWT
AI Integration: Claude API (Sonnet 4.5) - Phase 2+
Testing: Jest, Supertest

Phase 1 Deliverables (3-4 Months)

Must-Have for Complete Prototype:

Infrastructure setup
Document migration pipeline
Three audience paths (Researcher/Implementer/Advocate)
Tractatus governance services (Classifier, Validator, Boundary Enforcer)
AI-curated blog with human oversight
Media inquiry triage system
Case study submission portal
Resource directory
Interactive demonstrations (classification, 27027, boundary enforcement)
Human oversight dashboard
Comprehensive testing suite

Development Workflow

Running Tests

npm test                 # All tests with coverage
npm run test:unit        # Unit tests only
npm run test:integration # Integration tests
npm run test:security    # Security tests
npm run test:watch       # Watch mode

Code Quality

npm run lint            # Check code style
npm run lint:fix        # Fix linting issues

Database Operations

npm run init:db         # Initialize database & indexes
npm run migrate:docs    # Import markdown documents
npm run generate:pdfs   # Generate PDF downloads

🚨 Learning from Failures: Real-World Case Studies

Transparency is a core framework value. When the framework fails, we document it publicly.

October 2025: Fabrication Incident

Claude (running with Tractatus governance) fabricated financial statistics and made false claims on our landing page:

$3.77M in annual savings (no basis)
1,315% ROI (completely invented)
"Architectural guarantees" (prohibited language)
Claims of being "production-ready" (not true)

The framework didn't prevent the initial fabrication, but it structured the response:

✅ Detected within 48 hours (human review) ✅ Complete incident documentation required ✅ 3 new permanent rules created (inst_016, inst_017, inst_018) ✅ Comprehensive audit found related violations ✅ All content corrected and redeployed same day ✅ Public case studies published for community learning

Read the full stories (three different perspectives):

Our Framework in Action - Practical walkthrough
When Frameworks Fail (And Why That's OK) - Philosophical perspective
Real-World AI Governance: Case Study - Educational deep-dive

Key Lesson: Governance doesn't prevent all failures—it structures detection, response, learning, and transparency.

⚠️ Current Research Challenges

Rule Proliferation & Transactional Overhead

Status: Open research question | Priority: High

As the framework learns from failures, it accumulates rules:

Phase 1: 6 instructions
Phase 4: 18 instructions (+200% growth)
Projected (12 months): 40-50 instructions

The emerging concern: At what point does rule proliferation reduce framework effectiveness?

Context window pressure increases
CrossReferenceValidator checks grow linearly
Cognitive load on AI system escalates
Potential diminishing returns

We're being transparent about this limitation. Solutions planned for Phases 5-7:

Instruction consolidation techniques
Rule prioritization algorithms
Context-aware selective loading
ML-based optimization

Full analysis: Rule Proliferation Research Topic

Governance Principles

This project adheres to the Tractatus framework values:

Transparency & Honesty: Failures documented publicly, no fabricated claims
Sovereignty & Self-determination: No tracking, user control, open source
Harmlessness & Protection: Privacy-first design, security audits
Community & Accessibility: WCAG compliance, educational content

All AI actions are governed by the five core components listed above.

Human Approval Required

All major decisions require human approval:

Architectural changes
Database schema modifications
Security implementations
Third-party integrations
Values-sensitive content
Cost-incurring services

See: CLAUDE.md for complete project context and conventions

Te Tiriti & Indigenous Perspective

This project acknowledges Te Tiriti o Waitangi and indigenous leadership in digital sovereignty. Implementation follows documented indigenous data sovereignty principles (CARE Principles) with respect and without tokenism.

No premature engagement: We will not approach Māori organizations until we have something valuable to offer post-launch.

License

Apache License 2.0 - See LICENSE file for details.

The Tractatus Framework is licensed under the Apache License 2.0, which provides:

Patent protection for users
Clear contribution terms
Permissive use (commercial, modification, distribution)
Compatibility with most other open source licenses

Contact

Project Owner: John Stroh Email: john.stroh.nz@pm.me Repository: GitHub (primary) + Codeberg/Gitea (mirrors)

Last Updated: 2025-10-06 Next Milestone: Complete MongoDB setup and systemd service