tractatus/docs/markdown/introduction.md
TheFlow 19473fdbb6 docs: Phase 2 kickoff materials & domain migration to agenticgovernance.digital
This commit completes Phase 2 preparation with comprehensive kickoff materials
and migrates all domain references from mysy.digital to agenticgovernance.digital.

New Phase 2 Documents:
- PHASE-2-PRESENTATION.md: 20-slide stakeholder presentation deck
- PHASE-2-EMAIL-TEMPLATES.md: Invitation templates for 20-50 soft launch users
- PHASE-2-KICKOFF-CHECKLIST.md: Comprehensive 12-week deployment checklist (200+ tasks)
- PHASE-2-PREPARATION-ADVISORY.md: Advisory on achieving world-class UI/UX

Domain Migration (mysy.digital → agenticgovernance.digital):
- Updated CLAUDE.md project instructions
- Updated README.md
- Updated all Phase 2 planning documents (ROADMAP, COST-ESTIMATES, INFRASTRUCTURE)
- Updated governance policies (TRA-OPS-0002, TRA-OPS-0003)
- Updated framework documentation (introduction.md)
- Updated implementation progress report

Phase 2 Status:
 Budget approved: $550 USD for 3 months, $100-150/month ongoing
 Timeline confirmed: Starting NOW
 All 5 TRA-OPS-* governance policies approved
 Infrastructure decisions finalized (OVHCloud VPS Essential)
 Domain registered: agenticgovernance.digital

Ready to Begin:
- Week 1: Infrastructure deployment (VPS, DNS, SSL)
- Week 5-8: AI features (Claude API, blog, media, case studies)
- Week 9-12: Testing, governance audit, soft launch (20-50 users)

Next Steps:
1. Provision OVHCloud VPS Essential (Singapore/Australia)
2. Configure DNS for agenticgovernance.digital
3. Generate secrets (JWT, MongoDB passwords)
4. Draft 3-5 initial blog posts (human-written)
5. Begin Week 1 infrastructure deployment

🤖 Generated with [Claude Code](https://claude.com/claude-code)

Co-Authored-By: Claude <noreply@anthropic.com>
2025-10-07 13:17:42 +13:00

231 lines
7.9 KiB
Markdown

---
title: Introduction to the Tractatus Framework
slug: introduction
quadrant: STRATEGIC
persistence: HIGH
version: 1.0
type: framework
author: SyDigital Ltd
---
# Introduction to the Tractatus Framework
## What is Tractatus?
The **Tractatus-Based LLM Safety Framework** is a world-first architectural approach to AI safety that preserves human agency through **structural guarantees** rather than aspirational goals.
Instead of hoping AI systems "behave correctly," Tractatus implements **architectural constraints** that certain decision types **structurally require human judgment**. This creates bounded AI operation that scales safely with capability growth.
## The Core Problem
Current AI safety approaches rely on:
- Alignment training (hoping the AI learns the "right" values)
- Constitutional AI (embedding principles in training)
- RLHF (Reinforcement Learning from Human Feedback)
These approaches share a fundamental flaw: **they assume the AI will maintain alignment** regardless of capability level or context pressure.
## The Tractatus Solution
Tractatus takes a different approach inspired by Ludwig Wittgenstein's philosophy of language and meaning:
> **"Whereof one cannot speak, thereof one must be silent."**
> — Ludwig Wittgenstein, Tractatus Logico-Philosophicus
Applied to AI safety:
> **"Whereof the AI cannot safely decide, thereof it must request human judgment."**
### Architectural Boundaries
The framework defines **decision boundaries** based on:
1. **Domain complexity** - Can this decision be systematized?
2. **Values sensitivity** - Does this decision involve irreducible human values?
3. **Irreversibility** - Can mistakes be corrected without harm?
4. **Context dependence** - Does this decision require human cultural/social understanding?
## Core Innovation
The Tractatus framework is built on **five core services** that work together to ensure AI operations remain within safe boundaries:
### 1. InstructionPersistenceClassifier
Classifies instructions into five quadrants based on their strategic importance and persistence:
- **STRATEGIC** - Mission-critical, permanent decisions (HIGH persistence)
- **OPERATIONAL** - Standard operating procedures (MEDIUM-HIGH persistence)
- **TACTICAL** - Specific tasks with defined scope (LOW-MEDIUM persistence)
- **SYSTEM** - Technical configuration (HIGH persistence)
- **STOCHASTIC** - Exploratory, creative work (VARIABLE persistence)
### 2. CrossReferenceValidator
Prevents the "27027 failure mode" where AI forgets or contradicts explicit instructions:
- Validates all AI actions against stored instruction history
- Detects conflicts before execution
- Prevents parameter mismatches (e.g., using port 27027 when instructed to use 27017)
### 3. BoundaryEnforcer
Ensures certain decision types **structurally require human approval**:
- **Values decisions** - Privacy vs. performance, ethics, user agency
- **Irreversible changes** - Data deletion, architectural changes
- **High-risk operations** - Security changes, financial decisions
### 4. ContextPressureMonitor
Tracks session degradation across multiple factors:
- **Token usage** (35% weight) - Context window pressure
- **Conversation length** (25% weight) - Attention decay
- **Task complexity** (15% weight) - Concurrent tasks, dependencies
- **Error frequency** (15% weight) - Recent errors indicate degraded state
- **Instruction density** (10% weight) - Too many competing directives
Recommends session handoffs before quality degrades.
### 5. MetacognitiveVerifier
AI self-checks its own reasoning before proposing actions:
- **Alignment** - Does this match stated goals?
- **Coherence** - Is the reasoning internally consistent?
- **Completeness** - Are edge cases considered?
- **Safety** - What are the risks?
- **Alternatives** - Have other approaches been explored?
Returns confidence scores and recommends PROCEED, PROCEED_WITH_CAUTION, REQUIRE_REVIEW, or BLOCKED.
## Why "Tractatus"?
The name honors Ludwig Wittgenstein's *Tractatus Logico-Philosophicus*, which established that:
1. **Language has limits** - Not everything can be meaningfully expressed
2. **Boundaries are structural** - These limits aren't defects, they're inherent
3. **Clarity comes from precision** - Defining what can and cannot be said
Applied to AI:
1. **AI judgment has limits** - Not every decision can be safely automated
2. **Safety comes from architecture** - Build boundaries into the system structure
3. **Reliability requires specification** - Precisely define where AI must defer to humans
## Key Principles
### 1. Structural Safety Over Behavioral Safety
Traditional: "Train the AI to be safe"
Tractatus: "Make unsafe actions structurally impossible"
### 2. Explicit Over Implicit
Traditional: "The AI should infer user intent"
Tractatus: "Track explicit instructions and enforce them"
### 3. Degradation Detection Over Perfection Assumption
Traditional: "The AI should maintain quality"
Tractatus: "Monitor for degradation and intervene before failure"
### 4. Human Agency Over AI Autonomy
Traditional: "Give the AI maximum autonomy"
Tractatus: "Reserve certain decisions for human judgment"
## Real-World Impact
The Tractatus framework prevents failure modes like:
### The 27027 Incident
An AI was explicitly instructed to use database port 27017, but later used port 27027 in generated code, causing a critical failure. This happened because:
1. The instruction wasn't persisted beyond the immediate context
2. No validation checked the AI's actions against stored directives
3. The AI had no metacognitive check to verify port numbers
**CrossReferenceValidator** would have caught this before execution.
### Context Degradation
In long sessions (150k+ tokens), AI quality silently degrades:
- Forgets earlier instructions
- Makes increasingly careless errors
- Fails to verify assumptions
**ContextPressureMonitor** detects this degradation and recommends session handoffs.
### Values Creep
AI systems gradually make decisions in values-sensitive domains without realizing it:
- Choosing privacy vs. performance
- Deciding what constitutes "harmful" content
- Determining appropriate user agency levels
**BoundaryEnforcer** blocks these decisions and requires human judgment.
## Who Should Use Tractatus?
### Researchers
- Formal safety guarantees through architectural constraints
- Novel approach to alignment problem
- Empirical validation of degradation detection
### Implementers
- Production-ready code (Node.js, tested, documented)
- Integration guides for existing systems
- Immediate safety improvements
### Advocates
- Clear communication framework for AI safety
- Non-technical explanations of core concepts
- Policy implications and recommendations
## Getting Started
1. **Read the Core Concepts** - Understand the five services
2. **Review the Technical Specification** - See how it works in practice
3. **Explore the Case Studies** - Real-world failure modes and prevention
4. **Try the Interactive Demos** - Hands-on experience with the framework
## Status
**Phase 1 Implementation Complete (2025-10-07)**
- All five core services implemented and tested (100% coverage)
- 192 unit tests passing
- Instruction persistence database operational
- Active governance for development sessions
**This website** is built using the Tractatus framework to govern its own development - a practice called "dogfooding."
## Contributing
The Tractatus framework is open source and welcomes contributions:
- **Research** - Formal verification, theoretical extensions
- **Implementation** - Ports to other languages/platforms
- **Case Studies** - Document real-world applications
- **Documentation** - Improve clarity and accessibility
## License
Open source under [LICENSE TO BE DETERMINED]
## Contact
- **Email**: john.stroh.nz@pm.me
- **GitHub**: [Repository Link]
- **Website**: agenticgovernance.digital
---
**Next:** [Core Concepts](core-concepts.md) | [Implementation Guide](implementation-guide.md)