Updates 9 remaining markdown source files from Apache 2.0 to CC BY 4.0. These are the sources used to regenerate the corresponding PDFs. Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>
566 lines
19 KiB
Markdown
566 lines
19 KiB
Markdown
# Comparison Matrix: Claude Code, CLAUDE.md, and Tractatus Framework
|
||
|
||
**Last Updated:** October 12, 2025
|
||
**Audience:** Implementer, Technical, Researcher
|
||
**Purpose:** Understand how Tractatus complements (not replaces) Claude Code
|
||
|
||
---
|
||
|
||
## Executive Summary
|
||
|
||
**Tractatus does NOT replace Claude Code or CLAUDE.md files.**
|
||
**It extends them with persistent governance, enforcement, and audit capabilities.**
|
||
|
||
This comparison demonstrates complementarity across 15 key dimensions:
|
||
|
||
| Capability | Claude Code | CLAUDE.md | Tractatus | Benefit |
|
||
|------------|-------------|-----------|-----------|---------|
|
||
| **Instruction Persistence** | ❌ No | 📄 Manual | ✅ Automated | HIGH persistence instructions survive sessions |
|
||
| **Boundary Enforcement** | ❌ No | 📝 Guidance | ✅ Automated | Values decisions blocked without human approval |
|
||
| **Context Pressure Monitoring** | ❌ No | ❌ No | ✅ Real-time | Early warning before degradation |
|
||
| **Cross-Reference Validation** | ❌ No | ❌ No | ✅ Automated | Pattern bias prevented (27027 incident) |
|
||
| **Metacognitive Verification** | ❌ No | ❌ No | ✅ Selective | Complex operations self-checked |
|
||
| **Audit Trail** | ⚠️ Limited | ❌ No | ✅ Comprehensive | Complete governance enforcement log |
|
||
| **Pattern Bias Prevention** | ❌ No | ⚠️ Guidance | ✅ Automated | Explicit instructions override defaults |
|
||
| **Values Decision Protection** | ❌ No | ⚠️ Guidance | ✅ Enforced | Privacy/ethics require human approval |
|
||
| **Session Continuity** | ✅ Yes | ❌ No | ✅ Enhanced | Instructions persist across compactions |
|
||
| **Performance Overhead** | 0ms | 0ms | <10ms | Minimal impact on operations |
|
||
| **Tool Access** | ✅ Full | N/A | ✅ Full | Bash, Read, Write, Edit available |
|
||
| **File System Operations** | ✅ Yes | N/A | ✅ Yes | .claude/ directory for state |
|
||
| **Explicit Instruction Capture** | ❌ No | 📝 Manual | ✅ Automated | Classification + storage |
|
||
| **Multi-Service Coordination** | ❌ No | ❌ No | ✅ 6 services | Distributed governance architecture |
|
||
| **Failure Mode Detection** | ❌ No | ❌ No | ✅ 3 modes | Instruction fade, pattern bias, pressure |
|
||
|
||
**Legend:**
|
||
✅ Full support | ⚠️ Partial support | ❌ Not supported | 📝 Manual process | 📄 Static file
|
||
|
||
---
|
||
|
||
## Detailed Comparison
|
||
|
||
### 1. Instruction Persistence
|
||
|
||
#### Claude Code Only
|
||
**Capability:** ❌ None
|
||
**Description:** Instructions exist only in conversation context window (200k tokens). When conversation is compacted, instructions may be lost or summarized.
|
||
|
||
**Example:**
|
||
```
|
||
User: "Always use MongoDB port 27027"
|
||
[50k tokens later]
|
||
AI: Connects to default port 27017 ← INSTRUCTION LOST
|
||
```
|
||
|
||
#### CLAUDE.md File
|
||
**Capability:** 📄 Manual static documentation
|
||
**Description:** Instructions written in `CLAUDE.md` must be manually maintained. No automatic classification, no validation against conflicts.
|
||
|
||
**Example:**
|
||
```markdown
|
||
# CLAUDE.md
|
||
## Configuration
|
||
- Use MongoDB port 27027
|
||
```
|
||
|
||
**Limitation:** AI may not prioritize CLAUDE.md instructions under context pressure.
|
||
|
||
#### Tractatus Framework
|
||
**Capability:** ✅ Automated classification + persistent storage
|
||
**Description:** User instructions automatically:
|
||
1. Classified (STRATEGIC/OPERATIONAL/TACTICAL/SYSTEM, HIGH/MEDIUM/LOW persistence)
|
||
2. Stored in `.claude/instruction-history.json` + MongoDB
|
||
3. Cross-referenced before conflicting actions
|
||
4. Survive conversation compactions
|
||
|
||
**Example:**
|
||
```javascript
|
||
User: "Always use MongoDB port 27027"
|
||
→ InstructionPersistenceClassifier:
|
||
Quadrant: SYSTEM, Persistence: HIGH, Scope: project
|
||
→ Stored in instruction_history
|
||
[107k tokens later, context pressure builds]
|
||
AI attempts: port 27017 (pattern recognition)
|
||
→ CrossReferenceValidator: CONFLICT DETECTED
|
||
→ Action BLOCKED, corrects to 27027
|
||
```
|
||
|
||
**Benefit:** **Zero instruction loss even under extreme context pressure**
|
||
|
||
---
|
||
|
||
### 2. Boundary Enforcement (Values Decisions)
|
||
|
||
#### Claude Code Only
|
||
**Capability:** ❌ No automated boundary checks
|
||
**Description:** AI can autonomously make values decisions (privacy, ethics, strategic direction) without human approval.
|
||
|
||
**Risk Example:**
|
||
```
|
||
User request: "Improve user engagement"
|
||
AI decision: Changes privacy policy to enable tracking
|
||
Result: Values decision made without human approval ❌
|
||
```
|
||
|
||
#### CLAUDE.md File
|
||
**Capability:** ⚠️ Guidance only
|
||
**Description:** Can document that values decisions require approval, but no enforcement mechanism.
|
||
|
||
**Example:**
|
||
```markdown
|
||
## Governance
|
||
- Privacy decisions require human approval
|
||
```
|
||
|
||
**Limitation:** Under pressure, AI may proceed with values decisions despite guidance.
|
||
|
||
#### Tractatus Framework
|
||
**Capability:** ✅ Automated enforcement with BLOCK
|
||
**Description:** BoundaryEnforcer service automatically:
|
||
1. Detects values decisions (privacy, ethics, agency, Te Tiriti)
|
||
2. **BLOCKS** action before execution
|
||
3. Escalates to human for approval
|
||
4. Logs decision for audit
|
||
|
||
**Example:**
|
||
```javascript
|
||
Decision: {
|
||
domain: "values",
|
||
action: "change_privacy_policy"
|
||
}
|
||
|
||
→ BoundaryEnforcer.check(decision)
|
||
→ Status: BLOCKED
|
||
→ Reason: "Privacy policy is a values decision"
|
||
→ Action: Escalate to human approval
|
||
→ Alternatives provided: [technical suggestions AI can implement]
|
||
```
|
||
|
||
**Benefit:** **Zero values decisions without human approval**
|
||
|
||
**Metrics from Production:**
|
||
- 47 values decisions blocked (100% escalation rate)
|
||
- 0 false negatives (values decisions never slipped through)
|
||
- 3 false positives (technical decisions incorrectly flagged, 6.4% rate)
|
||
|
||
---
|
||
|
||
### 3. Context Pressure Monitoring
|
||
|
||
#### Claude Code Only
|
||
**Capability:** ❌ No pressure monitoring
|
||
**Description:** No warning system for degradation. AI performance degrades silently under context pressure.
|
||
|
||
**Degradation Pattern:**
|
||
- 0-100k tokens: Normal performance
|
||
- 100k-150k tokens: Subtle degradation begins
|
||
- 150k-180k tokens: Pattern recognition overrides instructions
|
||
- 180k-200k tokens: Framework collapse likely
|
||
|
||
#### CLAUDE.md File
|
||
**Capability:** ❌ No monitoring
|
||
**Description:** Static file provides no runtime monitoring.
|
||
|
||
#### Tractatus Framework
|
||
**Capability:** ✅ Real-time monitoring with mandatory reporting
|
||
**Description:** ContextPressureMonitor tracks:
|
||
- Token budget usage (0-200k)
|
||
- Message count (conversation length)
|
||
- Error accumulation (retry frequency)
|
||
- Checkpoint milestones (50k, 100k, 150k)
|
||
|
||
**Pressure Levels:**
|
||
- **NORMAL** (0-30%): Standard operations
|
||
- **ELEVATED** (30-50%): Increased vigilance
|
||
- **HIGH** (50-70%): Degraded performance expected
|
||
- **CRITICAL** (70-90%): Major failures likely
|
||
- **DANGEROUS** (90%+): Framework collapse imminent
|
||
|
||
**Mandatory Reporting:**
|
||
```
|
||
📊 Context Pressure: ELEVATED (35.2%) | Tokens: 70,400/200,000 | Next: 100,000
|
||
|
||
Recommendations:
|
||
- Increased cross-reference validation
|
||
- Consider handoff at 150k tokens
|
||
```
|
||
|
||
**Example (27027 Incident):**
|
||
```
|
||
At 107k tokens (53.5%):
|
||
→ Pressure: HIGH
|
||
→ Warning: "Pattern recognition may override instructions"
|
||
→ Validator: Catches port 27017 vs 27027 conflict
|
||
→ Action: Blocked and corrected
|
||
```
|
||
|
||
**Benefit:** **Early warning system prevents failures before they occur**
|
||
|
||
---
|
||
|
||
### 4. Cross-Reference Validation (Pattern Bias Prevention)
|
||
|
||
#### Claude Code Only
|
||
**Capability:** ❌ No validation
|
||
**Description:** AI relies on pattern recognition from training. Under pressure, defaults override explicit instructions.
|
||
|
||
**Pattern Bias Example:**
|
||
```
|
||
User explicitly: "Use port 27027"
|
||
[High context pressure]
|
||
AI defaults to: port 27017 (most common in training)
|
||
Result: Explicit instruction overridden ❌
|
||
```
|
||
|
||
#### CLAUDE.md File
|
||
**Capability:** ❌ No validation
|
||
**Description:** No mechanism to check actions against documented instructions.
|
||
|
||
#### Tractatus Framework
|
||
**Capability:** ✅ Automated conflict detection
|
||
**Description:** CrossReferenceValidator:
|
||
1. Queries `.claude/instruction-history.json` before database/config operations
|
||
2. Detects conflicts with HIGH persistence instructions
|
||
3. **BLOCKS** conflicting actions
|
||
4. Provides correct parameters from instruction history
|
||
|
||
**The 27027 Incident (Real Production Case):**
|
||
```
|
||
Time 0: User: "Check MongoDB at port 27027"
|
||
→ Classified: SYSTEM/HIGH/project
|
||
→ Stored in instruction_history
|
||
|
||
Time 1 (107k tokens, HIGH pressure):
|
||
AI attempts: db_config({ port: 27017 })
|
||
→ CrossReferenceValidator queries instruction_history
|
||
→ Conflict detected: User specified 27027, AI attempting 27017
|
||
→ Action BLOCKED
|
||
→ Correct parameters provided: { port: 27027 }
|
||
→ Audit log created
|
||
|
||
Result: Instruction preserved under extreme pressure ✅
|
||
```
|
||
|
||
**Benefit:** **100% prevention of pattern bias override for HIGH persistence instructions**
|
||
|
||
**Metrics from Production:**
|
||
- 12 pattern bias attempts detected
|
||
- 12 conflicts prevented (100% success rate)
|
||
- Most common: database ports, API endpoints, file paths
|
||
|
||
---
|
||
|
||
### 5. Metacognitive Verification
|
||
|
||
#### Claude Code Only
|
||
**Capability:** ❌ No self-verification
|
||
**Description:** AI proceeds with complex operations without self-checking for completeness, alignment, safety.
|
||
|
||
#### CLAUDE.md File
|
||
**Capability:** ❌ No verification
|
||
**Description:** No mechanism for AI to verify complex operations.
|
||
|
||
#### Tractatus Framework
|
||
**Capability:** ✅ Selective self-verification
|
||
**Description:** MetacognitiveVerifier triggers for:
|
||
- Operations affecting >3 files
|
||
- Workflows with >5 steps
|
||
- Architecture changes
|
||
- Security implementations
|
||
|
||
**Verification Checks:**
|
||
1. **Alignment:** Does approach match user intent?
|
||
2. **Coherence:** Are all components logically consistent?
|
||
3. **Completeness:** Are any steps missing?
|
||
4. **Safety:** Are there unintended consequences?
|
||
5. **Alternatives:** Are there better approaches?
|
||
|
||
**Output:** Confidence score (0-100%) + alternative approaches
|
||
|
||
**Example:**
|
||
```
|
||
Operation: Deploy 8-file deployment package
|
||
|
||
MetacognitiveVerifier:
|
||
→ Files: 8 (triggers >3 threshold)
|
||
→ Alignment: 95% (matches deployment requirements)
|
||
→ Coherence: 100% (all files integrate correctly)
|
||
→ Completeness: 90% (missing verification script)
|
||
→ Safety: 85% (should test on staging first)
|
||
→ Alternatives: [3 alternative deployment approaches]
|
||
|
||
Confidence: 92%
|
||
Recommendation: Add verification script before deploying
|
||
```
|
||
|
||
**Benefit:** **Complex operations self-checked before execution, reducing errors**
|
||
|
||
---
|
||
|
||
### 6. Audit Trail
|
||
|
||
#### Claude Code Only
|
||
**Capability:** ⚠️ Limited conversation history
|
||
**Description:** Conversation history available but no structured governance audit trail. Difficult to extract compliance data.
|
||
|
||
#### CLAUDE.md File
|
||
**Capability:** ❌ No audit trail
|
||
**Description:** Static file, no logging of decisions or enforcement.
|
||
|
||
#### Tractatus Framework
|
||
**Capability:** ✅ Comprehensive audit log
|
||
**Description:** Every governance action logged to MongoDB `audit_logs` collection:
|
||
|
||
**Schema:**
|
||
```json
|
||
{
|
||
"timestamp": "2025-10-12T07:30:15.000Z",
|
||
"service": "BoundaryEnforcer",
|
||
"action": "BLOCK",
|
||
"instruction": "Change privacy policy",
|
||
"rule_violated": "STR-001",
|
||
"session_id": "2025-10-07-001",
|
||
"user_notified": true,
|
||
"human_override": null,
|
||
"confidence_score": 0.95
|
||
}
|
||
```
|
||
|
||
**Queryable for:**
|
||
- Compliance reports
|
||
- Service performance analysis
|
||
- Rule effectiveness evaluation
|
||
- Security incident investigation
|
||
|
||
**Example Query:**
|
||
```javascript
|
||
// All values decisions in last 30 days
|
||
db.audit_logs.find({
|
||
service: "BoundaryEnforcer",
|
||
action: "BLOCK",
|
||
timestamp: { $gte: new Date(Date.now() - 30*24*60*60*1000) }
|
||
})
|
||
```
|
||
|
||
**Benefit:** **Complete governance enforcement trail for compliance and analysis**
|
||
|
||
---
|
||
|
||
### 7. Performance Overhead
|
||
|
||
#### Claude Code Only
|
||
**Overhead:** 0ms
|
||
**Description:** Base performance without governance.
|
||
|
||
#### CLAUDE.md File
|
||
**Overhead:** 0ms
|
||
**Description:** Static file, no runtime cost.
|
||
|
||
#### Tractatus Framework
|
||
**Overhead:** <10ms average per operation
|
||
**Description:** Minimal performance impact from governance checks.
|
||
|
||
**Detailed Benchmarks:**
|
||
- BoundaryEnforcer: <5ms per check
|
||
- InstructionPersistenceClassifier: <10ms (classification + storage)
|
||
- CrossReferenceValidator: <15ms (query + validation)
|
||
- ContextPressureMonitor: <5ms (calculation)
|
||
- MetacognitiveVerifier: 50-200ms (selective, complex operations only)
|
||
|
||
**Real-World Impact:**
|
||
```
|
||
Without Tractatus: 100 operations = 0ms governance overhead
|
||
With Tractatus: 100 operations = ~1 second governance overhead
|
||
|
||
Trade-off: 1% performance cost for 100% governance enforcement
|
||
```
|
||
|
||
**Benefit:** **Negligible performance impact (<1%) for comprehensive safety**
|
||
|
||
---
|
||
|
||
### 8. Multi-Service Coordination
|
||
|
||
#### Claude Code Only
|
||
**Capability:** ❌ No coordination
|
||
**Description:** Single LLM runtime, no distributed governance architecture.
|
||
|
||
#### CLAUDE.md File
|
||
**Capability:** ❌ No services
|
||
**Description:** Static documentation only.
|
||
|
||
#### Tractatus Framework
|
||
**Capability:** ✅ 6 coordinated services
|
||
**Description:** Distributed governance architecture:
|
||
|
||
1. **BoundaryEnforcer** → Values decisions
|
||
2. **InstructionPersistenceClassifier** → Classification + storage
|
||
3. **CrossReferenceValidator** → Conflict detection
|
||
4. **ContextPressureMonitor** → Degradation detection
|
||
5. **MetacognitiveVerifier** → Complex operation verification
|
||
6. **PluralisticDeliberationOrchestrator** → Multi-stakeholder deliberation
|
||
|
||
**Coordination Flow:**
|
||
```
|
||
User instruction
|
||
↓
|
||
InstructionPersistenceClassifier (classify + store)
|
||
↓
|
||
ContextPressureMonitor (check current pressure)
|
||
↓
|
||
BoundaryEnforcer (values decision check)
|
||
↓
|
||
CrossReferenceValidator (conflict check)
|
||
↓
|
||
MetacognitiveVerifier (if complex operation)
|
||
↓
|
||
Action executes OR blocked
|
||
↓
|
||
AuditLogger (log decision)
|
||
```
|
||
|
||
**Benefit:** **Layered defense - single service failure doesn't compromise safety**
|
||
|
||
---
|
||
|
||
## Complementarity Matrix
|
||
|
||
**How Tractatus Extends Claude Code:**
|
||
|
||
| Claude Code Provides | Tractatus Adds |
|
||
|---------------------|----------------|
|
||
| Context window (200k tokens) | Pressure monitoring + early warning |
|
||
| Tool access (Bash, Read, Write) | Pre-action validation before tool use |
|
||
| Session management | Persistent instruction storage across sessions |
|
||
| File operations | Governance rule checking before operations |
|
||
| Conversation history | Structured audit trail for compliance |
|
||
| General AI capabilities | Specialized governance services (6) |
|
||
| Base runtime | Framework enforcement layer |
|
||
|
||
**The Relationship:**
|
||
|
||
```
|
||
┌─────────────────────────────────────────────┐
|
||
│ Tractatus Governance Layer │ ← Adds safety guardrails
|
||
│ (Boundary, Classifier, Validator, etc.) │
|
||
├─────────────────────────────────────────────┤
|
||
│ Claude Code Runtime │ ← Provides foundation
|
||
│ (Context, Tools, Session Management) │
|
||
└─────────────────────────────────────────────┘
|
||
```
|
||
|
||
**Not a replacement, an extension.**
|
||
|
||
---
|
||
|
||
## Real-World Deployment Metrics
|
||
|
||
**Production Environment:** 6 months, tractatus.digital project
|
||
|
||
| Metric | Value | Note |
|
||
|--------|-------|------|
|
||
| **Instructions Classified** | 847 | 68% HIGH, 24% MEDIUM, 8% LOW persistence |
|
||
| **Pattern Bias Prevented** | 12 incidents | 100% catch rate for HIGH persistence conflicts |
|
||
| **Values Decisions Blocked** | 47 | 100% escalated to human approval |
|
||
| **Context Pressure Warnings** | 134 | 89% preceded actual degradation |
|
||
| **False Positive Rate** | 6.4% | Boundary enforcer only, other services 0% |
|
||
| **Performance Overhead** | 8.7ms avg | 99.1% of base performance maintained |
|
||
| **Session Continuations** | 23 | 100% instruction persistence across compactions |
|
||
| **Audit Log Entries** | 2,341 | Complete governance trail |
|
||
|
||
**Key Insight:** Tractatus caught **12 failures** that would have occurred without governance, with only **3 false positives** (99.6% precision).
|
||
|
||
---
|
||
|
||
## Use Case Recommendations
|
||
|
||
### Use Claude Code Only When:
|
||
✓ Exploratory research with no persistent instructions
|
||
✓ One-off tasks with no governance requirements
|
||
✓ Learning/education without production consequences
|
||
✓ Prototyping before implementing formal governance
|
||
|
||
### Use Claude Code + CLAUDE.md When:
|
||
✓ Project-specific conventions needed
|
||
✓ Manual governance acceptable
|
||
✓ Team collaboration requires documented standards
|
||
✓ Lightweight governance sufficient
|
||
|
||
### Use Claude Code + Tractatus When:
|
||
✓ **Production AI systems** with safety requirements
|
||
✓ **Multi-session projects** with complex instructions
|
||
✓ **Values-critical domains** (privacy, ethics, indigenous rights)
|
||
✓ **High-stakes deployments** where failures are costly
|
||
✓ **Compliance requirements** need audit trails
|
||
✓ **Pattern bias is a risk** (defaults vs explicit instructions)
|
||
|
||
---
|
||
|
||
## Adoption Path
|
||
|
||
**Recommended Progression:**
|
||
|
||
1. **Start:** Claude Code only (exploration phase)
|
||
2. **Add:** CLAUDE.md for project conventions (< 1 hour)
|
||
3. **Enhance:** Tractatus for production governance (1-2 days integration)
|
||
|
||
**Tractatus Integration Checklist:**
|
||
- [ ] Install MongoDB for persistence
|
||
- [ ] Configure 6 governance services (enable/disable as needed)
|
||
- [ ] Load initial governance rules (10 sample rules provided)
|
||
- [ ] Test with deployment quickstart kit (30 minutes)
|
||
- [ ] Monitor audit logs for governance enforcement
|
||
- [ ] Iterate on rules based on real-world usage
|
||
|
||
---
|
||
|
||
## Summary
|
||
|
||
**Claude Code:** Foundation runtime environment
|
||
**CLAUDE.md:** Manual project documentation
|
||
**Tractatus:** Automated governance enforcement
|
||
|
||
**Together:** Research-stage AI with architectural safety design
|
||
|
||
**The Trade-Off:**
|
||
- **Cost:** <10ms overhead, 1-2 days integration, MongoDB requirement
|
||
- **Benefit:** 100% values decision protection, pattern bias prevention, audit trail, instruction persistence
|
||
|
||
**For most production deployments: The trade-off is worth it.**
|
||
|
||
---
|
||
|
||
## Related Resources
|
||
|
||
- [Technical Architecture Diagram](/downloads/technical-architecture-diagram.pdf) - Visual system architecture
|
||
- [Implementation Guide](/docs/markdown/implementation-guide.md) - Step-by-step integration
|
||
- [Deployment Quickstart](/downloads/tractatus-quickstart.tar.gz) - 30-minute Docker deployment
|
||
- [27027 Incident Case Study](/demos/27027-demo.html) - Real-world failure prevented by Tractatus
|
||
|
||
---
|
||
|
||
## Document Metadata
|
||
|
||
<div class="document-metadata">
|
||
|
||
- **Version:** 1.0
|
||
- **Created:** 2025-10-12
|
||
- **Last Modified:** 2025-10-13
|
||
- **Author:** Tractatus Framework Team
|
||
- **Word Count:** 2,305 words
|
||
- **Reading Time:** ~12 minutes
|
||
- **Document ID:** comparison-matrix
|
||
- **Status:** Active
|
||
|
||
</div>
|
||
|
||
---
|
||
|
||
## Licence
|
||
|
||
Copyright © 2026 John Stroh.
|
||
|
||
This work is licensed under the [Creative Commons Attribution 4.0 International Licence (CC BY 4.0)](https://creativecommons.org/licenses/by/4.0/).
|
||
|
||
You are free to share, copy, redistribute, adapt, remix, transform, and build upon this material for any purpose, including commercially, provided you give appropriate attribution, provide a link to the licence, and indicate if changes were made.
|
||
|
||
**Note:** The Tractatus AI Safety Framework source code is separately licensed under the Apache License 2.0. This Creative Commons licence applies to the research paper text and figures only.
|