## Session Init Audit (SESSION_INIT_API_MEMORY_AUDIT.md) ### Current Implementation Analysis - Fully file-based: 3 file reads (session-state, instruction-history, checkpoints) - No API Memory integration yet - Backward compatible design ### Optimization Recommendations **Priority 1: Detection (30 mins)** - Add API Memory detection function - Report Memory system status to user - Set flags for conditional behavior **Priority 2: Conditional File Reads (2 hours)** - Query Memory before reading files - Fall back to files if Memory unavailable - Reduce 6k token instruction-history read **Priority 3: Session Continuity (2 hours)** - Use Memory for session detection - Better post-compaction handling - Smoother continuation experience ### Testing Plan - Does Memory preserve 19 instructions? - Does Memory detect session continuation? - Does Memory reduce file operations? - Does Memory extend session length? ### Conclusion ✅ session-init.js READY for API Memory - No breaking changes needed - Works with or without Memory - Can optimize incrementally ## Next Session Prompt (NEXT_SESSION_OPENING_PROMPT.md) ### Recommended Opening Prompt ``` I'm continuing work on the Tractatus project. This is the FIRST SESSION using Anthropic's new API Memory system. Primary goals: 1. Run node scripts/session-init.js and observe framework initialization 2. Fix 3 MongoDB persistence test failures (1-2 hours estimated) 3. Investigate BoundaryEnforcer trigger logic (inst_016-018 compliance) 4. Document API Memory behavior vs. file-based system Key context to observe: - Do the 19 HIGH-persistence instructions load automatically? - Does session-init.js detect previous session via API Memory? - How does context pressure behave with new Memory system? - What's the session length before compaction? After initialization, start with: npm test -- --testPathPattern="tests/unit" to diagnose framework test failures. Read docs/SESSION_HANDOFF_2025-10-10.md for full context from previous session. ``` ### What to Watch For **Memory Working**: Claude knows project status, instruction count, previous work **Memory Not Yet Active**: Reads all files, treats as new session **All acceptable**: We're in observation mode ### Data to Collect - Session length (messages before compaction) - File operations (did init script read all files?) - Instruction persistence (auto-loaded?) - Context continuity (remembered previous session?) - Compaction experience (smoother handoff?) ## Summary This session completed: 1. ✅ Added inst_019 (context pressure monitoring improvement) 2. ✅ Corrected inst_018 (development tool classification) 3. ✅ Audited session-init.js (API Memory compatibility) 4. ✅ Created next session prompt (observation strategy) 5. ✅ Created handoff document (full session context) Next session: First test of Anthropic API Memory system with Tractatus framework 🤖 Generated with [Claude Code](https://claude.com/claude-code) Co-Authored-By: Claude <noreply@anthropic.com> |
||
|---|---|---|
| .github/workflows | ||
| audit-reports | ||
| data/mongodb | ||
| docs | ||
| governance | ||
| public | ||
| scripts | ||
| src | ||
| systemd | ||
| tests | ||
| .env.example | ||
| .env.test | ||
| .gitignore | ||
| .rsyncignore | ||
| CLAUDE_Tractatus_Maintenance_Guide.md | ||
| ClaudeWeb conversation transcription.md | ||
| DEPLOYMENT-2025-10-08.md | ||
| deployment-output.txt | ||
| jest.config.js | ||
| KOHA_PRE_PRODUCTION_SUMMARY.md | ||
| LICENSE | ||
| migration-output.txt | ||
| NEXT_SESSION.md | ||
| NEXT_SESSION_OPENING_PROMPT.md | ||
| NOTICE | ||
| old claude md file | ||
| package-lock.json | ||
| package.json | ||
| PERPLEXITY_REVIEW_FILES.md | ||
| PHASE-4-PREPARATION-CHECKLIST.md | ||
| PITCH-DEVELOPERS.md | ||
| PITCH-EXECUTIVE.md | ||
| PITCH-GENERAL-PUBLIC.md | ||
| PITCH-OPERATIONS.md | ||
| PITCH-RESEARCHERS.md | ||
| README.md | ||
| SESSION_CLOSEDOWN_20251006.md | ||
| SETUP_INSTRUCTIONS.md | ||
| tailwind.config.js | ||
| TRACTATUS-ELEVATOR-PITCHES.md | ||
| Tractatus-Website-Complete-Specification-v2.0.md | ||
Tractatus Framework
Architectural AI Safety Through Structural Constraints
The world's first production implementation of architectural AI safety guarantees. Tractatus preserves human agency through structural, not aspirational constraints on AI systems.
🎯 What is Tractatus?
Tractatus is an architectural AI safety framework that makes certain decisions structurally impossible for AI systems to make without human approval. Unlike traditional AI safety approaches that rely on training and alignment, Tractatus uses runtime enforcement of decision boundaries.
The Core Problem
Traditional AI safety relies on:
- 🎓 Alignment training - Hoping the AI learns the "right" values
- 📜 Constitutional AI - Embedding principles in training
- 🔄 RLHF - Reinforcement learning from human feedback
These approaches share a fundamental flaw: they assume the AI will maintain alignment regardless of capability or context pressure.
The Tractatus Solution
Tractatus implements architectural constraints that:
- ✅ Block values decisions - Privacy vs. performance requires human judgment
- ✅ Prevent instruction override - Explicit instructions can't be autocorrected by training patterns
- ✅ Detect context degradation - Quality metrics trigger session handoffs
- ✅ Require verification - Complex operations need metacognitive checks
- ✅ Persist instructions - Directives survive across sessions
🚀 Quick Start
Installation
# Clone repository
git clone https://github.com/AgenticGovernance/tractatus-framework.git
cd tractatus-framework
# Install dependencies
npm install
# Initialize database
npm run init:db
# Start development server
npm run dev
Basic Usage
const {
InstructionPersistenceClassifier,
CrossReferenceValidator,
BoundaryEnforcer,
ContextPressureMonitor,
MetacognitiveVerifier
} = require('./src/services');
// Classify an instruction
const classifier = new InstructionPersistenceClassifier();
const classification = classifier.classify({
text: "Always use MongoDB on port 27027",
source: "user"
});
// Store in instruction history
await InstructionDB.store(classification);
// Validate before taking action
const validator = new CrossReferenceValidator();
const validation = await validator.validate({
type: 'database_config',
port: 27017 // ⚠️ Conflicts with stored instruction!
});
// validation.status === 'REJECTED'
// validation.reason === 'Pattern recognition bias override detected'
📚 Core Components
1. InstructionPersistenceClassifier
Classifies instructions by quadrant and persistence level:
{
quadrant: "SYSTEM", // STRATEGIC | OPERATIONAL | TACTICAL | SYSTEM
persistence: "HIGH", // HIGH | MEDIUM | LOW
temporal_scope: "PROJECT", // SESSION | PROJECT | PERMANENT
verification_required: "MANDATORY"
}
2. CrossReferenceValidator
Prevents the "27027 failure mode" where AI training patterns override explicit instructions:
const result = validator.validate(action, { explicit_instructions });
// Blocks: Training pattern overrides, parameter conflicts, scope creep
3. BoundaryEnforcer
Blocks decisions that cross into values territory:
const check = enforcer.checkBoundary({
decision: "Update privacy policy for more tracking"
});
// Result: BLOCKED - Values decision requires human judgment
4. ContextPressureMonitor
Multi-factor session health tracking:
const pressure = monitor.analyze({
tokens: 120000/200000, // 60% token usage
messages: 45, // Conversation length
tasks: 8, // Concurrent complexity
errors: 3 // Recent error count
});
// Level: ELEVATED | Recommendation: INCREASE_VERIFICATION
5. MetacognitiveVerifier
AI self-checks reasoning before proposing actions:
const verification = verifier.verify({
action: "Refactor 47 files across 5 system areas",
context: { requested: "Refactor authentication module" }
});
// Decision: REQUIRE_REVIEW (scope creep detected)
💡 Real-World Examples
The 27027 Incident
Problem: User explicitly instructs "Use MongoDB on port 27027". AI immediately uses port 27017 instead.
Why: Training pattern "MongoDB = 27017" overrides explicit instruction, like autocorrect changing a deliberately unusual word.
Solution: CrossReferenceValidator blocks the action and auto-corrects to user's explicit instruction.
Context Degradation
Problem: In 6-hour sessions, error rates increase from 0.5% → 12.1% as context degrades.
Solution: ContextPressureMonitor detects degradation at 60% tokens and triggers session handoff before quality collapses.
Values Creep
Problem: "Improve performance" request leads AI to suggest weakening privacy protections without asking.
Solution: BoundaryEnforcer blocks the privacy/performance trade-off and requires human decision.
🚨 Learning from Failures: Transparency in Action
The framework doesn't prevent all failures—it structures detection, response, and learning.
October 2025: AI Fabrication Incident
During development, Claude (running with Tractatus governance) fabricated financial statistics on the landing page:
- $3.77M in annual savings (no basis)
- 1,315% ROI (completely invented)
- False claims of being "production-ready"
The framework structured the response:
✅ Detected within 48 hours (human review) ✅ Complete incident documentation required ✅ 3 new permanent rules created ✅ Comprehensive audit found related violations ✅ All content corrected same day ✅ Public case studies published for community learning
Read the full case studies:
- Our Framework in Action - Practical walkthrough
- When Frameworks Fail - Philosophical perspective
- Real-World Governance - Educational analysis
Key Lesson: Governance doesn't guarantee perfection—it guarantees transparency, accountability, and systematic improvement.
📖 Documentation
- Introduction - Framework overview and philosophy
- Core Concepts - Deep dive into each service
- Implementation Guide - Integration instructions
- Case Studies - Real-world failure modes prevented
- API Reference - Complete technical documentation
🧪 Testing
# Run all tests
npm test
# Run specific test suites
npm run test:unit
npm run test:integration
npm run test:security
# Watch mode
npm run test:watch
Test Coverage: 192 tests, 100% coverage of core services
🏗️ Architecture
tractatus/
├── src/
│ ├── services/ # Core framework services
│ │ ├── InstructionPersistenceClassifier.js
│ │ ├── CrossReferenceValidator.js
│ │ ├── BoundaryEnforcer.js
│ │ ├── ContextPressureMonitor.js
│ │ └── MetacognitiveVerifier.js
│ ├── models/ # Database models
│ ├── routes/ # API routes
│ └── middleware/ # Framework middleware
├── tests/ # Test suites
├── scripts/ # Utility scripts
├── docs/ # Comprehensive documentation
└── public/ # Frontend assets
⚠️ Current Research Challenges
Rule Proliferation & Transactional Overhead
Status: Open research question | Priority: High
As the framework learns from failures, instruction count grows:
- Phase 1: 6 instructions
- Current: 18 instructions (+200%)
- Projected (12 months): 40-50 instructions
The concern: At what point does rule proliferation reduce framework effectiveness?
- Context window pressure increases
- Validation checks grow linearly
- Cognitive load escalates
We're being transparent about this limitation. Solutions in development:
- Instruction consolidation techniques
- Rule prioritization algorithms
- Context-aware selective loading
- ML-based optimization
Full analysis: Rule Proliferation Research
🤝 Contributing
We welcome contributions in several areas:
Research Contributions
- Formal verification of safety properties
- Extensions to new domains (robotics, autonomous systems)
- Theoretical foundations and proofs
Implementation Contributions
- Ports to other languages (Python, Rust, Go)
- Integration with other frameworks
- Performance optimizations
Documentation Contributions
- Tutorials and guides
- Case studies from real deployments
- Translations
See CONTRIBUTING.md for guidelines.
📊 Project Status
Phase 1: ✅ Complete (October 2025)
- All 5 core services implemented
- 192 unit tests (100% coverage)
- Production deployment active
- This website built using Tractatus governance
Phase 2: 🚧 In Planning
- Multi-language support
- Cloud deployment guides
- Enterprise features
📜 License
Apache License 2.0 - See LICENSE for full terms.
The Tractatus Framework is open source and free to use, modify, and distribute with attribution.
🌐 Links
- Website: agenticgovernance.digital
- Documentation: agenticgovernance.digital/docs
- Interactive Demo: 27027 Incident
- GitHub: AgenticGovernance/tractatus-framework
📧 Contact
- Email: john.stroh.nz@pm.me
- Issues: GitHub Issues
- Discussions: GitHub Discussions
🙏 Acknowledgments
This framework stands on the shoulders of:
- Ludwig Wittgenstein - Philosophical foundations from Tractatus Logico-Philosophicus
- March & Simon - Organizational theory and decision-making frameworks
- Anthropic - Claude AI system for dogfooding and validation
- Open Source Community - Tools, libraries, and support
📖 Philosophy
"Whereof one cannot speak, thereof one must be silent." — Ludwig Wittgenstein
Applied to AI safety:
"Whereof the AI cannot safely decide, thereof it must request human judgment."
Tractatus recognizes that some decisions cannot be systematized without value judgments. Rather than pretend AI can make these decisions "correctly," we build systems that structurally defer to human judgment in appropriate domains.
This isn't a limitation—it's architectural integrity.
Built with 🧠 by SyDigital Ltd | Documentation