Session Management with ContextPressureMonitor ✨ - Created scripts/check-session-pressure.js for automated pressure analysis - Updated CLAUDE.md with comprehensive session management protocol - Multi-factor analysis: tokens (35%), conversation (25%), complexity (15%), errors (15%), instructions (10%) - 5 pressure levels: NORMAL, ELEVATED, HIGH, CRITICAL, DANGEROUS - Proactive monitoring at 25%, 50%, 75% token usage - Exit codes: 0=NORMAL/ELEVATED, 1=HIGH, 2=CRITICAL, 3=DANGEROUS - Color-coded CLI output with recommendations - Dogfooding: Tractatus framework managing its own development sessions InstructionPersistenceClassifier: 58.8% → 85.3% (+26.5%, +9 tests) ✨ - Add snake_case field aliases (temporal_scope, extracted_parameters, context_snapshot) - Fix temporal scope detection for PERMANENT, PROJECT, SESSION, IMMEDIATE - Improve explicitness scoring with implicit/hedging language detection - Lower baseline from 0.5 → 0.3, add hedging penalty (-0.15 per word) - Fix persistence calculation for explicit port specifications (now HIGH) - Increase SYSTEM base score from 0.6 → 0.7 - Add PROJECT temporal scope adjustment (+0.05) - Lower MEDIUM threshold from 0.5 → 0.45 - Special case: port specifications with high explicitness → HIGH persistence ContextPressureMonitor: Maintained 60.9% (28/46) ✅ - No regressions, all improvements from previous session intact BoundaryEnforcer: Maintained 100% (43/43) ✅ - Perfect coverage maintained CrossReferenceValidator: Maintained 96.4% (27/28) ✅ - Near-perfect coverage maintained MetacognitiveVerifier: Maintained 56.1% (23/41) ⚠️ - Stable, needs future work Overall: 141/192 → 149/192 tests passing (+8 tests, +4.2%) Phase 1 Target: 70% - EXCEEDED (77.6%) Next Session Priorities: 1. MetacognitiveVerifier (56.1% → 70%+): Fix confidence calculations 2. ContextPressureMonitor (60.9% → 70%+): Fix remaining edge cases 3. InstructionPersistenceClassifier (85.3% → 90%+): Last 5 edge cases 4. Stretch: Push overall to 85%+ 🤖 Generated with Claude Code
18 KiB
Tractatus AI Safety Framework Website - Project Context
Project Name: Tractatus Website Platform Domain: mysy.digital Repository: GitHub (primary) + Codeberg/Gitea (mirrors) Status: Development - Phase 1 Implementation Created: 2025-10-06 Primary Developer: Claude Code (Anthropic Sonnet 4.5) Project Owner: John Stroh
⚠️ Critical: Project Isolation
THIS IS A SEPARATE PROJECT FROM family-history AND sydigital
- Separate MongoDB instance: Port 27017, database
tractatus_dev - Separate application port: 9000
- Separate Git repository: Local + GitHub account
- Separate systemd services: mongodb-tractatus.service, tractatus.service
- No shared code/data: Patterns may be adapted, but no dependencies
Sessions must maintain clear separation. Always verify which project context you're in.
Project Purpose
Build a world-class platform demonstrating the Tractatus-Based LLM Safety Framework through:
- Three Audience Paths: Researcher, Implementer, Advocate
- AI-Powered Features: Blog curation, media triage, case studies (all with human oversight)
- Interactive Demonstrations: Classification, 27027 incident, boundary enforcement
- Dogfooding: The website implements Tractatus to govern its own AI operations
- Values Alignment: Sovereignty, Transparency, Harmlessness, Community
Timeline: 3-4 months for complete Phase 1 local prototype (no rush, no shortcuts, world-class quality)
Technical Architecture
Infrastructure
- MongoDB: Port 27017, database
tractatus_dev - Application: Node.js/Express on port 9000
- WebSocket: Port 9001 (if needed)
- Data Directory:
/home/theflow/projects/tractatus/data/mongodb - Logs:
/home/theflow/projects/tractatus/logs/
Technology Stack
- Backend: Node.js 18+, Express 4.x, MongoDB 7+
- Frontend: Vanilla JavaScript, Tailwind CSS (no framework dependency)
- Authentication: JWT for admin/moderation
- AI Integration: Claude API (Sonnet 4.5) - Phase 2+
- File Storage: GridFS for PDFs, documents
- Testing: Jest + Supertest
Database Collections
tractatus_dev.documents // Technical papers, framework docs
tractatus_dev.blog_posts // AI-curated, human-approved
tractatus_dev.media_inquiries // Press/media with AI triage
tractatus_dev.case_submissions // Community case studies
tractatus_dev.resources // External links, aligned projects
tractatus_dev.moderation_queue // Human oversight queue
tractatus_dev.users // Admin accounts
tractatus_dev.citations // Academic citation tracking
tractatus_dev.translations // Multi-language content (future)
tractatus_dev.koha_donations // Phase 3
Tractatus Framework Governance
This project dogfoods the Tractatus framework - all AI actions are governed by:
Core Services (to be implemented)
- InstructionPersistenceClassifier - Classifies actions by quadrant (STR/OPS/TAC/SYS/STO)
- CrossReferenceValidator - Validates AI actions against explicit instructions
- BoundaryEnforcer - Ensures AI never makes values decisions without human approval
- ContextPressureMonitor - Detects conditions that increase error probability
- MetacognitiveVerifier - AI self-checks reasoning before proposing actions
Quadrant Mapping for Website Functions
| Function | Quadrant | Human Oversight | Example |
|---|---|---|---|
| Mission/values changes | STRATEGIC | Mandatory approval | "Always prioritize privacy" |
| Blog editorial guidelines | OPERATIONAL | Quarterly review | "All posts must cite sources" |
| Publish approved post | TACTICAL | Pre-approved | Execute after human approval |
| Technical config | SYSTEM | Technical review | MongoDB ports, API keys |
| AI suggests blog topics | STOCHASTIC | Always human approval | "Write about GDPR" |
Critical: All AI content suggestions require human approval. No AI action crosses into values territory without explicit human decision.
Session Management with ContextPressureMonitor
The Tractatus framework dogfoods itself - using ContextPressureMonitor to manage development sessions.
Session Pressure Analysis
Instead of arbitrary token thresholds, use multi-factor pressure analysis:
# Check current session pressure
node scripts/check-session-pressure.js --tokens 89195/200000 --messages 28 --tasks 2
# Output:
# Pressure Level: NORMAL
# Overall Score: 24.3%
# Action: PROCEED
# Recommendations: ✅ CONTINUE_NORMAL
Pressure Levels & Actions
| Level | Score | Action | What to Do |
|---|---|---|---|
| NORMAL | 0-30% | PROCEED | Continue normally |
| ELEVATED | 30-50% | INCREASE_VERIFICATION | More careful, verify outputs |
| HIGH | 50-70% | SUGGEST_CONTEXT_REFRESH | Consider session handoff |
| CRITICAL | 70-85% | MANDATORY_VERIFICATION | Verify all actions, prepare handoff |
| DANGEROUS | 85%+ | IMMEDIATE_HALT | Stop, create handoff, refresh context |
Monitored Factors (Weighted)
- Token Usage (35% weight) - Context window pressure
- Conversation Length (25% weight) - Attention decay over long sessions
- Task Complexity (15% weight) - Number of simultaneous tasks, dependencies, file modifications
- Error Frequency (15% weight) - Recent errors indicate degraded state
- Instruction Density (10% weight) - Too many competing directives
When to Check Pressure
Automatically check at:
- Session start (baseline)
- 25% token usage (early warning)
- 50% token usage (mid-session check)
- 75% token usage (prepare for handoff)
- After complex multi-file operations
- After any error or unexpected behavior
Proactive Monitoring: Claude should periodically assess pressure and adjust behavior:
- NORMAL: Work normally, maintain quality standards
- ELEVATED: Be more concise, increase verification
- HIGH: Suggest creating session handoff document
- CRITICAL: Mandatory verification, prepare handoff
- DANGEROUS: Stop work, create comprehensive handoff
Session Handoff Triggers
Create handoff document when:
- Pressure reaches CRITICAL or DANGEROUS
- Token usage exceeds 75%
- Complex multi-phase work remains
- Errors clustering (3+ in short period)
- User requests session break
Script Usage
# Basic check
node scripts/check-session-pressure.js --tokens <current>/<budget>
# With full context
node scripts/check-session-pressure.js \
--tokens 150000/200000 \
--messages 45 \
--tasks 3 \
--errors 1 \
--verbose
# JSON output for automation
node scripts/check-session-pressure.js --tokens 180000/200000 --json
# Exit codes: 0=NORMAL/ELEVATED, 1=HIGH, 2=CRITICAL, 3=DANGEROUS
Integration with Claude Sessions
Claude should:
- Track approximate token usage, message count, active tasks
- Periodically call ContextPressureMonitor (every 25% tokens)
- Report pressure level and recommendations to user
- Adjust verbosity/behavior based on pressure
- Proactively suggest session handoff when appropriate
Example:
[ContextPressureMonitor: ELEVATED - 52% pressure]
Recommendations: INCREASE_VERIFICATION, Token usage at 68%
Action: Continuing with increased verification. Consider handoff after current task.
Governance Documents
Located in /home/theflow/projects/tractatus/governance/ (to be created):
- TRA-VAL-0001: Tractatus Core Values (adapted from STR-VAL-0001)
- TRA-GOV-0001: Strategic Review Protocol (adapted from STR-GOV-0001)
- TRA-GOV-0002: Values Alignment Framework (adapted from STR-GOV-0002)
- TRA-GOV-0003: AI Boundary Enforcement Policy
- TRA-GOV-0004: Human Oversight Requirements
Reference: Source documents in /home/theflow/projects/sydigital/strategic/
Te Tiriti & Indigenous Perspective
Strategic Commitment
The framework acknowledges Te Tiriti o Waitangi and indigenous leadership in digital sovereignty.
Implementation Approach
- Respect without tokenism: Follow documented indigenous data sovereignty principles (CARE Principles)
- No premature engagement: Do not approach Māori organizations until we have something valuable to offer
- Well-documented standards: Use published research and frameworks (Te Mana Raraunga, CARE Principles)
- Baseline integration: Te Tiriti forms part of strategic foundation, not dominant cultural overlay
Content Placement
- Footer acknowledgment (subtle, respectful)
/about/valuespage (detailed explanation)- Resource directory (links to Māori data sovereignty organizations)
- No meetings/consultations until post-launch
Development Conventions
Code Style
- ES6+ JavaScript: Modern syntax, async/await patterns
- Modular architecture: Small, focused functions/classes
- Explicit naming: No abbreviations, clear intent
- Comments: Explain WHY, not WHAT
- Error handling: Comprehensive try/catch, meaningful error messages
File Naming
- Routes:
src/routes/blog.routes.js - Controllers:
src/controllers/blog.controller.js - Models:
src/models/BlogPost.model.js - Services:
src/services/BlogCuration.service.js - Middleware:
src/middleware/auth.middleware.js - Tests:
tests/unit/blog.test.js
Git Conventions
- Commits: Conventional commits format
feat:New featurefix:Bug fixdocs:Documentationrefactor:Code restructuretest:Test additionschore:Maintenance
- Branches:
feature/blog-curation,fix/auth-token,docs/api-reference - No commits to main: Always use feature branches
Environment Variables
# Application
NODE_ENV=development
PORT=9000
APP_NAME=Tractatus
# MongoDB
MONGODB_URI=mongodb://localhost:27017/tractatus_dev
MONGODB_PORT=27017
# JWT
JWT_SECRET=<generate_secure_secret>
JWT_EXPIRY=7d
# Claude API (Phase 2+)
CLAUDE_API_KEY=<anthropic_api_key>
CLAUDE_MODEL=claude-sonnet-4-5
# Admin
ADMIN_EMAIL=john.stroh.nz@pm.me
Directory Structure
/home/theflow/projects/tractatus/
├── .claude/ # Claude Code project config
├── .git/ # Git repository
├── docs/ # Source markdown documents
│ ├── markdown/ # Raw markdown files (migration source)
│ └── governance/ # TRA-VAL-*, TRA-GOV-* documents
├── public/ # Frontend assets
│ ├── css/
│ │ └── tailwind.css
│ ├── js/
│ │ ├── components/ # Reusable UI components
│ │ ├── demos/ # Interactive demonstrations
│ │ └── utils/
│ ├── images/
│ └── downloads/ # Generated PDFs
├── src/ # Backend code
│ ├── server.js # Express app entry point
│ ├── routes/
│ │ ├── docs.routes.js
│ │ ├── blog.routes.js
│ │ ├── media.routes.js
│ │ ├── cases.routes.js
│ │ ├── resources.routes.js
│ │ ├── admin.routes.js
│ │ └── demo.routes.js
│ ├── controllers/
│ ├── models/
│ │ ├── Document.model.js
│ │ ├── BlogPost.model.js
│ │ ├── MediaInquiry.model.js
│ │ ├── CaseSubmission.model.js
│ │ ├── ModerationQueue.model.js
│ │ └── User.model.js
│ ├── middleware/
│ │ ├── auth.middleware.js
│ │ ├── validation.middleware.js
│ │ └── tractatus/ # Framework enforcement
│ │ ├── classifier.middleware.js
│ │ ├── validator.middleware.js
│ │ └── boundary.middleware.js
│ ├── services/
│ │ ├── ClaudeAPI.service.js
│ │ ├── InstructionClassifier.service.js
│ │ ├── CrossReferenceValidator.service.js
│ │ ├── BoundaryEnforcer.service.js
│ │ ├── ContextPressureMonitor.service.js
│ │ ├── MetacognitiveVerifier.service.js
│ │ ├── BlogCuration.service.js
│ │ ├── MediaTriage.service.js
│ │ ├── DocumentProcessor.service.js
│ │ └── ModerationQueue.service.js
│ ├── utils/
│ │ ├── db.util.js
│ │ ├── jwt.util.js
│ │ ├── markdown.util.js
│ │ └── logger.util.js
│ └── config/
│ ├── database.config.js
│ └── app.config.js
├── scripts/ # Setup & migration
│ ├── init-db.js # Create collections, indexes
│ ├── migrate-documents.js # Import markdown content
│ ├── generate-pdfs.js # PDF export
│ ├── seed-admin.js # Create admin user
│ └── start-dev.sh # Development startup
├── tests/
│ ├── unit/
│ ├── integration/
│ └── security/
├── data/ # MongoDB data directory
│ └── mongodb/
├── logs/ # Application & MongoDB logs
│ ├── app.log
│ └── mongodb.log
├── .env.example # Template environment variables
├── .gitignore
├── package.json
├── package-lock.json
├── README.md
├── CLAUDE.md # This file
└── LICENSE
Phase 1 Deliverables (3-4 Months)
Must-Have for Complete Prototype:
-
✅ Infrastructure
- MongoDB instance (port 27017)
- Express application (port 9000)
- Systemd services
- Directory structure
-
Core Features
- Document migration pipeline
- Three audience paths (Researcher/Implementer/Advocate)
- Documentation viewer with search
- About/values pages (Te Tiriti acknowledgment)
-
Tractatus Governance Services
- InstructionPersistenceClassifier
- CrossReferenceValidator
- BoundaryEnforcer
- ContextPressureMonitor
- MetacognitiveVerifier
-
AI-Powered Features (with human oversight)
- Blog curation system
- Media inquiry triage
- Case study submission portal
- Resource directory curation
-
Interactive Demonstrations
- Instruction classification demo
- 27027 incident visualizer
- Boundary enforcement simulator
-
Human Oversight
- Moderation queue dashboard
- Admin authentication
- Approval workflows
-
Quality Assurance
- Comprehensive testing suite
- Security audit
- Performance optimization
- Accessibility compliance (WCAG)
Not in Phase 1:
- Production deployment (OVHCloud)
- Domain configuration (mysy.digital)
- ProtonBridge email integration
- Koha donations (Phase 3)
- Public launch
Success Criteria
Technical Excellence:
- Clean, maintainable code
- 80%+ test coverage
- <2s page load times
- WCAG AA accessibility
- Zero security vulnerabilities
- Complete API documentation
Framework Demonstration:
- All AI actions governed by Tractatus
- Human oversight for values-sensitive content
- Boundary enforcement working
- Classification system accurate
- Moderation queue functional
Content Quality:
- All documents migrated correctly
- Three audience paths distinct and clear
- Interactive demos working
- Blog system ready for Phase 2
- No placeholder/fake data
Human Approval Required For:
All Major Decisions:
- Architectural changes
- Database schema modifications
- Security implementations
- Third-party integrations
- Cost-incurring services
Content & Values:
- Governance document adaptations (TRA-VAL-, TRA-GOV-)
- Te Tiriti acknowledgment wording
- About/mission pages
- Editorial guidelines
- Any values-sensitive content
Phase Transitions:
- Completion of Phase 1 prototype
- Decision to proceed to production deployment
- Budget approval for Claude API (Phase 2)
- Launch timing and strategy
Links & References
Source Documents:
/home/theflow/projects/tractatus/Tractatus-Website-Complete-Specification-v2.0.md/home/theflow/projects/tractatus/ClaudeWeb conversation transcription.md/home/theflow/projects/sydigital/stochastic/innovation-exploration/STO-INN-0010-tractatus-llm-architecture-safety-framework-i1.md/home/theflow/projects/sydigital/stochastic/innovation-exploration/anthropic-submission/technical-proposal.md/home/theflow/projects/sydigital/stochastic/innovation-exploration/anthropic-submission/appendix-a-code-examples.md
Governance References:
/home/theflow/projects/sydigital/strategic/values-principles/STR-VAL-0001-core-values-principles-v1-0.md/home/theflow/projects/sydigital/strategic/governance/STR-GOV-0001-strategic-review-protocol-v1-0.md/home/theflow/projects/sydigital/strategic/governance/STR-GOV-0002-values-alignment-framework-v1-0.md
Framework Documentation:
/home/theflow/projects/sydigital/strategic/frameworks/STR-FRM-0001-agentic-workflow-framework-v1-0.md/home/theflow/projects/sydigital/stochastic/innovation-exploration/STO-INN-0002-agentic-organizational-structure-whitepaper-i2.md
Session Reminders
Always:
- Verify you're in
/home/theflow/projects/tractatuscontext - Check MongoDB port 27017, application port 9000
- No shortcuts, no fake data, world-class quality
- Human approval for major decisions
- Update todo list as tasks progress
Never:
- Mix tractatus code with family-history or sydigital
- Make values decisions without human approval
- Deploy to production during Phase 1
- Rush implementation to meet arbitrary deadlines
- Use placeholder/lorem ipsum content
Last Updated: 2025-10-07 Next Review: After Phase 1 completion