tractatus/docs/PHASE-2-PRESENTATION.md
TheFlow 2298d36bed fix(submissions): restructure Economist package and fix article display
- Create Economist SubmissionTracking package correctly:
  * mainArticle = full blog post content
  * coverLetter = 216-word SIR— letter
  * Links to blog post via blogPostId
- Archive 'Letter to The Economist' from blog posts (it's the cover letter)
- Fix date display on article cards (use published_at)
- Target publication already displaying via blue badge

Database changes:
- Make blogPostId optional in SubmissionTracking model
- Economist package ID: 68fa85ae49d4900e7f2ecd83
- Le Monde package ID: 68fa2abd2e6acd5691932150

Next: Enhanced modal with tabs, validation, export

🤖 Generated with [Claude Code](https://claude.com/claude-code)

Co-Authored-By: Claude <noreply@anthropic.com>
2025-10-24 08:47:42 +13:00

533 lines
14 KiB
Markdown

# Phase 2: Production Deployment & AI Features
## Tractatus AI Safety Framework
**Presented by**: Claude Code (Anthropic Sonnet 4.5)
**Prepared for**: John Stroh
**Date**: 2025-10-07
**Status**: APPROVED - Ready to Begin
---
## Slide 1: Executive Summary
### Phase 2 Overview
**Goal**: Transform local prototype → production platform with AI-powered features
**Timeline**: 2-3 months (starting NOW)
**Budget**:
- Total Phase 2: **$550 USD** (~$900 NZD)
- Ongoing: **$100-150/month** (~$165-250 NZD)
**Domain**: **agenticgovernance.digital** ✅ Registered
**Status**: All approvals granted, ready to deploy
---
## Slide 2: What We Built (Phase 1 Recap)
### Phase 1 Achievements ✅
**Infrastructure**:
- MongoDB database (tractatus_dev)
- Express application (port 9000)
- 118 integration tests (100% passing)
**Features**:
- Three audience paths (Researcher, Implementer, Advocate)
- Interactive demos (27027 incident, classification, boundary)
- Document viewer with 12+ technical papers
- Admin dashboard with moderation workflows
- API reference documentation
**Quality**:
- WCAG AA accessibility
- CSP compliance (script-src 'self')
- 85.3%+ test coverage on Tractatus services
- Mobile responsive
---
## Slide 3: What We're Building (Phase 2)
### Production Platform + AI Features
**Month 1: Infrastructure** (Weeks 1-4)
- Deploy to OVHCloud VPS (agenticgovernance.digital)
- SSL/TLS, security hardening, monitoring
- Nginx reverse proxy, automated backups
**Month 2: AI-Powered Features** (Weeks 5-8)
- Blog curation system (AI-assisted, human-approved)
- Media inquiry triage (classification + auto-drafts)
- Case study portal (community submissions)
**Month 3: Polish & Soft Launch** (Weeks 9-12)
- Governance enforcement audit
- End-to-end testing
- Soft launch to 20-50 users
- Feedback collection & iteration
---
## Slide 4: The Dogfooding Principle
### Tractatus Governs Itself
**Core Principle**: *"What cannot be systematized must not be automated."*
**Implementation**:
| AI Operation | Quadrant | Human Oversight |
|--------------|----------|-----------------|
| Blog topic suggestion | STOCHASTIC | Human selects topics |
| Blog outline generation | OPERATIONAL | Human reviews structure |
| **Blog publication decision** | **STRATEGIC** | **Human approves** |
| Media inquiry classification | OPERATIONAL | Human verifies |
| **Media response sending** | **STRATEGIC** | **Human approves** |
| Case study relevance analysis | OPERATIONAL | Human reviews |
| **Case study publication** | **STRATEGIC** | **Human approves** |
**Zero Tolerance**: AI cannot make values decisions without human approval
---
## Slide 5: Governance Framework (TRA-OPS-*)
### 5 Operational Policies Created
**TRA-OPS-0001**: AI Content Generation Policy (Master)
- Mandatory human approval for all public content
- Boundary enforcement (values require humans)
- $200/month API budget cap
**TRA-OPS-0002**: Blog Editorial Guidelines
- 4 content categories, citation standards
- AI assists; humans write & approve
**TRA-OPS-0003**: Media Inquiry Response Protocol
- AI classification + priority scoring
- No auto-send; all responses human-approved
**TRA-OPS-0004**: Case Study Moderation Standards
- Community submissions, AI relevance analysis
- Quality checklist, human publication decision
**TRA-OPS-0005**: Human Oversight Requirements
- Admin reviewer role & training
- SLAs: 4h (HIGH media), 48h (blog), 7d (case studies)
---
## Slide 6: Budget Breakdown
### Where the Money Goes
**One-Time Costs** (~$100):
- Domain (already paid)
- SSL certificates (Let's Encrypt - free)
- Initial security audit tools
**Monthly Recurring** (~$100-150):
- **Hosting** (OVHCloud VPS Essential): **$30**
- 2 vCores, 4GB RAM, 80GB SSD
- 1,000-5,000 visitors/month capacity
- **Claude API** (Sonnet 4.5): **$50**
- 30 blog outlines/month
- 50 media inquiries/month
- 20 case study analyses/month
- **Backups & Monitoring**: **$10-20**
- Off-site backups
- Uptime monitoring
- Error tracking (Sentry free tier)
**Total 3-Month Phase 2**: $550 USD (~$900 NZD)
---
## Slide 7: Infrastructure Architecture
### Production Stack
```
┌─────────────────┐
│ Internet │
└────────┬────────┘
┌────▼────┐
│ OVHCloud│ agenticgovernance.digital
│ DNS │ (No Cloudflare - sovereignty)
└────┬────┘
┌────▼────┐
│ Nginx │ SSL/TLS (Let's Encrypt)
│ :80/443 │ Reverse Proxy + Security Headers
└────┬────┘
┌────▼────┐
│ Node.js │ Tractatus Application
│ :9000 │ Express 4.x
└────┬────┘
┌────▼────┐
│ MongoDB │ tractatus_prod
│ :27017 │ 7.x with authentication
└─────────┘
```
**Security**: UFW firewall, Fail2ban, SSH key-only, automated updates
---
## Slide 8: AI Features in Detail
### Blog Curation System
**AI Role**: Suggest topics, generate outlines
**Human Role**: Select topics, write drafts, approve publication
**Workflow**:
1. AI scans AI safety news (weekly)
2. AI suggests 5-10 topics → Human selects 1-3
3. AI generates outline → Human reviews & edits
4. **Human writes full draft** (AI does NOT write)
5. Admin final approval → Publish
**Target**: 2-4 posts/month (8-16 total in Phase 2)
---
### Media Inquiry Triage
**AI Role**: Classify, prioritize, draft responses
**Human Role**: Verify, decide, send
**Categories**:
- **Press** (HIGH priority, 4h SLA)
- **Academic** (MEDIUM, 48h SLA)
- **Commercial** (MEDIUM, 7 days)
- **Community** (LOW, 14 days)
- **Spam** (IGNORE)
**Expected Volume**: 5-20 inquiries/month (soft launch)
---
### Case Study Portal
**AI Role**: Assess relevance, map to Tractatus framework
**Human Role**: Moderate, approve publication
**Submission Categories**:
- Hallucinations
- Boundary violations (AI making values decisions)
- Instruction overrides (27027-type)
- Context failures
- Bias/discrimination
**Target**: 3-5 community submissions/month
---
## Slide 9: Timeline & Milestones
### 12-Week Roadmap
**Weeks 1-4: Infrastructure** ✅ Ready to Execute
- Provision OVHCloud VPS (Singapore/Australia)
- Deploy application, configure SSL
- Security hardening, monitoring setup
- **Milestone**: Site live at https://agenticgovernance.digital
**Weeks 5-8: AI Features** ⏳ Awaiting Claude API key
- Integrate Claude Sonnet 4.5
- Build blog curation pipeline
- Implement media triage system
- Launch case study portal
- **Milestone**: All AI features operational
**Weeks 9-12: Polish & Launch** ⏳ Awaiting user cohort
- End-to-end testing
- Governance compliance audit
- Invite 20-50 soft launch users
- Collect feedback, iterate
- **Milestone**: Soft launch complete
---
## Slide 10: Success Criteria
### How We'll Know Phase 2 Succeeded
**Technical Success**:
- ✅ Site live with 99%+ uptime (30 days)
- ✅ Performance: <3s page load (95th percentile)
- Security: Zero critical vulnerabilities
- WCAG AA accessibility maintained
**Governance Success**:
- 100% human approval rate (no AI auto-publish)
- Zero boundary violations (values decisions)
- Audit trail complete (all AI decisions logged)
**User Success**:
- 20-50 soft launch users engaged
- 4+/5 average satisfaction rating
- 50+ readers/blog post average
- 5+ media inquiries handled
**Business Success**:
- Costs <$150/month
- Zero data breaches
- Positive user feedback
---
## Slide 11: Risks & Mitigation
### What Could Go Wrong?
| Risk | Probability | Impact | Mitigation |
|------|-------------|--------|------------|
| **Claude API costs exceed budget** | Medium | High | Rate limiting, $200 hard cap, alerts at 80% |
| **Security breach** | Low | Critical | Security audit, penetration testing, Fail2ban |
| **AI generates inappropriate content** | Medium | High | Mandatory human approval, no auto-publish |
| **Server downtime** | Medium | Medium | Monitoring, automated backups, <4h recovery |
| **Poor user adoption** | Medium | Medium | Clear onboarding, feedback loops, iteration |
**Overall Risk**: **LOW** - Strong governance, conservative approach
---
## Slide 12: Soft Launch Strategy
### Who Gets Early Access?
**Target Cohort**: 20-50 users across 3 audiences
**Researchers** (8-12 users):
- AI safety academics
- Philosophy/ethics researchers
- Computer science PhD students
**Implementers** (8-12 users):
- AI engineers at aligned companies
- Open-source AI developers
- Technical architects
**Advocates** (4-6 users):
- AI policy professionals
- Digital rights organizations
- Aligned nonprofits (EFF, Access Now)
**Invitation Method**: Personal email, curated list
**Feedback**: Structured survey + ongoing dialogue
---
## Slide 13: Phase 2 → Phase 3 Transition
### When to Proceed to Public Launch
**Exit Criteria**:
- All Phase 2 success metrics met
- Soft launch feedback positive (4+/5)
- Zero critical bugs
- Governance audit complete
- Your approval to proceed
**Phase 3 Preview** (3-6 months):
- Public launch & marketing campaign
- Koha donation system (micropayments)
- Multi-language support
- Community forums
- Academic partnerships
- Bug bounty program
**Not rushing**: Phase 2 soft launch could extend if needed for quality
---
## Slide 14: World-Class UI/UX Focus
### Excellence Standards
**Design Principles**:
- **Clarity over cleverness**: Users understand immediately
- **Accessibility first**: WCAG AA minimum, AAA aspirational
- **Performance**: <3s load, optimized for 3G networks
- **Consistency**: Design system for all components
- **Respect**: No dark patterns, honest communication
**Continuous Improvement**:
- User testing (soft launch feedback)
- Analytics (privacy-respecting, Plausible)
- A/B testing (ethical, transparent)
- Regular UX audits
**Benchmark**: Best-in-class documentation sites (Stripe, Tailwind, Anthropic)
---
## Slide 15: Next Steps (Action Items)
### What Happens Now?
**Immediate** (This Week):
- [ ] Sign TRA-OPS-* governance documents (formal approval)
- [ ] Provision OVHCloud VPS Essential (Singapore preferred)
- [ ] Create Anthropic Claude API account (production key)
- [ ] Set up payment methods (OVHCloud + Anthropic)
- [ ] Generate JWT secrets, MongoDB passwords (secure)
**Week 1-2**:
- [ ] Deploy infrastructure (server setup, SSL, security)
- [ ] Configure DNS (agenticgovernance.digital server IP)
- [ ] Deploy application code (Git-based workflow)
- [ ] Test production environment (health checks, monitoring)
**Week 3-4**:
- [ ] Integrate Claude API (test endpoints)
- [ ] Build blog curation pipeline
- [ ] Implement media triage system
- [ ] Launch case study portal
**Week 5-12**:
- [ ] Execute Phase 2 roadmap
- [ ] Weekly progress updates
- [ ] Soft launch preparation
---
## Slide 16: Your Role (John Stroh)
### What We Need From You
**Strategic Decisions**:
- Final approval on governance documents (sign-off)
- Soft launch user cohort selection (who to invite)
- Editorial direction (blog topics, tone)
- Phase 3 go/no-go decision
**Operational Tasks**:
- Blog content review & approval (2-4 posts/month)
- Media inquiry responses (HIGH priority, escalations)
- Case study moderation (assist admin if needed)
- Monthly budget review
**Time Commitment**:
- Phase 2 setup: 5-10 hours (one-time)
- Ongoing moderation: 5-10 hours/week
- Strategic reviews: 2 hours/month
**Support Available**:
- Claude Code for technical implementation
- Admin reviewer (if hired) for routine moderation
- Automated systems for monitoring, backups
---
## Slide 17: Why This Matters
### The Bigger Picture
**Problem**: AI safety approaches rely on behavioral alignment
**Limitation**: Alignment breaks down as capabilities scale
**Tractatus Approach**: Architectural constraints (structural safety)
**Advantage**: Safety guarantees independent of capability level
**This Platform**:
- **Demonstrates** the framework in production
- **Educates** researchers, implementers, advocates
- **Catalyzes** adoption (open source, replicable)
- **Influences** policy (proof of concept for regulation)
**Goal**: Make architectural AI safety the industry standard
---
## Slide 18: Questions & Discussion
### Open Issues for Discussion
**Technical**:
- OVHCloud region preference? (Singapore vs. Australia)
- Backup strategy: On-server only or off-site? (Backblaze B2)
- CDN needed? (Cloudflare basic or skip entirely)
**Content**:
- Initial blog topics? (27027 incident, framework intro, etc.)
- Soft launch invitation timing? (End of Month 2 or Month 3?)
- Media outreach? (Proactive or reactive only?)
**Governance**:
- Admin reviewer hiring? (Phase 2 or Phase 3?)
- Editorial board formation? (Phase 3 or later?)
- External audit? (Annual or Phase 3 milestone?)
**Anything else?**
---
## Slide 19: Summary & Approval
### Phase 2 Ready to Launch
**Approved** :
- Budget: $550 (Phase 2), $100-150/month (ongoing)
- Timeline: 2-3 months, starting NOW
- Governance: 5 TRA-OPS-* policies
- Infrastructure: OVHCloud VPS Essential
- AI Strategy: Blog, media, case studies with human oversight
**Deliverables**:
- Production site at agenticgovernance.digital
- Blog curation system (2-4 posts/month)
- Media inquiry triage (5-20 inquiries/month)
- Case study portal (3-5 submissions/month)
- Soft launch to 20-50 users
**Next Action**: Begin Week 1 infrastructure deployment
---
## Slide 20: Appendix - Resources
### Key Documents
**Planning**:
- PHASE-2-ROADMAP.md (comprehensive 3-month plan)
- PHASE-2-COST-ESTIMATES.md (budget breakdown)
- PHASE-2-INFRASTRUCTURE-PLAN.md (technical specs, deployment)
**Governance**:
- TRA-OPS-0001: AI Content Generation Policy
- TRA-OPS-0002: Blog Editorial Guidelines
- TRA-OPS-0003: Media Inquiry Response Protocol
- TRA-OPS-0004: Case Study Moderation Standards
- TRA-OPS-0005: Human Oversight Requirements
**Technical**:
- API Reference: /docs/api-reference.html
- Tractatus Framework Spec: /docs/technical-proposal.md
**Location**: `/home/theflow/projects/tractatus/docs/` and `governance/`
---
## Thank You
**Questions?**
**Ready to deploy?** Let's build world-class AI safety infrastructure.
---
**Presentation prepared by**: Claude Code (Anthropic Sonnet 4.5)
**Date**: 2025-10-07
**Status**: APPROVED - Phase 2 begins NOW
**Domain**: agenticgovernance.digital