- Create Economist SubmissionTracking package correctly: * mainArticle = full blog post content * coverLetter = 216-word SIR— letter * Links to blog post via blogPostId - Archive 'Letter to The Economist' from blog posts (it's the cover letter) - Fix date display on article cards (use published_at) - Target publication already displaying via blue badge Database changes: - Make blogPostId optional in SubmissionTracking model - Economist package ID: 68fa85ae49d4900e7f2ecd83 - Le Monde package ID: 68fa2abd2e6acd5691932150 Next: Enhanced modal with tabs, validation, export 🤖 Generated with [Claude Code](https://claude.com/claude-code) Co-Authored-By: Claude <noreply@anthropic.com>
533 lines
14 KiB
Markdown
533 lines
14 KiB
Markdown
# Phase 2: Production Deployment & AI Features
|
|
## Tractatus AI Safety Framework
|
|
|
|
**Presented by**: Claude Code (Anthropic Sonnet 4.5)
|
|
**Prepared for**: John Stroh
|
|
**Date**: 2025-10-07
|
|
**Status**: APPROVED - Ready to Begin
|
|
|
|
---
|
|
|
|
## Slide 1: Executive Summary
|
|
|
|
### Phase 2 Overview
|
|
|
|
**Goal**: Transform local prototype → production platform with AI-powered features
|
|
|
|
**Timeline**: 2-3 months (starting NOW)
|
|
|
|
**Budget**:
|
|
- Total Phase 2: **$550 USD** (~$900 NZD)
|
|
- Ongoing: **$100-150/month** (~$165-250 NZD)
|
|
|
|
**Domain**: **agenticgovernance.digital** ✅ Registered
|
|
|
|
**Status**: All approvals granted, ready to deploy
|
|
|
|
---
|
|
|
|
## Slide 2: What We Built (Phase 1 Recap)
|
|
|
|
### Phase 1 Achievements ✅
|
|
|
|
**Infrastructure**:
|
|
- MongoDB database (tractatus_dev)
|
|
- Express application (port 9000)
|
|
- 118 integration tests (100% passing)
|
|
|
|
**Features**:
|
|
- Three audience paths (Researcher, Implementer, Advocate)
|
|
- Interactive demos (27027 incident, classification, boundary)
|
|
- Document viewer with 12+ technical papers
|
|
- Admin dashboard with moderation workflows
|
|
- API reference documentation
|
|
|
|
**Quality**:
|
|
- WCAG AA accessibility
|
|
- CSP compliance (script-src 'self')
|
|
- 85.3%+ test coverage on Tractatus services
|
|
- Mobile responsive
|
|
|
|
---
|
|
|
|
## Slide 3: What We're Building (Phase 2)
|
|
|
|
### Production Platform + AI Features
|
|
|
|
**Month 1: Infrastructure** (Weeks 1-4)
|
|
- Deploy to OVHCloud VPS (agenticgovernance.digital)
|
|
- SSL/TLS, security hardening, monitoring
|
|
- Nginx reverse proxy, automated backups
|
|
|
|
**Month 2: AI-Powered Features** (Weeks 5-8)
|
|
- Blog curation system (AI-assisted, human-approved)
|
|
- Media inquiry triage (classification + auto-drafts)
|
|
- Case study portal (community submissions)
|
|
|
|
**Month 3: Polish & Soft Launch** (Weeks 9-12)
|
|
- Governance enforcement audit
|
|
- End-to-end testing
|
|
- Soft launch to 20-50 users
|
|
- Feedback collection & iteration
|
|
|
|
---
|
|
|
|
## Slide 4: The Dogfooding Principle
|
|
|
|
### Tractatus Governs Itself
|
|
|
|
**Core Principle**: *"What cannot be systematized must not be automated."*
|
|
|
|
**Implementation**:
|
|
|
|
| AI Operation | Quadrant | Human Oversight |
|
|
|--------------|----------|-----------------|
|
|
| Blog topic suggestion | STOCHASTIC | Human selects topics |
|
|
| Blog outline generation | OPERATIONAL | Human reviews structure |
|
|
| **Blog publication decision** | **STRATEGIC** | **Human approves** |
|
|
| Media inquiry classification | OPERATIONAL | Human verifies |
|
|
| **Media response sending** | **STRATEGIC** | **Human approves** |
|
|
| Case study relevance analysis | OPERATIONAL | Human reviews |
|
|
| **Case study publication** | **STRATEGIC** | **Human approves** |
|
|
|
|
**Zero Tolerance**: AI cannot make values decisions without human approval
|
|
|
|
---
|
|
|
|
## Slide 5: Governance Framework (TRA-OPS-*)
|
|
|
|
### 5 Operational Policies Created
|
|
|
|
**TRA-OPS-0001**: AI Content Generation Policy (Master)
|
|
- Mandatory human approval for all public content
|
|
- Boundary enforcement (values require humans)
|
|
- $200/month API budget cap
|
|
|
|
**TRA-OPS-0002**: Blog Editorial Guidelines
|
|
- 4 content categories, citation standards
|
|
- AI assists; humans write & approve
|
|
|
|
**TRA-OPS-0003**: Media Inquiry Response Protocol
|
|
- AI classification + priority scoring
|
|
- No auto-send; all responses human-approved
|
|
|
|
**TRA-OPS-0004**: Case Study Moderation Standards
|
|
- Community submissions, AI relevance analysis
|
|
- Quality checklist, human publication decision
|
|
|
|
**TRA-OPS-0005**: Human Oversight Requirements
|
|
- Admin reviewer role & training
|
|
- SLAs: 4h (HIGH media), 48h (blog), 7d (case studies)
|
|
|
|
---
|
|
|
|
## Slide 6: Budget Breakdown
|
|
|
|
### Where the Money Goes
|
|
|
|
**One-Time Costs** (~$100):
|
|
- Domain (already paid)
|
|
- SSL certificates (Let's Encrypt - free)
|
|
- Initial security audit tools
|
|
|
|
**Monthly Recurring** (~$100-150):
|
|
- **Hosting** (OVHCloud VPS Essential): **$30**
|
|
- 2 vCores, 4GB RAM, 80GB SSD
|
|
- 1,000-5,000 visitors/month capacity
|
|
- **Claude API** (Sonnet 4.5): **$50**
|
|
- 30 blog outlines/month
|
|
- 50 media inquiries/month
|
|
- 20 case study analyses/month
|
|
- **Backups & Monitoring**: **$10-20**
|
|
- Off-site backups
|
|
- Uptime monitoring
|
|
- Error tracking (Sentry free tier)
|
|
|
|
**Total 3-Month Phase 2**: $550 USD (~$900 NZD)
|
|
|
|
---
|
|
|
|
## Slide 7: Infrastructure Architecture
|
|
|
|
### Production Stack
|
|
|
|
```
|
|
┌─────────────────┐
|
|
│ Internet │
|
|
└────────┬────────┘
|
|
│
|
|
┌────▼────┐
|
|
│ OVHCloud│ agenticgovernance.digital
|
|
│ DNS │ (No Cloudflare - sovereignty)
|
|
└────┬────┘
|
|
│
|
|
┌────▼────┐
|
|
│ Nginx │ SSL/TLS (Let's Encrypt)
|
|
│ :80/443 │ Reverse Proxy + Security Headers
|
|
└────┬────┘
|
|
│
|
|
┌────▼────┐
|
|
│ Node.js │ Tractatus Application
|
|
│ :9000 │ Express 4.x
|
|
└────┬────┘
|
|
│
|
|
┌────▼────┐
|
|
│ MongoDB │ tractatus_prod
|
|
│ :27017 │ 7.x with authentication
|
|
└─────────┘
|
|
```
|
|
|
|
**Security**: UFW firewall, Fail2ban, SSH key-only, automated updates
|
|
|
|
---
|
|
|
|
## Slide 8: AI Features in Detail
|
|
|
|
### Blog Curation System
|
|
|
|
**AI Role**: Suggest topics, generate outlines
|
|
**Human Role**: Select topics, write drafts, approve publication
|
|
|
|
**Workflow**:
|
|
1. AI scans AI safety news (weekly)
|
|
2. AI suggests 5-10 topics → Human selects 1-3
|
|
3. AI generates outline → Human reviews & edits
|
|
4. **Human writes full draft** (AI does NOT write)
|
|
5. Admin final approval → Publish
|
|
|
|
**Target**: 2-4 posts/month (8-16 total in Phase 2)
|
|
|
|
---
|
|
|
|
### Media Inquiry Triage
|
|
|
|
**AI Role**: Classify, prioritize, draft responses
|
|
**Human Role**: Verify, decide, send
|
|
|
|
**Categories**:
|
|
- **Press** (HIGH priority, 4h SLA)
|
|
- **Academic** (MEDIUM, 48h SLA)
|
|
- **Commercial** (MEDIUM, 7 days)
|
|
- **Community** (LOW, 14 days)
|
|
- **Spam** (IGNORE)
|
|
|
|
**Expected Volume**: 5-20 inquiries/month (soft launch)
|
|
|
|
---
|
|
|
|
### Case Study Portal
|
|
|
|
**AI Role**: Assess relevance, map to Tractatus framework
|
|
**Human Role**: Moderate, approve publication
|
|
|
|
**Submission Categories**:
|
|
- Hallucinations
|
|
- Boundary violations (AI making values decisions)
|
|
- Instruction overrides (27027-type)
|
|
- Context failures
|
|
- Bias/discrimination
|
|
|
|
**Target**: 3-5 community submissions/month
|
|
|
|
---
|
|
|
|
## Slide 9: Timeline & Milestones
|
|
|
|
### 12-Week Roadmap
|
|
|
|
**Weeks 1-4: Infrastructure** ✅ Ready to Execute
|
|
- Provision OVHCloud VPS (Singapore/Australia)
|
|
- Deploy application, configure SSL
|
|
- Security hardening, monitoring setup
|
|
- **Milestone**: Site live at https://agenticgovernance.digital
|
|
|
|
**Weeks 5-8: AI Features** ⏳ Awaiting Claude API key
|
|
- Integrate Claude Sonnet 4.5
|
|
- Build blog curation pipeline
|
|
- Implement media triage system
|
|
- Launch case study portal
|
|
- **Milestone**: All AI features operational
|
|
|
|
**Weeks 9-12: Polish & Launch** ⏳ Awaiting user cohort
|
|
- End-to-end testing
|
|
- Governance compliance audit
|
|
- Invite 20-50 soft launch users
|
|
- Collect feedback, iterate
|
|
- **Milestone**: Soft launch complete
|
|
|
|
---
|
|
|
|
## Slide 10: Success Criteria
|
|
|
|
### How We'll Know Phase 2 Succeeded
|
|
|
|
**Technical Success**:
|
|
- ✅ Site live with 99%+ uptime (30 days)
|
|
- ✅ Performance: <3s page load (95th percentile)
|
|
- ✅ Security: Zero critical vulnerabilities
|
|
- ✅ WCAG AA accessibility maintained
|
|
|
|
**Governance Success**:
|
|
- ✅ 100% human approval rate (no AI auto-publish)
|
|
- ✅ Zero boundary violations (values decisions)
|
|
- ✅ Audit trail complete (all AI decisions logged)
|
|
|
|
**User Success**:
|
|
- ✅ 20-50 soft launch users engaged
|
|
- ✅ 4+/5 average satisfaction rating
|
|
- ✅ 50+ readers/blog post average
|
|
- ✅ 5+ media inquiries handled
|
|
|
|
**Business Success**:
|
|
- ✅ Costs <$150/month
|
|
- ✅ Zero data breaches
|
|
- ✅ Positive user feedback
|
|
|
|
---
|
|
|
|
## Slide 11: Risks & Mitigation
|
|
|
|
### What Could Go Wrong?
|
|
|
|
| Risk | Probability | Impact | Mitigation |
|
|
|------|-------------|--------|------------|
|
|
| **Claude API costs exceed budget** | Medium | High | Rate limiting, $200 hard cap, alerts at 80% |
|
|
| **Security breach** | Low | Critical | Security audit, penetration testing, Fail2ban |
|
|
| **AI generates inappropriate content** | Medium | High | Mandatory human approval, no auto-publish |
|
|
| **Server downtime** | Medium | Medium | Monitoring, automated backups, <4h recovery |
|
|
| **Poor user adoption** | Medium | Medium | Clear onboarding, feedback loops, iteration |
|
|
|
|
**Overall Risk**: **LOW** - Strong governance, conservative approach
|
|
|
|
---
|
|
|
|
## Slide 12: Soft Launch Strategy
|
|
|
|
### Who Gets Early Access?
|
|
|
|
**Target Cohort**: 20-50 users across 3 audiences
|
|
|
|
**Researchers** (8-12 users):
|
|
- AI safety academics
|
|
- Philosophy/ethics researchers
|
|
- Computer science PhD students
|
|
|
|
**Implementers** (8-12 users):
|
|
- AI engineers at aligned companies
|
|
- Open-source AI developers
|
|
- Technical architects
|
|
|
|
**Advocates** (4-6 users):
|
|
- AI policy professionals
|
|
- Digital rights organizations
|
|
- Aligned nonprofits (EFF, Access Now)
|
|
|
|
**Invitation Method**: Personal email, curated list
|
|
|
|
**Feedback**: Structured survey + ongoing dialogue
|
|
|
|
---
|
|
|
|
## Slide 13: Phase 2 → Phase 3 Transition
|
|
|
|
### When to Proceed to Public Launch
|
|
|
|
**Exit Criteria**:
|
|
- All Phase 2 success metrics met ✅
|
|
- Soft launch feedback positive (4+/5) ✅
|
|
- Zero critical bugs ✅
|
|
- Governance audit complete ✅
|
|
- Your approval to proceed ✅
|
|
|
|
**Phase 3 Preview** (3-6 months):
|
|
- Public launch & marketing campaign
|
|
- Koha donation system (micropayments)
|
|
- Multi-language support
|
|
- Community forums
|
|
- Academic partnerships
|
|
- Bug bounty program
|
|
|
|
**Not rushing**: Phase 2 soft launch could extend if needed for quality
|
|
|
|
---
|
|
|
|
## Slide 14: World-Class UI/UX Focus
|
|
|
|
### Excellence Standards
|
|
|
|
**Design Principles**:
|
|
- **Clarity over cleverness**: Users understand immediately
|
|
- **Accessibility first**: WCAG AA minimum, AAA aspirational
|
|
- **Performance**: <3s load, optimized for 3G networks
|
|
- **Consistency**: Design system for all components
|
|
- **Respect**: No dark patterns, honest communication
|
|
|
|
**Continuous Improvement**:
|
|
- User testing (soft launch feedback)
|
|
- Analytics (privacy-respecting, Plausible)
|
|
- A/B testing (ethical, transparent)
|
|
- Regular UX audits
|
|
|
|
**Benchmark**: Best-in-class documentation sites (Stripe, Tailwind, Anthropic)
|
|
|
|
---
|
|
|
|
## Slide 15: Next Steps (Action Items)
|
|
|
|
### What Happens Now?
|
|
|
|
**Immediate** (This Week):
|
|
- [ ] Sign TRA-OPS-* governance documents (formal approval)
|
|
- [ ] Provision OVHCloud VPS Essential (Singapore preferred)
|
|
- [ ] Create Anthropic Claude API account (production key)
|
|
- [ ] Set up payment methods (OVHCloud + Anthropic)
|
|
- [ ] Generate JWT secrets, MongoDB passwords (secure)
|
|
|
|
**Week 1-2**:
|
|
- [ ] Deploy infrastructure (server setup, SSL, security)
|
|
- [ ] Configure DNS (agenticgovernance.digital → server IP)
|
|
- [ ] Deploy application code (Git-based workflow)
|
|
- [ ] Test production environment (health checks, monitoring)
|
|
|
|
**Week 3-4**:
|
|
- [ ] Integrate Claude API (test endpoints)
|
|
- [ ] Build blog curation pipeline
|
|
- [ ] Implement media triage system
|
|
- [ ] Launch case study portal
|
|
|
|
**Week 5-12**:
|
|
- [ ] Execute Phase 2 roadmap
|
|
- [ ] Weekly progress updates
|
|
- [ ] Soft launch preparation
|
|
|
|
---
|
|
|
|
## Slide 16: Your Role (John Stroh)
|
|
|
|
### What We Need From You
|
|
|
|
**Strategic Decisions**:
|
|
- Final approval on governance documents (sign-off)
|
|
- Soft launch user cohort selection (who to invite)
|
|
- Editorial direction (blog topics, tone)
|
|
- Phase 3 go/no-go decision
|
|
|
|
**Operational Tasks**:
|
|
- Blog content review & approval (2-4 posts/month)
|
|
- Media inquiry responses (HIGH priority, escalations)
|
|
- Case study moderation (assist admin if needed)
|
|
- Monthly budget review
|
|
|
|
**Time Commitment**:
|
|
- Phase 2 setup: 5-10 hours (one-time)
|
|
- Ongoing moderation: 5-10 hours/week
|
|
- Strategic reviews: 2 hours/month
|
|
|
|
**Support Available**:
|
|
- Claude Code for technical implementation
|
|
- Admin reviewer (if hired) for routine moderation
|
|
- Automated systems for monitoring, backups
|
|
|
|
---
|
|
|
|
## Slide 17: Why This Matters
|
|
|
|
### The Bigger Picture
|
|
|
|
**Problem**: AI safety approaches rely on behavioral alignment
|
|
**Limitation**: Alignment breaks down as capabilities scale
|
|
|
|
**Tractatus Approach**: Architectural constraints (structural safety)
|
|
**Advantage**: Safety guarantees independent of capability level
|
|
|
|
**This Platform**:
|
|
- **Demonstrates** the framework in production
|
|
- **Educates** researchers, implementers, advocates
|
|
- **Catalyzes** adoption (open source, replicable)
|
|
- **Influences** policy (proof of concept for regulation)
|
|
|
|
**Goal**: Make architectural AI safety the industry standard
|
|
|
|
---
|
|
|
|
## Slide 18: Questions & Discussion
|
|
|
|
### Open Issues for Discussion
|
|
|
|
**Technical**:
|
|
- OVHCloud region preference? (Singapore vs. Australia)
|
|
- Backup strategy: On-server only or off-site? (Backblaze B2)
|
|
- CDN needed? (Cloudflare basic or skip entirely)
|
|
|
|
**Content**:
|
|
- Initial blog topics? (27027 incident, framework intro, etc.)
|
|
- Soft launch invitation timing? (End of Month 2 or Month 3?)
|
|
- Media outreach? (Proactive or reactive only?)
|
|
|
|
**Governance**:
|
|
- Admin reviewer hiring? (Phase 2 or Phase 3?)
|
|
- Editorial board formation? (Phase 3 or later?)
|
|
- External audit? (Annual or Phase 3 milestone?)
|
|
|
|
**Anything else?**
|
|
|
|
---
|
|
|
|
## Slide 19: Summary & Approval
|
|
|
|
### Phase 2 Ready to Launch
|
|
|
|
**Approved** ✅:
|
|
- Budget: $550 (Phase 2), $100-150/month (ongoing)
|
|
- Timeline: 2-3 months, starting NOW
|
|
- Governance: 5 TRA-OPS-* policies
|
|
- Infrastructure: OVHCloud VPS Essential
|
|
- AI Strategy: Blog, media, case studies with human oversight
|
|
|
|
**Deliverables**:
|
|
- Production site at agenticgovernance.digital
|
|
- Blog curation system (2-4 posts/month)
|
|
- Media inquiry triage (5-20 inquiries/month)
|
|
- Case study portal (3-5 submissions/month)
|
|
- Soft launch to 20-50 users
|
|
|
|
**Next Action**: Begin Week 1 infrastructure deployment
|
|
|
|
---
|
|
|
|
## Slide 20: Appendix - Resources
|
|
|
|
### Key Documents
|
|
|
|
**Planning**:
|
|
- PHASE-2-ROADMAP.md (comprehensive 3-month plan)
|
|
- PHASE-2-COST-ESTIMATES.md (budget breakdown)
|
|
- PHASE-2-INFRASTRUCTURE-PLAN.md (technical specs, deployment)
|
|
|
|
**Governance**:
|
|
- TRA-OPS-0001: AI Content Generation Policy
|
|
- TRA-OPS-0002: Blog Editorial Guidelines
|
|
- TRA-OPS-0003: Media Inquiry Response Protocol
|
|
- TRA-OPS-0004: Case Study Moderation Standards
|
|
- TRA-OPS-0005: Human Oversight Requirements
|
|
|
|
**Technical**:
|
|
- API Reference: /docs/api-reference.html
|
|
- Tractatus Framework Spec: /docs/technical-proposal.md
|
|
|
|
**Location**: `/home/theflow/projects/tractatus/docs/` and `governance/`
|
|
|
|
---
|
|
|
|
## Thank You
|
|
|
|
**Questions?**
|
|
|
|
**Ready to deploy?** → Let's build world-class AI safety infrastructure.
|
|
|
|
---
|
|
|
|
**Presentation prepared by**: Claude Code (Anthropic Sonnet 4.5)
|
|
**Date**: 2025-10-07
|
|
**Status**: APPROVED - Phase 2 begins NOW
|
|
**Domain**: agenticgovernance.digital ✅
|