TheFlow 2298d36bed fix(submissions): restructure Economist package and fix article display

- Create Economist SubmissionTracking package correctly:
  * mainArticle = full blog post content
  * coverLetter = 216-word SIR— letter
  * Links to blog post via blogPostId
- Archive 'Letter to The Economist' from blog posts (it's the cover letter)
- Fix date display on article cards (use published_at)
- Target publication already displaying via blue badge

Database changes:
- Make blogPostId optional in SubmissionTracking model
- Economist package ID: 68fa85ae49d4900e7f2ecd83
- Le Monde package ID: 68fa2abd2e6acd5691932150

Next: Enhanced modal with tabs, validation, export

🤖 Generated with [Claude Code](https://claude.com/claude-code)

Co-Authored-By: Claude <noreply@anthropic.com>

2025-10-24 08:47:42 +13:00

14 KiB

Raw Blame History

Phase 2: Production Deployment & AI Features

Tractatus AI Safety Framework

Presented by: Claude Code (Anthropic Sonnet 4.5) Prepared for: John Stroh Date: 2025-10-07 Status: APPROVED - Ready to Begin

Slide 1: Executive Summary

Phase 2 Overview

Goal: Transform local prototype → production platform with AI-powered features

Timeline: 2-3 months (starting NOW)

Budget:

Total Phase 2: $550 USD (~$900 NZD)
Ongoing: $100-150/month (~$165-250 NZD)

Domain: agenticgovernance.digital ✅ Registered

Status: All approvals granted, ready to deploy

Slide 2: What We Built (Phase 1 Recap)

Phase 1 Achievements ✅

Infrastructure:

MongoDB database (tractatus_dev)
Express application (port 9000)
118 integration tests (100% passing)

Features:

Three audience paths (Researcher, Implementer, Advocate)
Interactive demos (27027 incident, classification, boundary)
Document viewer with 12+ technical papers
Admin dashboard with moderation workflows
API reference documentation

Quality:

WCAG AA accessibility
CSP compliance (script-src 'self')
85.3%+ test coverage on Tractatus services
Mobile responsive

Slide 3: What We're Building (Phase 2)

Production Platform + AI Features

Month 1: Infrastructure (Weeks 1-4)

Deploy to OVHCloud VPS (agenticgovernance.digital)
SSL/TLS, security hardening, monitoring
Nginx reverse proxy, automated backups

Month 2: AI-Powered Features (Weeks 5-8)

Blog curation system (AI-assisted, human-approved)
Media inquiry triage (classification + auto-drafts)
Case study portal (community submissions)

Month 3: Polish & Soft Launch (Weeks 9-12)

Governance enforcement audit
End-to-end testing
Soft launch to 20-50 users
Feedback collection & iteration

Slide 4: The Dogfooding Principle

Tractatus Governs Itself

Core Principle: "What cannot be systematized must not be automated."

Implementation:

AI Operation	Quadrant	Human Oversight
Blog topic suggestion	STOCHASTIC	Human selects topics
Blog outline generation	OPERATIONAL	Human reviews structure
Blog publication decision	STRATEGIC	Human approves
Media inquiry classification	OPERATIONAL	Human verifies
Media response sending	STRATEGIC	Human approves
Case study relevance analysis	OPERATIONAL	Human reviews
Case study publication	STRATEGIC	Human approves

Zero Tolerance: AI cannot make values decisions without human approval

Slide 5: Governance Framework (TRA-OPS-*)

5 Operational Policies Created

TRA-OPS-0001: AI Content Generation Policy (Master)

Mandatory human approval for all public content
Boundary enforcement (values require humans)
$200/month API budget cap

TRA-OPS-0002: Blog Editorial Guidelines

4 content categories, citation standards
AI assists; humans write & approve

TRA-OPS-0003: Media Inquiry Response Protocol

AI classification + priority scoring
No auto-send; all responses human-approved

TRA-OPS-0004: Case Study Moderation Standards

Community submissions, AI relevance analysis
Quality checklist, human publication decision

TRA-OPS-0005: Human Oversight Requirements

Admin reviewer role & training
SLAs: 4h (HIGH media), 48h (blog), 7d (case studies)

Slide 6: Budget Breakdown

Where the Money Goes

One-Time Costs (~$100):

Domain (already paid)
SSL certificates (Let's Encrypt - free)
Initial security audit tools

Monthly Recurring (~$100-150):

Hosting (OVHCloud VPS Essential): $30
- 2 vCores, 4GB RAM, 80GB SSD
- 1,000-5,000 visitors/month capacity
Claude API (Sonnet 4.5): $50
- 30 blog outlines/month
- 50 media inquiries/month
- 20 case study analyses/month
Backups & Monitoring: $10-20
- Off-site backups
- Uptime monitoring
- Error tracking (Sentry free tier)

Total 3-Month Phase 2: $550 USD (~$900 NZD)

Slide 7: Infrastructure Architecture

Production Stack

┌─────────────────┐
│   Internet      │
└────────┬────────┘
         │
    ┌────▼────┐
    │ OVHCloud│ agenticgovernance.digital
    │   DNS   │ (No Cloudflare - sovereignty)
    └────┬────┘
         │
    ┌────▼────┐
    │  Nginx  │ SSL/TLS (Let's Encrypt)
    │ :80/443 │ Reverse Proxy + Security Headers
    └────┬────┘
         │
    ┌────▼────┐
    │ Node.js │ Tractatus Application
    │  :9000  │ Express 4.x
    └────┬────┘
         │
    ┌────▼────┐
    │ MongoDB │ tractatus_prod
    │ :27017  │ 7.x with authentication
    └─────────┘

Security: UFW firewall, Fail2ban, SSH key-only, automated updates

Slide 8: AI Features in Detail

Blog Curation System

AI Role: Suggest topics, generate outlines Human Role: Select topics, write drafts, approve publication

Workflow:

AI scans AI safety news (weekly)
AI suggests 5-10 topics → Human selects 1-3
AI generates outline → Human reviews & edits
Human writes full draft (AI does NOT write)
Admin final approval → Publish

Target: 2-4 posts/month (8-16 total in Phase 2)

Media Inquiry Triage

AI Role: Classify, prioritize, draft responses Human Role: Verify, decide, send

Categories:

Press (HIGH priority, 4h SLA)
Academic (MEDIUM, 48h SLA)
Commercial (MEDIUM, 7 days)
Community (LOW, 14 days)
Spam (IGNORE)

Expected Volume: 5-20 inquiries/month (soft launch)

Case Study Portal

AI Role: Assess relevance, map to Tractatus framework Human Role: Moderate, approve publication

Submission Categories:

Hallucinations
Boundary violations (AI making values decisions)
Instruction overrides (27027-type)
Context failures
Bias/discrimination

Target: 3-5 community submissions/month

Slide 9: Timeline & Milestones

12-Week Roadmap

Weeks 1-4: Infrastructure ✅ Ready to Execute

Provision OVHCloud VPS (Singapore/Australia)
Deploy application, configure SSL
Security hardening, monitoring setup
Milestone: Site live at https://agenticgovernance.digital

Weeks 5-8: AI Features ⏳ Awaiting Claude API key

Integrate Claude Sonnet 4.5
Build blog curation pipeline
Implement media triage system
Launch case study portal
Milestone: All AI features operational

Weeks 9-12: Polish & Launch ⏳ Awaiting user cohort

End-to-end testing
Governance compliance audit
Invite 20-50 soft launch users
Collect feedback, iterate
Milestone: Soft launch complete

Slide 10: Success Criteria

How We'll Know Phase 2 Succeeded

Technical Success:

✅ Site live with 99%+ uptime (30 days)
✅ Performance: <3s page load (95th percentile)
✅ Security: Zero critical vulnerabilities
✅ WCAG AA accessibility maintained

Governance Success:

✅ 100% human approval rate (no AI auto-publish)
✅ Zero boundary violations (values decisions)
✅ Audit trail complete (all AI decisions logged)

User Success:

✅ 20-50 soft launch users engaged
✅ 4+/5 average satisfaction rating
✅ 50+ readers/blog post average
✅ 5+ media inquiries handled

Business Success:

✅ Costs <$150/month
✅ Zero data breaches
✅ Positive user feedback

Slide 11: Risks & Mitigation

What Could Go Wrong?

Risk	Probability	Impact	Mitigation
Claude API costs exceed budget	Medium	High	Rate limiting, $200 hard cap, alerts at 80%
Security breach	Low	Critical	Security audit, penetration testing, Fail2ban
AI generates inappropriate content	Medium	High	Mandatory human approval, no auto-publish
Server downtime	Medium	Medium	Monitoring, automated backups, <4h recovery
Poor user adoption	Medium	Medium	Clear onboarding, feedback loops, iteration

Overall Risk: LOW - Strong governance, conservative approach

Slide 12: Soft Launch Strategy

Who Gets Early Access?

Target Cohort: 20-50 users across 3 audiences

Researchers (8-12 users):

AI safety academics
Philosophy/ethics researchers
Computer science PhD students

Implementers (8-12 users):

AI engineers at aligned companies
Open-source AI developers
Technical architects

Advocates (4-6 users):

AI policy professionals
Digital rights organizations
Aligned nonprofits (EFF, Access Now)

Invitation Method: Personal email, curated list

Feedback: Structured survey + ongoing dialogue

Slide 13: Phase 2 → Phase 3 Transition

When to Proceed to Public Launch

Exit Criteria:

All Phase 2 success metrics met ✅
Soft launch feedback positive (4+/5) ✅
Zero critical bugs ✅
Governance audit complete ✅
Your approval to proceed ✅

Phase 3 Preview (3-6 months):

Public launch & marketing campaign
Koha donation system (micropayments)
Multi-language support
Community forums
Academic partnerships
Bug bounty program

Not rushing: Phase 2 soft launch could extend if needed for quality

Slide 14: World-Class UI/UX Focus

Excellence Standards

Design Principles:

Clarity over cleverness: Users understand immediately
Accessibility first: WCAG AA minimum, AAA aspirational
Performance: <3s load, optimized for 3G networks
Consistency: Design system for all components
Respect: No dark patterns, honest communication

Continuous Improvement:

User testing (soft launch feedback)
Analytics (privacy-respecting, Plausible)
A/B testing (ethical, transparent)
Regular UX audits

Benchmark: Best-in-class documentation sites (Stripe, Tailwind, Anthropic)

Slide 15: Next Steps (Action Items)

What Happens Now?

Immediate (This Week):

Sign TRA-OPS-* governance documents (formal approval)
Provision OVHCloud VPS Essential (Singapore preferred)
Create Anthropic Claude API account (production key)
Set up payment methods (OVHCloud + Anthropic)
Generate JWT secrets, MongoDB passwords (secure)

Week 1-2:

Deploy infrastructure (server setup, SSL, security)
Configure DNS (agenticgovernance.digital → server IP)
Deploy application code (Git-based workflow)
Test production environment (health checks, monitoring)

Week 3-4:

Integrate Claude API (test endpoints)
Build blog curation pipeline
Implement media triage system
Launch case study portal

Week 5-12:

Execute Phase 2 roadmap
Weekly progress updates
Soft launch preparation

Slide 16: Your Role (John Stroh)

What We Need From You

Strategic Decisions:

Final approval on governance documents (sign-off)
Soft launch user cohort selection (who to invite)
Editorial direction (blog topics, tone)
Phase 3 go/no-go decision

Operational Tasks:

Blog content review & approval (2-4 posts/month)
Media inquiry responses (HIGH priority, escalations)
Case study moderation (assist admin if needed)
Monthly budget review

Time Commitment:

Phase 2 setup: 5-10 hours (one-time)
Ongoing moderation: 5-10 hours/week
Strategic reviews: 2 hours/month

Support Available:

Claude Code for technical implementation
Admin reviewer (if hired) for routine moderation
Automated systems for monitoring, backups

Slide 17: Why This Matters

The Bigger Picture

Problem: AI safety approaches rely on behavioral alignment Limitation: Alignment breaks down as capabilities scale

Tractatus Approach: Architectural constraints (structural safety) Advantage: Safety guarantees independent of capability level

This Platform:

Demonstrates the framework in production
Educates researchers, implementers, advocates
Catalyzes adoption (open source, replicable)
Influences policy (proof of concept for regulation)

Goal: Make architectural AI safety the industry standard

Slide 18: Questions & Discussion

Open Issues for Discussion

Technical:

OVHCloud region preference? (Singapore vs. Australia)
Backup strategy: On-server only or off-site? (Backblaze B2)
CDN needed? (Cloudflare basic or skip entirely)

Content:

Initial blog topics? (27027 incident, framework intro, etc.)
Soft launch invitation timing? (End of Month 2 or Month 3?)
Media outreach? (Proactive or reactive only?)

Governance:

Admin reviewer hiring? (Phase 2 or Phase 3?)
Editorial board formation? (Phase 3 or later?)
External audit? (Annual or Phase 3 milestone?)

Anything else?

Slide 19: Summary & Approval

Phase 2 Ready to Launch

Approved ✅:

Budget: $550 (Phase 2), $100-150/month (ongoing)
Timeline: 2-3 months, starting NOW
Governance: 5 TRA-OPS-* policies
Infrastructure: OVHCloud VPS Essential
AI Strategy: Blog, media, case studies with human oversight

Deliverables:

Production site at agenticgovernance.digital
Blog curation system (2-4 posts/month)
Media inquiry triage (5-20 inquiries/month)
Case study portal (3-5 submissions/month)
Soft launch to 20-50 users

Next Action: Begin Week 1 infrastructure deployment

Slide 20: Appendix - Resources

Key Documents

Planning:

PHASE-2-ROADMAP.md (comprehensive 3-month plan)
PHASE-2-COST-ESTIMATES.md (budget breakdown)
PHASE-2-INFRASTRUCTURE-PLAN.md (technical specs, deployment)