Personal AI Architecture

A custom-built skills and automation system that extends Claude with persistent memory, domain expertise, and live workflow automation β€” purpose-built for how I actually work.

The Core Idea

Out of the box, AI assistants are generic. This system makes Claude permanently aware of my context, decisions, preferences, and domain-specific rules β€” so I never re-explain myself.

Layer 1 β€” Shared Primitive (the foundation)
brandon-context
Single source of truth. Identity, infrastructure, hard rules. Every other skill loads this β€” nothing is re-declared.
↓ loaded by
Layer 2 β€” Domain Skills (trigger-based, context-aware)
βš™οΈ n8n Automation
Full workflow builder. Pattern library, credential IDs, error handling standards, node type reference.
🌐 Web Project
All decisions for an active site redesign β€” tech stack, pages, CMS, integrations. Never re-asks what's already decided.
🎭 Brand Management
Brand voice, copy rules, representation facts, and hard prohibitions for an active personal brand project.
πŸ’¬ Communication
Personalized communication toolkit for key personal relationships β€” scripts, occasion planning, plain English only.
πŸ‘¨β€πŸ‘§ Relationship Advisor
Frameworks for navigating specific relationship dynamics β€” boundaries, finances, long-term connection.
πŸ“Š + More
Additional domain skills covering other active areas β€” this is a representative sample, not the full set.
↓ governed by
Layer 3 β€” Output & Behavior Standards
πŸ“„ Output Formatting
Consistent, shareable HTML output. Font stack, sizing, layout rules β€” applied automatically across all deliverables.
🧠 Operating Mode
Chief-of-staff mode. Direct, unhedged, no fluff. Proactively surfaces risks and cross-domain connections.
πŸ“ Long-Term Memory
Infrastructure, career context, projects, goals, and decisions persist across sessions automatically.
The primitive pattern β€” the shared context file is a single source of truth that all other skills reference instead of re-declaring. Same DRY (Don't Repeat Yourself) principle engineers use in software. When a fact changes once, it changes everywhere.
What this means in practice
  • Skills fire automatically based on what I ask β€” no manual loading
  • Context is never lost between sessions
  • Decisions made once are respected forever
  • Output format is consistent and shareable without post-processing
  • Claude acts as a chief of staff β€” it anticipates, not just responds

Skills System

Each skill is a structured markdown file that loads into Claude's context when relevant. The examples below are a sample β€” skills are added as new domains and projects require them.

How skills trigger

Each skill has a description block that acts as a trigger. Claude reads what I ask, matches it to skill descriptions, and loads the right one β€” automatically, before responding.

What skills encode

Decisions already made. Infrastructure details. Rules and prohibitions. Output standards. Workflow patterns. The goal: Claude never needs to ask about something I've already figured out.

πŸ”‘
Shared Context
Primitive layer. Identity, infrastructure, hard rules. Never loaded directly β€” referenced by all other skills.
βš™οΈ
Automation Builder
Full workflow builder β€” patterns, credentials, error handling, canvas standards, node reference.
🌐
Web Project
Full site project state β€” all decisions, tech stack, page inventory, integrations. Never re-asked.
🎭
Brand Management
Brand voice, copy guidelines, representation facts, hard prohibitions for an active brand project.
πŸ’¬
Communication
Personalized communication toolkit β€” scripts, occasion planning, plain English only.
πŸ‘¨β€πŸ‘§
Relationship Advisor
Frameworks for specific relationship dynamics β€” boundaries, finances, long-term connection.
πŸ“„
Output Formatting
Consistent, shareable HTML output standard β€” no external dependencies, opens anywhere.
βž•
And More...
Additional skills added as new domains and active projects require them. The system scales.
Design principle: Skills are modular, trigger-aware, and wired to a shared primitive. Adding a new domain means adding one file β€” the rest of the system adapts automatically.

Workflow: AI Writing Assistant

6–9 independent Claude agents collaborate to write, critique, and iteratively revise through up to two editorial rounds β€” each agent operating with zero knowledge of the others.

βœ… ActiveMulti-AgentUp to 9 AI agentsParallel execution

This workflow doesn't use one AI β€” it uses a chain of independent agents, each with blank context, simulating a real editorial team: writer, reader, critic, reviser.

Supported document types
Research ReportAnalytical EssayOpinion / EditorialExecutive BriefWhite PaperBlog PostShort Fiction

Workflow Canvas

AI Writing Assistant workflow canvas
πŸ”§ Design Reasoning
  • Why blank context per agent? A single AI in a long conversation becomes a sycophant β€” it yes-and's its own prior output. Blank context forces each agent to encounter the work cold.
  • Why parallel Supporter + Critic? Prevents each from anchoring on the other's perspective. Two truly independent reads.
  • Why up to two rounds? Diminishing returns. Round 1 catches structural issues. Round 2 catches what survived.
  • Failure mode designed against: Context contamination β€” AI agents converging on agreement the longer they interact.

The Agent Chain

Agent 1
Persona Generator
Creates a fictional expert identity + writing style profile
↓
Agent 2
Writer
Writes the initial full draft β€” blank context, no conversation history
↓
Agent 3
Target Reader Analyzer
Profiles who the piece is written for
↓ Round 1 β€” parallel execution
Agent 4
Supporter (R1)
Compelling target reader β€” substantive but supportive notes
Agent 5
Critic (R1)
Skeptical contra-reader β€” finds logical gaps and weaknesses
↓ feedback merged β†’ Writer revises
Agent 6
Writer Revision R1
Reads both feedback sets and produces Draft V2
↓ if 2-round mode selected
Agent 7
Supporter (R2)
Fresh context β€” zero memory of R1
Agent 8
Critic (R2)
Fresh context β€” harder to satisfy now
↓
Agent 9
Writer Revision R2
Final polished draft
What this is modeling: A real editorial process β€” writer, sympathetic reader, hostile reader, revision. Runs in minutes, costs cents, and the critic never pulls punches.
Technologies involved
n8n (self-hosted)Claude (6–9 instances)Anthropic APIParallel merge nodesConditional routingGmail API

Workflow: Smart Research with Auto Depth

Natural language query routed through AI depth classification, searched at the appropriate depth, synthesized into a structured brief delivered by email.

βœ… ActiveAI-PoweredDual-depth routing

The system decides how deeply to search based on what you asked β€” not a manual toggle.

Workflow Canvas

Smart Research workflow canvas
πŸ”§ Design Reasoning
  • Why Claude Haiku for classification? Classification is low-complexity β€” return one word. Haiku is 5-10x cheaper and faster than Sonnet. Haiku handles triage; Sonnet handles thinking.
  • Why two Tavily search modes? Deep search costs more and takes longer. The classifier ensures it only fires when the query warrants it.
  • Why normalize before synthesis? The synthesis node doesn't need to handle two different data shapes. Cleaner architecture, fewer failure points.
  • Known architectural boundary: Output is intelligence for review, not compliance-grade sourcing. The Quality Gate provides the verification layer for higher-confidence use cases.

How It Works

Technologies involved
n8n (self-hosted)Claude Haiku (classifier)Claude Sonnet (synthesis)Tavily Search APIGmail APIIF routing node

Workflow: Achievement Log

A real-time capture system for professional contributions β€” built on the premise that performance reviews reward people who document well, not just people who perform well.

βœ… ActiveWork / Productivity3 entry points

In most organizations, contribution visibility is a function of recency bias β€” what you did last month matters more than what you did in February. Strong performers who don't document undersell their value at the moment it counts most.

πŸ”§ Design Reasoning
  • Why three entry points? Friction is the enemy of capture rate. Three entry points means the right tool is always within reach.
  • Why a flat file via SSH, not a database? Portability and zero dependency. Opens in any app, survives any migration, readable directly by AI.
  • Why no AI in the capture step? Deliberate. Zero-latency, zero-failure-risk. AI is reserved for synthesis at review time.

Three Ways to Log a Win

Entry Point 1 β€” Web Form

Styled dark-theme form. Dropdown category, free-text entry. Submits and confirms in-browser.

Log a Win form
Entry Point 2 β€” Email Trigger

Send any email with [WIN] in the subject. n8n polls Gmail every minute and auto-ingests it.

Subject: [WIN] Closed Q2 benchmarking partnership
Entry Point 3 β€” iPhone Shortcut

One-tap iPhone shortcut pings a webhook with a voice-to-text note. Captures the thought in under 10 seconds from anywhere β€” no app to open, no form to fill, no friction between the moment and the record.

iPhone form

How It Works

Part of a Larger System β€” Capture β†’ Verify β†’ Synthesize
Step 1
Achievement Log
Captures contributions in real time
β†’
Step 2
Quality Gate
Scores and validates output before use
β†’
Step 3
Synthesis
Comprehensive record drives the review process
Technologies involved
n8n (self-hosted)Gmail APISSH / Mac MiniNotes CLIiPhone ShortcutsWebhook

Workflow: AI Output Quality Gate

The verification layer in a three-part professional system. Accepts any AI-generated text and returns a structured confidence report scored across six quality dimensions.

βœ… ActiveLLM-as-JudgeDual entry6 scoring dimensions

Most AI systems generate output. This one evaluates it. A second Claude instance scores output against a rigorous rubric before it reaches a decision-maker.

Workflow Canvas

Quality Gate workflow canvas
πŸ”§ Design Reasoning
  • Why LLM-as-judge? The scorer uses an explicit rubric, different task framing, and no access to the generation context β€” it encounters the output as a fresh evaluator.
  • Why document-type-aware profiles? Source Fidelity matters deeply for research; it's irrelevant for fiction. Three profiles apply appropriate dimension weights.
  • Why weighted dimensions? For financial services, Source Fidelity and Analytical Neutrality should outweigh Interpretability.
  • Why two entry points? Manual covers ad-hoc use. Webhook enables automated pipelines.
  • Known limitation: Cannot independently verify factual claims against external ground truth. Human review remains the final control for high-stakes decisions.

The Six Scoring Dimensions

  • πŸ“Ž
    Source Fidelity (25% weight in research) β€” Are claims traceable to cited sources?
  • 🎯
    Query Alignment (20% weight) β€” Does the output directly answer what was asked?
  • πŸ”¬
    Claim Specificity (20% weight) β€” Are assertions specific and falsifiable, or vague and generic?
  • βš–οΈ
    Analytical Neutrality (15% weight) β€” Is the tone objective? Flags directional language and hidden assumptions.
  • πŸ”
    Confidence Transparency (15% weight) β€” Does the output distinguish known vs. inferred vs. uncertain?
  • πŸ’‘
    Interpretability (5% weight) β€” Can a reader act on this without ambiguity?
Manual β€” Web Form

Paste any text, select document type, add context, enter email. Receive a scored HTML report with dimension breakdown, key findings, flagged passages, and recommended action.

Automated β€” Webhook

Any other n8n workflow can POST output directly via webhook. The Research workflow pipes its brief here automatically before delivery.

Technologies involved
n8n (self-hosted)LLM-as-Judge patternClaude Sonnet (evaluator)Anthropic APIGmail APIWebhook + Form triggers

System: Morning Intelligence Brief

Five specialized sub-workflows coordinated by an orchestrator β€” runs every weekday at 6:00 AM. Outputs are merged, synthesized by Claude, and delivered as a structured brief before the trading day begins.

βœ… In Daily UseMulti-Workflow SystemAI SynthesisScheduled Mon–Fri 6AM

The most complex system in the stack β€” five workflows running independently and in parallel, in daily production since early 2026.

Orchestrator Canvas

Morning Intelligence orchestrator canvas
πŸ”§ Design Reasoning
  • Why five parallel sub-workflows? Each domain has different data sources, API patterns, and failure modes. A single source failing doesn't take down the whole brief.
  • Why an orchestrator pattern? Linear workflows are sequential. The orchestrator fires all five simultaneously β€” roughly 90 seconds vs. 5 minutes sequentially.
  • Why the "One Thing" synthesis step? More information is not always the goal β€” actionable signal is. Claude Sonnet distills one sentence: the single most important insight before market open.
  • Why Check Day of Week? The brief only runs Monday through Friday. No market summary needed on weekends.
  • Data source coverage is actively expanding β€” the architecture accommodates new sources without structural changes to the orchestrator.

The Five Sub-Workflows

πŸ“ˆ Markets
Real-time market data, index levels, premarket movers, stocks to watch, and earnings calendar.
πŸ“° Headlines
Top news stories filtered for relevance β€” business, finance, and macro events.
πŸ€– Tech & AI
AI industry developments, technology sector news, and emerging trends.
πŸ† Sports
Results and schedules across F1, UFC/BKFC, NBA, MLB, and NHL.
🧠 Orchestrator
Fires all four in parallel, merges outputs, runs One Thing synthesis, delivers the brief.

Sample Output

The brief lands in the inbox at 6:00 AM every weekday. The ONE THING section leads every issue.

Morning Brief sample output
The ONE THING pattern: After four parallel research streams are merged, Claude Sonnet reads the complete dataset and produces one sentence β€” the single most important insight for the day. On May 5: "Oil's decline is lifting equities while semiconductor earnings will determine whether AI investment enthusiasm justifies current valuations." That's not a summary. That's a synthesis.
Stocks to Watch β€” intelligent labeling: Every ticker is tagged with the reason it's included β€” Premarket gainer, Most active, Earnings watch. The workflow classifies each stock by its signal type so the brief is scannable in under 30 seconds.
Runs on your own infrastructure: The footer of every brief reads "Generated at 06:00 AM MST Β· Brandon's Morning Brief Β· Mac Mini Homelab" β€” self-hosted, scheduled, reliable, and fully under personal control.
Technologies involved
n8n (self-hosted)Multi-workflow orchestrationClaude Sonnet (synthesis)Anthropic APIGmail APIParallel merge nodesSchedule trigger

Workflow: Script Breakdown

Upload a PDF document plus context. Claude Sonnet analyzes it and emails back four formatted sections: scene analysis, annotated rehearsal script, clean script, and camera-ready styling direction.

βœ… ActiveAI-PoweredPDF Processing

Upload a PDF, provide any relevant context, and receive a structured professional-grade breakdown in minutes.

Workflow Canvas

Script Breakdown workflow canvas
πŸ”§ Design Reasoning
  • Why native PDF to Claude? Text extraction mangles formatting β€” stage directions, character names, scene headers lost or flattened. Claude's native document understanding preserves what matters.
  • Why single-pass four-section output? One context window β€” all four outputs internally consistent. Four separate calls risk contradictions and cost roughly 4x more in API tokens.
  • Why the PDF binary handling matters: n8n's filesystem mode requires getBinaryDataBuffer() β€” not the standard binary field. Getting this wrong produces a silent failure where Claude receives an empty document.
Four output sections
  • Scene Analysis β€” objectives, subtext, power dynamics, beat-by-beat coaching notes
  • Annotated Rehearsal Script β€” full script with inline director-style coaching
  • Clean Script β€” unadorned, ready for memorization or scene partner use
  • Audition Appearance & Styling β€” camera-calibrated direction on clothing, color palette, hair, makeup, and accessories
Technologies involved
n8n (self-hosted)Claude Sonnet 4.6Anthropic APIPDF binary processingGmail API

Why This Matters for AI Strategy

What looks like personal productivity tooling is actually a working model of enterprise-grade AI integration patterns.

The patterns demonstrated here
  • πŸ”‘
    Shared context primitives β€” DRY principle applied to AI system design. One source of truth, referenced everywhere.
  • 🎯
    Skill-based modularity β€” Discrete, trigger-aware modules. Adding a new domain = adding a new skill file.
  • πŸ€–
    AI as classifier and middleware β€” Right model, right job. Haiku classifies; Sonnet synthesizes.
  • πŸ‘₯
    Multi-agent orchestration β€” Independent blank-context agents, parallel execution, conditional routing, feedback merging.
  • πŸ§ͺ
    LLM-as-judge evaluation β€” AI output scored by a second AI before reaching a decision-maker.
  • ⚑
    Multi-workflow orchestration at scale β€” Five independent workflows coordinated in parallel. Pipeline architecture, not automation scripting.
  • πŸ”„
    Human-in-the-loop design β€” Automate what's safe; confirm what matters.
  • πŸ“‹
    Error handling and observability β€” Every external API call retries twice before alerting. Production ops discipline.
The gap between "I use AI" and "I've built AI-integrated systems" is large. These workflows close that gap β€” they're not prompts, they're architectures.

What's Next

The system is in active use β€” what follows is how it evolves.

πŸ”—
Deeper Integration

Connecting workflows so output from one automatically feeds the next. The Quality Gate wired as a standard downstream step across all content-generating workflows is the near-term priority.

πŸ“‘
Broader Coverage

Extending research and intelligence capabilities to additional domains and data sources. The Morning Intelligence architecture accommodates new sub-workflows without structural changes to the orchestrator.

πŸ“Š
Operational Maturity

Monitoring, alerting, and performance visibility across the full workflow stack. Treated as an operational system, not a collection of scripts.

🧬
Self-Improving Skills

The longer-horizon goal: skills that evaluate their own output against Quality Gate scores over time and iteratively refine their instructions to perform better. Static knowledge encoded once is a starting point β€” dynamic knowledge that updates based on observed performance is the destination. The Quality Gate exists today; closing the feedback loop into the skills layer is what comes next.

Self-hosted stack

n8n runs on a Mac Mini M1 in Docker via OrbStack. Zero dependency on third-party cloud automation services β€” full control, full observability, no subscription lock-in.

The bigger picture

Every pattern here β€” orchestration, evaluation, modular skills, self-improvement β€” maps directly to how enterprise AI systems are designed and operated at scale. This is the same thinking, applied personally first.