Boulder Startup Week 2026 · Thu May 7 · RegenHub

Architecting
Agentic Workflows
for the Lean 2026 Startup

Sophia Stein · AI Architect · Local LLM & Agentic Workflows

agenticarchitect.ai/blog · github.com/sudosoph

By the end of this hour, you'll have an agent running tomorrow morning that does customer discovery while you sleep.

Open source · ~$8–$15/mo · forkable · yours when you walk out.

LIVE

Switching to n8n + Gmail · the agent is mid-flight · narrating drafts as they populate

⌘+tab to browser · stay here ~90 seconds

PART 01 · 5 MIN · FOUNDATIONS

What is an agent, really?

Same loop in every framework. Same three layers in every system. Worth getting on the same page before we go further.

Foundations · the loop every agent runs

The REACT loop.

┌──────────────────────┐ │ REASON │ what should I do next? └──────────┬───────────┘ ↓ ┌──────────────────────┐ │ ACTION │ call a tool · query data · make a draft └──────────┬───────────┘ ↓ ┌──────────────────────┐ │ OBSERVATION │ what came back · was it enough └──────────┬───────────┘ ↓ (loop until stop condition is met)

Why this matters when you debug

Wrong reasoning → bad prompt or model. Wrong action → tool wired wrong. Wrong observation → context engineering issue. Three steps. Same loop in every framework.

Foundations · three layers in every agent

An agent has three layers. Pick each one deliberately.

Layer	What it is	Today's pick	If you outgrow it
Model	The reasoning engine. Decides what to do, drafts output, decides when to stop.	Sonnet 4.6 + Haiku 4.5	Qwen 3.6 (local) · DeepSeek (frontier OSS)
Runtime	Manages the loop, state, tool calls, retries, audit log.	n8n (visual, OSS)	LangGraph (code-first) · Temporal (durable)
Tools	External capabilities the model can invoke.	Claude `web_search` · Firecrawl · Gmail · Sheets	+ MCP servers as you grow (HubSpot · Linear · Stripe)

For the next 25 minutes we're going to make exactly these choices for our discovery agent.

PART 02 · 8 MIN · THE 2026 LANDSCAPE

What changed in 2026
that makes your agent possible?

Five forces. Each one is something specific that landed in the last 18 months and that you can use today.

Landscape · Force 03 · The model market · May 2026

Models available today.

Model	In/M	Out/M	Released	Use for
Claude Opus 4.7	$5	$25	Apr 16	Hard edges only
Claude Sonnet 4.6	$3	$15	—	Drafting · nuance · voice match
Claude Haiku 4.5	$1	$5	Oct 2025	Routing · classify · extract
GPT-5.5	$5	$30	Apr 23	Doubled price · OAI-locked
Gemini 3.1 Pro / Flash	$2 / $0.50	$12 / $3	—	If on GCP / budget tier
Qwen 3.6-27B (local)	~$0	~$0	Apr 2026	Classification · claims Sonnet parity
DeepSeek V3.2	$0.14	$1.10	—	Cheapest frontier reasoning

Today's stack: Sonnet 4.6 + Haiku 4.5. Two models, one cascade — cost reasoning in Part 5.

Landscape · The shape of the lean 2026 team

The new founder math.

2023

4 engineers · product · ops · marketing · support

$40K/mo loaded

2026

1 founder + agent stack · same surface area

$300/mo total

Real proof, not theory

Pieter Levels: $3M+ ARR, zero employees · Ben Broca / Polsia: $1M+ ARR, 1,100 client cos solo. The wall is around $50–150K MRR. That's a runway, not a problem.

Stop asking AI questions.
Start giving AI tasks.

Adapted from Henry Ward, CEO Carta · AI Agent Conference NYC · May 4, 2026

Tasking, concretely

1. Define a typed JSON schema 2. Give 5 in-context examples 3. Set a human checkpoint 4. Hit Run.

Landscape · The contract for this hour

What this hour delivers.

Blueprint for Autonomy

From the description

The 3-axis matrix · 90-sec worksheet · Green/Yellow/Red for your tasks.

Live build-along

From the description

The Founder's Discovery Engine · clonable n8n JSON · MIT licensed on GitHub.

HITL standard

HITL = Human-in-the-Loop

The 4-stage trust ladder · voice.md pattern · 10× bandwidth math.

Plus — 12 transferable patterns & 7 other agents on the same stack.

PART 03 · 8 MIN · BLUEPRINT FOR AUTONOMY

Which 20% of your manual tasks
should be 80% automated?

The same question every founder has. We answer it in eight minutes.

Blueprint · The framework · three questions to ask of any task

The 20/80 Blueprint.

Volume

How often does this task come up?

High — daily/weekly. Low — occasional. Don't automate low-volume; build cost will exceed time saved.

Determinism

Are the steps the same every time?

Deterministic — rules apply. Ambiguous — needs judgment. Ambiguous tasks need a human checkpoint.

Reversibility

If the agent's wrong, what breaks?

Low-stakes — recoverable. Irreversible — not. Irreversible tasks should never be fully automated.

GREEN = high vol × deterministic × reversible · YELLOW = high vol × ambiguous × reversible · HITL · RED = anything irreversible · don't

Blueprint · Examples for a lean startup

Green / Yellow / Red — pick your first agent.

GREEN · automate now

Lead enrichment
Competitor monitoring
Content repurposing
Form/signup triage
Meeting notes → CRM
Daily metric digests

YELLOW · with HITL

Outbound copy
Pricing analysis
Support drafts
Investor updates
Cold outreach
Refund decisions

RED · don't automate

Hiring decisions
Product roadmap
Customer escalations
Strategic pivots
Cofounder conflicts
Legal terms

Today's demo lives in YELLOW — cold outreach with a human checkpoint.

Blueprint · 90 seconds · I'll be quiet · don't watch me · write

Your turn. Five tasks.

Task you do every week	Volume H/L	Determinism D/A	Reversibility Low/High	Color G/Y/R

Pick one Green from your list. That's your first agent.

Before we build · pick your track

Two axes · four tracks. Everyone in this room can build this.

🆓 FREE · zero card

Stack: n8n.cloud trial · Groq Llama 4 Scout · HN Algolia + Reddit JSON · Jina Reader. Cost: $0 for 14 days. Good enough to evaluate; voice match below Sonnet.

💎 PAID · ~$6/mo

Stack: n8n.cloud · Anthropic Sonnet 4.6 + Haiku 4.5 · Firecrawl. Cost: ~$0.21/run · ~$6/mo at default frequency. Best voice match — what the live demo just used.

🧑‍💻 MANUAL build

Walk through 17 steps in TUTORIAL.md. ~45 minutes self-paced. You learn the architecture deeply. Best for technical founders who want to own the stack.

⚡ AUTOPILOT setup

Paste a prompt into Claude for Chrome. The browser agent does signup, OAuth, sheet creation, workflow import. ~10 min hands-off. Best for non-technical visionaries.

You'll get the slides · the speaker notes · the workflow JSONs · the autopilot prompts · the tutorials. Don't memorize anything. Reference. Pick your track this weekend.

PART 04 · 25 MIN · LIVE BUILD-ALONG

The Founder's
Discovery Engine.

A customer-discovery agent that runs while you sleep. n8n + Claude + Gmail. Open source. ~$8–$15/mo.

Build · One-screen spec

What we're building.

DAILY 7AM agent wakes up via cron ↓ LISTENS Hacker News · Reddit · Product Hunt · X for ICP signals ↓ RESEARCHES each lead · pain mentions · hiring posts · complaints ↓ DRAFTS customer-discovery email in YOUR voice (interview ask · not pitch) ↓ DROPS Gmail Drafts · never sends · you approve and send ↓ FOLLOWS UP after 5 days if no reply ↓ DIGEST 7:30am · "5 drafts · 3 follow-ups · 1 reply yesterday"

Cost per run: ~$0.21 · Monthly: ~$6 paid · ~$8 max-DIY · Build today: 25 min

Build · The architecture · the centerpiece

Nine steps · three sub-agents · one workflow.

1 · TRIGGER

Cron daily 7am · Manual webhook for the live demo

↓

2 · READ

Sheets ICP + Drive voice.md · config-as-files pattern

↓

3 · DISCOVERY

Haiku 4.5 + web_search · sub-agent #1 · returns typed JSON [{ person, signal_type, score, … }]

↓

4 · DEDUP

vs Sheets Sent log · idempotency = re-run safety

↓

5 · ENRICH

Firecrawl on top 5 only · Haiku writes a 2-line company summary

↓

6 · DRAFT

Sonnet 4.6 + voice.md · sub-agent #2 · the only premium-token step in the workflow

↓

7 · HITL GATE

Gmail.createDraft · NEVER sends · you approve, you send

↓

8 · DIGEST

Sonnet · sub-agent #3 · daily 7:30am summary email to you

↓

9 · LOG

Append to Sheets Runs · audit + cost meter

Three sub-agents · each gets only the context it needs. None see the whole conversation. Sub-agent decomposition is how you cut cost without losing quality.

Build · Step 1/8 · ~3 min

Triggers — how your agent wakes up.

Trigger	Plain English	When it fires	Founder use case
Cron	A timer · "run this every X"	Schedule · e.g. 7am daily	Recurring jobs · daily / weekly
Webhook	A doorbell · another system rings · agent answers	Event from elsewhere	Form fill · signup · support ticket
Manual	You · clicking Run	You decide	Testing · ad-hoc demos

Today's choice: cron(0 13 * * *) — daily at 7am Boulder time (13:00 UTC during MDT) — plus a manual trigger for the live demo. Almost every SMB agent starts as cron.

Build · Step 2/8 · the most important pattern in this hour

Config-as-files. Not config-as-prompts.

Every input that drives the agent — voice, ICP, schemas, examples, exclusion lists — lives as a versioned editable file in Drive or Sheets. Not buried in workflow nodes.

File	Lives in	Controls	Edited by
`voice.md`	Drive	Tone · words · 5 example emails	Founder · rarely
`icp.md`	Drive	Who we look for · keywords	Founder · monthly
`do-not-contact.csv`	Sheets	Exclusion list	Anyone · any time
`schemas.md`	Drive	Structured JSON output shapes	Architect · rarely

The shorthand: look at every place in your workflow where a string of English drives the agent's behavior. That's a file. Not a prompt. A file. With version history.

Build · Step 3/8 · sub-agent #1

Discovery: one node, one API, one call.

// Claude Haiku 4.5 with web_search tool enabled
{
  "model": "claude-haiku-4-5",
  "tools": [{ "type": "web_search_20250305" }],
  "system": "[ICP keywords from icp.md]",
  "messages": [{
    "role": "user",
    "content": "Find founders publicly displaying ICP signals on
                HN, Reddit, Product Hunt this week. Return JSON:
                [{ person, signal_type, source_url,
                   evidence_quote, score 0-10 }]"
  }]
}

Why Haiku, not Sonnet

Sub-agent #1 is classification. $1/$5 per million tokens is fine. Save Sonnet for nuance. This is the cascade pattern.

Why Claude search, not Perplexity

One vendor · one key · one node. ~$1.50/mo more at our scale. The simplicity is worth it.

Build · Step 4/8 · idempotency = re-run safety

Dedup against state.

Idempotent = running the agent twice produces the same outcome as running it once. Without it: bug fix triggers a re-run · missed cron triggers a re-run · demo on stage triggers a re-run · and you double-email five prospects who block you and tell their friends.

THE PATTERN · three lines in an n8n Code node 1. Maintain a Sheets("Sent") log of every contact. 2. Filter discovery results against the log. 3. Only proceed with the new ones. const sent = await readSheet('Sent', 'email'); const fresh = candidates.filter(c => !sent.includes(c.email)); return fresh;

Plumbing. The unsexy stuff that separates "demo agent" from "agent that runs every day."

Build · Step 5/8 · cheap-fast first

Progressive enrichment.

Discovery (cheap, fast) → all 30 candidates ↓ score · keep top N=5 Enrich (expensive, deep) → top 5 only ↓ Firecrawl on company.com → markdown extract ↓ Haiku 4.5 · "in 2 sentences, what does this co. do?" ↓ Pass to drafting node

About Firecrawl

Apache 2.0 OSS. Self-host on Docker for free or Cloud at $83/mo. Best signal-to-noise on JS-heavy sites.

Build · Step 6/8 · sub-agent #2 · the only premium-token step

Drafting — Sonnet 4.6 + voice.md.

{
  "model": "claude-sonnet-4-6",
  "system": "[contents of voice.md] + [5 example emails]",
  "messages": [{
    "role": "user",
    "content": "Draft a customer-discovery email to {person}.
                Their signal: {evidence_quote}
                Source: {source_url}
                Their company context: {enriched_summary}

                Follow voice.md exactly. Interview ask, not pitch.
                Soft CTA. 80-110 words."
  }]
}

Why Sonnet here

Voice match is where quality matters. Haiku writes generic-friendly. Sonnet reads voice.md and writes like the person who wrote those examples.

Why 5 examples

In-context examples beat prompt instructions. The discipline of writing 5 good emails is what makes voice work. Few-shot > zero-shot.

Build · what's in voice.md

The file that does the heavy lifting.

# Sophia's voice — for outbound writing

## Posture
- Direct, no fluff. Short sentences. Cut adverbs.
- Specific over abstract. Numbers over adjectives.
- Confident but not arrogant. Assume the reader is smart.

## Words I use
"architecture", "compounding", "leverage", "the math here is"

## Words I never use
"synergy", "unlock", "revolutionary", "I hope this finds you well"

## Email structure (cold)
1. One-line context: how I found you (specific)
2. Two-line observation: something specific about your situation
3. One question I'd ask
4. One sentence on what I do
5. Soft CTA: "worth 15 minutes Thursday?"

## Examples
[5 real emails — see voice-md-template handout]

The agent inherits your taste through examples, not adjectives. If you can't write 5 good examples, the agent can't either.

Build · Step 7/8 · the HITL gate

Human-in-the-Loop (HITL): the agent drafts. You send.

What it does

Drafts land in Gmail Drafts folder
   ↓
You skim · edit ~2 words · hit Send
   ↓
~10 seconds per draft
   ↓
~300 personalized emails / month
   ↓
~50 minutes of YOUR time / month

HITL = a checkpoint where a human reviews agent output before it goes external.

What it gives you

The 10× output math the workshop description promised.

Bandwidth scales · 90 min/day → 5–10 min/day on outbound
Voice never drifts · every send goes through your eyes
Track record builds · approved drafts = data for graduating up the trust ladder

10 hours/week back. Goes to product · calls · fundraising.

Build · Step 8/8 · sub-agent #3 · daily wrap

Daily digest + audit log.

The digest · 7:30am

Subject: Discovery Pulse — May 8

Today:
  · 5 new drafts queued
  · 3 follow-ups ready
  · 1 reply yesterday from @kbrant

Cost today:  $0.21
Top signal:  pricing-frustration (n=2)
Your move:   approve before 5pm

The audit log · Sheets

date · leads_found · drafts ·
  cost_usd · avg_score ·
  top_signal · errors

2026-05-07 · 30 · 5 · 0.21 ·
  7.4 · pricing · 0
2026-05-08 · 28 · 5 · 0.19 ·
  6.9 · hiring  · 0

Trust gets built through visible track record. After two weeks of high approval rates, you can promote the agent to Stage 3.

Build · Bonus · same architecture · different inputs

Follow-up sub-workflow.

SECOND CRON · daily 8am Query Gmail label "Sent — Discovery" ↓ For each thread: · elapsed days since last message · any reply received ↓ If 5+ days AND no reply: ↓ Read full thread context ↓ Sonnet 4.6 · draft a follow-up that references the original thread (NOT a copy-paste reminder) ↓ Gmail.createDraft → you approve

Same architecture as Step 6. Different inputs. Once you've built one agent, you've built five.

Build · Recap · zoomed-out view of what we just connected

What you just built · in plain terms.

1 · The brain

Claude API — Haiku 4.5 routes & scores · Sonnet 4.6 drafts in your voice. Cascade cuts cost 60–70%. Free: Llama 4 Scout on Groq.

2 · The eyes

web_search + Firecrawl — Claude finds prospects, Firecrawl extracts company sites. Free: HN Algolia + Reddit JSON + Jina Reader, no auth.

3 · The body

n8n — open-source orchestrator. Visual nodes wire up the steps. Self-host on a $5 VPS or use n8n.cloud trial.

4 · The memory

Google Sheets — three tabs: ICP (your config), Sent (idempotency), Runs (audit log). Edit the sheet, agent inherits.

5 · The voice

voice.md in Drive — your writing samples. Cached on every Sonnet call. Edit it; the next run sounds more like you.

6 · The gate

Gmail Drafts — the agent NEVER sends. You read each draft, edit, hit send. Stage 2 of the trust ladder.

To run · 4 free accounts: n8n.cloud · Anthropic ($5 credit) · Firecrawl (500 free) · Google. · To customize for your business · 4 things in your Sheet: voice.md (style) · icp_description (target) · signal_keywords (listen for) · subreddits (where they hang out). No redeploy.

Build · Summary · what we just used

Plumbing primitives — reusable building blocks every agent needs.

Primitive	Purpose	Where
Trigger types (cron · webhook · manual)	Wake the agent up	Step 1
Config-as-files	Editable behavior, no redeploys	Step 2
Sub-agent decomposition	Cheaper, narrower context per step	Steps 3 · 6 · 8
Cascade (Haiku → Sonnet → Opus)	60-70% bill cut	Steps 3 & 6
Schema-first JSON outputs	Connections that don't break	Steps 3 · 5
Idempotent dedup	Re-run safety	Step 4
HITL via Gmail drafts	Brand voice protection	Step 7
Audit log + cost meter	Trust through track record	Step 8

The spine of every agent you'll build for the next 5 years. Tools change. Patterns don't.

Build · 12 transferable patterns · yours to keep

Walked in for one agent. Walking out with 12 Lego pieces.

1Cron + webhook triggersStep 1

2Config-as-filesStep 2

3Cascade model selectionSteps 3 & 6

4Web search as agent eyesStep 3

5Targeted enrichmentStep 5

6Schema-first JSONSteps 3 · 5

7Idempotent dedupStep 4

8HITL via draftsStep 7

9Sub-agent decompositionSteps 3 · 6 · 8

10Audit log on every actionStep 8

11Cost metering per runStep 8

12Follow-up via thread contextSub-workflow

Combine them. Recombine them. Tools change. Patterns don't.

Build · The multiplier · same stack, different agents

7 other agents you can build < 30 minutes each.

Agent	What it does	Trigger	Output
Inbound Triage	Form fill → research → draft response	Webhook	Gmail draft
Competitor Pulse	Daily diff competitor changelogs	Cron daily	Email digest
Support Draft	Ticket → KB → drafted reply	Webhook	Helpdesk draft
Content Repurposer	Blog → tweet thread + LinkedIn + newsletter	Webhook	Drafts in Drive
Call Notes → CRM	Recording → structured notes → CRM update	Webhook	CRM entry
Investor Update	Metrics → narrative → draft email	Cron monthly	Gmail draft
Onboarding Personalizer	Signup → enriched welcome series	Webhook	Gmail sequence

Same architecture. Different trigger · search/extract · output. Ship a different one every week.

PART 05 · 6 MIN · TOKEN ECONOMICS + OSS

Same agent. $135/month or $8/month.
The difference is one weekend.

Three cost levers stacked. Plus the open-source growth playbook your tools used.

Cost · Lever 01 · the cascade

Sub-agent decomposition for cost.

Naive pattern

Sonnet for everything.

$18/month

at 100K input / 20K output daily

Cascade pattern

Haiku for routing · classify · extract · score · dedup
Sonnet for nuanced drafting
Opus only for hard edge cases

$6/month

60–70% reduction. Same workflow, same output quality. The most underused cost lever in 2026. Free path follows the same logic — Llama 3.1 8B Instant for cheap, Llama 4 Scout for the heavy work.

Cost · Roadmap · from FREE to fully paid

Same workflow · five cost tiers.

Tier 0 · FREE · zero installn8n.cloud trial · Groq Llama 4 Scout + 3.1 8B · HN+Reddit+Jina · 14 days · no card

$0/mo

Tier 1 · All-cloud SaaSn8n.cloud + Perplexity + Firecrawl + Sonnet API

~$135

$135/mo

Tier 2 · Hybridn8n self + Perplexity + Firecrawl Cloud + Sonnet

~$95

$95/mo

Tier 3 · Lean self-hostedn8n + SearXNG + Firecrawl all self · Sonnet API

~$15

$15/mo

Tier 4 · Maximum DIY+ Qwen 3.6-27B local · Ollama install required

~$8

$8/mo

Try Tier 0 free this week. No credit card. No install. Move to Tier 1 or 3 when you outgrow the trial.

Cost · Lever 03 · the local inflection

2026 changed the local hardware math.

Build	Price	Runs	Throughput
AMD Strix Halo (Framework Desktop)	$1,499 — $2,500	Llama 3 70B Q4	14–18 t/s
Mac Studio M3 Ultra 96GB	$3,999	Qwen 3.6-27B	17–18 t/s
RTX 5090 32GB	~$3,600	Qwen 3.6-27B Q4 · Gemma 4 31B	5,200 t/s prefill

Crossover math

Break-even vs API at ~2–3M tokens/day · ~12-month payback. Then 40–200× cheaper ongoing. Stack: llama.cpp (HIP / Vulkan / Metal) + Ollama or vLLM.

Revisit local when API bill > $200/mo · PII workloads · predictable throughput.

PART 06 · 3 MIN · HITL + PRODUCTIONIZATION

Maintain brand voice and ethical oversight
while scaling output 10×.

Direct from the workshop description. We covered HITL in the build · now what comes after, and how to graduate.

HITL · From human-in-the-loop today to full auto tomorrow

The 4-stage trust ladder.

Stage	What the agent does	What you do	Guardrails to add	Promote when
1 · Shadow	Runs alongside · no external action	Compare agent output to your own work	Schema validation · cost ceiling per run	≥ 90% agreement over 50+ runs
2 · Pending Review (today)	Drafts only · never sends	Approve / edit / reject each output	+ voice-distance check · audit log · do-not-contact filter	≥ 95% approval over 2 weeks
3 · Conditional Auto	Auto-fires high-confidence (score ≥ 8 + voice match ≥ 0.85)	Review low-confidence + 10% sample audit	+ rate limit per recipient · anomaly detection · sandbox isolation	Auto-error rate < 1% sustained 30+ days
4 · Full Auto	Fires all · alerts on exceptions	Weekly audit log review	+ A/B replay testing · drift detection · automatic rollback	Trusted like a senior teammate

Discovery Engine ships at Stage 2. Each row adds the guardrails of the row before. Graduate one stage at a time.

HITL · Real proof · Bain HR Services payroll agent · April 2026

From 70% to 98% accuracy through shadow testing.

Context

8 LangGraph subgraphs · 1M+ emails/year
Customer-facing autonomous
3K+ live shadow emails

Numbers

Started: 70% real-world accuracy
After 6 mo shadow: 98%
80% of engineering = scaffolding

The lesson

"The secret to successful agents in Enterprise is not the agents — it's the scaffolding around the agents."

— Sanjin Bicanic, Bain & Co · Apr 29, 2026

Same lesson at startup scale: the agent is 20%. Your dedup, HITL gate, voice.md, audit log — that's the 80%.

Close · 5 things to remember

What to remember when you walk out.

The REACT loop is in every agent.

Reason → Action → Observation → loop. When you debug an agent, one of these three is broken. Memorize the loop, not the framework.

Config-as-files. Not config-as-prompts.

voice.md in Drive. ICP description in a Sheet cell. Edit a file — the next run inherits. The single most important pattern in this hour.

Cascade your models. Cut 60–70% off the bill.

Haiku for cheap classification. Sonnet for the one step that needs voice nuance. Free path: Llama 3.1 8B Instant for cheap, Llama 4 Scout for the heavy work.

HITL via Gmail drafts.

The agent drafts. You send. ~10 seconds per draft. ~10 hours/week back. Brand voice protected. Stage 2 of the trust ladder is the default — earn higher autonomy through the audit log over weeks.

Same architecture · 7 other agents.

Inbound triage · competitor pulse · support draft · content repurposer · call notes → CRM · investor update · onboarding personalizer. Different trigger, different search, different output. Same six pieces.

Close · two ways to keep going · both free

Take this further.

1 · Architect's Office Hours

5 free 30-min audits for BSW attendees.

You bring your bottleneck. I'll tell you what to automate, what to leave alone, and which 3 tools to use.

cal.com/sophia-stein/architect-audit-bsw

2 · Agentic Architect Blog

agenticarchitect.ai/blog

Weekly deep-dives on agentic architecture for lean founders. n8n templates · cost teardowns · what's actually working in 2026.

agenticarchitect.ai/blog

Repo: github.com/sudosoph/bsw26-agentic-workflows · MIT · fork · ship yours

Bring an agent next month.

Boulder is the lean-founder agentic capital, and you're already here.

Boulder AI Builders · Boulder Startup Week · Silicon Flatirons (CU Boulder)

Thanks for the hour.
Sophia Stein · AI Architect
Local LLM & Agentic Workflows
agenticarchitect.ai/blog
sophia@agenticarchitect.ai

ArchitectingAgentic Workflowsfor the Lean 2026 Startup

LIVE

What is an agent, really?

The REACT loop.

Why this matters when you debug

An agent has three layers. Pick each one deliberately.

What changed in 2026that makes your agent possible?

Models available today.

The new founder math.

2023

2026

Real proof, not theory

Tasking, concretely

What this hour delivers.

Blueprint for Autonomy

Live build-along

HITL standard

Which 20% of your manual tasksshould be 80% automated?

The 20/80 Blueprint.

Volume

Determinism

Reversibility

Green / Yellow / Red — pick your first agent.

GREEN · automate now

YELLOW · with HITL

RED · don't automate

Your turn. Five tasks.

Two axes · four tracks. Everyone in this room can build this.

🆓 FREE · zero card

💎 PAID · ~$6/mo

🧑‍💻 MANUAL build

⚡ AUTOPILOT setup

The Founder'sDiscovery Engine.

What we're building.

Nine steps · three sub-agents · one workflow.

Triggers — how your agent wakes up.

Config-as-files. Not config-as-prompts.

Discovery: one node, one API, one call.

Why Haiku, not Sonnet

Why Claude search, not Perplexity

Dedup against state.

Progressive enrichment.

About Firecrawl

Drafting — Sonnet 4.6 + voice.md.

Why Sonnet here

Why 5 examples

The file that does the heavy lifting.

Human-in-the-Loop (HITL): the agent drafts. You send.

What it does

What it gives you

Daily digest + audit log.

The digest · 7:30am

The audit log · Sheets

Follow-up sub-workflow.

What you just built · in plain terms.

1 · The brain

2 · The eyes

3 · The body

4 · The memory

5 · The voice

6 · The gate

Plumbing primitives — reusable building blocks every agent needs.

Walked in for one agent. Walking out with 12 Lego pieces.

7 other agents you can build < 30 minutes each.

Same agent. $135/month or $8/month.The difference is one weekend.

Sub-agent decomposition for cost.

Naive pattern

Cascade pattern

Same workflow · five cost tiers.

2026 changed the local hardware math.

Crossover math

Maintain brand voice and ethical oversightwhile scaling output 10×.

The 4-stage trust ladder.

From 70% to 98% accuracy through shadow testing.

Context

Numbers

The lesson

What to remember when you walk out.

The REACT loop is in every agent.

Config-as-files. Not config-as-prompts.

Architecting
Agentic Workflows
for the Lean 2026 Startup

What changed in 2026
that makes your agent possible?

Which 20% of your manual tasks
should be 80% automated?

The Founder's
Discovery Engine.

Same agent. $135/month or $8/month.
The difference is one weekend.

Maintain brand voice and ethical oversight
while scaling output 10×.