tinyctl.dev

Paperclip Pricing Guide (2026): What Running AI Agents Actually Costs

A real-numbers breakdown of what a Paperclip agent company costs to operate monthly. Model selection, heartbeat frequency, agent count — we cover the levers that determine your bill and the mistakes that inflate it.

Published 5/12/2026

Paperclip is free. Your AI model bill is not.

That distinction matters because most people evaluate Paperclip, see “open-source” and “self-hosted,” and assume the whole thing is free. It is — right up until your agents start running. Then you’re paying Claude or OpenAI for every heartbeat cycle, every issue pickup, every article draft, every status report.

The real question isn’t “what does Paperclip cost?” It’s “what does running 5 agents on a continuous heartbeat schedule cost, and how do I keep that number sane?”

We’ve been running a 5-agent Paperclip company for this site. Here’s what we’ve learned about the actual economics.


The Four Cost Levers

Your monthly Paperclip bill is determined by exactly four variables. Every optimization comes back to tuning one of these.

1. Agent Count

Every agent you add multiplies your base cost. A 3-agent company costs roughly 60% of what a 5-agent company costs. A 10-agent company costs roughly double.

The temptation is to add specialized agents for every function — a dedicated SEO auditor, a separate link checker, a deployment monitor. Each one sounds useful in isolation. Combined, they compound your bill.

The right number for most operations is 3-5. Below 3, you don’t have enough specialization to produce quality output. Above 5, you’re paying for coordination overhead that doesn’t translate to proportionally more output.

2. Model Tier Per Agent

This is the single biggest cost lever and the one most people get wrong.

Not every agent needs the most expensive model. Your CEO — the agent that delegates, reviews, and makes strategic decisions — benefits from the most capable model available. Your SEO Writer, which executes against a well-defined brief, does not need the same level of reasoning capability.

The cost difference is dramatic:

Model TierApproximate Input CostApproximate Output CostBest For
Claude Haiku~$0.25/M tokens~$1.25/M tokensStatus checks, simple lookups
Claude Sonnet 4.6~$3/M tokens~$15/M tokensContent production, code generation
Claude Opus 4.6~$15/M tokens~$75/M tokensCEO orchestration, complex reasoning

Running all 5 agents on Opus costs roughly 5x what a smart split costs. We don’t do that.

The exact split — which model on which agent, and why the obvious assignment is wrong — is one of the highest-leverage optimizations we’ve found. It’s baked into our company templates.

3. Heartbeat Frequency

Heartbeat frequency is how often each agent wakes up and checks for work. This directly multiplies your token usage.

An agent with a 30-minute heartbeat runs 48 times per day. The same agent with a 2-hour heartbeat runs 12 times per day. That’s a 4x cost difference for the same agent doing the same quality of work — just checking less frequently.

The tricky part: heartbeats that are too infrequent create pipeline stalls. If your CEO only checks every 4 hours, a Writer that finishes an article at 9 AM won’t get its next assignment until 1 PM. Four hours of idle compute.

Finding the right frequency for each agent’s role — not too fast (wasting money on empty checks), not too slow (creating bottlenecks) — is a calibration problem. The wrong default settings can cost you 3-5x more than necessary.

4. Context Window Size

Each heartbeat, an agent loads its context: open issues, recent comments, workspace state, instructions. More context means more tokens consumed per cycle.

A board with 50 open issues costs more per heartbeat than a board with 5. An agent with a long AGENTS.md instruction file costs more per cycle than one with a concise one.

This is a subtle cost driver because it grows over time. Your first week is cheap — few issues, short history. By month three, your board has hundreds of resolved issues and the context window is loading more data per cycle.

Managing this effectively requires understanding how Paperclip constructs agent context and what you can do to keep it lean. This is one of those things that’s not obvious until you see a cost spike at month two.


What Our Company Actually Costs

We run a 5-agent company producing this site: CEO, Content Strategist, SEO Writer, Site Engineer, Revenue Ops.

Monthly Cost Range

ScenarioMonthly CostNotes
Optimized (our current setup)$300-450Smart model split, tuned heartbeats
Naive defaults$800-1,200All agents on Opus, frequent heartbeats
Aggressive optimization$150-250Minimum viable heartbeats, all Sonnet

The optimized setup produces 15-30 articles/month with automated deployment, editorial pipeline management, and revenue tracking. The aggressive optimization produces similar volume but with slower pipeline throughput — articles take longer to move from brief to published because agents check in less frequently.

Cost Per Article

At our current production rate and cost structure:

  • Cost per published article: $10-15 (including all agent overhead, not just the Writer)
  • Freelance equivalent: $150-500 per article of comparable depth and SEO optimization
  • Break-even: 1-2 affiliate commissions per article covers the production cost

The economics work because the marginal cost of each additional article is low once the company infrastructure is running.


The Expensive Mistakes

These are the cost patterns we discovered the hard way. Each one inflated our bill significantly before we identified and fixed it.

Running the Wrong Model on the CEO

The CEO agent makes every delegation decision. A weaker model on the CEO creates cascading quality issues — bad briefs lead to bad articles lead to rewrites lead to more heartbeats. The CEO is the one role where model quality has a multiplier effect on total cost.

Heartbeat Intervals That Don’t Match Workload

Running all agents at the same frequency is the default, and it’s wrong. The CEO needs frequent heartbeats to keep the pipeline moving. The Revenue Ops agent, which runs weekly reports, does not need to check in every 30 minutes.

Matching heartbeat frequency to actual work cadence is one of the simplest optimizations and one of the most impactful.

Duplicate Issue Storms

Without proper guardrails, a CEO agent can create the same issue multiple times across heartbeat cycles. We had a cycle where 19 duplicate article assignments were created in a single heartbeat. Each duplicate generates downstream work — agents pick them up, start writing, and produce near-identical output.

Preventing this requires specific CEO configuration. The cost of not preventing it: 5-10x your expected compute for that cycle.

Idle Heartbeats on Blocked Agents

An agent that’s blocked on external input (waiting for a credential, waiting for human approval) still runs heartbeat cycles. Each cycle, it checks the board, finds it’s still blocked, and goes back to sleep — consuming tokens for nothing.

Configuring agents to reduce or pause heartbeats when blocked is an optimization that most setups miss.


Comparison: Paperclip vs. Alternatives

PlatformMonthly Infrastructure CostOutput ModelBest For
Paperclip$300-450 (model APIs)Continuous autonomous operationOngoing content/dev businesses
CrewAIPer-run ($0.50/run on AMP)On-demand task executionBounded workflows
Hiring freelancers$2,000-5,000Human-quality, slow throughputOne-off projects
Manual ChatGPT$20-200/month subscriptionOne article at a timeIndividual creators

Paperclip’s cost structure makes sense when you need continuous, autonomous output. If you only need to run a pipeline occasionally, the heartbeat model works against you — you’re paying for idle cycles. See our Paperclip vs CrewAI comparison for a detailed architecture breakdown.


How to Optimize Your Setup

The difference between a $1,200/month Paperclip company and a $300/month one producing the same output comes down to three decisions:

  1. Model assignment per agent — which agent gets which tier
  2. Heartbeat calibration — frequency matched to each role’s actual work cadence
  3. CEO guardrails — preventing the duplicate-issue and coordination-loop failure modes

Getting these right on the first try saves weeks of expensive trial and error. Getting them wrong costs real money — we spent over $200 in wasted compute during our first week alone diagnosing and fixing these exact problems.

We’ve packaged the specific configurations that solved each of these into ready-to-import company templates. One import, optimized cost structure from day one.

Get pre-optimized Paperclip company templates →


Getting Started

If you’re evaluating Paperclip and want to understand the full picture before committing:

The fastest way to get the cost structure right from day one: browse our Paperclip company templates →