The Honest Costs of an AI Outbound System: What $5K/Month Actually Buys
White Paper

The Honest Costs of an AI Outbound System: What $5K/Month Actually Buys

Jake McCluskey
Back to white papers

The pitch you've been hearing

You've seen the LinkedIn DM, or sat through the discovery call. The setup is consistent across maybe forty agencies running the same play right now. "We book 30 to 120 qualified meetings per month using AI-powered cold outreach. No SDR hires, no learning curve, you just show up to the calls. Done-for-you. Three to seven thousand a month, six month commitment, results in 60 days." The deck has the same screenshots. Smartlead inbox views. A Clay table with green checkmarks. A Calendly link with five booked slots in one week. Sometimes a testimonial from a client who closed $180K in pipeline in 90 days.

It is compelling for a real reason. Cold outbound used to require a five-person team: a list-builder, two SDRs, an ops person to babysit deliverability, and a manager. The AI tooling stack has compressed that team down to roughly 1.5 humans plus software. The agencies running this play are arbitraging the gap between what the tools cost and what an in-house team would cost. That arbitrage is real. The question is how much of the savings the agency keeps versus how much shows up in your CAC.

The thing you'll notice on every discovery call: the answer to "how much does it actually cost to run this?" is always vague. The agency tells you the retail price. They do not tell you their cost stack. They do not tell you what your blended cost-per-meeting is at the volume you'd actually buy. They do not tell you the conversion math from "qualified meeting" to closed revenue at your specific ACV. That is the gap this paper closes. If you're a founder or COO looking at one of these pitches and wondering whether $5K a month is a smart spend or a slow leak, the next 4,000 words are the math you didn't get on the call.

The cost stack underneath the agency price

Here is what an AI outbound agency is actually buying with your $5K. Numbers are mid-2026 retail, USD, for a real-volume operation sending 30,000 to 80,000 emails per month from 8 to 15 mailboxes. Smaller volumes drop these numbers proportionally. Larger ones break the upper end of every tool's pricing.

Line item Monthly cost What it does
Smartlead or Instantly (sender) $97-397 Sends the emails, manages warmup schedule, rotates inboxes
Apollo (data + intent) $59-199 per seat Contact data, technographics, intent signals
Clay (enrichment + waterfall) $349-2,700 Multi-source enrichment, list cleaning, AI-personalized lines
Secondary domains (registration) $4-8 (5-10 domains, amortized monthly) Burner domains for cold sending so the primary stays clean
Google Workspace seats $42-315 Mailboxes at $7-21 each, 6-15 seats typical
Domain warmup tools $30-100 Inbox-rotator services that send conversational mail to age the domains
Dialer or AI voice (optional) $50-150 For accounts that pair email with phone follow-up
List enrichment overage $300-1,500 Clay credits run out fast at 50K+ contacts/mo. Budget extra.
Copywriter (real human) $1,500-4,000 If they have one. Many don't.
VA or junior SDR (inbox triage) $800-2,500 Reads replies, filters interested from out-of-office, books the call
Recurring monthly total $3,234-11,861 Real-volume operation, not a starter setup
Tech setup labor (one-time) $1,500-9,000 20-60 hours at $75-150/hr to build the stack

Two things to notice. First, the recurring number for a low-volume client is around $3,200 a month. The retail price is $5,000. That is a 36% gross margin, which after the agency's own overhead, fixed staff, and opportunity cost is a single-digit net margin. Most of the agencies running this play are not getting rich on a $5K client. They make their money on volume and on the clients who renew past month six.

Second, the high end of the recurring stack is almost $12,000 a month for a real-volume operation. If an agency is charging you $5K and promising 80 to 120 booked meetings, one of two things is true. Either the volume promise is fiction (the meetings won't show up), or they are losing money on you (which means they will under-deliver to manage the loss, or churn you at month three). Both outcomes look the same from the buyer side: the meetings dry up by month four.

The honest middle is this. A boutique agency running you at 30 to 50 meetings a month is probably spending $4,000 to $5,500 of recurring cost to deliver, plus the one-time setup that they amortize across the first six months. Their retail of $5K to $7K leaves a 20% to 50% gross margin. That is a reasonable margin for a service business. The marketing implies it is much higher, because the marketing is selling you on a software arbitrage that does not actually exist. The arbitrage is on the labor side: they have one operations lead managing eight clients, instead of you hiring one ops person managing only your stack.

That is the value. You are buying labor scale, not software scale. Once you see it that way, the rest of the decision becomes simpler.

What "30-120 qualified meetings" actually means

The word "qualified" is doing very heavy lifting in that sentence. In agency-speak, "qualified meeting" usually means: someone replied positively to a cold email, the agency's inbox VA confirmed a time, and the prospect actually clicked the calendar link. That is the entire qualification bar. It does not mean the prospect has budget. It does not mean they are in-market. It does not mean they fit your ICP beyond having a job title and a company size that matched the filters.

Run the realistic funnel from there. If the agency books you 30 meetings in a month, here is what typically happens:

  • Show-up rate: 50% to 70%. Cold-booked calls have terrible attendance. People agree to take a meeting because the email was clever, then they reread their week on Sunday night and the cold call is the first thing that gets cancelled. Realistic: 18 to 21 calls actually held out of 30 booked.
  • Sales-qualified rate: 15% to 30% of attended calls. The rest are tire-kickers, wrong-fit, or polite "we're not in market right now" conversations. Realistic: 3 to 6 SQLs per month from 18 to 21 calls.
  • Close rate: 10% to 25% of SQLs over the next 60 to 120 days. Cold-sourced opportunities close worse than referrals or inbound. Realistic: zero to two closed deals per month from cold AI outbound, with most of the actual closes happening in months three through six.

So the honest version of "30 booked meetings a month" is roughly: zero to two closed deals per month, on a 60 to 120 day lag, with the first deal probably landing in month three or four of the engagement.

This is fine if your average contract value is $20K to $50K. Two closes a month at $30K ACV is $60K of new business against $5K of agency spend. That is a 12:1 return, and the math gets better as the program matures and you optimize the offer.

It is a disaster if your ACV is $5K. Two closes a month at $5K ACV is $10K of new business against $5K of agency spend. Your gross margin on the deal has to be over 50% just to break even on the agency, before you've paid for any other GTM cost, before you've paid yourself, and before you've paid for the AE who has to actually take all those calls. At $5K ACV, the agency is a money pit unless you are using it for top-of-funnel signal generation feeding a stronger nurture motion, which is a different business model than what is being sold.

The single most important number in this whole conversation is your blended fully-loaded ACV against your blended fully-loaded cost-per-meeting. If a meeting costs you $300 (fifteen meetings a month, $5K agency, $1K of your time), and your ACV is $30K with a 12% close rate from cold meetings, your cost per closed deal is $2,500 against $30K of revenue. Healthy. Same numbers at $5K ACV, you're at $2,500 cost per $5K deal. Half your gross margin gone before you've delivered the work.

The four things the agency does that you actually can't replicate cheaply

Push back hard on the agency on margin and contract terms. But do not pretend the work is trivial. There are four pieces of this that matter and that are genuinely hard to replicate in-house in your first six months. These are the real reasons to consider DFY at all.

Domain reputation management at scale

This is the single hardest part of cold email in 2026, and the part that breaks most in-house attempts in months two and three. Cold sending burns domain reputation. Once a domain's reputation drops below the threshold Gmail and Outlook use, your messages start landing in spam. Once a domain is on a real blacklist (Spamhaus, SURBL), it is dead and you have to register a new one and warm it for 4 to 6 weeks before sending. A real cold operation runs 8 to 15 domains in rotation, with continuous warmup, tight per-domain volume caps (40 to 60 sends per mailbox per day, no more), and active monitoring of placement scores. The agency has built infrastructure to do this. They know which warmup tools actually work versus which are scams. They have SOPs for what to do when a domain starts dropping. You will not figure this out in week one. If you try to and you mess it up, you blacklist your primary corporate domain by accident and your sales team can't email anyone for 30 days. That is a real risk worth paying to avoid.

List quality and de-duplication across mailboxes

The boring expensive part. A bad list is the single biggest predictor of a campaign tanking, and most cold lists are bad in the same predictable ways: stale data, duplicate contacts at the same company, wrong-title hits, wrong-size companies, GDPR-protected EU contacts mixed in with North American ones. The agencies with the cleanest lists are running multi-source waterfall enrichment in Clay, then layering AI verification on title and seniority, then de-duping across all 15 mailboxes so the same prospect doesn't get the same email twice from "Sarah at Acme" and "Mike at Acme." This work is unsexy and easy to get wrong. The agency has done it 200 times. You are doing it for the first time.

Inbox triage at volume

When 15 mailboxes are sending 800 messages a day each, you get back 200 to 400 replies a day. Maybe 5% of those are interested. The other 95% are "wrong person, try Bob," "remove me," automated out-of-office, hostile responses, and bounces. Someone has to read all of them, route the interested ones to the calendar, route the "try Bob" ones into the next campaign, suppress the unsubscribes (legally required), and not lose track. A good agency has a VA doing this for two hours every morning. Doing it yourself or with your salespeople doing it pulls them off selling.

Compliance discipline

CAN-SPAM in the US, CASL in Canada, GDPR in the EU, the new state-level laws in California and Colorado. Every cold message has to honor unsubscribes within 10 business days, has to include a real physical mailing address, can't have misleading subject lines, can't email EU residents without explicit basis. The agency has SOPs for all of this. You'd write them from scratch, probably miss something, and the first time you get a CAN-SPAM complaint your bank gets a letter. Buying their compliance posture is buying down a real legal risk.

These four things are why "just run it in-house, it's just software" is a lie. The software is the easy part. The operational discipline around the software is what you are paying for.

The four things the agency does that you should NOT pay them for

Here is where you push back hard. There are four pieces of the agency offering where you are being charged a premium for work that is either trivial or that you should be doing yourself anyway. If an agency tries to tell you these are part of their value, you have a margin lever.

Writing the offer or value proposition

This one matters most. Nobody on the agency's team understands your ICP, your differentiation, or your buyer's actual pain better than you do. The offer they write for you will be the same generic three-sentence pitch they wrote for the 12 other clients in your category. "Help [persona] [achieve outcome] without [pain point]." It is a Mad Lib. The clients with the best cold campaigns wrote the offer themselves, gave it to the agency to run, and treated the agency as a distribution channel for messaging the founder owns. If the agency is leading with "we'll write your messaging for you," you are paying agency rates for the lowest-value part of their service.

Booking calls onto your calendar

This is a $30 a month problem dressed up as a $400 a month service. A Calendly link, a Zapier flow that pings your CRM when a slot is booked, and a confirmation email template is the entire stack. The agency charges as if they were a full SDR doing live booking. They are not. They are using the same Calendly you would use, plus a VA pasting the link into reply emails. Insist on either using your own Calendly or paying a clearly broken-out fee for booking that doesn't exceed $200 per month at any volume.

Reporting dashboards

Every agency in this space sells a "client portal" or a "weekly reporting dashboard" as if it were proprietary infrastructure. It is a Notion page or a Google Sheet that pulls open rates, reply rates, and meetings-booked from Smartlead's existing dashboard. The data is already in the tool. The dashboard is reformatted, not generated. If the agency wants to charge you an "analytics fee," the answer is no. Ask for raw export access to the underlying inbox data and your sales ops person can do it in two hours.

"Strategy calls" with you each week

The 60-minute weekly strategy call is the most overcharged piece of the entire engagement. You are paying somewhere between $400 and $800 for that hour, in the form of allocated agency time. What happens on the call: they read the dashboard back to you, ask if you want to tweak the subject line, and tell you they are testing a new opening hook next week. None of that requires a meeting. A 5-bullet email update once a week, plus a real working session every six weeks when something is actually changing, gets you the same result. Push back on the cadence. Most agencies will fold to "monthly check-ins plus async updates" if you ask.

The cost-side of the engagement looks very different once you strip these four lines out. You are buying domain reputation, list operations, inbox triage, and compliance, all of which are real. You are not buying offer development, calendar booking, dashboard formatting, or check-in meetings. The $5K pitch usually rolls all of that into one number on purpose, because the soft-margin items are where the actual gross margin sits.

Decision framework: when DFY makes sense vs. when in-house wins

Three variables decide this question. Ignore everything else. ACV, your existing sales team's bandwidth, and your time horizon.

DFY wins when: ACV is over $15K, you have never run cold outbound and have nobody who knows how, you can commit to at least 6 months of spend before you judge results, and you have at least one closer who can absorb an extra 8 to 12 booked meetings per week without dropping their existing pipeline. If three of those four are true, DFY is probably the right call. The agency's labor scale and their existing operational muscle will get you to first revenue faster than building it yourself.

In-house wins when: ACV is between $5K and $15K, you have one person who can own outbound as their primary job (not a side responsibility), you want compounding learning over speed-to-first-meeting, and your ICP is unusual enough that a generalist agency will not understand the buyer. The compounding learning point matters most. An in-house program in month 12 is worth 3x what an in-house program in month 3 is worth, because the operator learns the buyer language, the objections, and the timing windows. A DFY agency does not give you that learning. They give you meetings. When you cancel them, the meetings stop and you have nothing.

Neither makes sense when: ACV is under $5K, you have less than 6 months of runway, or your offer has not closed any prospects yet at all. In all three cases the right answer is to go work on the offer or the runway, not on outbound distribution. Cold outbound multiplies whatever you've got. If the offer doesn't close warm prospects, it will close zero cold ones. If you have less than 6 months of runway, you cannot wait the 90-day ramp window before outbound starts producing real revenue.

Three worked examples to make this concrete.

Example one. SaaS at $200/mo ACV. Annual contract value $2,400. Realistic close rate from cold meetings: 8%. To pay for $5K/mo of agency spend, you need 26 closed deals per month. To get 26 closed deals you need ~325 SQLs, which means 1,600+ booked meetings, which is 5x what any agency promises. The math does not work at any volume. Cold outbound is not the right channel for sub-$500/mo SaaS. Use content, partnerships, or PLG instead.

Example two. Consulting practice at $50K/yr per engagement. Realistic close rate from cold meetings: 15%. One closed deal a month covers the agency 10x over. The question is whether you have the delivery capacity to take on one new $50K engagement per month. If yes, DFY at $5K/mo is one of the highest-ROI marketing spends available to you. If your delivery is already maxed at three concurrent engagements and you can't take on more without hiring, you are buying meetings you can't service, which will burn referrals and reputation. Build delivery capacity before turning on cold.

Example three. Agency or services firm at $250K average annual contract. Two closed deals per year covers the agency program plus all overhead. The risk is not ROI. The risk is brand: cold outbound at this ACV is sometimes a negative signal to enterprise buyers. The right call is usually a hybrid. Run the agency on a tighter, more curated list (1,000 to 3,000 contacts max), with messaging the founder writes personally, treating outbound as one of three channels alongside referrals and content. Pay the agency primarily for the operational discipline, not the volume.

What honest pricing looks like (and the 5 questions to ask any DFY agency)

If you decide DFY is the right call, the pricing conversation needs to look different from the pitch deck. Get these five questions answered in writing before you sign anything. The answers tell you whether you're buying labor scale or buying a marketing illusion.

1. What is your average client's ACV, and how many of those convert from your meetings?

The honest answer names a specific ACV range and a specific conversion rate (e.g., "our typical client closes 12-18% of cold meetings at a $25K to $60K ACV range"). The dishonest answer is a case study about one anomalous client who closed $300K in 90 days. Push for the median, not the highlight. If they refuse to share the median, walk.

2. Can I see the deliverability scores on the domains I'd be using? What's your churn-out-of-domain rate?

Every cold operation kills domains. Healthy rate is 1 to 2 domains per quarter per client. If they tell you they "never lose domains" they are lying or they are not sending real volume. If they refuse to share placement scores, they are hiding a deliverability problem. Honest answer looks like: "we run 12 domains per client, we replace 1-2 per quarter, current placement score across the fleet is 8.2 out of 10, here's a screenshot." That is what you are looking for.

3. What software is included vs. what gets billed back to me?

Many agencies pass the Smartlead, Apollo, and Clay costs through to the client at retail or a markup. That is reasonable if disclosed. It is hostile if hidden. Ask line by line. If Clay credits or domain registration costs are pass-through, get the cap in writing.

4. What's your cancellation policy if my numbers don't hit at month 3?

The standard six-month commitment exists because it takes 60 to 90 days to know if a campaign is working. That part is fair. What is not fair is no off-ramp at month three if the meeting count is half of what was promised. Honest agencies will agree to a "results check" at day 60 or 90 with a defined number that triggers a reduced-fee continuation or a clean exit. If they refuse any version of this conversation, the contract is structured for them, not for you.

5. Who actually writes the copy: a human, an AI, or a templated prompt?

The honest answers, in descending order of value: "a senior copywriter on our team writes it from scratch with input from your founder," "we use a templated framework with AI-personalized first lines from Clay and human review," "we have a copywriter who customizes templates for each client." All three can work. The dishonest answer is silence about the actual process while implying senior human work. If the copy is being generated by ChatGPT off a generic prompt, you are paying agency rates for prompt-engineering you could do in an afternoon.

How to read each answer. Sales people on the agency side are trained to deflect questions one and four with case studies and "every situation is different" hedges. The right move is to ask the question, listen to the answer, and ask a tighter follow-up if the answer was vague. "What was the median ACV across the last 10 clients" is harder to dodge than "what is your typical client like." "What is the contract clause for under-performance at day 90" is harder to dodge than "what's your cancellation policy."

The one-page action plan

Three paths depending on where you are.

If you are considering an agency: Get the five questions above answered in writing. Ask for two reference calls with current clients in your ACV range, not their highlight reel. Ask for a 60-day results check with a defined "reduce fee or exit" trigger built into the contract. Insist on raw access to the inbox data, not just the agency's dashboard. If they push back on any of this, they are not the right partner. Good agencies welcome these questions because the answers are a competitive moat for the ones running operations honestly.

If you want to go in-house, a 90-day rollout:

  • Weeks 1-2: stack setup. Buy 8 secondary domains, set up 8 Google Workspace mailboxes, configure SPF/DKIM/DMARC, install Smartlead or Instantly, start warmup. Do not send anything in week 1.
  • Weeks 3-4: list build. Pull 5,000 to 10,000 contacts in Clay or Apollo against tight ICP filters. De-dupe. Verify. Write three offer variants you genuinely believe in. Have a friendly customer read them.
  • Weeks 5-8: send and iterate. Launch at low volume (50 sends per mailbox per day) and watch placement, reply rate, and meetings booked weekly. Adjust copy on what's failing, kill the worst variant, double the volume on the best.
  • Weeks 9-12: scale. Expand to full volume (1,000 to 1,200 sends/day total), add a second list, hire a part-time VA for inbox triage. By day 90 you should have ~10 to 25 booked meetings per month and the operational rhythm to keep going.

If you are not sure: book a free 30-minute scoping call at /schedule. We will walk through the math for your specific ACV, sales team capacity, and runway, and give you a direct call on whether DFY, in-house, or neither is the right move for the next two quarters. We do not run cold outbound as a service, so there is no incentive to push you in either direction.

The shortest version of this paper: AI outbound is a real channel that costs real money to run. The agencies pitching $5K/mo are not lying about the meetings. They are being vague about the conversion math from meeting to revenue at your specific ACV, and they are bundling four genuinely valuable services with four genuinely overcharged ones. Get the math right before you sign, and the channel is one of the best you have. Get the math wrong, and you'll spend $30K over six months for a list of names that never closed.

Common questions

Frequently asked

Is a $5K/month AI outbound agency worth it for a $50K-ACV consulting practice?

Almost always yes, if your delivery has capacity for the inbound. At $50K ACV with a 12-15% close rate from cold meetings, one closed deal a month covers the agency roughly 10x over. The constraint is delivery, not lead-gen. Build delivery capacity to handle 8-12 incremental engagements per year before turning the program on, otherwise you'll burn referrals when you can't service the new pipeline.

What's the actual cost to run AI cold email in-house?

Around $1,200 to $2,500 per month for a real-volume operation, plus 15-25 hours per week of one operator's time. The software stack (Smartlead, Apollo, Clay, Google Workspace seats, domain warmup) runs $800-1,500 monthly. Domains and registrations add $50-100. The gap between in-house and DFY ($3,500-7,000/mo retail) is paying for someone else's labor scale and operational SOPs, not for the software.

How do I tell if a lead-gen agency is overcharging?

Ask for the median ACV and median conversion rate across their last 10 clients (not the highlight reel). Ask whether software costs are pass-through or marked-up. Ask what the contract clause says about under-performance at day 90. If they deflect any of those three questions with case studies, the price is structured for them. A fair retail margin on real-volume cold outbound is 25-40% above their actual cost stack, so $4,500-6,500 retail on $3,500 of recurring cost is honest.

What questions should I ask before hiring an AI lead-gen agency?

Five: (1) What's your median client ACV and conversion rate? (2) Can I see deliverability scores on the domains I'd use, plus your domain churn rate? (3) What software is included vs. billed back? (4) What's the contract clause if numbers miss at month 3? (5) Who writes the copy: a human, AI, or a templated prompt? The honest answers name specific numbers and processes. Vague answers mean you're being sold marketing, not operations.

Can I do AI cold outreach without getting my domain blacklisted?

Yes, but never send cold from your primary corporate domain. Buy 5-10 secondary domains (e.g., yourcompany-team.com, getcompany.com), warm them for 4-6 weeks, and keep per-mailbox volume under 50-60 sends per day. Use a warmup tool to keep reputation healthy. Monitor Google Postmaster scores weekly. If a domain drops below the threshold, pull it from rotation and replace it. This is the single hardest operational discipline in cold email and where most in-house attempts fail.

How long until AI outbound starts producing meetings?

Days 1-30: zero, because domains are warming and lists are building. Days 30-60: first 5-15 meetings if the offer is right, often messy and unconverted. Days 60-90: a real cadence of 15-40 meetings per month with 2-6 SQLs. The first closed deal usually lands in months 3-4, sometimes month 5 for longer sales cycles. Anyone promising meetings in week one is either lying or sending from already-burned domains.

When does Done-For-You lead generation NOT make sense?

Three cases. First, ACV under $5K: the math doesn't work at any volume the agency can deliver. Second, less than 6 months of runway: cold outbound has a 60-90 day ramp and you can't wait. Third, an offer that hasn't closed any prospects yet: cold multiplies what you have, so a non-converting offer produces zero closes from cold just as it does from referrals. Fix the offer or the runway first. Outbound second.

READY TO IMPLEMENT

Want to talk through this in your business?

The paper above is the thinking. Let's spend 30 minutes on what it would actually look like to ship in your shop, no pitch, just a real scoping conversation.

The Honest Costs of an AI Outbound System