They Quoted 85K, Then Ate 57K in Overruns

A boutique AI agency in Austin quoted a client $85,000 for a document classification system. The project ended up costing $142,000 to deliver. The data was messier than expected, the model required three additional training iterations, and the client's integration environment had undocumented dependencies that doubled the deployment timeline. The agency ate $57,000 in overruns and nearly lost their most experienced engineer, who was frustrated by the constant firefighting. The worst part: every single one of those overruns was predictable if they had used a budgeting process designed for AI work.

This is not an unusual story. Research from McKinsey suggests that AI projects run over budget 60-80% of the time, and the average overrun is 45% of the original estimate. The problem is not that agencies are bad at estimating. The problem is that they are using budgeting processes designed for predictable software development on inherently unpredictable AI work.

Let us build a better process.

Why Standard Project Budgets Fail for AI Work

Traditional project budgeting works like this: define the scope, estimate the hours, multiply by the rate, add a margin, and quote the client. This works reasonably well for building a web application or implementing a CRM integration because the work is well-understood and relatively predictable.

AI projects break this model in three fundamental ways.

The Data Variable

In traditional software, your input is well-defined. A user submits a form, and you process the fields. In AI projects, your input is data, and data is inherently messy. You might budget 40 hours for data preparation and discover that 30% of the records have inconsistent formatting that requires custom parsing logic. Or you might find that the training dataset is biased in ways that require collecting additional data. These are not edge cases. They are the norm.

The Iteration Variable

Building a traditional feature follows a roughly linear path: design, build, test, deploy. Building an AI model follows an experimental path: try an approach, evaluate results, adjust the approach, evaluate again. You might try five different model architectures before finding one that meets the performance threshold. Each iteration takes time and compute resources that are difficult to predict before you start.

The Integration Variable

AI models do not exist in isolation. They need to integrate with existing systems, handle edge cases gracefully, and perform reliably under production conditions. The gap between a working prototype and a production deployment is often larger for AI systems than for traditional software, and that gap is notoriously hard to estimate.

The Three-Layer Budget Framework

Instead of a single budget number, build every AI project budget with three layers: a base budget, a contingency layer, and a scope expansion reserve.

Layer 1: The Base Budget

The base budget covers the work you can estimate with reasonable confidence. This includes:

Discovery and requirements gathering. Meeting with stakeholders, understanding the business problem, defining success criteria. This is predictable work.
Data assessment. An initial review of available data, quality checks, and feasibility analysis. Budget enough time to actually look at the data, not just hear about it.
Known engineering work. Infrastructure setup, API development, monitoring and logging implementation, deployment pipeline creation. This is standard software work that you can estimate normally.
Project management and communication. Status meetings, documentation, client presentations. Budget this explicitly rather than hoping it gets absorbed.
Testing and QA. Functional testing, performance testing, security review. Again, predictable and estimable.

For the base budget, use your normal estimation process. Break the work into tasks, estimate hours per task, multiply by your blended rate. This layer should feel solid.

Layer 2: The Contingency Layer

The contingency layer covers the work that is inherently uncertain. This is where AI projects differ from traditional projects, and this is where most agencies underbudget.

Data preparation contingency (30-50% of estimated data prep time). Whatever you think data cleaning will take, add 30-50%. If the client's data has never been used for ML before, go with 50%. If you have worked with similar data before, 30% might suffice.

Model development contingency (40-60% of estimated modeling time). Model training and iteration is the most unpredictable phase. If you are building a novel model architecture, budget 60% contingency. If you are fine-tuning an established approach, 40% may be enough.

Integration contingency (20-30% of estimated integration time). Production deployment surprises are common but usually bounded. Legacy system APIs that behave differently than documented, latency requirements that force architectural changes, or data pipeline reliability issues.

Compute cost contingency (25-40% of estimated compute costs). Training runs, especially for large models, can surprise you. GPU costs accumulate fast when you need extra iterations.

Add these contingency amounts to get your Layer 2 total. This is not padding or sandbagging. It is an honest accounting of uncertainty.

Layer 3: The Scope Expansion Reserve

During any AI project, new opportunities and requirements emerge. The client sees early results and wants to add a feature. The data reveals an adjacent use case. The production environment requires capabilities that were not in the original scope.

Reserve 10-15% of your total budget (Layer 1 + Layer 2) as a scope expansion reserve. This is not for fixing problems. It is for capturing opportunities. When the client asks for something new, you can say yes without blowing the budget, or you can propose a change order for anything that exceeds the reserve.

Putting It All Together

Here is how the math works for a hypothetical project:

Base budget: $80,000 (predictable work)
Data prep contingency: $8,000 (40% of $20,000 estimated data prep)
Model development contingency: $15,000 (50% of $30,000 estimated modeling)
Integration contingency: $4,500 (25% of $18,000 estimated integration)
Compute contingency: $2,000 (30% of estimated $6,500 compute costs)
Subtotal with contingency: $109,500
Scope expansion reserve (12%): $13,140
Total project budget: $122,640

Compare this to the naive estimate of $80,000. The three-layer budget is 53% higher, which might feel aggressive. But remember: AI projects run over budget by 45% on average. The three-layer approach is not inflating the budget. It is sizing it correctly.

Phase-Gated Budgeting

For larger projects (over $100,000), do not budget the entire project upfront. Instead, use phase-gated budgeting where you budget each phase separately and the client approves the next phase based on results from the current one.

Phase 1: Discovery and Feasibility ($15,000-$25,000)

Budget this phase as a fixed-price engagement. The deliverable is a feasibility report that includes:

Data quality assessment with specific findings
Recommended technical approach with rationale
Detailed budget estimate for the full project (using the three-layer framework)
Risk assessment with mitigation strategies
Timeline with milestones and decision points

This phase gives both you and the client the information needed to budget the remaining phases accurately. It also gives the client an off-ramp if the project is not feasible.

Phase 2: Development and Iteration (Budgeted Based on Phase 1 Findings)

After Phase 1, you have actually seen the data and validated the approach. Your budget for Phase 2 will be significantly more accurate than any estimate you could have given before Phase 1.

Budget Phase 2 using the three-layer framework, but with lower contingency percentages because your uncertainty has decreased. Typical contingencies for Phase 2 after a thorough Phase 1:

Data prep contingency: 15-25%
Model development contingency: 25-40%
Integration contingency: 15-20%

Phase 3: Deployment and Optimization

Budget this phase after Phase 2 delivers a working model. You now know the model's performance characteristics, integration requirements, and operational constraints. Budgeting is nearly as predictable as traditional software deployment at this point.

Why Clients Prefer Phase-Gated Budgets

You might worry that clients will balk at a phased approach. In practice, the opposite happens. Clients prefer phase-gated budgets because:

Lower initial commitment. They invest $15,000-$25,000 to validate the project before committing six figures.
More control. They can adjust scope, priorities, or budget at each gate based on real results.
Better visibility. They see tangible progress at each phase instead of waiting months for a final deliverable.
Reduced risk. If the project is not feasible, they find out after $20,000 instead of $150,000.

Bottom-Up vs. Top-Down Estimation

Use both approaches and reconcile them. Bottom-up estimation alone misses systemic factors. Top-down estimation alone misses task-level details.

Bottom-Up Process

Break the project into work packages. Break each work package into tasks. Estimate each task in hours. Multiply by the appropriate rate for the role performing the task.

For AI projects, your work packages typically include:

Data engineering: Collection, cleaning, transformation, pipeline development
Model development: Architecture selection, training, evaluation, iteration
Application engineering: API development, UI development, integration
Infrastructure: Cloud setup, monitoring, CI/CD, security
Quality assurance: Testing, validation, performance benchmarking
Project management: Planning, communication, documentation, reviews

Have the person doing the work estimate the hours whenever possible. The team lead who will build the model gives a better estimate than the sales lead who sold the project.

Top-Down Process

Look at similar projects you have completed. What did they actually cost? Not what you estimated, but what you spent. Use these actuals as your reference point.

If you have not completed a similar project, use industry benchmarks:

Simple AI integration (pre-trained model, API-based, limited customization): $20,000-$50,000
Custom model development (moderate data, established techniques): $75,000-$200,000
Complex AI system (large data, novel approaches, multiple models): $200,000-$500,000+

Reconciliation

Compare your bottom-up and top-down estimates. If they are within 15% of each other, you are in good shape. If they diverge significantly, investigate why.

Common reasons for divergence:

Bottom-up is much lower: You probably forgot a work package or underestimated the data and iteration work.
Bottom-up is much higher: You might be over-engineering the solution. Look at whether simpler approaches could meet the requirements.
Top-down comparison projects are not relevant: Your project might be genuinely different from your historical data. Lean on bottom-up in this case but increase your contingency percentages.

Tracking Budgets During Execution

A budget that is only checked at the end of the project is not a budget. It is a post-mortem. Build real-time budget tracking into your project process.

Weekly Budget Reviews

Every week, compare actual spend to planned spend for the current phase. Calculate these three numbers:

Spend to date: Total hours logged multiplied by rates, plus any direct costs
Estimated spend at completion: Current spend rate extrapolated to project end
Budget remaining: Total budget minus spend to date

If your estimated spend at completion exceeds the budget (including contingency), you have a problem. Catch it early when you can still adjust.

Earned Value Tracking (Simplified)

Earned value management sounds complicated, but the simplified version is powerful:

Planned Value (PV): How much work should be done by now according to the schedule?
Earned Value (EV): How much work is actually done?
Actual Cost (AC): How much have we spent?

If EV is less than PV, you are behind schedule. If AC is greater than EV, you are over budget per unit of work. Both are early warning signals.

You do not need fancy tools for this. A simple spreadsheet updated weekly provides the visibility you need.

Contingency Burn Rate

Track how fast you are consuming your contingency layer. If you burn through 50% of contingency in the first 25% of the project, you are on a trajectory to blow the budget. This metric gives you lead time to have a conversation with the client about scope adjustments or additional investment.

Presenting Budgets to Clients

How you present the budget matters as much as the numbers. Here are presentation strategies that work.

The Range Approach

Instead of presenting a single number, present a range:

Best case (base budget): $80,000 if data is clean, model trains efficiently, and integration is straightforward.
Expected case (base + partial contingency): $105,000 based on typical project patterns.
Worst case (base + full contingency + reserve): $125,000 if we encounter significant data issues or the model requires extensive iteration.

Clients appreciate this honesty. They budget for the expected case and are prepared for the worst case.

The Phase-Gated Approach

Present Phase 1 as a fixed price and communicate that Phase 2 will be scoped based on Phase 1 findings:

"We propose a $20,000 discovery phase that will give us both the data needed to budget the full project accurately. Based on similar projects, we expect the total investment to be in the range of $100,000 to $140,000, but Phase 1 findings will allow us to narrow that range significantly."

This approach is honest, professional, and reduces the client's upfront risk.

What Not to Do

Do not present a single fixed price for the entire project unless you have extremely high confidence in the scope and have built enough margin to absorb surprises.
Do not hide contingency inside inflated task estimates. Clients and their procurement teams are smart. They will find the padding and lose trust.
Do not promise the best-case budget and then ask for more money later. This destroys client relationships faster than almost anything else.

Building a Historical Budget Database

Your budgeting process gets better over time only if you track actuals and learn from them. After every project, capture:

Original budget by category (data, modeling, engineering, PM, etc.)
Actual spend by category
Variance by category with explanation
Contingency consumed and what triggered it
Scope changes and their budget impact
Client satisfaction with the budgeting process

After 10-15 projects, patterns emerge. You will learn that your data prep estimates are consistently 30% low, or that your integration estimates are actually pretty good. This empirical data makes every subsequent estimate more accurate.

Store this data in a simple spreadsheet or your project management tool. The format does not matter. What matters is that you capture it consistently and reference it when estimating new projects.

Budgeting for Internal Projects

Not all work at your agency is client-facing. Internal projects, like building tools, improving processes, or developing intellectual property, also need budgets. Apply the same discipline.

Estimate the hours and assign a cost even if no one is billing externally
Set a contingency because internal projects face the same uncertainties
Track actuals against the budget weekly
Kill or descope internal projects that exceed their budget without a clear justification

Agencies that budget internal projects rigorously are consistently more profitable than those who treat internal work as free because no one is billing for it.

Your Next Step

Pull up the last three AI projects your agency delivered. For each one, calculate the original budget and the actual cost. Divide the actual by the original to get your overrun ratio. Average the three ratios. This is your agency's historical overrun factor, and it tells you exactly how much contingency you need to build into future budgets. If your average overrun is 1.4x, your contingency layer needs to be at least 40% of your base budget. Start using this number on your next proposal, and watch your project profitability stabilize.

Let us build a better process.

Why Standard Project Budgets Fail for AI Work

AI projects break this model in three fundamental ways.

The Data Variable

The Iteration Variable

The Integration Variable

The Three-Layer Budget Framework

Instead of a single budget number, build every AI project budget with three layers: a base budget, a contingency layer, and a scope expansion reserve.

Layer 1: The Base Budget

The base budget covers the work you can estimate with reasonable confidence. This includes:

Discovery and requirements gathering. Meeting with stakeholders, understanding the business problem, defining success criteria. This is predictable work.
Data assessment. An initial review of available data, quality checks, and feasibility analysis. Budget enough time to actually look at the data, not just hear about it.
Known engineering work. Infrastructure setup, API development, monitoring and logging implementation, deployment pipeline creation. This is standard software work that you can estimate normally.
Project management and communication. Status meetings, documentation, client presentations. Budget this explicitly rather than hoping it gets absorbed.
Testing and QA. Functional testing, performance testing, security review. Again, predictable and estimable.

For the base budget, use your normal estimation process. Break the work into tasks, estimate hours per task, multiply by your blended rate. This layer should feel solid.

Layer 2: The Contingency Layer

The contingency layer covers the work that is inherently uncertain. This is where AI projects differ from traditional projects, and this is where most agencies underbudget.

Compute cost contingency (25-40% of estimated compute costs). Training runs, especially for large models, can surprise you. GPU costs accumulate fast when you need extra iterations.

Add these contingency amounts to get your Layer 2 total. This is not padding or sandbagging. It is an honest accounting of uncertainty.

Layer 3: The Scope Expansion Reserve

Putting It All Together

Here is how the math works for a hypothetical project:

Base budget: $80,000 (predictable work)
Data prep contingency: $8,000 (40% of $20,000 estimated data prep)
Model development contingency: $15,000 (50% of $30,000 estimated modeling)
Integration contingency: $4,500 (25% of $18,000 estimated integration)
Compute contingency: $2,000 (30% of estimated $6,500 compute costs)
Subtotal with contingency: $109,500
Scope expansion reserve (12%): $13,140
Total project budget: $122,640

Phase-Gated Budgeting

Phase 1: Discovery and Feasibility ($15,000-$25,000)

Budget this phase as a fixed-price engagement. The deliverable is a feasibility report that includes:

Data quality assessment with specific findings
Recommended technical approach with rationale
Detailed budget estimate for the full project (using the three-layer framework)
Risk assessment with mitigation strategies
Timeline with milestones and decision points

This phase gives both you and the client the information needed to budget the remaining phases accurately. It also gives the client an off-ramp if the project is not feasible.

Phase 2: Development and Iteration (Budgeted Based on Phase 1 Findings)

After Phase 1, you have actually seen the data and validated the approach. Your budget for Phase 2 will be significantly more accurate than any estimate you could have given before Phase 1.

Budget Phase 2 using the three-layer framework, but with lower contingency percentages because your uncertainty has decreased. Typical contingencies for Phase 2 after a thorough Phase 1:

Data prep contingency: 15-25%
Model development contingency: 25-40%
Integration contingency: 15-20%

Phase 3: Deployment and Optimization

Why Clients Prefer Phase-Gated Budgets

You might worry that clients will balk at a phased approach. In practice, the opposite happens. Clients prefer phase-gated budgets because:

Lower initial commitment. They invest $15,000-$25,000 to validate the project before committing six figures.
More control. They can adjust scope, priorities, or budget at each gate based on real results.
Better visibility. They see tangible progress at each phase instead of waiting months for a final deliverable.
Reduced risk. If the project is not feasible, they find out after $20,000 instead of $150,000.

Bottom-Up vs. Top-Down Estimation

Use both approaches and reconcile them. Bottom-up estimation alone misses systemic factors. Top-down estimation alone misses task-level details.

Bottom-Up Process

Break the project into work packages. Break each work package into tasks. Estimate each task in hours. Multiply by the appropriate rate for the role performing the task.

For AI projects, your work packages typically include:

Data engineering: Collection, cleaning, transformation, pipeline development
Model development: Architecture selection, training, evaluation, iteration
Application engineering: API development, UI development, integration
Infrastructure: Cloud setup, monitoring, CI/CD, security
Quality assurance: Testing, validation, performance benchmarking
Project management: Planning, communication, documentation, reviews

Have the person doing the work estimate the hours whenever possible. The team lead who will build the model gives a better estimate than the sales lead who sold the project.

Top-Down Process

Look at similar projects you have completed. What did they actually cost? Not what you estimated, but what you spent. Use these actuals as your reference point.

If you have not completed a similar project, use industry benchmarks:

Simple AI integration (pre-trained model, API-based, limited customization): $20,000-$50,000
Custom model development (moderate data, established techniques): $75,000-$200,000
Complex AI system (large data, novel approaches, multiple models): $200,000-$500,000+

Reconciliation

Compare your bottom-up and top-down estimates. If they are within 15% of each other, you are in good shape. If they diverge significantly, investigate why.

Common reasons for divergence:

Bottom-up is much lower: You probably forgot a work package or underestimated the data and iteration work.
Bottom-up is much higher: You might be over-engineering the solution. Look at whether simpler approaches could meet the requirements.
Top-down comparison projects are not relevant: Your project might be genuinely different from your historical data. Lean on bottom-up in this case but increase your contingency percentages.

Tracking Budgets During Execution

A budget that is only checked at the end of the project is not a budget. It is a post-mortem. Build real-time budget tracking into your project process.

Weekly Budget Reviews

Every week, compare actual spend to planned spend for the current phase. Calculate these three numbers:

Spend to date: Total hours logged multiplied by rates, plus any direct costs
Estimated spend at completion: Current spend rate extrapolated to project end
Budget remaining: Total budget minus spend to date

If your estimated spend at completion exceeds the budget (including contingency), you have a problem. Catch it early when you can still adjust.

Earned Value Tracking (Simplified)

Earned value management sounds complicated, but the simplified version is powerful:

Planned Value (PV): How much work should be done by now according to the schedule?
Earned Value (EV): How much work is actually done?
Actual Cost (AC): How much have we spent?

If EV is less than PV, you are behind schedule. If AC is greater than EV, you are over budget per unit of work. Both are early warning signals.

You do not need fancy tools for this. A simple spreadsheet updated weekly provides the visibility you need.

Contingency Burn Rate

Presenting Budgets to Clients

How you present the budget matters as much as the numbers. Here are presentation strategies that work.

The Range Approach

Instead of presenting a single number, present a range:

Best case (base budget): $80,000 if data is clean, model trains efficiently, and integration is straightforward.
Expected case (base + partial contingency): $105,000 based on typical project patterns.
Worst case (base + full contingency + reserve): $125,000 if we encounter significant data issues or the model requires extensive iteration.

Clients appreciate this honesty. They budget for the expected case and are prepared for the worst case.

The Phase-Gated Approach

Present Phase 1 as a fixed price and communicate that Phase 2 will be scoped based on Phase 1 findings:

This approach is honest, professional, and reduces the client's upfront risk.

What Not to Do

Do not present a single fixed price for the entire project unless you have extremely high confidence in the scope and have built enough margin to absorb surprises.
Do not hide contingency inside inflated task estimates. Clients and their procurement teams are smart. They will find the padding and lose trust.
Do not promise the best-case budget and then ask for more money later. This destroys client relationships faster than almost anything else.

Building a Historical Budget Database

Your budgeting process gets better over time only if you track actuals and learn from them. After every project, capture:

Original budget by category (data, modeling, engineering, PM, etc.)
Actual spend by category
Variance by category with explanation
Contingency consumed and what triggered it
Scope changes and their budget impact
Client satisfaction with the budgeting process

Store this data in a simple spreadsheet or your project management tool. The format does not matter. What matters is that you capture it consistently and reference it when estimating new projects.

Budgeting for Internal Projects

Not all work at your agency is client-facing. Internal projects, like building tools, improving processes, or developing intellectual property, also need budgets. Apply the same discipline.

Estimate the hours and assign a cost even if no one is billing externally
Set a contingency because internal projects face the same uncertainties
Track actuals against the budget weekly
Kill or descope internal projects that exceed their budget without a clear justification

Agencies that budget internal projects rigorously are consistently more profitable than those who treat internal work as free because no one is billing for it.

They Quoted 85K, Then Ate 57K in Overruns

Why Standard Project Budgets Fail for AI Work

The Data Variable

The Iteration Variable

The Integration Variable

The Three-Layer Budget Framework

Layer 1: The Base Budget

Layer 2: The Contingency Layer

Layer 3: The Scope Expansion Reserve

Putting It All Together

Phase-Gated Budgeting

Phase 1: Discovery and Feasibility ($15,000-$25,000)

Phase 2: Development and Iteration (Budgeted Based on Phase 1 Findings)

Phase 3: Deployment and Optimization

Why Clients Prefer Phase-Gated Budgets

Bottom-Up vs. Top-Down Estimation

Bottom-Up Process

Top-Down Process

Reconciliation

Tracking Budgets During Execution

Weekly Budget Reviews

Earned Value Tracking (Simplified)

Contingency Burn Rate

Presenting Budgets to Clients

The Range Approach

The Phase-Gated Approach

What Not to Do

Building a Historical Budget Database

Budgeting for Internal Projects

Your Next Step

Agency Script Editorial

Related Articles

Understaffed or Overstaffed? Both Camps Were Right.

Optimizing Daily Standups for Distributed AI Agency Teams

Complete Utilization Rate Management Guide — The Metric That Makes or Breaks Agency Profitability

Ready to certify your AI capability?

They Quoted 85K, Then Ate 57K in Overruns

Why Standard Project Budgets Fail for AI Work

The Data Variable

The Iteration Variable

The Integration Variable

The Three-Layer Budget Framework

Layer 1: The Base Budget

Layer 2: The Contingency Layer

Layer 3: The Scope Expansion Reserve

Putting It All Together

Phase-Gated Budgeting

Phase 1: Discovery and Feasibility ($15,000-$25,000)

Phase 2: Development and Iteration (Budgeted Based on Phase 1 Findings)

Phase 3: Deployment and Optimization

Why Clients Prefer Phase-Gated Budgets

Bottom-Up vs. Top-Down Estimation

Bottom-Up Process

Top-Down Process

Reconciliation

Tracking Budgets During Execution

Weekly Budget Reviews

Earned Value Tracking (Simplified)

Contingency Burn Rate

Presenting Budgets to Clients

The Range Approach

The Phase-Gated Approach

What Not to Do

Building a Historical Budget Database

Budgeting for Internal Projects

Your Next Step

Agency Script Editorial

Related Articles

Understaffed or Overstaffed? Both Camps Were Right.

Optimizing Daily Standups for Distributed AI Agency Teams

Complete Utilization Rate Management Guide — The Metric That Makes or Breaks Agency Profitability

Ready to certify your AI capability?