Four Lighting Conditions Blew Up a 95K Vision Project

A 16-person AI agency in Atlanta scoped a computer vision project at $95,000 and 12 weeks. The client needed to detect defects in manufactured parts using photos from their production line. Straightforward enough. The agency had done similar work before. Twelve weeks later, the project was only 60% complete and had consumed $130,000 in labor costs. What went wrong? The client's production line had four different lighting conditions that nobody had assessed. The defect categories turned out to be far more nuanced than the initial samples suggested. And the edge computing deployment target had hardware constraints that required model architecture changes not budgeted in the original scope.

None of these issues were unforeseeable. Every one of them would have been caught by a thorough scoping process that included on-site assessment, data analysis, and infrastructure evaluation before committing to a timeline and budget. The agency had skipped these steps to close the deal faster. That shortcut cost them $35,000 in margin erosion and six weeks of team overtime.

Scoping is the highest-leverage activity in AI agency operations. Every hour invested in scoping saves five to ten hours in execution because it prevents the rework, scope changes, and timeline extensions that destroy project profitability.

Why AI Projects Are Harder to Scope

AI project scoping fails more often than traditional software scoping because of three fundamental challenges.

Challenge 1: Data Uncertainty

In traditional software, the inputs are defined by the specification. In AI projects, the inputs are data, and data is unpredictable until you examine it. A client might describe their data as "clean and well-structured," only for your data engineers to discover that 20% of records have missing fields, the date formats are inconsistent across systems, and there are duplicate records that will contaminate model training.

Implication for scoping: You cannot accurately scope data work without looking at the actual data. Descriptions of data quality are unreliable. You need samples.

Challenge 2: Performance Uncertainty

Traditional software either works or it does not. AI models work on a spectrum of performance that depends on data quality, algorithm selection, hyperparameter tuning, and sometimes luck. You might achieve 95% accuracy quickly or spend weeks trying to get past 85%.

Implication for scoping: You cannot guarantee a specific performance level without first understanding the data and the problem's inherent difficulty. Scope should include performance targets with clear caveats about achievability.

Challenge 3: Integration Uncertainty

Deploying an AI model into a production environment often reveals requirements that did not surface during development: latency constraints, data pipeline reliability, security requirements, user interface expectations, and integration with legacy systems.

Implication for scoping: You need to understand the deployment environment before scoping. "We will deploy to your cloud" is not specific enough. Which cloud? Which services? What latency requirements? What data flow?

The Five-Phase Scoping Process

Phase 1: Problem Definition (1-2 hours)

Before any technical assessment, ensure you understand the business problem thoroughly.

Questions to answer:

What business outcome does the client want to achieve?
How is this problem currently being solved (manually, with existing software, not at all)?
What is the quantifiable value of solving this problem? (Revenue gained, cost saved, risk reduced)
Who will use the AI system? What is their technical sophistication?
What does "success" look like in business terms?
What are the consequences of the AI system being wrong? (High-stakes decisions versus low-stakes automation)

Why this matters for scoping: The business context determines the appropriate level of investment in accuracy, reliability, and polish. A system that recommends product descriptions can tolerate occasional errors. A system that flags potential fraud needs much higher accuracy and reliability, which means more development time.

Deliverable: A one-page problem definition that both you and the client agree on. This document becomes the reference point for all subsequent scoping decisions.

Phase 2: Data Assessment (4-8 hours)

Request and examine actual data samples before committing to any timeline or budget.

What to request:

Representative data samples (not cherry-picked examples)
Data documentation (schema definitions, field descriptions, valid values)
Information about data volume (total records, new records per day/week)
Information about data quality (known issues, cleaning that has been done)
Data access mechanisms (API, database access, file transfer)

What to evaluate:

Volume: Is there enough data to train a model? Rule of thumb: at least 1,000 examples per class for classification tasks, more for complex problems.
Quality: What percentage of records have missing, inconsistent, or incorrect values? What cleaning is required?
Representation: Does the data represent the full range of real-world scenarios? Are there edge cases or rare events that need special handling?
Labeling: Is the data labeled? If not, who labels it, and how long does labeling take?
Bias: Does the data exhibit biases that could cause problematic model behavior?
Accessibility: Can you access the data efficiently? Are there security, privacy, or compliance constraints?

Deliverable: A data assessment summary that includes quality findings, estimated data preparation effort, and identified data risks.

Phase 3: Technical Assessment (4-8 hours)

Evaluate the technical approach, infrastructure requirements, and deployment environment.

Technical approach evaluation:

What model architecture or approach is best suited for this problem?
Are there pre-trained models or transfer learning opportunities that reduce development time?
What is the current state of the art for this type of problem, and what performance is realistically achievable?
What evaluation metrics are appropriate? (Accuracy, precision, recall, F1, RMSE, etc.)
What baseline performance can we establish quickly? (Simple heuristic, basic model)

Infrastructure evaluation:

Where will the model be deployed? (Cloud, edge, on-premise)
What are the latency requirements?
What are the throughput requirements?
What existing infrastructure does the client have that we can leverage?
What are the security and compliance requirements for the deployment?
What monitoring and maintenance will be needed post-deployment?

Integration evaluation:

What systems does the AI need to integrate with?
What are the APIs, data formats, and protocols for those systems?
Who on the client side manages those systems and will they be available during integration?
What are the authentication and authorization requirements?

Deliverable: A technical approach document that specifies the recommended architecture, expected performance range, infrastructure requirements, and integration approach.

Phase 4: Effort Estimation (2-4 hours)

With problem definition, data assessment, and technical assessment complete, you can now estimate effort with reasonable accuracy.

Break the work into work packages:

Data preparation: Cleaning, transformation, feature engineering, labeling (if needed)
Model development: Architecture selection, training, evaluation, iteration
Application development: API creation, UI development (if applicable), business logic
Integration: Connecting to client systems, data pipeline development, authentication
Infrastructure: Cloud setup, deployment pipeline, monitoring, security
Testing and validation: Unit tests, integration tests, performance benchmarks, user acceptance testing
Documentation: Technical documentation, user documentation, handoff materials
Project management: Planning, communication, status reporting, meetings

For each work package, estimate three values:

Best case: Everything goes smoothly. No surprises. Minimum realistic effort.
Most likely: Typical effort based on your experience with similar work.
Worst case: Significant challenges in data, performance, or integration.

Calculate the weighted estimate using PERT: Weighted estimate = (Best case + 4 times most likely + worst case) divided by 6

This gives a more realistic estimate than simply using the most likely value because it accounts for the fact that overruns are more common than underruns.

Apply contingency by category:

Data preparation: add 30-50% to the PERT estimate
Model development: add 40-60%
Integration: add 20-30%
Other categories: add 10-20%

These contingencies are not padding. They are empirically based adjustments for the inherent uncertainty in AI work.

Phase 5: Scope Document Creation (2-4 hours)

Compile everything into a scope document that serves as the foundation for the proposal and the project plan.

Scope document structure:

Section 1: Problem statement (from Phase 1) Section 2: Proposed approach (from Phase 3) Section 3: Data requirements and assessment findings (from Phase 2) Section 4: Deliverables (specific, measurable outputs) Section 5: Timeline (phased with milestones) Section 6: Effort estimate (hours by category and role) Section 7: Assumptions (what must be true for this scope to hold) Section 8: Exclusions (what is explicitly out of scope) Section 9: Risks (identified risks and mitigation strategies) Section 10: Success criteria (measurable conditions for project completion)

The Assumptions Section Is Critical

The assumptions section protects your scope from common failure modes. Clearly document:

Data quality assumptions based on your sample review
Client availability assumptions (response times, decision-making timelines)
Infrastructure assumptions (access, permissions, capabilities)
Performance target assumptions (achievability based on current data assessment)
Scope boundary assumptions (what happens if the client wants to add features)

Example assumptions:

"Data quality is consistent with the samples reviewed during scoping. If production data differs significantly, additional data preparation time will be needed."
"Client will provide feedback on deliverables within 5 business days. Delays in client feedback will extend the timeline proportionally."
"Model accuracy of 88-92% is achievable based on the assessed data. Higher accuracy may require additional training data or model complexity beyond the current scope."

Each assumption is a potential scope change trigger. When an assumption proves false, you have documented justification for a timeline extension or budget adjustment.

Preventing Scope Creep During Execution

Accurate scoping is necessary but not sufficient. You also need processes that prevent scope from expanding during the project.

The Change Request Process

Every change to scope, no matter how small, goes through a formal process:

Request: The client or team member identifies a change.
Assessment: The project manager evaluates the impact on timeline, budget, and dependencies.
Communication: The PM shares the impact assessment with the client before any work begins.
Decision: The client approves or declines the change, understanding the impact.
Documentation: Approved changes are documented as amendments to the scope document.

The critical discipline: No work on out-of-scope items begins until the change request is approved. This sounds rigid, but it is the only way to prevent scope from expanding gradually through small accommodations that individually seem harmless but collectively blow the budget.

The "Nice to Have" Parking Lot

During project execution, ideas for improvements and extensions will surface constantly. Rather than rejecting them or scope-creeping them in, maintain a "parking lot" list:

Document every idea
Acknowledge the value of the suggestion
Explain that it will be considered for a future phase
Review the parking lot with the client at project milestones

This approach respects the client's input without compromising the current scope.

Weekly Scope Health Check

Every week, the project manager should answer:

Are we working on anything that is not in the scope document?
Have any assumptions proven false?
Are we trending toward needing a change request?

If the answer to any of these is yes, address it immediately rather than waiting until the impact is severe.

Scoping for Different Engagement Models

Fixed-Price Projects

Scoping accuracy is most critical for fixed-price projects because you bear the risk of overruns.

Use the five-phase scoping process in full
Apply higher contingency percentages
Be explicit about assumptions and exclusions
Include a provision for how out-of-scope work is handled (change orders with separate pricing)
Consider a paid scoping phase before committing to a fixed price

Time and Materials Projects

Scoping for T&M is less about commitment and more about expectation setting.

Provide an effort estimate and likely range (not a commitment)
Be transparent about uncertainty areas
Agree on regular budget check-ins with the client
Set expectations that the estimate may change as the project progresses

Retainer Engagements

Scoping for retainers focuses on capacity allocation and deliverable expectations.

Define the hours per month included in the retainer
Define the types of work covered
Establish a process for handling requests that exceed the retainer scope
Review the retainer scope quarterly and adjust as needed

Building Your Scoping Knowledge Base

Every completed project is a scoping data point. After each project, capture:

Original scope estimate by category
Actual effort by category
What assumptions proved false and the impact
What was missed in scoping and why
Lessons learned for future scoping

After 10-15 projects, you will have a scoping knowledge base that dramatically improves the accuracy of your estimates. You will know that your data preparation estimates are consistently 25% low, or that your integration estimates are actually pretty good. This empirical data replaces gut feel with evidence.

Your Next Step

Pull up the last project that went over budget. Reverse-engineer the scoping: what was estimated for each work category, and what was actually spent? Identify the specific categories where the overrun occurred and the root cause for each. Was it a data surprise? A performance issue? An integration challenge? An assumption that proved false? Document these findings. Then compare them to your next active scoping effort. Are you about to make the same mistake? This retrospective analysis, applied consistently to every completed project, is the fastest way to improve scoping accuracy at your agency.

Why AI Projects Are Harder to Scope

AI project scoping fails more often than traditional software scoping because of three fundamental challenges.

Challenge 1: Data Uncertainty

Implication for scoping: You cannot accurately scope data work without looking at the actual data. Descriptions of data quality are unreliable. You need samples.

Challenge 2: Performance Uncertainty

Challenge 3: Integration Uncertainty

The Five-Phase Scoping Process

Phase 1: Problem Definition (1-2 hours)

Before any technical assessment, ensure you understand the business problem thoroughly.

Questions to answer:

What business outcome does the client want to achieve?
How is this problem currently being solved (manually, with existing software, not at all)?
What is the quantifiable value of solving this problem? (Revenue gained, cost saved, risk reduced)
Who will use the AI system? What is their technical sophistication?
What does "success" look like in business terms?
What are the consequences of the AI system being wrong? (High-stakes decisions versus low-stakes automation)

Deliverable: A one-page problem definition that both you and the client agree on. This document becomes the reference point for all subsequent scoping decisions.

Phase 2: Data Assessment (4-8 hours)

Request and examine actual data samples before committing to any timeline or budget.

What to request:

Representative data samples (not cherry-picked examples)
Data documentation (schema definitions, field descriptions, valid values)
Information about data volume (total records, new records per day/week)
Information about data quality (known issues, cleaning that has been done)
Data access mechanisms (API, database access, file transfer)

What to evaluate:

Volume: Is there enough data to train a model? Rule of thumb: at least 1,000 examples per class for classification tasks, more for complex problems.
Quality: What percentage of records have missing, inconsistent, or incorrect values? What cleaning is required?
Representation: Does the data represent the full range of real-world scenarios? Are there edge cases or rare events that need special handling?
Labeling: Is the data labeled? If not, who labels it, and how long does labeling take?
Bias: Does the data exhibit biases that could cause problematic model behavior?
Accessibility: Can you access the data efficiently? Are there security, privacy, or compliance constraints?

Deliverable: A data assessment summary that includes quality findings, estimated data preparation effort, and identified data risks.

Phase 3: Technical Assessment (4-8 hours)

Evaluate the technical approach, infrastructure requirements, and deployment environment.

Technical approach evaluation:

What model architecture or approach is best suited for this problem?
Are there pre-trained models or transfer learning opportunities that reduce development time?
What is the current state of the art for this type of problem, and what performance is realistically achievable?
What evaluation metrics are appropriate? (Accuracy, precision, recall, F1, RMSE, etc.)
What baseline performance can we establish quickly? (Simple heuristic, basic model)

Infrastructure evaluation:

Where will the model be deployed? (Cloud, edge, on-premise)
What are the latency requirements?
What are the throughput requirements?
What existing infrastructure does the client have that we can leverage?
What are the security and compliance requirements for the deployment?
What monitoring and maintenance will be needed post-deployment?

Integration evaluation:

What systems does the AI need to integrate with?
What are the APIs, data formats, and protocols for those systems?
Who on the client side manages those systems and will they be available during integration?
What are the authentication and authorization requirements?

Deliverable: A technical approach document that specifies the recommended architecture, expected performance range, infrastructure requirements, and integration approach.

Phase 4: Effort Estimation (2-4 hours)

With problem definition, data assessment, and technical assessment complete, you can now estimate effort with reasonable accuracy.

Break the work into work packages:

Data preparation: Cleaning, transformation, feature engineering, labeling (if needed)
Model development: Architecture selection, training, evaluation, iteration
Application development: API creation, UI development (if applicable), business logic
Integration: Connecting to client systems, data pipeline development, authentication
Infrastructure: Cloud setup, deployment pipeline, monitoring, security
Testing and validation: Unit tests, integration tests, performance benchmarks, user acceptance testing
Documentation: Technical documentation, user documentation, handoff materials
Project management: Planning, communication, status reporting, meetings

For each work package, estimate three values:

Best case: Everything goes smoothly. No surprises. Minimum realistic effort.
Most likely: Typical effort based on your experience with similar work.
Worst case: Significant challenges in data, performance, or integration.

Calculate the weighted estimate using PERT: Weighted estimate = (Best case + 4 times most likely + worst case) divided by 6

This gives a more realistic estimate than simply using the most likely value because it accounts for the fact that overruns are more common than underruns.

Apply contingency by category:

Data preparation: add 30-50% to the PERT estimate
Model development: add 40-60%
Integration: add 20-30%
Other categories: add 10-20%

These contingencies are not padding. They are empirically based adjustments for the inherent uncertainty in AI work.

Phase 5: Scope Document Creation (2-4 hours)

Compile everything into a scope document that serves as the foundation for the proposal and the project plan.

Scope document structure:

The Assumptions Section Is Critical

The assumptions section protects your scope from common failure modes. Clearly document:

Data quality assumptions based on your sample review
Client availability assumptions (response times, decision-making timelines)
Infrastructure assumptions (access, permissions, capabilities)
Performance target assumptions (achievability based on current data assessment)
Scope boundary assumptions (what happens if the client wants to add features)

Example assumptions:

"Data quality is consistent with the samples reviewed during scoping. If production data differs significantly, additional data preparation time will be needed."
"Client will provide feedback on deliverables within 5 business days. Delays in client feedback will extend the timeline proportionally."
"Model accuracy of 88-92% is achievable based on the assessed data. Higher accuracy may require additional training data or model complexity beyond the current scope."

Each assumption is a potential scope change trigger. When an assumption proves false, you have documented justification for a timeline extension or budget adjustment.

Preventing Scope Creep During Execution

Accurate scoping is necessary but not sufficient. You also need processes that prevent scope from expanding during the project.

The Change Request Process

Every change to scope, no matter how small, goes through a formal process:

Request: The client or team member identifies a change.
Assessment: The project manager evaluates the impact on timeline, budget, and dependencies.
Communication: The PM shares the impact assessment with the client before any work begins.
Decision: The client approves or declines the change, understanding the impact.
Documentation: Approved changes are documented as amendments to the scope document.

The "Nice to Have" Parking Lot

During project execution, ideas for improvements and extensions will surface constantly. Rather than rejecting them or scope-creeping them in, maintain a "parking lot" list:

Document every idea
Acknowledge the value of the suggestion
Explain that it will be considered for a future phase
Review the parking lot with the client at project milestones

This approach respects the client's input without compromising the current scope.

Weekly Scope Health Check

Every week, the project manager should answer:

Are we working on anything that is not in the scope document?
Have any assumptions proven false?
Are we trending toward needing a change request?

If the answer to any of these is yes, address it immediately rather than waiting until the impact is severe.

Scoping for Different Engagement Models

Fixed-Price Projects

Scoping accuracy is most critical for fixed-price projects because you bear the risk of overruns.

Use the five-phase scoping process in full
Apply higher contingency percentages
Be explicit about assumptions and exclusions
Include a provision for how out-of-scope work is handled (change orders with separate pricing)
Consider a paid scoping phase before committing to a fixed price

Time and Materials Projects

Scoping for T&M is less about commitment and more about expectation setting.

Provide an effort estimate and likely range (not a commitment)
Be transparent about uncertainty areas
Agree on regular budget check-ins with the client
Set expectations that the estimate may change as the project progresses

Retainer Engagements

Scoping for retainers focuses on capacity allocation and deliverable expectations.

Define the hours per month included in the retainer
Define the types of work covered
Establish a process for handling requests that exceed the retainer scope
Review the retainer scope quarterly and adjust as needed

Building Your Scoping Knowledge Base

Every completed project is a scoping data point. After each project, capture:

Original scope estimate by category
Actual effort by category
What assumptions proved false and the impact
What was missed in scoping and why
Lessons learned for future scoping

Four Lighting Conditions Blew Up a 95K Vision Project

Why AI Projects Are Harder to Scope

Challenge 1: Data Uncertainty

Challenge 2: Performance Uncertainty

Challenge 3: Integration Uncertainty

The Five-Phase Scoping Process

Phase 1: Problem Definition (1-2 hours)

Phase 2: Data Assessment (4-8 hours)

Phase 3: Technical Assessment (4-8 hours)

Phase 4: Effort Estimation (2-4 hours)

Phase 5: Scope Document Creation (2-4 hours)

The Assumptions Section Is Critical

Preventing Scope Creep During Execution

The Change Request Process

The "Nice to Have" Parking Lot

Weekly Scope Health Check

Scoping for Different Engagement Models

Fixed-Price Projects

Time and Materials Projects

Retainer Engagements

Building Your Scoping Knowledge Base

Your Next Step

Agency Script Editorial

Related Articles

Understaffed or Overstaffed? Both Camps Were Right.

Optimizing Daily Standups for Distributed AI Agency Teams

Complete Utilization Rate Management Guide — The Metric That Makes or Breaks Agency Profitability

Ready to certify your AI capability?

Four Lighting Conditions Blew Up a 95K Vision Project

Why AI Projects Are Harder to Scope

Challenge 1: Data Uncertainty

Challenge 2: Performance Uncertainty

Challenge 3: Integration Uncertainty

The Five-Phase Scoping Process

Phase 1: Problem Definition (1-2 hours)

Phase 2: Data Assessment (4-8 hours)

Phase 3: Technical Assessment (4-8 hours)

Phase 4: Effort Estimation (2-4 hours)

Phase 5: Scope Document Creation (2-4 hours)

The Assumptions Section Is Critical

Preventing Scope Creep During Execution

The Change Request Process

The "Nice to Have" Parking Lot

Weekly Scope Health Check

Scoping for Different Engagement Models

Fixed-Price Projects

Time and Materials Projects

Retainer Engagements

Building Your Scoping Knowledge Base

Your Next Step

Agency Script Editorial

Related Articles

Understaffed or Overstaffed? Both Camps Were Right.

Optimizing Daily Standups for Distributed AI Agency Teams

Complete Utilization Rate Management Guide — The Metric That Makes or Breaks Agency Profitability

Ready to certify your AI capability?