AGENCYSCRIPT
CoursesEnterpriseBlog
👑FoundersSign inJoin Waitlist
AGENCYSCRIPT

Governed Certification Framework

The operating system for AI-enabled agency building. Certify judgment under constraint. Standards over scale. Governance over shortcuts.

Stay informed

Governance updates, certification insights, and industry standards.

Products

  • Platform
  • Certification
  • Launch Program
  • Vault
  • The Book

Certification

  • Foundation (AS-F)
  • Operator (AS-O)
  • Architect (AS-A)
  • Principal (AS-P)

Resources

  • Blog
  • Verify Credential
  • Enterprise
  • Partners
  • Pricing

Company

  • About
  • Contact
  • Careers
  • Press
© 2026 Agency Script, Inc.·
Privacy PolicyTerms of ServiceCertification AgreementSecurity

Standards over scale. Judgment over volume. Governance over shortcuts.

On This Page

Why AI Projects Are Harder to ScopeChallenge 1: Data UncertaintyChallenge 2: Performance UncertaintyChallenge 3: Integration UncertaintyThe Five-Phase Scoping ProcessPhase 1: Problem Definition (1-2 hours)Phase 2: Data Assessment (4-8 hours)Phase 3: Technical Assessment (4-8 hours)Phase 4: Effort Estimation (2-4 hours)Phase 5: Scope Document Creation (2-4 hours)The Assumptions Section Is CriticalPreventing Scope Creep During ExecutionThe Change Request ProcessThe "Nice to Have" Parking LotWeekly Scope Health CheckScoping for Different Engagement ModelsFixed-Price ProjectsTime and Materials ProjectsRetainer EngagementsBuilding Your Scoping Knowledge BaseYour Next Step
Home/Blog/Four Lighting Conditions Blew Up a 95K Vision Project
Operations

Four Lighting Conditions Blew Up a 95K Vision Project

A

Agency Script Editorial

Editorial Team

·March 20, 2026·12 min read
ai agency scopingproject estimationscope managementagency project planning

A 16-person AI agency in Atlanta scoped a computer vision project at $95,000 and 12 weeks. The client needed to detect defects in manufactured parts using photos from their production line. Straightforward enough. The agency had done similar work before. Twelve weeks later, the project was only 60% complete and had consumed $130,000 in labor costs. What went wrong? The client's production line had four different lighting conditions that nobody had assessed. The defect categories turned out to be far more nuanced than the initial samples suggested. And the edge computing deployment target had hardware constraints that required model architecture changes not budgeted in the original scope.

None of these issues were unforeseeable. Every one of them would have been caught by a thorough scoping process that included on-site assessment, data analysis, and infrastructure evaluation before committing to a timeline and budget. The agency had skipped these steps to close the deal faster. That shortcut cost them $35,000 in margin erosion and six weeks of team overtime.

Scoping is the highest-leverage activity in AI agency operations. Every hour invested in scoping saves five to ten hours in execution because it prevents the rework, scope changes, and timeline extensions that destroy project profitability.

Why AI Projects Are Harder to Scope

AI project scoping fails more often than traditional software scoping because of three fundamental challenges.

Challenge 1: Data Uncertainty

In traditional software, the inputs are defined by the specification. In AI projects, the inputs are data, and data is unpredictable until you examine it. A client might describe their data as "clean and well-structured," only for your data engineers to discover that 20% of records have missing fields, the date formats are inconsistent across systems, and there are duplicate records that will contaminate model training.

Implication for scoping: You cannot accurately scope data work without looking at the actual data. Descriptions of data quality are unreliable. You need samples.

Challenge 2: Performance Uncertainty

Traditional software either works or it does not. AI models work on a spectrum of performance that depends on data quality, algorithm selection, hyperparameter tuning, and sometimes luck. You might achieve 95% accuracy quickly or spend weeks trying to get past 85%.

Implication for scoping: You cannot guarantee a specific performance level without first understanding the data and the problem's inherent difficulty. Scope should include performance targets with clear caveats about achievability.

Challenge 3: Integration Uncertainty

Deploying an AI model into a production environment often reveals requirements that did not surface during development: latency constraints, data pipeline reliability, security requirements, user interface expectations, and integration with legacy systems.

Implication for scoping: You need to understand the deployment environment before scoping. "We will deploy to your cloud" is not specific enough. Which cloud? Which services? What latency requirements? What data flow?

The Five-Phase Scoping Process

Phase 1: Problem Definition (1-2 hours)

Before any technical assessment, ensure you understand the business problem thoroughly.

Questions to answer:

  • What business outcome does the client want to achieve?
  • How is this problem currently being solved (manually, with existing software, not at all)?
  • What is the quantifiable value of solving this problem? (Revenue gained, cost saved, risk reduced)
  • Who will use the AI system? What is their technical sophistication?
  • What does "success" look like in business terms?
  • What are the consequences of the AI system being wrong? (High-stakes decisions versus low-stakes automation)

Why this matters for scoping: The business context determines the appropriate level of investment in accuracy, reliability, and polish. A system that recommends product descriptions can tolerate occasional errors. A system that flags potential fraud needs much higher accuracy and reliability, which means more development time.

Deliverable: A one-page problem definition that both you and the client agree on. This document becomes the reference point for all subsequent scoping decisions.

Phase 2: Data Assessment (4-8 hours)

Request and examine actual data samples before committing to any timeline or budget.

What to request:

  • Representative data samples (not cherry-picked examples)
  • Data documentation (schema definitions, field descriptions, valid values)
  • Information about data volume (total records, new records per day/week)
  • Information about data quality (known issues, cleaning that has been done)
  • Data access mechanisms (API, database access, file transfer)

What to evaluate:

  • Volume: Is there enough data to train a model? Rule of thumb: at least 1,000 examples per class for classification tasks, more for complex problems.
  • Quality: What percentage of records have missing, inconsistent, or incorrect values? What cleaning is required?
  • Representation: Does the data represent the full range of real-world scenarios? Are there edge cases or rare events that need special handling?
  • Labeling: Is the data labeled? If not, who labels it, and how long does labeling take?
  • Bias: Does the data exhibit biases that could cause problematic model behavior?
  • Accessibility: Can you access the data efficiently? Are there security, privacy, or compliance constraints?

Deliverable: A data assessment summary that includes quality findings, estimated data preparation effort, and identified data risks.

Phase 3: Technical Assessment (4-8 hours)

Evaluate the technical approach, infrastructure requirements, and deployment environment.

Technical approach evaluation:

  • What model architecture or approach is best suited for this problem?
  • Are there pre-trained models or transfer learning opportunities that reduce development time?
  • What is the current state of the art for this type of problem, and what performance is realistically achievable?
  • What evaluation metrics are appropriate? (Accuracy, precision, recall, F1, RMSE, etc.)
  • What baseline performance can we establish quickly? (Simple heuristic, basic model)

Infrastructure evaluation:

  • Where will the model be deployed? (Cloud, edge, on-premise)
  • What are the latency requirements?
  • What are the throughput requirements?
  • What existing infrastructure does the client have that we can leverage?
  • What are the security and compliance requirements for the deployment?
  • What monitoring and maintenance will be needed post-deployment?

Integration evaluation:

  • What systems does the AI need to integrate with?
  • What are the APIs, data formats, and protocols for those systems?
  • Who on the client side manages those systems and will they be available during integration?
  • What are the authentication and authorization requirements?

Deliverable: A technical approach document that specifies the recommended architecture, expected performance range, infrastructure requirements, and integration approach.

Phase 4: Effort Estimation (2-4 hours)

With problem definition, data assessment, and technical assessment complete, you can now estimate effort with reasonable accuracy.

Break the work into work packages:

  1. Data preparation: Cleaning, transformation, feature engineering, labeling (if needed)
  2. Model development: Architecture selection, training, evaluation, iteration
  3. Application development: API creation, UI development (if applicable), business logic
  4. Integration: Connecting to client systems, data pipeline development, authentication
  5. Infrastructure: Cloud setup, deployment pipeline, monitoring, security
  6. Testing and validation: Unit tests, integration tests, performance benchmarks, user acceptance testing
  7. Documentation: Technical documentation, user documentation, handoff materials
  8. Project management: Planning, communication, status reporting, meetings

For each work package, estimate three values:

  • Best case: Everything goes smoothly. No surprises. Minimum realistic effort.
  • Most likely: Typical effort based on your experience with similar work.
  • Worst case: Significant challenges in data, performance, or integration.

Calculate the weighted estimate using PERT: Weighted estimate = (Best case + 4 times most likely + worst case) divided by 6

This gives a more realistic estimate than simply using the most likely value because it accounts for the fact that overruns are more common than underruns.

Apply contingency by category:

  • Data preparation: add 30-50% to the PERT estimate
  • Model development: add 40-60%
  • Integration: add 20-30%
  • Other categories: add 10-20%

These contingencies are not padding. They are empirically based adjustments for the inherent uncertainty in AI work.

Phase 5: Scope Document Creation (2-4 hours)

Compile everything into a scope document that serves as the foundation for the proposal and the project plan.

Scope document structure:

Section 1: Problem statement (from Phase 1) Section 2: Proposed approach (from Phase 3) Section 3: Data requirements and assessment findings (from Phase 2) Section 4: Deliverables (specific, measurable outputs) Section 5: Timeline (phased with milestones) Section 6: Effort estimate (hours by category and role) Section 7: Assumptions (what must be true for this scope to hold) Section 8: Exclusions (what is explicitly out of scope) Section 9: Risks (identified risks and mitigation strategies) Section 10: Success criteria (measurable conditions for project completion)

The Assumptions Section Is Critical

The assumptions section protects your scope from common failure modes. Clearly document:

  • Data quality assumptions based on your sample review
  • Client availability assumptions (response times, decision-making timelines)
  • Infrastructure assumptions (access, permissions, capabilities)
  • Performance target assumptions (achievability based on current data assessment)
  • Scope boundary assumptions (what happens if the client wants to add features)

Example assumptions:

  • "Data quality is consistent with the samples reviewed during scoping. If production data differs significantly, additional data preparation time will be needed."
  • "Client will provide feedback on deliverables within 5 business days. Delays in client feedback will extend the timeline proportionally."
  • "Model accuracy of 88-92% is achievable based on the assessed data. Higher accuracy may require additional training data or model complexity beyond the current scope."

Each assumption is a potential scope change trigger. When an assumption proves false, you have documented justification for a timeline extension or budget adjustment.

Preventing Scope Creep During Execution

Accurate scoping is necessary but not sufficient. You also need processes that prevent scope from expanding during the project.

The Change Request Process

Every change to scope, no matter how small, goes through a formal process:

  1. Request: The client or team member identifies a change.
  2. Assessment: The project manager evaluates the impact on timeline, budget, and dependencies.
  3. Communication: The PM shares the impact assessment with the client before any work begins.
  4. Decision: The client approves or declines the change, understanding the impact.
  5. Documentation: Approved changes are documented as amendments to the scope document.

The critical discipline: No work on out-of-scope items begins until the change request is approved. This sounds rigid, but it is the only way to prevent scope from expanding gradually through small accommodations that individually seem harmless but collectively blow the budget.

The "Nice to Have" Parking Lot

During project execution, ideas for improvements and extensions will surface constantly. Rather than rejecting them or scope-creeping them in, maintain a "parking lot" list:

  • Document every idea
  • Acknowledge the value of the suggestion
  • Explain that it will be considered for a future phase
  • Review the parking lot with the client at project milestones

This approach respects the client's input without compromising the current scope.

Weekly Scope Health Check

Every week, the project manager should answer:

  • Are we working on anything that is not in the scope document?
  • Have any assumptions proven false?
  • Are we trending toward needing a change request?

If the answer to any of these is yes, address it immediately rather than waiting until the impact is severe.

Scoping for Different Engagement Models

Fixed-Price Projects

Scoping accuracy is most critical for fixed-price projects because you bear the risk of overruns.

  • Use the five-phase scoping process in full
  • Apply higher contingency percentages
  • Be explicit about assumptions and exclusions
  • Include a provision for how out-of-scope work is handled (change orders with separate pricing)
  • Consider a paid scoping phase before committing to a fixed price

Time and Materials Projects

Scoping for T&M is less about commitment and more about expectation setting.

  • Provide an effort estimate and likely range (not a commitment)
  • Be transparent about uncertainty areas
  • Agree on regular budget check-ins with the client
  • Set expectations that the estimate may change as the project progresses

Retainer Engagements

Scoping for retainers focuses on capacity allocation and deliverable expectations.

  • Define the hours per month included in the retainer
  • Define the types of work covered
  • Establish a process for handling requests that exceed the retainer scope
  • Review the retainer scope quarterly and adjust as needed

Building Your Scoping Knowledge Base

Every completed project is a scoping data point. After each project, capture:

  • Original scope estimate by category
  • Actual effort by category
  • What assumptions proved false and the impact
  • What was missed in scoping and why
  • Lessons learned for future scoping

After 10-15 projects, you will have a scoping knowledge base that dramatically improves the accuracy of your estimates. You will know that your data preparation estimates are consistently 25% low, or that your integration estimates are actually pretty good. This empirical data replaces gut feel with evidence.

Your Next Step

Pull up the last project that went over budget. Reverse-engineer the scoping: what was estimated for each work category, and what was actually spent? Identify the specific categories where the overrun occurred and the root cause for each. Was it a data surprise? A performance issue? An integration challenge? An assumption that proved false? Document these findings. Then compare them to your next active scoping effort. Are you about to make the same mistake? This retrospective analysis, applied consistently to every completed project, is the fastest way to improve scoping accuracy at your agency.

Search Articles

Categories

OperationsSalesDeliveryGovernance

Popular Tags

prompt engineeringai fundamentalsai toolsthe difference between AIMLagency operationsagency growthenterprise sales

Share Article

A

Agency Script Editorial

Editorial Team

The Agency Script editorial team delivers operational insights on AI delivery, certification, and governance for modern agency operators.

Related Articles

Operations

Understaffed or Overstaffed? Both Camps Were Right.

You cannot manage what you cannot see. Here is how to build a team capacity dashboard that prevents burnout, eliminates bench time, and keeps projects staffed correctly.

A
Agency Script Editorial
March 21, 2026·12 min read
Operations

Optimizing Daily Standups for Distributed AI Agency Teams

Optimized standups keep distributed AI agency teams aligned without consuming the focused work time that engineers need to ship quality deliverables.

A
Agency Script Editorial
March 21, 2026·10 min read
Operations

Complete Utilization Rate Management Guide — The Metric That Makes or Breaks Agency Profitability

A 5% shift in utilization can swing agency profit by 30% or more. Here is the definitive guide to measuring, managing, and optimizing the most important metric in your agency.

A
Agency Script Editorial
March 21, 2026·13 min read

Ready to certify your AI capability?

Join the professionals building governed, repeatable AI delivery systems.

Explore Certification