Managing Intellectual Property Across AI Agency Client Projects
Your agency just finished building a custom document extraction system for a legal tech client. The system works beautifully โ it extracts key provisions from contracts with 94% accuracy. Your team is proud of the architecture, and your tech lead has already identified a financial services prospect who could use something similar. Then your account manager pulls up the contract and reads the IP clause aloud: "All work product created in connection with this engagement, including all models, code, training data, documentation, and derivative works, shall be the sole and exclusive property of Client." Your custom extraction architecture, the data augmentation techniques your team invented, the evaluation framework they built โ it all belongs to the client. Legally, you cannot use any of it for the financial services prospect without building everything from scratch.
Meanwhile, on a different engagement, a client is demanding that you hand over the internal ML pipeline tools your team used to build their model. They argue that since the tools were used "in connection with" their engagement, the IP clause covers them. If they are right, every client would own a piece of your internal infrastructure, and you would need to rebuild your tools from scratch for every new engagement.
IP management is one of the highest-stakes operational challenges for AI agencies. Get it wrong, and you either give away the assets that make your agency valuable or expose yourself to legal action from clients who believe they own more than they actually do. Get it right, and you build a growing library of reusable components that make every engagement more profitable while respecting your clients' legitimate ownership rights.
Why IP Is Uniquely Complex in AI
Traditional service firms deal with IP too, but AI agencies face several additional layers of complexity.
Models are not like code. A model is a mathematical representation trained on data. It does not contain the data, but it encodes patterns from the data. If a model is trained on a client's proprietary data, does the client own the model? What about a model that was pre-trained on public data and then fine-tuned on client data? What about the weights of a foundation model that your team adapted?
Data transforms create derivative works. When your team cleans, augments, labels, and transforms client data, the transformed data is a derivative work. Who owns it? The answer depends on the contract, and many contracts do not address this clearly.
Techniques and approaches are hard to segregate. When your ML engineer develops a novel training technique while working on a client project, is that technique the client's IP or the engineer's (and thus the agency's) general knowledge? The answer often depends on whether the technique is specific to the client's problem or generally applicable.
Open source complicates ownership. AI work routinely uses open source libraries, pre-trained models, and public datasets. If your deliverable incorporates open source components, the client's ownership of the deliverable is constrained by the open source licenses โ something many clients do not realize or accept.
Generative AI adds new questions. If your team uses a large language model to generate code or content that becomes part of a client deliverable, the IP status of that generated material is legally uncertain. Copyright law in most jurisdictions has not fully resolved whether AI-generated content is protectable, and who holds any rights that do exist.
Defining Your IP Categories
The first step in managing IP is creating clear categories that your contracts and your team can reference.
Category One: Pre-Existing IP
This is intellectual property that existed before the client engagement began. It includes your internal tools, frameworks, libraries, pipelines, and any other assets your agency built independently.
Examples:
- Your custom ML pipeline framework
- Internal data quality assessment tools
- Reusable model evaluation harnesses
- Project templates and automation scripts
- Internal training data or benchmarks
- Proprietary algorithms or techniques developed on prior non-client work
Default ownership: The agency. Pre-existing IP should always remain the agency's property. No client engagement should transfer ownership of pre-existing IP. If pre-existing IP is used in a client deliverable, the client should receive a license to use it, not ownership.
Category Two: Client-Specific Deliverables
This is intellectual property created specifically for the client during the engagement. It is the work product the client is paying for.
Examples:
- Models trained on the client's data for the client's specific use case
- Custom integrations with the client's systems
- Client-specific data pipelines and transformations
- Analysis reports and documentation specific to the client's business
- Client-specific configurations and tuning
Default ownership: The client, or negotiated. Most clients expect to own the work product they pay for. This is reasonable for client-specific deliverables. However, some agencies negotiate to retain ownership and grant the client a perpetual license, which preserves the agency's ability to reuse approaches.
Category Three: General Knowledge and Skills
This is the expertise, knowledge, and skills that your team develops through their work. It includes general techniques, approaches, and understanding that are not specific to any client.
Examples:
- General ML engineering skills and best practices
- Knowledge of which algorithms work well for certain types of problems
- Familiarity with cloud infrastructure and deployment patterns
- Understanding of industry-specific data patterns (without specific client data)
- General software engineering practices and patterns
Default ownership: Not ownable. General knowledge and skills belong to the individuals who possess them and cannot be assigned or restricted by contract. Your contracts should explicitly carve out general knowledge from the definition of work product.
Category Four: Improvements to Pre-Existing IP
When your team uses pre-existing tools on a client engagement and improves those tools in the process, who owns the improvements?
Examples:
- Bug fixes to your internal framework discovered during client work
- Performance improvements to your pipeline tools
- New features added to your evaluation harness to meet client needs
- Extensions to your data quality tools
Default ownership: The agency. Improvements to pre-existing IP should remain with the agency. If the improvement was funded by the client's engagement (the work was done on their time), the client may have a legitimate claim. Address this in your contracts by specifying that improvements to pre-existing IP remain the agency's property, with the client receiving the benefit of the improvements during their engagement.
Category Five: Jointly Developed IP
Sometimes you and the client collaborate on creating something new that neither party could have created alone โ the client contributes domain expertise and data while your team contributes technical expertise and engineering.
Examples:
- A novel algorithm designed in collaboration with the client's research team
- A proprietary dataset created through a joint data collection effort
- A new product concept developed during a workshop or innovation sprint
Default ownership: Negotiated. Joint IP is the most complex category. Common approaches include joint ownership (each party can use it independently), client ownership with agency license for non-competing uses, or agency ownership with client license for their specific use case. The right approach depends on the relative contributions and the strategic importance of the IP to each party.
Structuring Your Contracts for IP Protection
Your contracts are your primary IP management tool. Here is how to structure the IP provisions.
The Three-Layer Approach
Layer one โ define the categories. Your contract should explicitly define pre-existing IP, client-specific deliverables, general knowledge, and any other relevant categories. Use the categories above as a starting point.
Layer two โ assign ownership by category. For each category, specify who owns the IP. Be explicit and unambiguous. "Agency retains all right, title, and interest in Pre-Existing IP. Client shall own all Client-Specific Deliverables upon full payment. General Knowledge is not work product and is not subject to assignment."
Layer three โ define licenses. Where ownership and usage diverge, define licenses. "Agency grants Client a perpetual, non-exclusive, royalty-free license to use Pre-Existing IP as incorporated in the Deliverables, solely in connection with Client's internal business operations."
Key Clauses to Include
Pre-existing IP reservation. "Nothing in this Agreement transfers ownership of Agency's Pre-Existing IP to Client. Agency retains all rights in its tools, frameworks, methodologies, and other pre-existing materials."
General knowledge carve-out. "Notwithstanding any other provision of this Agreement, Agency and its personnel shall be free to use and employ their general knowledge, skills, and experience, and any ideas, concepts, know-how, or techniques that are retained in the unaided memory of Agency's personnel, in providing services to other clients."
Open source disclosure. "Agency shall disclose to Client any open source software incorporated in the Deliverables and the applicable license terms. Client acknowledges that its use of such open source components is subject to the terms of the applicable open source licenses."
Model ownership specifics. "Models trained on Client data are Client-Specific Deliverables and are owned by Client. Model architectures, training techniques, and hyperparameter configurations that are generally applicable are General Knowledge and are not subject to assignment."
Operational IP Management
Contracts set the rules. Operations ensure the rules are followed.
The IP Register
Maintain an internal register of your agency's IP assets. This register should list every reusable component, tool, framework, and technique that constitutes your pre-existing IP.
For each asset, document:
- Name and description
- Date of creation
- Creator (employee or team)
- Client engagement context (if created or significantly improved during client work)
- Current version and location
- License terms (if any restrictions apply)
- Clients who have been granted licenses to use it
The IP register serves two purposes. First, it gives your team a library of reusable components for new engagements. Second, it provides evidence of pre-existing IP in case of a dispute. If a client claims ownership of a tool, the register shows that the tool existed before their engagement.
Code and Asset Separation
Maintain strict separation between client-specific code and your reusable assets.
Use separate repositories. Client-specific code should live in client-specific repositories. Your internal tools and frameworks should live in separate, agency-owned repositories. Do not mix client code with internal tools in the same repository.
Use dependency management. When a client project uses your internal tools, import them as dependencies (via package managers, Git submodules, or similar mechanisms) rather than copying the code into the client's repository. This maintains clear separation and makes it easy to demonstrate that the tool is a separate, pre-existing asset.
Version control is your friend. Git history provides a clear record of when code was created and by whom. If a dispute arises about whether a component is pre-existing IP, the Git history shows that it was committed to your internal repository before the client engagement began.
Team Training and Awareness
Your team needs to understand IP management practices to avoid inadvertent IP transfers.
Train engineers on IP categories. Every engineer should understand the difference between pre-existing IP, client-specific deliverables, and general knowledge. They should know that internal tools used on a client engagement remain the agency's property, and that techniques they develop can generally be applied to future work.
Establish contribution guidelines. When an engineer improves an internal tool during client work, the improvement should be contributed back to the internal repository, not embedded in the client's codebase. Define a workflow for this.
Address data handling. Engineers must understand that client data is the client's property. Models trained on client data may be the client's property depending on the contract. Training data must never be mixed between client engagements unless explicitly permitted.
Discuss generative AI usage. If your team uses AI coding assistants or generative AI tools, establish guidelines about disclosure and IP implications. Some clients have specific requirements about AI-generated code in their deliverables.
Common IP Disputes and How to Handle Them
Dispute: Client claims ownership of your internal tools. This happens when a broad work-for-hire clause in the contract potentially covers tools used during the engagement. Resolution: Point to the pre-existing IP reservation in the contract, show evidence of the tool's existence before the engagement (Git history, IP register), and offer the client a license to use the tool as delivered. Prevention: Ensure your contracts have a clear pre-existing IP reservation.
Dispute: Client wants the training data. If you created training data during the engagement (labeled datasets, augmented data), the client may claim ownership. Resolution: Review the contract โ if the training data is a deliverable, deliver it. If it is a byproduct of the process, negotiate. Prevention: Clarify in the SOW whether training data is a deliverable.
Dispute: Former client sees similar work in your portfolio. A client notices that work you did for them looks similar to work you are doing for another client and suspects IP transfer. Resolution: Demonstrate that the similarities reflect general techniques and approaches, not specific IP. Show that the new work was built independently. Prevention: Be thoughtful about what you share publicly, and maintain clear separation between engagements.
Dispute: Employee leaves and takes knowledge. A departing employee who worked on client projects takes general knowledge with them. A client may be concerned that their confidential information is leaving too. Resolution: Distinguish between confidential information (covered by NDA) and general knowledge (protected by the carve-out). Remind the departing employee of their NDA obligations. Prevention: Include general knowledge carve-outs in both client contracts and employment agreements.
IP as a Strategic Asset
For AI agencies, intellectual property is not just a legal concern โ it is a strategic asset that compounds in value over time.
Every engagement should grow your IP portfolio. Not by taking client-specific work, but by developing reusable tools, frameworks, and techniques that make future engagements faster and more profitable. If you build a data quality assessment tool for one client and refine it across ten more, that tool becomes a significant competitive advantage.
Your IP portfolio increases your agency's valuation. When agencies are valued for acquisition or investment, proprietary tools and frameworks that reduce delivery costs and improve quality command a premium. An agency that delivers every engagement from scratch is worth less than one with a robust library of battle-tested components.
Communicate your IP value to clients. When a client benefits from your pre-existing tools โ faster delivery, higher quality, lower risk โ make sure they understand the value. "We are able to deliver this in eight weeks instead of twelve because we have a proven data pipeline framework that we have refined across dozens of engagements" justifies your rates and demonstrates the value of your experience.
Managing IP across client projects requires discipline, clear contracts, and operational systems that maintain separation while enabling reuse. The agencies that get this right build compounding advantages โ every engagement makes them faster, better, and more valuable. The agencies that get it wrong give away their most valuable assets one contract at a time.