AGENCYSCRIPT
CoursesEnterpriseBlog
đź‘‘FoundersSign inJoin Waitlist
AGENCYSCRIPT

Governed Certification Framework

The operating system for AI-enabled agency building. Certify judgment under constraint. Standards over scale. Governance over shortcuts.

Stay informed

Governance updates, certification insights, and industry standards.

Products

  • Platform
  • Certification
  • Launch Program
  • Vault
  • The Book

Certification

  • Foundation (AS-F)
  • Operator (AS-O)
  • Architect (AS-A)
  • Principal (AS-P)

Resources

  • Blog
  • Verify Credential
  • Enterprise
  • Partners
  • Pricing

Company

  • About
  • Contact
  • Careers
  • Press
© 2026 Agency Script, Inc.·
Privacy PolicyTerms of ServiceCertification AgreementSecurity

Standards over scale. Judgment over volume. Governance over shortcuts.

On This Page

Starting From Scratch: What Is a Model Learning?Two Ways to FailUnderfitting: Too Simple to LearnWhat Causes ItOverfitting: Memorizing Instead of LearningHow You Recognize ItThe Train-and-Test SplitThe Balance You Are Always ChasingA Simple Routine to Start WithA Picture to Keep In Your HeadFrequently Asked QuestionsWhich is worse, overfitting or underfitting?Do I need to know math to understand this?How much data should go in the test set?Can I just collect more data to avoid both problems?What is the very first thing to check on a new model?Key Takeaways
Home/Blog/Two Students, One Exam, and the Whole Idea of Overfitting
General

Two Students, One Exam, and the Whole Idea of Overfitting

A

Agency Script Editorial

Editorial Team

·May 21, 2025·8 min read
ai model overfitting and underfittingai model overfitting and underfitting for beginnersai model overfitting and underfitting guideai fundamentals

Imagine a student studying for an exam. One student memorizes every answer in last year's practice test word for word. They score perfectly on those exact questions, then fail the real exam because the questions changed. A second student barely studies, skims the material once, and fails both the practice and the real exam. The first student overfit. The second underfit. That is the entire concept, and you already understand it.

This guide assumes you know nothing about machine learning. We will define every term as it comes up, build from the ground floor, and avoid math beyond simple comparisons. By the end you will understand what these two words mean, why every model builder worries about them, and how to tell which one you are facing.

The reason beginners need to learn this early is that overfitting and underfitting are not edge cases. They are the default. Getting a model to generalize well is the central challenge, and most early mistakes trace back to one of these two failures.

Starting From Scratch: What Is a Model Learning?

A machine learning model learns patterns from examples. You show it data, it adjusts itself to predict an answer, and then you hope it does well on new data it has never seen. That last part is the whole point. We do not care whether the model can repeat what it already saw. We care whether it can handle what comes next.

The word for handling new data well is generalization. A model that generalizes has learned the real pattern. A model that fails to generalize has either learned too little or learned the wrong things.

Two Ways to Fail

There are exactly two ways generalization breaks down:

  • The model is too simple to capture the real pattern. This is underfitting.
  • The model is so flexible it memorized the specific examples, including their random quirks. This is overfitting.

Almost every model problem you will encounter is one of these two, or a step on the road between them.

Underfitting: Too Simple to Learn

Underfitting is the easier one to picture. Suppose the true relationship in your data is a gentle curve, but you force a straight line to describe it. No matter how you adjust the line, it will miss large parts of the curve. The line is too simple for the job.

You spot underfitting because the model does badly everywhere, on the data it trained on and on new data alike. It never got good at anything.

What Causes It

  • The model is not powerful enough for the task.
  • You did not train it long enough.
  • The inputs you gave it do not contain enough useful information.

The fix is to give the model more power, more training time, or better inputs. We cover the full repair process in A Step-by-Step Approach to Ai Model Overfitting and Underfitting.

Overfitting: Memorizing Instead of Learning

Overfitting is sneakier because at first it looks like success. The model does fantastically on the training data. The trap is that it achieved this by memorizing, not by understanding. It absorbed the random noise in the examples as if that noise were a real pattern.

Real data always has noise. A house price dataset has one home that sold cheap because the seller was in a hurry. An overfit model treats that fluke as a rule. When new data arrives without that fluke, the model stumbles.

How You Recognize It

The signature of overfitting is a gap. The model scores high on training data and noticeably lower on new data. The bigger that gap, the worse the overfitting.

The Train-and-Test Split

Here is the simplest tool in the whole field, and it solves a real problem: how do you measure generalization before you ship a model?

You split your data. Most of it becomes the training set, which the model learns from. A smaller part becomes the test set, which you hide from the model until the end. Then you check performance on the test set, which stands in for new data.

  • High training score, high test score: the model learned well.
  • High training score, low test score: overfitting.
  • Low training score, low test score: underfitting.

This three-way check is something you will use on every project. For the recurring slip-ups beginners make with it, see 7 Common Mistakes with Ai Model Overfitting and Underfitting.

The Balance You Are Always Chasing

If you make a model simpler, you push it toward underfitting. If you make it more flexible, you push it toward overfitting. The skill is finding the middle, where the model is flexible enough to learn the real pattern but not so flexible that it memorizes noise.

This middle ground has a formal name, the bias-variance trade-off, which the The Complete Guide to Ai Model Overfitting and Underfitting explains in full. For now, hold the picture of a dial. Turn it one way and you underfit. Turn it the other and you overfit. Your job is to find where the dial points to good performance on new data.

A Simple Routine to Start With

  1. Split your data into training and test sets.
  2. Train a simple model first and check both scores.
  3. If both scores are bad, your model is too simple; add power.
  4. If training is great but test is poor, your model is too flexible; simplify it or add more data.
  5. Adjust and repeat until the test score is as good as you can get it.

A Picture to Keep In Your Head

If formulas feel intimidating, hold onto an image instead. Imagine plotting dots on paper and trying to draw a line through them that captures the trend.

  • Draw a straight line through clearly curved dots and it misses most of them. That is underfitting: too simple.
  • Draw a wild squiggle that touches every single dot exactly, including the ones that are just random scatter, and it will look ridiculous the moment a new dot appears. That is overfitting: too flexible.
  • Draw a smooth curve that follows the general trend without chasing every dot. That is the good fit you are after.

The squiggle scores perfectly on the dots it was drawn for and fails on new ones, which is exactly why overfitting fools people. The smooth curve scores a little worse on the original dots but handles new ones gracefully. That trade, slightly worse on what you have seen for much better on what you have not, is the heart of the whole subject.

Frequently Asked Questions

Which is worse, overfitting or underfitting?

Neither is universally worse, but overfitting is more dangerous because it disguises itself as success. An underfit model is obviously bad on every measure, so you catch it immediately. An overfit model looks great in development and only reveals its weakness on real data, sometimes after you have already deployed it.

Do I need to know math to understand this?

No. The intuition behind overfitting and underfitting requires no math at all. The exam-student analogy carries you most of the way. Math becomes useful later when you want to measure these effects precisely, but you can build correct instincts without it.

How much data should go in the test set?

A common starting point is keeping 20 to 30 percent of your data for testing, though this varies with how much data you have overall. The key rule is that the model must never see the test set during training, because the moment it does, your measurement of generalization becomes meaningless.

Can I just collect more data to avoid both problems?

More data helps overfitting a lot, because it gives the model more examples and makes memorizing noise harder. But more data does not help underfitting, since the problem there is a model too simple to use the data it already has. Match the fix to the failure.

What is the very first thing to check on a new model?

Compare the training score to the test score. If they are both low, you are underfitting. If training is high but test is low, you are overfitting. That single comparison tells you which direction to move.

Key Takeaways

  • Underfitting means the model is too simple and does poorly everywhere.
  • Overfitting means the model memorized the training data, including its noise, and does poorly on new data.
  • Generalization, doing well on new data, is the only thing that matters.
  • Split your data into training and test sets to measure generalization honestly.
  • More data helps overfitting but not underfitting; match the fix to the problem.
  • Finding the balance between too simple and too flexible is the core skill you are building.

Search Articles

Categories

OperationsSalesDeliveryGovernance

Popular Tags

prompt engineeringai fundamentalsai toolsthe difference between AIMLagency operationsagency growthenterprise sales

Share Article

A

Agency Script Editorial

Editorial Team

The Agency Script editorial team delivers operational insights on AI delivery, certification, and governance for modern agency operators.

Related Articles

General

Prompt Quality Decides Whether AI Earns Its Keep

Prompt quality is the single biggest variable in whether AI delivers real work or expensive noise. The model matters, the platform matters — but the prompt you write determines whether you get a first

A
Agency Script Editorial
June 1, 2026·10 min read
General

Counting the Real Cost of Every Token You Send

Tokens and context windows sit at the intersection of AI capability and operational cost—yet most business cases treat them as technical footnotes. That's a mistake that costs real money. Every time y

A
Agency Script Editorial
June 1, 2026·10 min read
General

Rolling Out AI Hallucinations Across a Team

Most teams discover AI hallucinations the hard way — a confident-sounding wrong answer makes it into a client deliverable, a legal brief, or a published report. The damage isn't just to the output; it

A
Agency Script Editorial
June 1, 2026·11 min read

Ready to certify your AI capability?

Join the professionals building governed, repeatable AI delivery systems.

Explore Certification