A girl biting on a pencil stressed about a quiz. There is text on the image. It reads: What data team member are you? Take the quiz to go find out!

Overfitting

Share icon

When your model is too smart for its own good and memorizes the training data instead of learning useful patterns.

Overfitting

Overfitting is a phenomenon in machine learning and data science where a model learns the details and noise in the training data to the extent that it negatively impacts the model's performance on new data. This occurs when the model is excessively complex, having too many parameters relative to the number of observations. As a result, while the model may perform exceptionally well on the training dataset, it fails to generalize to unseen data, leading to poor predictive performance. Overfitting is particularly critical in fields such as artificial intelligence, where the ability to generalize from training data to real-world applications is essential.

Overfitting is often identified through techniques such as cross-validation, where the model's performance is evaluated on a separate validation dataset. It is important for data scientists, machine learning engineers, and data analysts to recognize and mitigate overfitting to ensure that their models are robust and reliable. Common strategies to avoid overfitting include simplifying the model, using regularization techniques, and employing dropout in neural networks.

Example in the Wild

It's like training for a marathon by only running in your living room; you might ace the treadmill, but good luck on the actual pavement!

Alternative Names

  • Model Memorization
  • Excessive Fitting
  • Overtraining

Fun Fact

Overfitting was first recognized in the early days of statistical modeling, but it gained significant attention in the 1990s as machine learning began to flourish, leading to the development of various techniques aimed at preventing this common pitfall.

Overfitting
An ad for Secoda which says, experiencing metadata migraines? Ask your data engineer about Secoda.
URBAN DATA DICTIONARY IS WRITTEN WITH YOU
Submit a word
The ad reads "When it comes to your valuable data, don't leave it to chance! Contact us". With a mother and baby looking at a computer together while sitting in a kitchen.An image of a book mock up called "The State of Data Governance in 2025" by Secoda. Below the image there's text that reads" The state of Data Governance in 2025. Download the report."