What is Overfitting?

Overfitting is a modeling error which occurs when a function is too closely fit to a limited set of data points. Overfitting the model generally takes the form of making an overly complex model to explain idiosyncrasies in the data under study.

In reality, the data often studied has some degree of error or random noise within it. Thus attempting to make the model conform too closely to slightly inaccurate data can infect the model with substantial errors and reduce its predictive power.

Understanding Overfitting

[Important: Financial professionals must always be aware of the dangers of overfitting a model based on limited data]

For instance, a common problem is using computer algorithms to search extensive databases of historical market data in order to find patterns. Given enough study, it is often possible to develop elaborate theorems which appear to predict things such as returns in the stock market with close accuracy.

However, when applied to data outside of the sample, such theorems may likely prove to be merely an overfitting of a model to what were in reality just chance occurrences. In all cases, it is important to test a model against data which is outside of the sample used to develop it.

Key Takeaways

  • Overfitting is a modeling error which occurs when a function is too closely fit to a limited set of data points.
  • Financial professionals must always be aware of the dangers of overfitting a model based on limited data.