When considering the addition of more data, it is important to calculate the cost, if, that is, such data is available. If the data available is data from an earlier date than what is currently owned, it is not clear that the data will lead to improvements, as data gets “stale” over time. As the size of the dataset increases, cross validation becomes a better indicator.