However, this would almost always overfit the data (e.g., grow the tree based on noise) and create a classifier that would not generalize well to new data4. To determine whether we should continue splitting, we can use some combination of (i) minimum number of points in a node, (ii) purity or error threshold of a […]