[D] is overfitting on a VERY small data set a necessary condition for a neural network to overfit or generalize on a big data set? • /r/MachineLearning
DRANK
How useful is it to test a new architecture on a very small data set, e.g. maybe only 1-5 samples? Let's say the architecture I want to test does...