Dropout (2012) | AI Timeline

What Happened

In July 2012, researchers proposed “dropout,” a training-time technique where units are randomly omitted, reducing co-adaptation and overfitting in neural networks.

Why It Matters

Dropout became a widely adopted regularization method and a canonical example of “cheap, effective” training tricks that improved reliability—especially important as models grew larger and more expressive.

Technical Details

Dropout can be interpreted as training an ensemble of subnetworks and approximating model averaging at inference time through weight scaling.

Sources

Original paper (arXiv)
PDF