• Episodes
  • Contact
  • Menu

Linear Digressions

  • Episodes
  • Contact

SMOTE: makin' yourself some fake minority data

July 30, 2016

Machine learning on imbalanced classes: surprisingly tricky. Many (most?) algorithms tend to just assign the majority class label to all the data and call it a day. SMOTE is an algorithm for manufacturing new minority class examples for yourself, to help your algorithm better identify them in the wild.

Relevant links:

  • SMOTE: synthetic minority over-sampling technique
Prev / Next