Some classification problems do not have a balanced number of examples for each class label. For example, for a simple classification task, there might be 20 samples labelled frog, and 80 samples labelled airplane.

Stratification preserves this proportion of the each labels in each of the sets.

Stratified split is the correct choice when the data is skewed in terms of the labels, like the aforementioned example.

Last updated on Nov 07, 2022

Get to production reliably.

Hasty is a unified agile ML platform for your entire Vision AI pipeline — with minimal integration effort for you.

Start for free Check out our services