CI-MNIST

Correlated and Imbalanced MNIST

ImagesTabularTextsOpenIntroduced 2021-06-07

CI-MNIST (Correlated and Imbalanced MNIST) is a variant of MNIST dataset with introduced different types of correlations between attributes, dataset features, and an artificial eligibility criterion. For an input image xx, the label y1,0y \in \\{1, 0\\} indicates eligibility or ineligibility, respectively, given that xx is even or odd. The dataset defines the background colors as the protected or sensitive attribute s0,1s \in \\{0, 1\\}, where blue denotes the unprivileged group and red denotes the privileged group. The dataset was designed in order to evaluate bias-mitigation approaches in challenging setups and be capable of controlling different dataset configurations.