Skip to contents

A synthetic dataset comprised of two distinct clusters (1:500, 501:1000), where the values of both predictors do not overlap each other. One variable is informative of the other variable. The clusters are defined so that they have a specific shape.

Usage

dataset_6

Format

`dataset_6` A data frame with 1,000 rows and 2 columns:

X1

Continuous variable, mean = 1, sd = 1; mean = 8, sd = 1

X2

Continuous variable, mean = 2, sd = 1; mean = 9, sd = 1

Source

Synthetic