Skip to contents

A synthetic dataset comprised of two clusters (1:500, 501:1000), where X1 is distinct between both clusters and X2 is distinct between both clusters. Both variables are informative of the cluster.

Usage

dataset_3

Format

`dataset_3` A data frame with 1,000 rows and 3 columns:

X1

Categorical variable, levels = A or B, distinct for each cluster

X2

Categorical variable, levels = 1 or 2, distinct for each cluster

Source

Synthetic