A synthetic dataset comprised of two clusters (1:500, 501:1000), where X1 values overlap for both clusters, X2 values do not overlap for both clusters and X3 (categorical) is distinct between both clusters. Both X2 and X3 are informative of the cluster (not X1).