Same Stats, Different Graphs: Generating Datasets with Varied Appearance and Identical Statistics through Simulated Annealing | Autodesk Researchautodeskresearch.com/publications/samestats
These 13 datasets (the Datasaurus, plus 12 others) each have the same summary statistics (x/y mean, x/y standard deviation, and Pearson's correlation) to two decimal places, while being drastically different in appearance. This work describes the technique we developed to create this dataset, and others like it.
jvns/pandas-cookbook: Recipes for using Python's pandas librarygithub.com/jvns/pandas-cookbook
pandas-cookbook - Recipes for using Python's pandas library