Same Stats, Different Graphs: Generating Datasets with Varied Appearance and Identical Statistics through Simulated Annealing | Autodesk Research
autodeskresearch.com/publications/samestatsThese 13 datasets (the Datasaurus, plus 12 others) each have the same summary statistics (x/y mean, x/y standard deviation, and Pearson's correlation) to two decimal places, while being drastically different in appearance. This work describes the technique we developed to create this dataset, and others like it.
jvns/pandas-cookbook: Recipes for using Python's pandas library
github.com/jvns/pandas-cookbookpandas-cookbook - Recipes for using Python's pandas library