Same Stats, Different Graphs: Generating Datasets with Varied Appearance and Identical Statistics through Simulated Annealing | Autodesk Researchautodeskresearch.com/publications/samestats
These 13 datasets (the Datasaurus, plus 12 others) each have the same summary statistics (x/y mean, x/y standard deviation, and Pearson's correlation) to two decimal places, while being drastically different in appearance. This work describes the technique we developed to create this dataset, and others like it.
A visual introduction to probability and statistics.
Surprise Maps: Showing the Unexpected – Mediummedium.com/@uwdata/surprise-maps-showing-the-unexpected-e92b67398865
In 1977, Jerry Ehman — an astronomer working with the SETI project to seek out alien life — came across an interesting radio signal, one needle in the haystack of all of the electromagnetic signals…
Anthropometrics of Fit – Research – Herman Millerhermanmiller.com/research/solution-essays/anthropometrics-of-fit.html
How the Aeron chair's three sizes and accommodating seat and backrest material provide a personal fit for any individual.
Testing hypotheses suggested by the data - Wikipedia, the free encyclopediaen.wikipedia.org/wiki/Testing_hypotheses_suggested_by_the_data
Hypotheses suggested by the data, if tested using the data set that suggested them, are likely to be accepted even when they are not true. This is because circular reasoning (double dipping) would be involved: something seems true in the limited data set, therefore we hypothesize that it is true in general, therefore we (wrongly) test it on the same limited data set, which seems to confirm that it is true.