The #RStats version of rickrolling is hiding the #DatasaurusDozen in a sample dataset.

@edwiebe @science

If you are interested, please see also this thread on the importance of visualizing data (the Anscombe's quartet, Simpson's paradox are also included in @LabPlot):

https://mstdn.social/@onemoment/109692198312380103

#Anscombe #SimpsonsParadox #DatasaurusDozen #Visualization #DataViz

Onemoment :verified: (@[email protected])

Attached: 1 image The importance of visualizing data. Part 1. Anscombe's quartet comprises four data sets that have nearly identical simple descriptive statistics, yet have very different distributions and appear very different when graphed. The quartet is intended to counter the impression that "numerical calculations are exact, but graphs are rough." The example is available in @[email protected] via File > Open Example. #LabPlot #OpenSource #DataViz #Visualization #Statistics #DataAnalysis #AnscombesQuartet

Mastodon ๐Ÿ˜

The importance of visualizing data. Part 2.

The Datasaurus Dozen contains 12 datasets that are equal in standard measures: mean, standard deviation, and Pearson's correlation.

Matejka, J., & Fitzmaurice, G. (2017). Same Stats, Different Graphs: Generating Datasets with Varied Appearance and Identical Statistics through Simulated Annealing.

The example is available in @LabPlot via File > Open Example.

#LabPlot #OpenSource #DataViz #Visualization #Statistics #DataAnalysis #DatasaurusDozen