Adjusting Point Size to Facilitate More Accurate Correlation Perception in Scatterplots

Gabriel Strain, Andrew James Stewart, Paul A Warren, Caroline Jay

Exemplar figure, described by caption below
Examples of the experimental stimuli used, with an r value of 0.6. When 150 participants were asked to rate correlation in scatterplots, they were most accurate when the non-linear decay condition was used across a range of 45 r values.

Viewers consistently underestimate correlation in positively correlated scatterplots. We use a novel data point size manipulation to correct for this bias. In a high-powered and fully reproducible study, we demonstrate that decreasing the size of a point on a scatterplot as a function of its distance from the regression line is able to correct for a systematic perceptual bias long present in the literature. We recommend the implementation of our technique when designing scatterplots that aim to communicate positive correlations.