{"id":62871,"date":"2012-05-31T00:00:00","date_gmt":"2012-05-31T00:00:00","guid":{"rendered":"https:\/\/rockcontent.com\/blog\/the-pros-and-cons-of-scatterplots\/"},"modified":"2025-09-15T15:35:09","modified_gmt":"2025-09-15T18:35:09","slug":"the-pros-and-cons-of-scatterplots","status":"publish","type":"post","link":"https:\/\/pingback.com\/en\/resources\/the-pros-and-cons-of-scatterplots\/","title":{"rendered":"The Pros and Cons of Scatterplots"},"content":{"rendered":"<p>Scatterplots may not be used too often in infographics, but they definitely have their place. <\/p>\n<p>They can show large quantities of data and make it easy to see correlations between variables and clustering effects. <\/p>\n<p>As a quick overview and analytical tool, scatterplots are invaluable and work with almost any continuous scale data. <\/p>\n<p>Unfortunately, scatterplots aren&#8217;t always great for presentation. Several problems occur frequently, and it&#8217;s best to be aware of each when using scatterplots for analysis or presentation. <\/p>\n<p>A scatterplot works by placing one dimension on the vertical axis and a different dimension on the horizontal axis. <\/p>\n<p>Each piece of data is represented by a point on the chart. Variations on scatterplots introduce differently shaped or colored points for categories and differently sized points for quantitative data. <\/p>\n<p>Occasionally, people use pie charts as the points in scatterplots to show even more data with a <a href=\"https:\/\/blog.visual.ly\/the-whole-story-on-part-to-whole-relationships\/\" target=\"_blank\" rel=\"noopener noreferrer\">part-whole<\/a> relationship.<\/p>\n<figure class=\"wp-block-image\"><a href=\"https:\/\/s3.amazonaws.com\/scribblelive-com-prod\/wp-content\/uploads\/2012\/05\/header3.png\" target=\"_blank\" rel=\"noopener noreferrer\"><img decoding=\"async\" src=\"https:\/\/s3.amazonaws.com\/scribblelive-com-prod\/wp-content\/uploads\/2012\/05\/header3-618x294.png\" alt=\"\" class=\"wp-image-4783\" title=\"Scatterplots\" \/><\/a><\/figure>\n<p>The major cause of problems with scatterplots is discretization of values. <\/p>\n<p>This happens when decimal places are rounded off, measurements are not accurate enough, or a data field is categorical. <\/p>\n<p>The scatterplot below uses a <a href=\"https:\/\/lib.stat.cmu.edu\/datasets\/cars.data\" target=\"_blank\" rel=\"noopener noreferrer\">standardized dataset about cars<\/a>. <\/p>\n<p>The problems with this scatterplot all derive from the x-axis; number of cylinders. There are so few values that cylinders is really a categorical scale being represented using numbers. <\/p>\n<p>This causes overplotting problems so there are hundreds of values all stacked on top of each other. <\/p>\n<p>This makes it difficult to see the full quantity of values in the dataset, and correlation and clustering is harder to find with so few possible values on the x-axis.<\/p>\n<figure class=\"wp-block-image\"><a href=\"https:\/\/s3.amazonaws.com\/scribblelive-com-prod\/wp-content\/uploads\/2012\/05\/discretization.png\" target=\"_blank\" rel=\"noopener noreferrer\"><img decoding=\"async\" src=\"https:\/\/s3.amazonaws.com\/scribblelive-com-prod\/wp-content\/uploads\/2012\/05\/discretization.png\" alt=\"\" class=\"wp-image-4764\" title=\"discretization\" \/><\/a><\/figure>\n<p>If you are dead-set on a scatterplot, there is not much you can do to remedy such a severe case of discretization, but in slightly better cases, there are some possible fixes. <\/p>\n<p>Translucency is a powerful tool for dealing with overplotting. <\/p>\n<p>Another possible mitigation technique is removing the fill of the mark. Both methods have advantages and disadvantages, and the combination of the two can also be useful. <\/p>\n<p>In practice, it\u2019s often a bit of trial and error. You fiddle around with the transparency slider and sometimes it just&#8230; doesn\u2019t help as much as you hoped, especially when points are stuck right on top of each other. And if you\u2019re working with some clunky data visualization software (not naming names), \u201cremoving the fill\u201d might turn your dataset into a blob of ghostly rings, which can be more confusing than enlightening. Still\u2014every so often, using both translucency and hollow points will at least give you a fighting chance to see overlapping data, even if it never feels perfect.<\/p>\n<p>There\u2019s also the reality that audiences don\u2019t always \u201cget\u201d scatterplots straight away. Folks used to bar charts or line graphs might find themselves squinting at a mass of tiny dots, wondering if they missed the point (pun intended). If I had a dollar for every time someone asked where the trend line is, I could probably buy better graphing software. But sometimes that\u2019s the cost of nuance\u2014scatterplots reveal complicated stuff, only if you know where to look and what questions to ask.<\/p>\n<p>Unfortunately, these methods are not a cure-all solution. It is still possible to have so many points or perfectly aligned points that pile up beyond the opacity range.<\/p>\n<figure class=\"wp-block-image\"><a href=\"https:\/\/s3.amazonaws.com\/scribblelive-com-prod\/wp-content\/uploads\/2012\/05\/overplottingMitigation.png\" target=\"_blank\" rel=\"noopener noreferrer\"><img decoding=\"async\" src=\"https:\/\/s3.amazonaws.com\/scribblelive-com-prod\/wp-content\/uploads\/2012\/05\/overplottingMitigation.png\" alt=\"\" class=\"wp-image-4765\" title=\"Overplotting Mitigation\" \/><\/a><\/figure>\n<p>Ideally, avoiding data dimensions with low precision or few unique values is the best way to prevent these problems. <\/p>\n<p>Sometimes data just doesn&#8217;t belong in a scatterplot and you should visualize another dimension instead.<\/p>\n<p>In the case below, two continuous scales are shown and the overall shape of the group indicates negative correlation between the two dimensions.<\/p>\n<p><a href=\"https:\/\/s3.amazonaws.com\/scribblelive-com-prod\/wp-content\/uploads\/2012\/05\/correlationContinuous.png\" target=\"_blank\" rel=\"noopener noreferrer\"><img fetchpriority=\"high\" fetchpriority=\"high\" fetchpriority=\"high\" fetchpriority=\"high\" decoding=\"async\" title=\"correlationContinuous\" width=\"618\" height=\"595\" class=\"alignnone size-full wp-image-4777\" src=\"https:\/\/s3.amazonaws.com\/scribblelive-com-prod\/wp-content\/uploads\/2012\/05\/correlationContinuous.png\" alt=\"\"><\/a> &nbsp; <\/p>\n<p>If you really need to show categorical data, consider visually encoding it as color. <\/p>\n<p>The following chart does have dimensions with lower unique value counts (data from <a href=\"https:\/\/en.wikipedia.org\/wiki\/Iris_flower_data_set\" target=\"_blank\" rel=\"noopener noreferrer\">Fisher&#8217;s Iris Data<\/a>), however it does a good job of showing how color can help call out clusters. <\/p>\n<p><a href=\"https:\/\/s3.amazonaws.com\/scribblelive-com-prod\/wp-content\/uploads\/2012\/05\/clustering.png\"><img decoding=\"async\" title=\"clustering\" width=\"616\" height=\"595\" class=\"alignnone size-full wp-image-4773\" src=\"https:\/\/s3.amazonaws.com\/scribblelive-com-prod\/wp-content\/uploads\/2012\/05\/clustering.png\" alt=\"\"><\/a> &nbsp;<\/p>\n<p>Scatterplots definitely have limitations, most of which come from characteristics of the data. <\/p>\n<p>When used correctly, however, they are great for overviews, finding outliers, and for showing patterns between some dimensions. For a data visualizer, a responsibly used scatterplot can be a very valuable tool. &nbsp; <\/p>\n<p><em><a href=\"https:\/\/twitter.com\/#!\/SeeingStructure\" target=\"_blank\" rel=\"noopener noreferrer\">Drew Skau<\/a> is a scatterbrained PhD Computer Science Visualization student at <a href=\"https:\/\/www.uncc.edu\/\" target=\"_blank\" rel=\"noopener noreferrer\">UNCC<\/a>, with an undergraduate degree in Architecture.<\/em><\/p>\n<div class=\"wp-block-image\">\n<figure class=\"aligncenter size-large\"><a href=\"https:\/\/resources.rockcontent.com\/communicate-complex-data?utm_source=rockcontent-blog&amp;utm_medium=referral\" target=\"_blank\" rel=\"noreferrer noopener\"><img decoding=\"async\" src=\"https:\/\/s3.amazonaws.com\/scribblelive-com-prod\/wp-content\/uploads\/2020\/11\/Communicate-complex-data.png\" alt=\"Guide to visually communicate complex data - Promotional Banner\" class=\"wp-image-13563\"><\/a><\/figure>\n<\/div>\n<p> }}<\/p>\n","protected":false},"excerpt":{"rendered":"<p>Scatterplots may not be used too often in infographics, but they definitely have their place. They can show large quantities of data and make it easy to see correlations between variables and clustering effects. As a quick overview and analytical tool, scatterplots are invaluable and work with almost any continuous scale data. Unfortunately, scatterplots aren&#8217;t [&hellip;]<\/p>\n","protected":false},"author":1,"featured_media":53027,"comment_status":"open","ping_status":"open","sticky":false,"template":"","format":"standard","meta":{"footnotes":""},"categories":[26],"tags":[],"class_list":["post-62871","post","type-post","status-publish","format-standard","has-post-thumbnail","hentry","category-uncategorized"],"yoast_head":"<!-- This site is optimized with the Yoast SEO plugin v25.5 - https:\/\/yoast.com\/wordpress\/plugins\/seo\/ -->\n<title>The Pros and Cons of Scatterplots - Pingback<\/title>\n<meta name=\"robots\" content=\"index, follow, max-snippet:-1, max-image-preview:large, max-video-preview:-1\" \/>\n<link rel=\"canonical\" href=\"https:\/\/pingback.com\/en\/resources\/the-pros-and-cons-of-scatterplots\/\" \/>\n<meta property=\"og:locale\" content=\"pt_BR\" \/>\n<meta property=\"og:type\" content=\"article\" \/>\n<meta property=\"og:title\" content=\"The Pros and Cons of Scatterplots - Pingback\" \/>\n<meta property=\"og:description\" content=\"Scatterplots may not be used too often in infographics, but they definitely have their place. They can show large quantities of data and make it easy to see correlations between variables and clustering effects. As a quick overview and analytical tool, scatterplots are invaluable and work with almost any continuous scale data. Unfortunately, scatterplots aren&#8217;t [&hellip;]\" \/>\n<meta property=\"og:url\" content=\"https:\/\/pingback.com\/en\/resources\/the-pros-and-cons-of-scatterplots\/\" \/>\n<meta property=\"og:site_name\" content=\"Pingback\" \/>\n<meta property=\"article:published_time\" content=\"2012-05-31T00:00:00+00:00\" \/>\n<meta property=\"article:modified_time\" content=\"2025-09-15T18:35:09+00:00\" \/>\n<meta name=\"author\" content=\"Carolina\" \/>\n<meta name=\"twitter:card\" content=\"summary_large_image\" \/>\n<meta name=\"twitter:label1\" content=\"Escrito por\" \/>\n\t<meta name=\"twitter:data1\" content=\"Carolina\" \/>\n\t<meta name=\"twitter:label2\" content=\"Est. tempo de leitura\" \/>\n\t<meta name=\"twitter:data2\" content=\"4 minutos\" \/>\n<script type=\"application\/ld+json\" class=\"yoast-schema-graph\">{\"@context\":\"https:\/\/schema.org\",\"@graph\":[{\"@type\":\"WebPage\",\"@id\":\"https:\/\/pingback.com\/en\/resources\/the-pros-and-cons-of-scatterplots\/\",\"url\":\"https:\/\/pingback.com\/en\/resources\/the-pros-and-cons-of-scatterplots\/\",\"name\":\"The Pros and Cons of Scatterplots - Pingback\",\"isPartOf\":{\"@id\":\"https:\/\/pingback.com\/en\/resources\/#website\"},\"primaryImageOfPage\":{\"@id\":\"https:\/\/pingback.com\/en\/resources\/the-pros-and-cons-of-scatterplots\/#primaryimage\"},\"image\":{\"@id\":\"https:\/\/pingback.com\/en\/resources\/the-pros-and-cons-of-scatterplots\/#primaryimage\"},\"thumbnailUrl\":\"\",\"datePublished\":\"2012-05-31T00:00:00+00:00\",\"dateModified\":\"2025-09-15T18:35:09+00:00\",\"author\":{\"@id\":\"https:\/\/pingback.com\/en\/resources\/#\/schema\/person\/5931a4533700c840b9f38199581abc33\"},\"breadcrumb\":{\"@id\":\"https:\/\/pingback.com\/en\/resources\/the-pros-and-cons-of-scatterplots\/#breadcrumb\"},\"inLanguage\":\"pt-BR\",\"potentialAction\":[{\"@type\":\"ReadAction\",\"target\":[\"https:\/\/pingback.com\/en\/resources\/the-pros-and-cons-of-scatterplots\/\"]}]},{\"@type\":\"ImageObject\",\"inLanguage\":\"pt-BR\",\"@id\":\"https:\/\/pingback.com\/en\/resources\/the-pros-and-cons-of-scatterplots\/#primaryimage\",\"url\":\"\",\"contentUrl\":\"\"},{\"@type\":\"BreadcrumbList\",\"@id\":\"https:\/\/pingback.com\/en\/resources\/the-pros-and-cons-of-scatterplots\/#breadcrumb\",\"itemListElement\":[{\"@type\":\"ListItem\",\"position\":1,\"name\":\"In\u00edcio\",\"item\":\"https:\/\/pingback.com\/blog\/\"},{\"@type\":\"ListItem\",\"position\":2,\"name\":\"The Pros and Cons of Scatterplots\"}]},{\"@type\":\"WebSite\",\"@id\":\"https:\/\/pingback.com\/en\/resources\/#website\",\"url\":\"https:\/\/pingback.com\/en\/resources\/\",\"name\":\"Pingback\",\"description\":\"Marketing for builders\",\"potentialAction\":[{\"@type\":\"SearchAction\",\"target\":{\"@type\":\"EntryPoint\",\"urlTemplate\":\"https:\/\/pingback.com\/en\/resources\/?s={search_term_string}\"},\"query-input\":{\"@type\":\"PropertyValueSpecification\",\"valueRequired\":true,\"valueName\":\"search_term_string\"}}],\"inLanguage\":\"pt-BR\"},{\"@type\":\"Person\",\"@id\":\"https:\/\/pingback.com\/en\/resources\/#\/schema\/person\/5931a4533700c840b9f38199581abc33\",\"name\":\"Carolina\",\"image\":{\"@type\":\"ImageObject\",\"inLanguage\":\"pt-BR\",\"@id\":\"https:\/\/pingback.com\/en\/resources\/#\/schema\/person\/image\/\",\"url\":\"https:\/\/secure.gravatar.com\/avatar\/70cde532238b4f8bf4a6e7e589ff0a259eda38fa966564ca7ed7d23e61c27774?s=96&d=mm&r=g\",\"contentUrl\":\"https:\/\/secure.gravatar.com\/avatar\/70cde532238b4f8bf4a6e7e589ff0a259eda38fa966564ca7ed7d23e61c27774?s=96&d=mm&r=g\",\"caption\":\"Carolina\"},\"sameAs\":[\"https:\/\/pingback.com\"],\"url\":\"https:\/\/pingback.com\/en\/resources\/author\/adm1n\/\"}]}<\/script>\n<!-- \/ Yoast SEO plugin. -->","yoast_head_json":{"title":"The Pros and Cons of Scatterplots - Pingback","robots":{"index":"index","follow":"follow","max-snippet":"max-snippet:-1","max-image-preview":"max-image-preview:large","max-video-preview":"max-video-preview:-1"},"canonical":"https:\/\/pingback.com\/en\/resources\/the-pros-and-cons-of-scatterplots\/","og_locale":"pt_BR","og_type":"article","og_title":"The Pros and Cons of Scatterplots - Pingback","og_description":"Scatterplots may not be used too often in infographics, but they definitely have their place. They can show large quantities of data and make it easy to see correlations between variables and clustering effects. As a quick overview and analytical tool, scatterplots are invaluable and work with almost any continuous scale data. Unfortunately, scatterplots aren&#8217;t [&hellip;]","og_url":"https:\/\/pingback.com\/en\/resources\/the-pros-and-cons-of-scatterplots\/","og_site_name":"Pingback","article_published_time":"2012-05-31T00:00:00+00:00","article_modified_time":"2025-09-15T18:35:09+00:00","author":"Carolina","twitter_card":"summary_large_image","twitter_misc":{"Escrito por":"Carolina","Est. tempo de leitura":"4 minutos"},"schema":{"@context":"https:\/\/schema.org","@graph":[{"@type":"WebPage","@id":"https:\/\/pingback.com\/en\/resources\/the-pros-and-cons-of-scatterplots\/","url":"https:\/\/pingback.com\/en\/resources\/the-pros-and-cons-of-scatterplots\/","name":"The Pros and Cons of Scatterplots - Pingback","isPartOf":{"@id":"https:\/\/pingback.com\/en\/resources\/#website"},"primaryImageOfPage":{"@id":"https:\/\/pingback.com\/en\/resources\/the-pros-and-cons-of-scatterplots\/#primaryimage"},"image":{"@id":"https:\/\/pingback.com\/en\/resources\/the-pros-and-cons-of-scatterplots\/#primaryimage"},"thumbnailUrl":"","datePublished":"2012-05-31T00:00:00+00:00","dateModified":"2025-09-15T18:35:09+00:00","author":{"@id":"https:\/\/pingback.com\/en\/resources\/#\/schema\/person\/5931a4533700c840b9f38199581abc33"},"breadcrumb":{"@id":"https:\/\/pingback.com\/en\/resources\/the-pros-and-cons-of-scatterplots\/#breadcrumb"},"inLanguage":"pt-BR","potentialAction":[{"@type":"ReadAction","target":["https:\/\/pingback.com\/en\/resources\/the-pros-and-cons-of-scatterplots\/"]}]},{"@type":"ImageObject","inLanguage":"pt-BR","@id":"https:\/\/pingback.com\/en\/resources\/the-pros-and-cons-of-scatterplots\/#primaryimage","url":"","contentUrl":""},{"@type":"BreadcrumbList","@id":"https:\/\/pingback.com\/en\/resources\/the-pros-and-cons-of-scatterplots\/#breadcrumb","itemListElement":[{"@type":"ListItem","position":1,"name":"In\u00edcio","item":"https:\/\/pingback.com\/blog\/"},{"@type":"ListItem","position":2,"name":"The Pros and Cons of Scatterplots"}]},{"@type":"WebSite","@id":"https:\/\/pingback.com\/en\/resources\/#website","url":"https:\/\/pingback.com\/en\/resources\/","name":"Pingback","description":"Marketing for builders","potentialAction":[{"@type":"SearchAction","target":{"@type":"EntryPoint","urlTemplate":"https:\/\/pingback.com\/en\/resources\/?s={search_term_string}"},"query-input":{"@type":"PropertyValueSpecification","valueRequired":true,"valueName":"search_term_string"}}],"inLanguage":"pt-BR"},{"@type":"Person","@id":"https:\/\/pingback.com\/en\/resources\/#\/schema\/person\/5931a4533700c840b9f38199581abc33","name":"Carolina","image":{"@type":"ImageObject","inLanguage":"pt-BR","@id":"https:\/\/pingback.com\/en\/resources\/#\/schema\/person\/image\/","url":"https:\/\/secure.gravatar.com\/avatar\/70cde532238b4f8bf4a6e7e589ff0a259eda38fa966564ca7ed7d23e61c27774?s=96&d=mm&r=g","contentUrl":"https:\/\/secure.gravatar.com\/avatar\/70cde532238b4f8bf4a6e7e589ff0a259eda38fa966564ca7ed7d23e61c27774?s=96&d=mm&r=g","caption":"Carolina"},"sameAs":["https:\/\/pingback.com"],"url":"https:\/\/pingback.com\/en\/resources\/author\/adm1n\/"}]}},"_links":{"self":[{"href":"https:\/\/pingback.com\/en\/resources\/wp-json\/wp\/v2\/posts\/62871","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/pingback.com\/en\/resources\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/pingback.com\/en\/resources\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/pingback.com\/en\/resources\/wp-json\/wp\/v2\/users\/1"}],"replies":[{"embeddable":true,"href":"https:\/\/pingback.com\/en\/resources\/wp-json\/wp\/v2\/comments?post=62871"}],"version-history":[{"count":3,"href":"https:\/\/pingback.com\/en\/resources\/wp-json\/wp\/v2\/posts\/62871\/revisions"}],"predecessor-version":[{"id":128712,"href":"https:\/\/pingback.com\/en\/resources\/wp-json\/wp\/v2\/posts\/62871\/revisions\/128712"}],"wp:featuredmedia":[{"embeddable":true,"href":"https:\/\/pingback.com\/en\/resources\/wp-json\/"}],"wp:attachment":[{"href":"https:\/\/pingback.com\/en\/resources\/wp-json\/wp\/v2\/media?parent=62871"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/pingback.com\/en\/resources\/wp-json\/wp\/v2\/categories?post=62871"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/pingback.com\/en\/resources\/wp-json\/wp\/v2\/tags?post=62871"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}