appeal to all of the hype lately inside information science, however I’d argue they’re each secondary to a extra necessary—and often-ignored—part of the sphere.
When coping with information, there are two important steps:
- Processing and analyzing the info to extract significant insights.
- Conveying these insights to others.
The second level is essential and infrequently neglected. The world’s most superior algorithm or useful perception is ineffective if nobody can perceive it. As a knowledge scientist, you need to be taught to convey your insights to others. There may be multiple motive for this, with the obvious one being that if the best folks perceive the info, the world at giant will profit. Nonetheless, there may be one other equally necessary motive: It’s typically in describing our findings to others that we uncover errors, extra profound information, or additional areas for exploration.
On this article, we’ll look at a robust and efficient instrument which will help obtain the second step above: information visualization. That is the primary in a collection of articles that may take absolute newbies deep into the realm of information visualization. This primary article is basic and lightweight, meant as an introduction to the sphere as a complete. In later articles, I’ll get into the extra technical facets, finally concluding by educating you the best way to construct your individual information visualizations.
With that information, you’ll be armed to deal with your information in new, thrilling methods.
“The best worth of an image is when it forces us to note what we by no means anticipated to see.” –John Tukey
What Counts as a Knowledge Visualization?
Many individuals view information visualization via a restricted lens, solely classifying normal graphs, resembling bar charts, line charts, and the like, as true information visualizations. Seen from this attitude, information visualization didn’t materialize till the center of the 18th century. (We’ll see some examples under.)
Nonetheless, we might do properly to broaden our minds. Visible transformations of information are not at all restricted to our conventional concepts. They’ve been round for 1000’s of years. For instance, right here is the Imago Mundi [1], the oldest identified map on the earth, found as a relic of the traditional metropolis of Babylon:
This map locations Babylon on the heart and was doubtless a particularly useful gizmo for visualizing what we now formally name geospatial information. It is among the world’s earliest information visualizations.
There are a plethora of comparable figures and pictures from numerous historical civilizations—cave work, calendars, stone carvings, even Egyptian hieroglyphics—these are all successfully visible representations of information that had been obscure of their preliminary type. Viewing these examples as information visualizations leads us to an necessary precept:
At its core, information visualization is nothing greater than taking some information—be it numerical, textual, or in any other case—and making use of a change to signify it visually.
This foundational precept results in a number of associated matters primarily involving the simplest strategies to conduct these transformations, the place efficient loosely interprets to “sincere, straightforward to grasp, and informative.”
Early Examples of Knowledge Visualizations
Now that now we have broadened our views regarding what constitutes a knowledge visualization, allow us to check out some trendy examples. Under is a chart from 1644 developed by Michael Florent Van Langren [2]. It is among the earliest graphical representations of what we contemplate to be conventional statistical information, depicting estimates of the distinction in longitude between Rome and Toledo.

Let’s contemplate a extra concerned instance subsequent—one which immediately highlights Tukey’s quote above.
Under is a map of London’s Soho District in 1854 [3]. It was designed by John Snow so as to decide if there have been any patterns within the cholera outbreak that was debilitating the city on the time:

Trying towards the middle of the map, we will see an exceptionally giant variety of deaths close to the water pump on Broad Road. An investigation decided that this pump was contaminated and was a serious explanation for the unfold of the illness.
This instance highlights precisely the precept from John Tukey we famous above: Among the best makes use of of information visualization is to rapidly see insights which can be tough to search out within the information’s preliminary type.
Precision and Flexibility
Knowledge visualization is a broad and deep subject that may be approached in some ways. That stated, there are two ideas that it is best to remember regardless of the particular type of information visualization you interact in: precision and flexibility.
A very good information visualization doesn’t attempt to accomplish ill-defined duties, resembling displaying the essence of or summarizing the whole lot necessary a couple of information set. Statements like these are subjective and basically unimaginable to attain.
Quite, information visualization highlights a selected and well-defined side of the related information in a method that makes it simpler to grasp for the person. It is best to at all times articulate precisely what you need to specific about your information earlier than you even start designing a visualization.
To internalize this precept, it’s useful to recall what the aim of a knowledge visualization is to start with: to show insights from a knowledge set in a transparent and helpful method. We need to make the info simpler to grasp. Being exact ensures we obtain this objective. A visualization that makes an attempt to do an excessive amount of may find yourself complicated the viewer much more. It’s significantly better to provide a visualization which covers much less information in a clearer method. High quality is extra necessary than amount.
Check out the info desk under, which comprises details about salaries from completely different cities round america.
Title | Metropolis | Earnings | Occupation |
---|---|---|---|
Sarah Mitchell | Denver, CO | $72,500 | Advertising Supervisor |
Jamal Rodriguez | Houston, TX | $58,300 | Electrician |
Priya Desai | Seattle, WA | $91,200 | Software program Engineer |
Thomas Nguyen | Chicago, IL | $64,800 | Nurse |
Which of the next is the higher visualization alternative for the above information?
- A visualization that makes an attempt to simplify the knowledge within the information desk utilizing a bar chart that has names on one axis and salaries on the opposite axis, makes use of colour to distinguish amongst cities, and makes use of a texture on the bars (dashed strains, diagonal strains, and so forth.) to tell apart amongst careers.
- The identical visualization as above, however this time excluding the majors. In different phrases, a bar chart of names and salaries which colours the bars based mostly on location.
It’s tempting to decide on the primary one, however the truth is, it tries to do an excessive amount of. Higher to show restricted, focused data than to confuse your viewers.
Along with being exact, sustaining flexibility can be necessary. There isn’t a such factor as an ideal information visualization. There may be at all times room for enchancment, and information visualizations usually turn out to be higher with every revision. After all, in some unspecified time in the future, a knowledge visualization should be shared with others and serve its objective.
This results in a quandary—how a lot revision is sufficient revision? There isn’t a definitive reply to this query. The method of revising a visualization should be undertaken with care. Asking too many individuals for recommendation will doubtless lead to a bunch of half-baked, conflicting opinions. However, publishing the primary draft of a visualization—i.e., not revising it in any respect—is prone to result in a subpar outcome.
Though there isn’t any excellent resolution, there are just a few tips you’ll be able to comply with:
- Determine 2-3 folks to offer you suggestions in your visualization.
- Strive to make sure your record of individuals encompasses the next:
- A reviewer who’s proficient in designing information visualizations
- A reviewer who has a powerful understanding of the info that’s getting used to develop the visualization (e.g., a political scientist for election information)
- A reviewer who’s a part of the meant viewers for the visualization
- Undergo 2-3 rounds of suggestions and revision with this similar record of individuals. This may be certain that enhancements to the visualization are steady and logical.
Closing Ideas and Trying Ahead
In some ways, information visualization is akin to writing. Even probably the most prolific and gifted authors have editors, and their books undergo in depth revision earlier than being permitted for publishing. Why? For the straightforward motive that good writing is essentially depending on the viewers, and thoroughly curated revision ensures the perfect expertise for the eventual readers of a guide. The identical concept applies to information visualization.
By following these tips, you’ll be able to make sure you develop a strong information visualization which is grounded in greatest practices, accurately shows the info at hand, and is comprehensible for the meant viewers.
They’re the important thing to efficient information visualization, and the muse for superior visualization methods that will likely be mentioned in future articles. Till then.
References
[1] https://commons.wikimedia.org/wiki/File:The_Babylonian_map_of_the_world,_from_Sippar,_Mesopotamia..JPG
[2] The Visible Show of Quantitative Info, Edward Tufte
[3] https://picryl.com/media/snow-cholera-map-1-cbadea