Accessibility: More than just Alt-text

Graphics are one of the primary tools for science or data communication, and are powerful because they make use of our visual system in a way that off-loads much of the work of processing the data, freeing cognitive resources up to consider the content rather than the representation. Unfortunately, not everyone can leverage their visual systems in this way, due to differences in the visual system or in information processing systems within the brain. There are a wide range of issues which may impact how effectively people can use graphics, including colorblindness, poor visual acuity, blindness, dyslexia, dyscalculia, and even differences in data literacy and numerical literacy. In some sense, it can be useful to go through the introspection process when looking at a graph, and consider what perceptual and cognitive resources are required to complete each step when looking at a chart. Ultimately, as scientists and people working with data, we need to work to make our data representations accessible. The specifics of which populations we focus on, and how we adapt existing representations (or create new ones) are based on the target audience(s), and will differ across different disciplines. For instance, if you are designing graphics to be used in Air Traffic Controller trainings, you likely do not need to accommodate Blind and Low Vision (BLV) individuals or even consider color-blindness. However, if you are creating graphics to be consumed by a general web audience, it is important to consider a range of visual impairments and accommodations.

This is an active area of research in data science, information visualization, and design. It is always useful to see what new solutions have come out recently, because there are new developments in this area on a regular basis.

Adapting Existing Graphics

One of the simplest ways to increase accessibility of graphics is to ensure that they meet basic guidelines for discoverability and distinguishability. Discoverability ensures that screen-reader users know that the graphic exists; this requires designing the entire web page with these users in mind, but is critical to ensuring equal access to information for blind and low-vision users. Creating graphics which adhere to contrast, color selection, font size, and other distinguishability guidelines helps low-vision people, those with sensory processing issues, and people with colorblindness use existing data representations effectively.

Discoverability

Alt Text

When retrofitting an existing page for accessibility, it may not be possible to make charts and graphics fully accessible to individuals who are blind or using screen-readers. In these cases, it is important to write good alt-text for each graph that is intended to convey information (it is fine to skip alt-text for purely decorative images).

Good alt-text is:

concise
accurate
relevant
context-dependent - the same image may require different alt-text depending on the broader context of the web page.

Graphs are some of the hardest images to fully describe in alt-text, in part because providing the same information that the image provides may require thousands of words, full data tables, or other accommodations. The alt text field in HTML does not allow for paragraphs, line breaks, and other structural elements; as a result, it is often better to include a short description in the alt-text field and a longer description (or table) as part of the web page source. Linking the alt-text and the longer description together may facilitate keyboard navigation between the two, making the navigation process less cognitively intensive for screen reader users.

The BrailleR R package integrates with many common R plotting functions (including base graphics and ggplot2) and can generate some functional alt-text automatically.

BrailleR alt-text demo

library(BrailleR)

data <- read.csv("../data/penguins.csv")
library(ggplot2)
scatterplot <- ggplot(data, 
                      aes(x = bill_length_mm, 
                          y = bill_depth_mm, 
                          color = species)) + 
  geom_point()

scatterplot

Warning: Removed 2 rows containing missing values or values outside the scale range
(`geom_point()`).

This is an untitled chart with no subtitle or caption.
It has x-axis 'bill_length_mm' with labels 40, 50 and 60.
It has y-axis 'bill_depth_mm' with labels 15.0, 17.5 and 20.0.
There is a legend indicating colour is used to show species, with 3 levels:
Adelie shown as strong reddish orange colour, 
Chinstrap shown as vivid yellowish green colour and 
Gentoo shown as brilliant blue colour.
The chart is a set of 342 big solid circle points of which about 97% can be seen.

Page Structure

The entire web-page structure is important to consider when designing to include screen-reader users. Sighted users may be able to take in the web page structure visually and determine which elements to focus on; screen reader users have to take in the page structure audibly, and in sequence. Some users describe it as “viewing a web page through a straw”.

It is important to provide a contextual overview first, and then provide specific data and details in a structured hierarchy. We can update the “information seeking mantra” of overview, zoom and filter, details on demand to “gist”, “supporting methods” (contextual information), and “details” (actual data content). The structure of the page needs to support understanding when navigated hierarchically. Information embedded in visual design themes (font sizes, colors, nesting, space) need to be explicitly accessible via aria- fields to be accessible to screen reader users.

Discriminability

In addition to the information below, which includes some web references, there is a lovely series of Observable posts on accessibility, contrast, and color choice for data visualization. Check it out!

red### Color selection

We’ll first approach color selection with color impairment (aka “colorblindness”, though most color impaired people can see some colors) in mind, though many of the considerations here factor into contrast considerations later. There are several approaches to accommodating color impairment:

Avoid red and green combinations. This helps but is not sufficient, particularly for those who have trouble with red and blue, rather than green.
Use palettes designed to be “colorblind-friendly”, such as David Nichol’s, Okabe-Ito’s, Paul Tol’s. Colorbrewer’s colorblind friendly palettes are less useful than these options.
Design your graphic so that it is functional in greyscale. This will make it safe for all types of color impairment.
Dual-encode colors with other attributes, such as shape or linetype.

It can be difficult to fully accommodate those with color impairment, particularly when working with graphics that use many different hues. Keep in mind that even people with full color vision cannot keep more than \(7 \pm 2\) items in working memory - so using many different colors is problematic for everyone, not just for those with impaired color vision.

Contrast

It can be hard to see content that does not have much contrast against the background. People with low vision rely on contrast even more than the rest of the population; in addition, individuals with color impairment tend to rely on contrast cues to determine whether ambiguous colors are, in fact, different.

W3C (World Wide Web Consortium) creates Web Content Accessibility Guidelines (WCAG) to provide a standard of accessible online content. These guidelines have recommendations for creating alt-text, how to ensure accessibility of different types of media, and standards for how to make content distinguishable.

WCAG guidelines are provided on a scale from A (basic accessibility) to AAA (most accessible).

(A) guidelines for distinguishability:

(AA) guidelines for distinguishability:

Contrast: The visual presentation of text and images of text has a contrast ratio of at least 4.5:1, except for the following:
- Large Text: Large-scale text and images of large-scale text have a contrast ratio of at least 3:1;
- Incidental: Text or images of text that are part of an inactive user interface component, that are pure decoration, that are not visible to anyone, or that are part of a picture that contains significant other visual content, have no contrast requirement.
- Logo: Text that is part of a logo or brand name has no contrast requirement.
Resize Text: Except for captions and images of text, text can be resized without assistive technology up to 200 percent without loss of content or functionality.
Images of Text: If the technologies being used can achieve the visual presentation, text is used to convey information rather than images of text except for the following:
- Customizable: The image of text can be visually customized to the user’s requirements;
- Essential: A particular presentation of text is essential to the information being conveyed. Logotypes (text that is part of a logo or brand name) are considered essential.
Reflow: Content can be presented without loss of information or functionality, and without requiring scrolling in two dimensions for:
- Vertical scrolling content at a width equivalent to 320 CSS pixels;
- Horizontal scrolling content at a height equivalent to 256 CSS pixels.
- Except for parts of the content which require two-dimensional layout for usage or meaning.
Non-text Contrast: The visual presentation of the following have a contrast ratio of at least 3:1 against adjacent color(s):
- User Interface Components: Visual information required to identify user interface components and states, except for inactive components or where the appearance of the component is determined by the user agent and not modified by the author;
- Graphical Objects: Parts of graphics required to understand the content, except when a particular presentation of graphics is essential to the information being conveyed.
Text Spacing: In content implemented using markup languages that support the following text style properties, no loss of content or functionality occurs by setting all of the following and by changing no other style property:
- Line height (line spacing) to at least 1.5 times the font size;
- Spacing following paragraphs to at least 2 times the font size;
- Letter spacing (tracking) to at least 0.12 times the font size;
- Word spacing to at least 0.16 times the font size.
- Exception: Human languages and scripts that do not make use of one or more of these text style properties in written text can conform using only the properties that exist for that combination of language and script.
Content on Hover or Focus: Where receiving and then removing pointer hover or keyboard focus triggers additional content to become visible and then hidden, the following are true:
- Dismissible: A mechanism is available to dismiss the additional content without moving pointer hover or keyboard focus, unless the additional content communicates an input error or does not obscure or replace other content;
- Hoverable: If pointer hover can trigger the additional content, then the pointer can be moved over the additional content without the additional content disappearing;
- Persistent: The additional content remains visible until the hover or focus trigger is removed, the user dismisses it, or its information is no longer valid.
- Exception: The visual presentation of the additional content is controlled by the user agent and is not modified by the author.

(AAA) guidelines for distinguishability:

Enhanced Contrast: The visual presentation of text and images of text has a contrast ratio of at least 7:1, except for the following:
- Large Text: Large-scale text and images of large-scale text have a contrast ratio of at least 4.5:1;
- Incidental: Text or images of text that are part of an inactive user interface component, that are pure decoration, that are not visible to anyone, or that are part of a picture that contains significant other visual content, have no contrast requirement.
- Logotypes: Text that is part of a logo or brand name has no contrast requirement.
Low or No Background Audio: For prerecorded audio-only content that (1) contains primarily speech in the foreground, (2) is not an audio CAPTCHA or audio logo, and (3) is not vocalization intended to be primarily musical expression such as singing or rapping, at least one of the following is true:
- No Background: The audio does not contain background sounds.
- Turn Off: The background sounds can be turned off.
- 20 dB: The background sounds are at least 20 decibels lower than the foreground speech content, with the exception of occasional sounds that last for only one or two seconds.
Visual Presentation: For the visual presentation of blocks of text, a mechanism is available to achieve the following:
- Foreground and background colors can be selected by the user.
- Width is no more than 80 characters or glyphs (40 if CJK).
- Text is not justified (aligned to both the left and the right margins).
- Line spacing (leading) is at least space-and-a-half within paragraphs, and paragraph spacing is at least 1.5 times larger than the line spacing.
- Text can be resized without assistive technology up to 200 percent in a way that does not require the user to scroll horizontally to read a line of text on a full-screen window.
Images of Text: Images of text are only used for pure decoration or where a particular presentation of text is essential to the information being conveyed.

You can use https://www.accessibilitychecker.org/ to check the compliance of your website.

Chartability is a set of heuristics for ensuring accessibility of data visualizations (and the pages that contain them). It’s created by BLV designers and is designed to help you locate accessibility barriers in data visualizations. They maintain an audit workbook that has tests that help identify design failures.

Creating More Accessible Graphics

In general, charts created as images, which are the default in many systems such as ggplot, matplotlib, and SAS, require alt-text, inclusion of data tables, and other modifications that still do not produce full accessibility. By contrast, d3, Observable.js, Highcharts.js, and other svg-based web graphics allow for some navigation within the chart by screen reader users. However, these tools still require some extra planning to design charts that are accessible and well-formatted for screen-reader users.

The OLLi project works within Observable and Vega/VegaLite visualizations to create a navigable, hierarchical tree for keyboard navigation and descriptions.

Let’s consider three different pages produced using quarto to showcase the penguins data. Explore the graphics yourself first, then play the video below to see how my screen reader handled each one. Note the level of detail available to the user.

Warning: Removed 2 rows containing missing values or values outside the scale range
(`geom_point()`).

This is an untitled chart with no subtitle or caption.
It has x-axis 'bill_length_mm' with labels 40, 50 and 60.
It has y-axis 'bill_depth_mm' with labels 15.0, 17.5 and 20.0.
There is a legend indicating colour is used to show species, with 3 levels:
Adelie shown as strong reddish orange colour, 
Chinstrap shown as vivid yellowish green colour and 
Gentoo shown as brilliant blue colour.
The chart is a set of 342 big solid circle points of which about 97% can be seen.

Figure 1: A plot of the palmer penguins data, showing bill length in mm on the x axis and bill depth in mm on the y axis. Points are colored by species. For each species, there is a positive relationship between bill depth and bill length, with each species occupying a different region of the space. Adelie penguins have shorter, deeper bills. Chinstrap penguins have longer, deeper bills. Gentoo penguins have longer, shallower bills.

data=FileAttachment("../data/penguins.csv").csv({typed: true})
Plot.plot({
  marks: [
    Plot.dot(data, {x: "bill_length_mm", y: "bill_depth_mm", fill: "species"})
  ],
  color: {legend: true}
})

(a) ?(caption)

(b) ?(caption)

Figure 2: A plot of the palmer penguins data, showing bill length in mm on the x axis and bill depth in mm on the y axis. Points are colored by species. For each species, there is a positive relationship between bill depth and bill length, with each species occupying a different region of the space. Adelie penguins have shorter, deeper bills. Chinstrap penguins have longer, deeper bills. Gentoo penguins have longer, shallower bills.

penguinChart = ({
  marks: [
    Plot.dot(data, {x: "bill_length_mm", y: "bill_depth_mm", fill: "species"})
  ],
  color: {legend: true}
})

// Create plot with specification
Plot.plot(penguinChart);

Figure 3: A plot of the palmer penguins data, showing bill length in mm on the x axis and bill depth in mm on the y axis. Points are colored by species. For each species, there is a positive relationship between bill depth and bill length, with each species occupying a different region of the space. Adelie penguins have shorter, deeper bills. Chinstrap penguins have longer, deeper bills. Gentoo penguins have longer, shallower bills.

Olli is a fantastic project, but support for Observable is limited; there is better support for Vega charts, but even then, not all chart types are supported.

Sonification

There are other methods of communicating data without relying primarily on visual methods and adapting those representations to remove reliance on vision. Zong et al. (2024) developed Umwelt, which allows for editing of multimodal data representations, providing some support for sonification and non-visual data communication. There are also methods for adapting existing visualizations to produce sonified equivalents, using R tools like sonify or Python tools like Strauss, miditools (Russo 2024).

Sonifying Data With Python

Here’s an example of how to create a data sonification using the penguins data, adapted from Russo (2024).

# Code adapted from https://hub.ovh2.mybinder.org/user/systemsounds-so-ation-tutorials-vr3cdobo/doc/tree/data2midi-part1.ipynb

import pandas as pd

penguins = pd.read_csv("../data/penguins.csv").dropna()

# Define a general mapping function
def map_value(value, min_value, max_value, min_result, max_result):
  '''maps value (or array of values) from one range to another'''
  
  result = min_result + (value - min_value)/(max_value - min_value)*(max_result - min_result)
  return result

# Set desired duration: 15 seconds/beats
penguins.bill_length_mm.describe # get info on penguin bill lengths

<bound method NDFrame.describe of 0      39.1
1      39.5
2      40.3
4      36.7
5      39.3
       ... 
339    55.8
340    43.5
341    49.6
342    50.8
343    50.2
Name: bill_length_mm, Length: 333, dtype: float64>

penguins = penguins.sort_values(by=['species', 'bill_length_mm'], ascending = True) # sort data by bill length

duration_beats = 15
bpm = 60
duration_sec = duration_beats*60/bpm

# Scale x axis
penguins["t_data"] = map_value(penguins.bill_length_mm, min(penguins.bill_length_mm), max(penguins.bill_length_mm), 0, duration_beats)

# Scale y axis
penguins["y_data"] = map_value(penguins.bill_depth_mm, min(penguins.bill_depth_mm), max(penguins.bill_depth_mm), 0, 1)

# May want to transform data a bit - example uses sqrt
# y_scale = 0.5
# 
# penguins["y_data"] = penguins.y_data**y_scale

import matplotlib.pyplot as plt

plt.scatter(penguins.t_data, penguins.y_data)
plt.xlabel('time (bill length, mm, normalized)')
plt.ylabel('bill depth, mm, normalized')
plt.show()

plt.clf()

# Scale penguin species
penguins["track"] = penguins.species.replace({"Adelie":0,"Gentoo":1,"Chinstrap":2})
for track_i in range(3):
  dat = penguins.query("track==@track_i")
  plt.scatter(dat.t_data, dat.y_data)

plt.xlabel('time (bill length, mm, normalized)')
plt.ylabel('bill depth, mm, normalized')
plt.show()

Now that we’ve transformed the data, we can map data values to tones. MIDI tones and velocities are integers, so we must transform and then round our values to match the requirements of the medium we’re using.

# In this case, I'm happy with just 2 octaves of chromatic notes, from C3 to C5. 
# These correspond to MIDI notes 48:72

penguins["midi_y"] = round(map_value(penguins.y_data, 0, 1, 48, 72))
penguins["midi_y"] = penguins.midi_y.convert_dtypes()
for track_i in range(3):
  dat = penguins.query("track==@track_i")
  plt.scatter(dat.t_data, dat.midi_y)
plt.xlabel('time (bill length, mm, normalized)')
plt.ylabel('MIDI note number')
plt.show()

# Map data to note velocity - velocity is a combination of volume and intensity
# We dual-encode pitch and velocity
# 
# vel_min, vel_max = 35, 127
# penguins["midi_vel"] = round(map_value(penguins.y_data, 0, 1, vel_max, vel_min))
# penguins["midi_vel"] = penguins.midi_vel.convert_dtypes()

Finally, we create the midi file. We add 3 tracks, one for each species - this would allow us to change the “program” (instrument) to correspond to each species group. For the moment, I’ve commented the program changes out because the addition of instruments makes the end result sound like elementary school band warm-up time (complete chaos).


from midiutil.MidiFile import MIDIFile #import library to make midi file, https://midiutil.readthedocs.io/en/1.2.1/
    
#create midi file object, add tempo
my_midi_file = MIDIFile(3, deinterleave=False) #three tracks, one for each species 
my_midi_file.addTempo(track=0, time=0, tempo=bpm) 
my_midi_file.addTempo(track=1, time=0, tempo=bpm) 
my_midi_file.addTempo(track=2, time=0, tempo=bpm) 

# 
# # .addProgramChange(track, channel, time, program)
# my_midi_file.addProgramChange(0, 0, 0, 71) # first set of penguins as clarinet
# my_midi_file.addProgramChange(1, 0, 0, 75) # second set of penguins as pan flute
# my_midi_file.addProgramChange(2, 0, 0, 59) # third set of penguins as muted trumpets

#add midi notes
for i in penguins.index:
    my_midi_file.addNote(track=penguins.track[i], channel=0, pitch=penguins.midi_y[i], time=penguins.t_data[i], duration=0.25, volume=35)

#create and save the midi file itself
filename = 'penguins_sonification.mid'
with open(filename, "wb") as f:
    my_midi_file.writeFile(f) 

# Listen
# import pygame #import library for playing midi files, https://pypi.org/project/pygame/
# pygame.init()
# pygame.mixer.music.load(filename)
# pygame.mixer.music.play()

Results of sonification of Figure 1 using python and MIDI audio encoding.

Physicalization

There are a number of ways to create accessible tactile charts using embossing machines, capsule paper (Brauner 2023), or 3D printers. Tactile graphics have higher performance than tactile tables or electronic tables accessed via screen reader (Watanabe and Mizukami 2018), in addition, tactile bar charts presented either alone or with auditory information have higher performance than audio-only presentation (Goncu, Marriott, and Hurst 2010).

R packages like rayshader can be used to convert ggplot2 plots into 3D-printable STL files (Morgan-Wall 2024). This produces a STL file that has some tactile information, without requiring too much specialized software; it could be made more accessible by using a Braille font. One downside is that the height of the plot object is mapped to color/fill, and does not accommodate categorical mappings.

Rayshader demo

data <- read.csv("../data/penguins.csv")
library(ggplot2)
plot <- ggplot(data) + 
  stat_density_2d(aes(x = bill_length_mm, y = bill_depth_mm, 
                      fill = after_stat(!!str2lang("density"))),
                  contour = F, geom = "raster") +
  scale_x_continuous(expand=c(0,0)) +
  scale_y_continuous(expand=c(0,0))

library(rayshader)
plot_gg(plot, emboss_text = .05)

Warning: Removed 2 rows containing non-finite outside the scale range
(`stat_density2d()`).
Removed 2 rows containing non-finite outside the scale range
(`stat_density2d()`).

This is an untitled chart with no subtitle or caption.
It has x-axis 'bill_length_mm' with labels 40 and 50.
It has y-axis 'bill_depth_mm' with labels 14, 16, 18 and 20.
There is a legend indicating fill is used to show density, ranging from 5.50919365996529e-09 represented by fill dark purplish blue to 0.0160915333509369 shown as fill brilliant blue.
The chart is a raster graph that VI cannot process.

Warning: Removed 2 rows containing non-finite outside the scale range
(`stat_density2d()`).

rgl::rglwidget()

save_3dprint("../fig/penguins-density-plot.stl", maxwidth = 125, unit = "mm", rotate = TRUE)

Dimensions of model are: 125.0 mm x 125.0 mm x 25.0 mm

References

Brauner, Diane. 2023. “Creating Tactile Graphic Images Part 3: Tips for Embossing. Perkins School for the Blind.” May 24, 2023. https://www.perkins.org/resource/creating-tactile-graphic-images-part-3-tips-embossing/.

Goncu, Cagatay, Kim Marriott, and John Hurst. 2010. “Usability of Accessible Bar Charts.” In Diagrammatic Representation and Inference, edited by Ashok K. Goel, Mateja Jamnik, and N. Hari Narayanan, 167–81. Berlin, Heidelberg: Springer. https://doi.org/10.1007/978-3-642-14600-8_17.

Morgan-Wall, Tyler. 2024. “Create Maps and Visualize Data in 2D and 3D. Rayshader.” May 10, 2024. https://www.rayshader.com/.

Russo, Matt. 2024. “Sonification: Convert Data to MIDI (Part 1). JupyterLab.” March 21, 2024. https://hub.ovh2.mybinder.org/user/systemsounds-so-ation-tutorials-vr3cdobo/doc/tree/data2midi-part1.ipynb.

Watanabe, Tetsuya, and Hikaru Mizukami. 2018. “Effectiveness of Tactile Scatter Plots: Comparison of Non-Visual Data Representations.” In Computers Helping People with Special Needs, edited by Klaus Miesenberger and Georgios Kouroupetroglou, 628–35. Cham: Springer International Publishing. https://doi.org/10.1007/978-3-319-94277-3_97.

Zong, Jonathan, Isabella Pedraza Pineros, Mengzhu Katie Chen, Daniel Hajas, and Arvind Satyanarayan. 2024. “Umwelt: Accessible Structured Editing of Multimodal Data Representations.” In Proceedings of the CHI Conference on Human Factors in Computing Systems, 1–20. https://doi.org/10.1145/3613904.3641996.