An Introduction to Datasets

By Divya Manian

Introduction

HTML5 includes a means to set custom attributes on elements using the data- prefix. Called data attributes, they can be scripted to define and store data as well as increase options for attribute selection when styling with CSS. You can use as many data attributes as you require, providing greater control when manipulating and rendering data. For a first look, here's an example that generates annotated music sheets.

Dataset basics

Here is an example of data attributes on an element:

<div id="day2-meal-expense" 
  data-drink="coffee" 
  data-food="sushi" 
  data-meal="lunch">$20.12</div>

To get the value of of an attribute, you can use the dataset object as follows:

var expenseday2 = document.getElementById('day2-meal-expense');  
var typeOfDrink = expenseday2.dataset.drink;

Note that hyphenated names become camel-cased. For example, if there was an attribute data-meal-time in the above markup, then its value would be retrieved by using expenseday2.dataset.mealTime.

Data attributes have been supported in almost all browsers for some time, but support for dataset has only recently been added. In Opera 11.10+ you can use dataset to access your custom data attributes via JavaScript. It is also supported in Chrome 9+, Safari 6+ (upcoming) and Firefox 6+ (upcoming).

Why do we need dataset?

The traditional way of accessing values of an attribute is as follows below:

var typeOfDrink = document.getElementById('day2-meal-expense').getAttribute('data-drink');

Now, trying to access more than a few custom attributes could easily get messy:

var attrs = expenseday2.attributes,
expense = {}, i, j;  
for (i = 0, j = attrs.length; i < j; i++) {
  if(attrs[i].name.substring(0, 5) == 'data-') {
    expense[attrs[i].name.substring(5)] = attrs[i].value;
  }
}

With the dataset attribute, you do not need to use any kind of looping to get the values you want to manipulate. You can use it immediately:

expense = document.getElementById('day2-meal-expense').dataset;

dataset is not your typical object in JavaScript; it is a DOMStringMap object. DOMStringMap is a new interface that is available in HTML5 for a set of name-value pairs.

Manipulating a dataset

You could manipulate these name-value pairs like so:

chartInput = [];
for (var item in expense) {
  chartInput.push(expense[item]);
}

If you want to delete a data attribute, you can do that with:

delete expenseday2.dataset.meal;

And to add an attribute to an element:

expenseday2.dataset.dessert = 'icecream';

How fast is it compared to getAttribute?

Using dataset to manipulate data is slightly slower than doing so with getAttribute, although if you are manipulating only a handful of data-attributes, the impact is not that significant.

Then again, dataset is much less of a headache to manipulate and use compared to other forms of manipulating attributes, and much more readable.

Where can I use it?

Every time you use a custom data attribute, using dataset to access the name-value pair is a good way forward. You can also feature-detect for dataset support and use dataset when supported, like this:

if(expenseday2.dataset) {
  expenseday2.dataset.dessert = 'icecream';
} else {
  expenseday2.setAttribute('data-dessert', 'icecream');
}

Note: If you have more intensive applications that require frequent updating of data attributes, I would recommend you use a JavaScript object to maintain data rather than manipulating the data attributes every time.

Data Attributes in CSS

Using data attributes is also pretty handy if you want to selectively apply styles based on the attribute value. For example, if you want to style each kind of food-related expense based on the kind of meal that was being had, you could do this:

div[data-meal="lunch"] {
  background-image: url('lunch.png');
}

Using Data Attributes to create charts

Now I'd like to present to you an example of using data attributes to render charts. In this example, data are added to the elements using the dataset, which are then rendered via generated content.

In summary

dataset is an easy way to access data attributes on an element. Support is slowly increasing, with Firefox nightlies also supporting the dataset attribute. Using this attribute does not improve code performance, but it does make it shorter, easier and more readable.

All snippets that I used in this article are available in this fiddle (or as a gist) for you to play with. The Charts demo is also up on the Opera Github repository for you to play with.

Divya Manian made the jump from developing device drivers for Motorola phones to designing websites and has not looked back since. She loves reading books and making light of things on Twitter.


This article is licensed under a Creative Commons Attribution 3.0 Unported license.

Comments

The forum archive of this article is still available on My Opera.

No new comments accepted.