MAMA: Phrase, block, list, and other elements
- Previous article—MAMA: Hyperlinks
- Next article—MAMA: Images - elements and formats
- Table of contents
Index:
- Introduction
- Phrase elements
- FONT element: Markup dedicated to visual formatting
- Basic block elements
- List elements
- Ruby elements
- Obsolete elements
- Browser extension elements/attributes
- Miscellaneous elements
Introduction
This section is a catch-all, covering the "rest" of popular markup in use that did not easily fall into the other markup categories. It covers basic phrase and block markup, list-related elements, browser extensions, and other elements and attributes. While none of these elements are among the 10 most popular, they do constitute a majority among the next 10 and the following 20 as well.
Phrase elements
The purpose of many markup elements is to declare the appearance or nature of
the embedded text. The following 22 elements either describe some simple
formatting instructions for text content or have a loftier goal of describing
the intrinsic nature of the content. Applying a bold effect is the most popular
usage for this type of element (STRONG
almost always
equates to bold formatting). The almost eight-fold preference of SUP
over the comparable SUB
element is puzzling.
ELEMENT | Frequency | ELEMENT | Frequency | |
---|---|---|---|---|
B | 1,805,495 | CITE | 12,562 | |
SPAN | 1,527,964 | ACRONYM | 10,983 | |
STRONG | 1,102,056 | SUB | 9,378 | |
I | 668,742 | S | 7,767 | |
EM | 351,594 | CODE | 7,595 | |
U | 342,612 | STRIKE | 5,991 | |
SMALL | 155,962 | DFN | 3,584 | |
BIG | 76,946 | Q | 1,785 | |
SUP | 73,309 | SAMP | 1,609 | |
TT | 15,480 | KBD | 1,313 | |
ABBR | 14,902 | VAR | 1,258 |
FONT
element: markup dedicated to visual formatting
This element was originally a Netscape extension that later was included in
HTML 3.2 and then immediately deprecated—much greater control is attainable
using CSS. That does not stop it from remaining extremely popular with authors.
In fact, it is THE most popular phrase markup element, even
more popular than the B
or SPAN
elements.
ELEMENT/Attribute | Frequency |
---|---|
FONT | 2,061,417 |
Size | 1,709,405 |
Color | 1,634,714 |
Face | 1,379,110 |
Attribute values
Because of the popularity of this element, and other studies focusing their
attentions on other areas of markup (Hickson) or CSS (Saarsoo), it was instructive
to look at values for some HTML attributes that had not been discussed
before. The 3 main FONT
attributes were ripe for this
type of analysis.
Size
attribute
With the Size
attribute, absolute font size values
are preferred over relative font size changes. Font size "2"
is the most popular, with "1" also having high representation.
Attribute value | Frequency | Attribute value | Frequency | |
---|---|---|---|---|
2 | 967,193 | -1 | 311,017 | |
1 | 785,227 | 6 | 204,675 | |
3 | 488,919 | -2 | 182,372 | |
4 | 485,621 | +1 | 170,310 | |
5 | 332,907 | +2 | 115,593 |
Color
attribute
The hexadecimal (#RRGGBB syntax) method is preferred over the friendly color names. White and black have the clear edge in the frequency list, while blue, red, yellow, and gray are also very popular.
Attribute value | Frequency | Attribute value | Frequency | |
---|---|---|---|---|
#ffffff | 535,698 | red | 90,996 | |
#000000 | 442,291 | white | 90,452 | |
#ff0000 | 318,323 | #666666 | 84,941 | |
#0000ff | 188,611 | #ffff00 | 81,481 | |
#000080 | 101,950 | black | 77,329 |
Face
attribute
This attribute allows you to specify the type of font used, in the same manner
as the font-family
CSS property. The "Arial" font is a runaway
favorite here, while "Helvetica", "Verdana" and "sans-serif" are only half as
popular. This attribute value allows a comma-separated list of font names, so
"Arial" may be very popular as a primary font choice, or it may also be a popular
fallback font used when designating other specific fonts.
Attribute value | Frequency | Attribute value | Frequency | |
---|---|---|---|---|
Arial | 1,036,962 | times | 101,094 | |
Helvetica | 660,926 | tahoma | 93,718 | |
Verdana | 548,563 | geneva | 86,016 | |
sans-serif | 486,824 | serif | 72,203 | |
Times New Roman | 197,881 | comic sans ms | 68,708 |
About Face!: Fonts used in Japan
This is a good time to make use of MAMA's country tracking in order to locate
regional differences in specific font use. Japanese font usage on Web pages,
as a break-out example, would likely have specific trends very different than
that of the rest of the Web-at-large. There were 124,976 URLs tracked in MAMA
identified as being from Japan, and only 23,466 used the FONT
Face
attribute. In this restricted population, the
"Arial" font surprisingly takes the top spot again. This is a bit odd because
"Arial" does not contain any Japanese character glyphs. Perhaps "Arial" is again
being used as a fallback font. Japan-specific fonts "Osaka" and "MS UI Gothic"
also have high representation here.
Note: Some of the values in the full frequency table are meaningless gibberish, due to a bug in MAMA's storage of multi-byte Asian characters.
Attribute value | Frequency | Attribute value | Frequency | |
---|---|---|---|---|
Arial | 3,333 | verdana | 1,303 | |
Osaka | 2,615 | sans-serif | 1,235 | |
Helvetica | 1,738 | times | 607 | |
Times New Roman | 1,677 | comic sans ms | 567 | |
MS UI Gothic | 1,627 | century | 542 |
Basic block elements
The P
and DIV
elements are the
workhorses of HTML, as expected, but the popularity of CENTER
is higher than expected, being encountered more often than any of the
Hx
heading levels. The decreasing popularity of the
heading levels from H1
-H6
is not a surprise.
ELEMENT/Attribute | Frequency | ELEMENT/Attribute | Frequency | |
---|---|---|---|---|
P | 2,702,935 | H4 | 185,110 | |
DIV | 2,499,779 | H5 | 103,060 | |
CENTER | 1,076,535 | ADDRESS | 50,269 | |
H1 | 769,344 | H6 | 45,676 | |
H2 | 573,002 | PRE | 36,620 | |
H3 | 438,496 | Width | 63 | |
BLOCKQUOTE | 188,947 | |||
Cite | 1,402 |
A common attribute for block elements: Align
Many of the basic block elements have the Align
attribute
in common, which controls horizontal alignment of the content within the block.
More than half of all URLs with P
elements also use the
Align
attribute on at least one of the element instances, but
the prize for highest usage goes to the DIV
element:
67.82% of the URLs that use DIV
also have at least
one of the DIV
s using the Align
attribute. The Heading elements (H1
-H6
)
all have similar usage rates for the Align
attribute: ~15-20%.
ELEMENT | Frequency | % Total element usage | ELEMENT | Frequency | % Total element usage | |
---|---|---|---|---|---|---|
DIV | 1,695,287 | 67.82% | H3 | 60,477 | 13.79% | |
P | 1,375,278 | 50.88% | H4 | 30,682 | 16.58% | |
H1 | 110,342 | 14.34% | H5 | 21,302 | 20.67% | |
H2 | 76,291 | 13.31% | H6 | 10,342 | 22.64% |
List elements
It is surprising that the unordered list (UL
) is
almost 20 times as popular as the ordered list (OL
)—one would think that ranking things would be a more popular activity on the
Web (hotornot.com, anyone?). The UL
and OL
elements are used in the same documents in 29,564 cases (~2/3 of the OL
total). The DL
list also finishes the race far in
advance of the OL
element, at almost a 2-to-1 ratio.
The deprecated MENU
and DIR
elements are both definitely on the way out—if usage of
OL
compared to UL
seemed rare,
the situation for MENU
and DIR
is dire.
ELEMENT/Attribute | Frequency | ELEMENT/Attribute | Frequency | |
---|---|---|---|---|
LI | 843,869 | DT | 74,984 | |
Type | 5854 | DD | 71,477 | |
Value | 597 | OL | 47,196 | |
UL | 809,915 | Start | 2,266 | |
Type | 23,996 | Type | 3,425 | |
DL | 84,257 | DIR | 4,397 | |
Compact | 583 | MENU | 1,906 |
The Type
attribute of LI
,
UL
and OL
MAMA stored the values of the Type
attribute for the
LI
element and combined the Type
values for UL
and OL
elements.
The expected keywords are dominant in both cases, but a prominent swap occurs
between the frequency tables for this attribute of the
LI
element and the
UL
/OL
elements. The most popular value for LI
/Type
is "square", followed by "disc".
The top OL
-UL
/Type
values are the same, but reversed. There may be an obvious explanation for this
that I am missing.
Ruby elements
The RUBY
elements were introduced to HTML in
XHTML 1.1 for primary use in select Asian language situations. The various
Ruby-related elements still appear to suffer from low adoption. As expected,
the clear majority of cases (265 of 289) are Japanese URLs.
ELEMENT | Frequency |
---|---|
RUBY | 289 |
RT | 278 |
RB | 216 |
RP | 204 |
Obsolete elements
These elements were obsolete back when HTML 2.0 was the latest and greatest version in the land. As you can see, they have all but disappeared from author's development lexicons.
ELEMENT | Frequency |
---|---|
XMP | 311 |
PLAINTEXT | 189 |
LISTING | 32 |
Browser extension elements/attributes
Some elements and attributes have been created by browser vendors in the past
but have not been embraced in the standards. The functionality of the oft-maligned
BLINK
element was absorbed in CSS to become
"text-decoration
: blink".
With time, usage of the BLINK
element will disappear,
but currently BLINK
's usage by 26,807 URLs is ~36%
of the "text-decoration
: blink"
version. Some extensions catch on, while others do not. The LAYER
and ILAYER
elements—both Netscape 4 specific elements—are
fairly popular, while usage of MULTICOL
, an element
introduced and only supported in the same Netscape version is almost nonexistent.
Of all the browser extensions that have not gained standards traction, the crown
definitely belongs to MARQUEE
—an element originating
in Microsoft's Internet Explorer 2.0 and now embraced by all major browsers.
ELEMENT/Attribute | Frequency | ELEMENT/Attribute | Frequency | |
---|---|---|---|---|
MARQUEE | 140,328 | SPACER | 34,470 | |
Scrollamount | 73,551 | BLINK | 26,807 | |
Width | 67,071 | LAYER | 26,305 | |
Scrolldelay | 66,185 | ILAYER | 21,391 | |
Height | 58,552 | NOLAYER | 12,389 | |
Direction | 43,831 | MULTICOL | 75 | |
Behavior | 37,835 | |||
Bgcolor | 32,466 | |||
Align | 23,804 | |||
Loop | 16,055 | |||
Border | 9,059 | |||
Truespeed | 6,182 | |||
Hspace | 2,159 | |||
Vspace | 2,052 |
Miscellaneous elements
These elements are a grab bag of markup that do not really fit in other categories,
but as you can see by the popularity of BR
(the 11th
most popular element overall), they cannot be ignored. The revision control
elements INS
and DEL
, along
with the bi-directional control element BDO
were all
introduced in HTML 4.0, but none of them have any significant authoring traction
"in the wild" even though they have been around for over 10 years.
Note: The value in parentheses for the BR
element indicates the frequency of "<br />
",
which was detected separately from "<br>
"
ELEMENT/Attribute | Frequency | ELEMENT/Attribute | Frequency | |
---|---|---|---|---|
BR | 2,859,662 (168,642) | WBR | 4,883 | |
Clear | 107,624 | INS | 1,344 | |
HR | 729,380 | Datetime | 139 | |
Size | 227,745 | Cite | 82 | |
Width | 226,657 | DEL | 1,243 | |
Noshade | 117,978 | Datetime | 165 | |
Align | 100,044 | Cite | 33 | |
NOBR | 89,903 | BDO | 167 | |
BASEFONT | 13,158 | Dir | 147 | |
Size | 10,113 | |||
Color | 7,231 | |||
Face | 1,810 |
- Previous article—MAMA: Hyperlinks
- Next article—MAMA: Images - elements and formats
- Table of contents
This article is licensed under a Creative Commons Attribution, Non Commercial - Share Alike 2.5 license.
Comments
The forum archive of this article is still available on My Opera.