With the advent of desktop publishing in the 1980's, typesetting suddenly changed from a spectator sport to participatory. Along with this, a lot of the subtleties got lost for many. A lot of people didn't know they were missing these details, and a lot of programs didn't let you do anything about it even if you did.
With the creation of Unicode, this has changed for many applications. Unicode includes many characters which are traditionally combined in computer-generated works. This is a list of some of those, including their proper usage.
Some similar topics are discussed in 10 Typographical Blunders and in The Trouble With EM 'n EN at A List Apart.
Thanks to Toby Thain for sending me the Mac keystrokes that go with each character.
Coders of various stripes should read the following:
- The Absolute Minimum Every Software Developer Absolutely, Positively Must Know About Unicode and Character Sets (No Excuses!)
- Named versus Numeric Entities. I have included the named HTML entites here, but please read this article before using them.
Contents
Single Quote-like Characters
- ASCII Apostrophe
-
This is the character that you type on a standard (US layout) keyboard with the key that's beside the semicolon. It shouldn't really ever be used in proper typography, but is often used because it's easy to type and well supported. It is superseded by one of the below, depending on meaning.
- Unicode character: 0027 hex = 39 dec
- Unicode character name: APOSTROPHE
- Unicode Reference glyph:
- Your browser's representation (in the default serif and sans-serif fonts): ' '
- XML/HTML numeric entity:
'
or'
- TeX/LaTeX: When you type an apostrophe, TeX produces one of the below characters, depending on the current mode. The typewriter text font produces an apostrophe character if you type it.
- Typing: apostrophe key
- The ASCII Apostrophe on MyFonts.com.
- The ASCII Apostrophe on zvon.org
- The ASCII Apostrophe on eki.ee
- Display of ASCII Apostrophe in various fonts
- Backtick
-
This is the character that's typically on a US layout keyboard below the tilde (~). It shouldn't be used in place of the opening single quote, or for any other discernible typographic purpose.
- Unicode character: 0060 hex = 96 dec
- Unicode character name: GRAVE ACCENT
- Unicode Reference glyph:
- Your browser's representation (in the default serif and sans-serif fonts): ` `
- XML/HTML numeric entity:
`
or`
- TeX/LaTeX: Can be produced by typing it in typewriter text. Typing the backtick generally produces one of the below characters.
- Typing: backtick key
- The Backtick on MyFonts.com.
- The Backtick on zvon.org
- The Backtick on eki.ee
- Display of Backtick in various fonts
- Opening Single Quote
-
This is the symbol that should be used to start a quotation that's delimited with single quotes. For example, ‘Over here!’
- Unicode character: 2018 hex = 8216 dec
- Unicode character name: LEFT SINGLE QUOTATION MARK
- Unicode Reference glyph:
- Your browser's representation (in the default serif and sans-serif fonts): ‘ ‘
- XML/HTML numeric entity:
‘
or‘
- HTML named entity:
‘
- TeX/LaTeX: Type a backtick, `.
- Typing: in Ubuntu/Gnome, Ctrl-Shift-u 2 0 1 8 Enter; in Windows, hold Alt and type + 2 0 1 8; on a Mac, hold Option and type 2 0 1 8
- Mac keystroke: Option-]
- The Opening Single Quote on MyFonts.com.
- The Opening Single Quote on zvon.org
- The Opening Single Quote on eki.ee
- Display of Opening Single Quote in various fonts
- Closing Single Quote
-
This is the symbol that should be used to end a quotation that's delimited with single quotes. For example, ‘Over here!’ This is also the preferred character to use as an apostrophe, as in I’m coming, or He’s with me.
- Unicode character: 2019 hex = 8217 dec
- Unicode character name: RIGHT SINGLE QUOTATION MARK
- Unicode Reference glyph:
- Your browser's representation (in the default serif and sans-serif fonts): ’ ’
- XML/HTML numeric entity:
’
or’
- HTML named entity:
’
- TeX/LaTeX: Type an apostrophe, '.
- Typing: in Ubuntu/Gnome, Ctrl-Shift-u 2 0 1 9 Enter; in Windows, hold Alt and type + 2 0 1 9; on a Mac, hold Option and type 2 0 1 9
- Mac keystroke: Shift-Option-]
- The Closing Single Quote on MyFonts.com.
- The Closing Single Quote on zvon.org
- The Closing Single Quote on eki.ee
- Display of Closing Single Quote in various fonts
- Prime
-
Used in mathematics, as in xx′ + yy′. It's also used as the symbol for feet, as in I am 6′ tall.
- Unicode character: 2032 hex = 8242 dec
- Unicode character name: PRIME
- Unicode Reference glyph:
- Your browser's representation (in the default serif and sans-serif fonts): ′ ′
- XML/HTML numeric entity:
′
or′
- TeX/LaTeX: Type an apostrophe in math mode.
- Typing: in Ubuntu/Gnome, Ctrl-Shift-u 2 0 3 2 Enter; in Windows, hold Alt and type + 2 0 3 2; on a Mac, hold Option and type 2 0 3 2
- Mac keystroke: Option-4 in Symbol font
- The Prime on MyFonts.com.
- The Prime on zvon.org
- The Prime on eki.ee
- Display of Prime in various fonts
- Acute Accent
-
This character is provided so it can be placed over others, as an accent. It should take up zero horizontal space, because it's designed to overlap another character.
- Unicode character: 0301 hex = 769 dec
- Unicode character name: COMBINING ACUTE ACCENT
- Unicode Reference glyph:
- Your browser's representation (in the default serif and sans-serif fonts): ́ ́
- XML/HTML numeric entity:
́
oŕ
- TeX/LaTeX: Use the \' (backslash-apostrophe) macro before the character you want to accent.
- Typing: in Ubuntu/Gnome, Ctrl-Shift-u 0 3 0 1 Enter; in Windows, hold Alt and type + 0 3 0 1; on a Mac, hold Option and type 0 3 0 1
- Mac keystroke: Option-e then accented character
- The Acute Accent on MyFonts.com.
- The Acute Accent on zvon.org
- The Acute Accent on eki.ee
- Display of Acute Accent in various fonts
- Grave Accent
-
This character is provided so it can be placed over others, as an accent. It should take up zero horizontal space, because it's designed to overlap another character.
- Unicode character: 0300 hex = 768 dec
- Unicode character name: COMBINING GRAVE ACCENT
- Unicode Reference glyph:
- Your browser's representation (in the default serif and sans-serif fonts): ̀ ̀
- XML/HTML numeric entity:
̀
or̀
- TeX/LaTeX: Use the \` (backslash-backtick) macro before the character you want to accent.
- Typing: in Ubuntu/Gnome, Ctrl-Shift-u 0 3 0 0 Enter; in Windows, hold Alt and type + 0 3 0 0; on a Mac, hold Option and type 0 3 0 0
- Mac keystroke: Option-` (backtick) then accented character
- The Grave Accent on MyFonts.com.
- The Grave Accent on zvon.org
- The Grave Accent on eki.ee
- Display of Grave Accent in various fonts
Double Quote-like Characters
- ASCII Double Quote
-
This is the character that you type on a standard (US layout) keyboard with the key that's beside the semicolon. It shouldn't really ever be used in proper typography, but is often used because it's easy to type and well supported. It is superseded by one of the below, depending on meaning.
- Unicode character: 0022 hex = 34 dec
- Unicode character name: QUOTATION MARK
- Unicode Reference glyph:
- Your browser's representation (in the default serif and sans-serif fonts): " "
- XML/HTML numeric entity:
"
or"
- HTML named entity:
"
- TeX/LaTeX: Typing a double quote will produce one, but you should usually use the below. The typewriter text font produces a double quote character if you type it.
- Typing: double-quote key
- The ASCII Double Quote on MyFonts.com.
- The ASCII Double Quote on zvon.org
- The ASCII Double Quote on eki.ee
- Display of ASCII Double Quote in various fonts
- Opening Double Quote
-
This is the symbol that should be used to start a quotation that's delimited with double quotes. For example, “Over here!”
- Unicode character: 201C hex = 8220 dec
- Unicode character name: LEFT DOUBLE QUOTATION MARK
- Unicode Reference glyph:
- Your browser's representation (in the default serif and sans-serif fonts): “ “
- XML/HTML numeric entity:
“
or“
- HTML named entity:
“
- TeX/LaTeX: Type two backticks, ``.
- Typing: in Ubuntu/Gnome, Ctrl-Shift-u 2 0 1 C Enter; in Windows, hold Alt and type + 2 0 1 C; on a Mac, hold Option and type 2 0 1 C
- Mac keystroke: Option-[
- The Opening Double Quote on MyFonts.com.
- The Opening Double Quote on zvon.org
- The Opening Double Quote on eki.ee
- Display of Opening Double Quote in various fonts
- Closing Double Quote
-
This is the symbol that should be used to end a quotation that's delimited with double quotes. For example, “Over here!”
- Unicode character: 201D hex = 8221 dec
- Unicode character name: RIGHT DOUBLE QUOTATION MARK
- Unicode Reference glyph:
- Your browser's representation (in the default serif and sans-serif fonts): ” ”
- XML/HTML numeric entity:
”
or”
- HTML named entity:
”
- TeX/LaTeX: Type two apostrophes, ''.
- Typing: in Ubuntu/Gnome, Ctrl-Shift-u 2 0 1 D Enter; in Windows, hold Alt and type + 2 0 1 D; on a Mac, hold Option and type 2 0 1 D
- Mac keystroke: Shift-Option-[
- The Closing Double Quote on MyFonts.com.
- The Closing Double Quote on zvon.org
- The Closing Double Quote on eki.ee
- Display of Closing Double Quote in various fonts
- Double Prime
-
A doubled version of the prime symbol. Used in mathematics like xx′x″ + yy′y″. It is also used to indicate inches, The table is 5′ 6″ long. Unicode also defines a triple prime symbol at hex 2034.
- Unicode character: 2033 hex = 8243 dec
- Unicode character name: DOUBLE PRIME
- Unicode Reference glyph:
- Your browser's representation (in the default serif and sans-serif fonts): ″ ″
- XML/HTML numeric entity:
″
or″
- TeX/LaTeX: Type two apostrophes in math mode.
- Typing: in Ubuntu/Gnome, Ctrl-Shift-u 2 0 3 3 Enter; in Windows, hold Alt and type + 2 0 3 3; on a Mac, hold Option and type 2 0 3 3
- Mac keystroke: Option-, (comma) in Symbol font
- The Double Prime on MyFonts.com.
- The Double Prime on zvon.org
- The Double Prime on eki.ee
- Display of Double Prime in various fonts
- Ditto Mark
-
Used to indicate that the text or other material is identical to that above it.
- Unicode character: 3003 hex = 12291 dec
- Unicode character name: DITTO MARK
- Unicode Reference glyph:
- Your browser's representation (in the default serif and sans-serif fonts): 〃 〃
- XML/HTML numeric entity:
〃
or〃
- TeX/LaTeX: Can't be produced? Use one of the double quotes instead.
- Typing: in Ubuntu/Gnome, Ctrl-Shift-u 3 0 0 3 Enter; in Windows, hold Alt and type + 3 0 0 3; on a Mac, hold Option and type 3 0 0 3
- The Ditto Mark on MyFonts.com.
- The Ditto Mark on zvon.org
- The Ditto Mark on eki.ee
- Display of Ditto Mark in various fonts
Dashes
- ASCII Hyphen
-
The hyphen produced by the key on your keyboard. It can be used in place of the hyphen or minus sign (as described below), but the more specific characters are preferred.
- Unicode character: 002D hex = 45 dec
- Unicode character name: HYPHEN-MINUS
- Unicode Reference glyph:
- Your browser's representation (in the default serif and sans-serif fonts): - -
- XML/HTML numeric entity:
-
or-
- TeX/LaTeX: Can be produced by typing a hyphen in typewriter text.
- Typing: hyphen key
- The ASCII Hyphen on MyFonts.com.
- The ASCII Hyphen on zvon.org
- The ASCII Hyphen on eki.ee
- Display of ASCII Hyphen in various fonts
- Hyphen
-
Used in hyphenating a word, as in pre‐fabricated or e‐mail. This is also the symbol that should be used when breaking a word across a line.
- Unicode character: 2010 hex = 8208 dec
- Unicode character name: HYPHEN
- Unicode Reference glyph:
- Your browser's representation (in the default serif and sans-serif fonts): ‐ ‐
- XML/HTML numeric entity:
‐
or‐
- TeX/LaTeX: Type a hyphen, -.
- Typing: in Ubuntu/Gnome, Ctrl-Shift-u 2 0 1 0 Enter; in Windows, hold Alt and type + 2 0 1 0; on a Mac, hold Option and type 2 0 1 0
- The Hyphen on MyFonts.com.
- The Hyphen on zvon.org
- The Hyphen on eki.ee
- Display of Hyphen in various fonts
- En-Dash
-
The en-dash is used to indicate a range, like I'll need 100–150 units or John Doe, 1914–2001.
- Unicode character: 2013 hex = 8211 dec
- Unicode character name: EN DASH
- Unicode Reference glyph:
- Your browser's representation (in the default serif and sans-serif fonts): – –
- XML/HTML numeric entity:
–
or–
- HTML named entity:
–
- TeX/LaTeX: Type two hyphens, --.
- Typing: in Ubuntu/Gnome, Ctrl-Shift-u 2 0 1 3 Enter; in Windows, hold Alt and type + 2 0 1 3; on a Mac, hold Option and type 2 0 1 3
- Mac keystroke: Option-hyphen
- The En-Dash on MyFonts.com.
- The En-Dash on zvon.org
- The En-Dash on eki.ee
- Display of En-Dash in various fonts
- Em-Dash
-
This is used to break up a sentence. Two dashes, --, are often used in its place when typing, but this is the proper way to do it. [In fact, if your browser doesn't have a real em-dash character, you might see the double-hyphen instead.] For example, He said he had it under control—I could see that wasn't true.
- Unicode character: 2014 hex = 8212 dec
- Unicode character name: EM DASH
- Unicode Reference glyph:
- Your browser's representation (in the default serif and sans-serif fonts): — —
- XML/HTML numeric entity:
—
or—
- HTML named entity:
—
- TeX/LaTeX: Type three hyphens, ---.
- Typing: in Ubuntu/Gnome, Ctrl-Shift-u 2 0 1 4 Enter; in Windows, hold Alt and type + 2 0 1 4; on a Mac, hold Option and type 2 0 1 4
- Mac keystroke: Shift-Option-hyphen
- The Em-Dash on MyFonts.com.
- The Em-Dash on zvon.org
- The Em-Dash on eki.ee
- Display of Em-Dash in various fonts
- Minus Sign
-
Used to indicate subtraction, as in 4 − 1 = 3 or z = x − y.
- Unicode character: 2212 hex = 8722 dec
- Unicode character name: MINUS SIGN
- Unicode Reference glyph:
- Your browser's representation (in the default serif and sans-serif fonts): − −
- XML/HTML numeric entity:
−
or−
- HTML named entity:
−
- TeX/LaTeX: Type a hyphen in math mode.
- Typing: in Ubuntu/Gnome, Ctrl-Shift-u 2 2 1 2 Enter; in Windows, hold Alt and type + 2 2 1 2; on a Mac, hold Option and type 2 2 1 2
- Mac keystroke: hyphen in Symbol font
- The Minus Sign on MyFonts.com.
- The Minus Sign on zvon.org
- The Minus Sign on eki.ee
- Display of Minus Sign in various fonts
- Soft Hyphen
-
This isn't quite a "character" in the usual sense. The soft hyphen is used to indicate a place that a word can be broken if it's near the end of a line. Usually, this character shouldn't appear (if you see nothing in the example below, that's right). It displays as a hyphen if the line is broken at that point.
- Unicode character: 00AD hex = 173 dec
- Unicode character name: SOFT HYPHEN
- Unicode Reference glyph:
- Your browser's representation (in the default serif and sans-serif fonts):
- XML/HTML numeric entity:
­
or­
- HTML named entity:
­
- TeX/LaTeX: Use the macro \-.
- Typing: in Ubuntu/Gnome, Ctrl-Shift-u 0 0 A D Enter; in Windows, hold Alt and type + 0 0 A D; on a Mac, hold Option and type 0 0 A D
- Mac keystroke: application dependent; command-hyphen in Quark XPress
- The Soft Hyphen on MyFonts.com.
- The Soft Hyphen on zvon.org
- The Soft Hyphen on eki.ee
- Display of Soft Hyphen in various fonts
- Non-breaking Hyphen
-
The appearance of this character is exactly the same as the regular hyphen. The difference is that the line should not be broken at this point.
- Unicode character: 2011 hex = 8209 dec
- Unicode character name: NON-BREAKING HYPHEN
- Unicode Reference glyph:
- Your browser's representation (in the default serif and sans-serif fonts): ‑ ‑
- XML/HTML numeric entity:
‑
or‑
- TeX/LaTeX: You can force TeX to not break a line with the \hbox macro, as in \hbox{no-break}.
- Typing: in Ubuntu/Gnome, Ctrl-Shift-u 2 0 1 1 Enter; in Windows, hold Alt and type + 2 0 1 1; on a Mac, hold Option and type 2 0 1 1
- The Non-breaking Hyphen on MyFonts.com.
- The Non-breaking Hyphen on zvon.org
- The Non-breaking Hyphen on eki.ee
- Display of Non-breaking Hyphen in various fonts
- Hyphen Bullet
-
This character is intended to be used as the bullet in a bulleted list.
- Unicode character: 2043 hex = 8259 dec
- Unicode character name: HYPHEN BULLET
- Unicode Reference glyph:
- Your browser's representation (in the default serif and sans-serif fonts): ⁃ ⁃
- XML/HTML numeric entity:
⁃
or⁃
- TeX/LaTeX: LaTeX usually does the bullets for you. Typically, the second level of bullets is set to {\bfseries -}.
- Typing: in Ubuntu/Gnome, Ctrl-Shift-u 2 0 4 3 Enter; in Windows, hold Alt and type + 2 0 4 3; on a Mac, hold Option and type 2 0 4 3
- The Hyphen Bullet on MyFonts.com.
- The Hyphen Bullet on zvon.org
- The Hyphen Bullet on eki.ee
- Display of Hyphen Bullet in various fonts
Spaces
The rules for spacing are a lot looser than for the other characters above. There's a lot of style associated with the space between various elements. So, I'll present the important spacing characters and give the best description I can of what they're commonly used for.
- Space
-
This is the regular space that should appear between words. It's what you get when you press the space-bar on your keyboard.
- Unicode character: 0020 hex = 32 dec
- Unicode character name: SPACE
- Unicode Reference glyph:
- Your browser's representation (in the default serif and sans-serif fonts):
- XML/HTML numeric entity:
 
or 
- TeX/LaTeX: This can usually be produced by just typing a space. TeX guesses about sentence endings and puts a longer space there. You can override this or produce multiple spaces with the \ (backslash-space) macro.
- Typing: space bar
- The Space on MyFonts.com.
- The Space on zvon.org
- The Space on eki.ee
- Display of Space in various fonts
- Non-breaking Space
-
This is used is indicate a space where the line should not be broken. For example, all of the spaces in Mr. John Doe should be non-breaking since it's incorrect to split a line in the middle of someone's name.
- Unicode character: 00A0 hex = 160 dec
- Unicode character name: NO-BREAK SPACE
- Unicode Reference glyph:
- Your browser's representation (in the default serif and sans-serif fonts):
- XML/HTML numeric entity:
 
or 
- HTML named entity:
- TeX/LaTeX: Type a tilde, ~
- Typing: in Ubuntu/Gnome, Ctrl-Shift-u 0 0 A 0 Enter; in Windows, hold Alt and type + 0 0 A 0; on a Mac, hold Option and type 0 0 A 0
- Mac keystroke: application dependent; Command-space in Quark XPress
- The Non-breaking Space on MyFonts.com.
- The Non-breaking Space on zvon.org
- The Non-breaking Space on eki.ee
- Display of Non-breaking Space in various fonts
- Em-Space
-
A space with the same width as the height of the font, approximately the width of a capital letter 'M'.
- Unicode character: 2003 hex = 8195 dec
- Unicode character name: EM SPACE
- Unicode Reference glyph:
- Your browser's representation (in the default serif and sans-serif fonts):
- XML/HTML numeric entity:
 
or 
- HTML named entity:
 
- TeX/LaTeX: Use \hspace{1em} or \quad.
- Typing: in Ubuntu/Gnome, Ctrl-Shift-u 2 0 0 3 Enter; in Windows, hold Alt and type + 2 0 0 3; on a Mac, hold Option and type 2 0 0 3
- The Em-Space on MyFonts.com.
- The Em-Space on zvon.org
- The Em-Space on eki.ee
- Display of Em-Space in various fonts
- En-Space
-
Half an em-space. This is about the amount of space that's commonly put between sentences by typesetters, although it's typically a little less in modern texts.
- Unicode character: 2002 hex = 8194 dec
- Unicode character name: EN SPACE
- Unicode Reference glyph:
- Your browser's representation (in the default serif and sans-serif fonts):
- XML/HTML numeric entity:
 
or 
- HTML named entity:
 
- TeX/LaTeX: Use \hspace{.5em}. The \@ macro is used to indicate a between-sentences space.
- Typing: in Ubuntu/Gnome, Ctrl-Shift-u 2 0 0 2 Enter; in Windows, hold Alt and type + 2 0 0 2; on a Mac, hold Option and type 2 0 0 2
- Mac keystroke: Option-space in most applications
- The En-Space on MyFonts.com.
- The En-Space on zvon.org
- The En-Space on eki.ee
- Display of En-Space in various fonts
Others
- Ellipsis
-
The ellipsis is used to indicate an omission. For example, 1, 2, …, 10. The dots in an ellipsis should be spaced a little further apart than three periods would typically be, so they have been assigned this character.
- Unicode character: 2026 hex = 8230 dec
- Unicode character name: HORIZONTAL ELLIPSIS
- Unicode Reference glyph:
- Your browser's representation (in the default serif and sans-serif fonts): … …
- XML/HTML numeric entity:
…
or…
- HTML named entity:
…
- TeX/LaTeX: Use the \ldots macro. There are several others that can be used in math mode to produce a higher ellipsis, vertical ellipsis, etc.
- Typing: in Ubuntu/Gnome, Ctrl-Shift-u 2 0 2 6 Enter; in Windows, hold Alt and type + 2 0 2 6; on a Mac, hold Option and type 2 0 2 6
- Mac keystroke: Option-; (semicolon)
- The Ellipsis on MyFonts.com.
- The Ellipsis on zvon.org
- The Ellipsis on eki.ee
- Display of Ellipsis in various fonts