HTML 4.01 Character Entity References


The greatly extended range of entities supported in HTML 4.0, HTML 4.01 and XHTML (252, up from 66 in HTML 3.2), plus the greater number of characters in the WGL4 versions of Microsoft’s TrueType fonts for Windows, should make it much easier and more reliable to display a wide range of characters than relying on the unsupported technique of using the FACE attribute of the FONT tag and the Symbol font.

As at March 2009, Internet Explorer 7, Firefox 3 and Safari 3 can show all of the proper characters. Opera 9 can show nearly all of the characters, but shows all of the space characters as the same width. Older versions of these browsers are less capable.

XML does not support character entity references by default, but Dave Pawson has a page on Special Characters that includes instructions for modifying the DOCTYPE declaration in XML DTDs and XSLT scripts.

Even browsers that are capable of showing the correct characters cannot do so unless you use a font with an extended character set. Fortunately, the range of fonts with Unicode support is increasing steadily, with Microsoft’s core fonts (Arial, Courier New and Times New Roman), Arial Unicode MS, Lucida Sans Unicode, Tahoma, Bitstream CyberBit, Code2000 and Lucida Grande containing hundreds or even thousands of characters.

In the following lists, each character should appear twice. The first occurrence is generated from the numeric character reference and the second from the character entity reference.


Unicode Ranges

Basic Latin

Unicode U+0000 – U+007F   (0-127)

" or "
quotation mark, APL quote (" or ")
& or &
ampersand (& or &)
< or <
less than (‹ is left single angle quotation mark) (&#60; or &lt;)
> or >
greater than (› is right single angle quotation mark) (&#62; or &gt;)

Latin-1 Supplement

Unicode U+0080 – U+00FF   (128-255)

 ’ or ‘ 
non-breaking space (&#160; or &nbsp;)
¡ or ¡
inverted exclamation mark (&#161; or &iexcl;)
¢ or ¢
cent sign (&#162; or &cent;)
£ or £
pound sterling sign, Irish Punt (&#163; or &pound;)
¤ or ¤
general currency sign (&#164; or &curren;)
¥ or ¥
yen sign (&#165; or &yen;)
¦ or ¦
broken (vertical) bar (&#166; or &brvbar;)
§ or §
section sign (&#167; or &sect;)
¨ or ¨
umlaut (dieresis) (&#168; or &uml;)
© or ©
copyright sign (&#169; or &copy;)
ª or ª
ordinal indicator, feminine (&#170; or &ordf;)
« or «
angle quotation mark (left) (&#171; or &laquo;)
¬ or ¬
not sign (&#172; or &not;)
­ or ­
soft hyphen (&#173; or &shy;)
® or ®
registered sign (&#174; or &reg;)
¯ or ¯
macron, APL overbar (&#175; or &macr;)
° or °
degree sign (&#176; or &deg;)
± or ±
plus or minus sign (&#177; or &plusmn;)
² or ²
superscript two (&#178; or &sup2;)
³ or ³
superscript three (&#179; or &sup3;)
´ or ´
acute accent (&#180; or &acute;)
µ or µ
micro sign (&#181; or &micro;)
¶ or ¶
pilcrow (paragraph sign) (&#182; or &para;)
· or ·
middle dot (⋅ is dot operator) (&#183; or &middot;)
¸ or ¸
cedilla (&#184; or &cedil;)
¹ or ¹
superscript one (&#185; or &sup1;)
º or º
ordinal indicator, masculine (&#186; or &ordm;)
» or »
angle quotation mark (right) (&#187; or &raquo;)
¼ or ¼
fraction one-quarter (&#188; or &frac14;)
½ or ½
fraction one-half (&#189; or &frac12;)
¾ or ¾
fraction three-quarters (&#190; or &frac34;)
¿ or ¿
inverted question mark (&#191; or &iquest;)
À or À
capital A, grave (&#192; or &Agrave;)
Á or Á
capital A, acute (&#193; or &Aacute;)
 or Â
capital A, circumflex (&#194; or &Acirc;)
à or Ã
capital A, tilde (&#195; or &Atilde;)
Ä or Ä
capital A, umlaut (&#196; or &Auml;)
Å or Å
capital A, ring (&#197; or &Aring;)
Æ or Æ
capital AE diphthong (ligature) (&#198; or &AElig;)
Ç or Ç
capital C, cedilla (&#199; or &Ccedil;)
È or È
capital E, grave (&#200; or &Egrave;)
É or É
capital E, acute (&#201; or &Eacute;)
Ê or Ê
capital E, circumflex (&#202; or &Ecirc;)
Ë or Ë
capital E, umlaut (&#203; or &Euml;)
Ì or Ì
capital I, grave (&#204; or &Igrave;)
Í or Í
capital I, acute (&#205; or &Iacute;)
Î or Î
capital I, circumflex (&#206; or &Icirc;)
Ï or Ï
capital I, umlaut (&#207; or &Iuml;)
Ð or Ð
capital Eth (Icelandic) (&#208; or &ETH;)
Ñ or Ñ
capital N, tilde (&#209; or &Ntilde;)
Ò or Ò
capital O, grave (&#210; or &Ograve;)
Ó or Ó
capital O, acute (&#211; or &Oacute;)
Ô or Ô
capital O, circumflex (&#212; or &Ocirc;)
Õ or Õ
capital O, tilde (&#213; or &Otilde;)
Ö or Ö
capital O, umlaut (&#214; or &Ouml;)
× or ×
multiply sign, times (&#215; or &times;)
Ø or Ø
capital O, slash (&#216; or &Oslash;)
Ù or Ù
capital U, grave (&#217; or &Ugrave;)
Ú or Ú
capital U, acute (&#218; or &Uacute;)
Û or Û
capital U, circumflex (&#219; or &Ucirc;)
Ü or Ü
capital U, umlaut (&#220; or &Uuml;)
Ý or Ý
capital Y, acute (&#221; or &Yacute;)
Þ or Þ
capital Thorn (Icelandic) (&#222; or &THORN;)
ß or ß
small sharp s (German) (sz ligature) (&#223; or &szlig;)
à or à
small a, grave (&#224; or &agrave;)
á or á
small a, acute (&#225; or &aacute;)
â or â
small a, circumflex (&#226; or &acirc;)
ã or ã
small a, tilde (&#227; or &atilde;)
ä or ä
small a, umlaut (&#228; or &auml;)
å or å
small a, ring (&#229; or &aring;)
æ or æ
small ae diphthong (ligature) (&#230; or &aelig;)
ç or ç
small c, cedilla (&#231; or &ccedil;)
è or è
small e, grave (&#232; or &egrave;)
é or é
small e, acute (&#233; or &eacute;)
ê or ê
small e, circumflex (&#234; or &ecirc;)
ë or ë
small e, umlaut (&#235; or &euml;)
ì or ì
small i, grave (&#236; or &igrave;)
í or í
small i, acute (&#237; or &iacute;)
î or î
small i, circumflex (&#238; or &icirc;)
ï or ï
small i, umlaut (&#239; or &iuml;)
ð or ð
small eth (Icelandic) (&#240; or &eth;)
ñ or ñ
small n, tilde (&#241; or &ntilde;)
ò or ò
small o, grave (&#242; or &ograve;)
ó or ó
small o, acute (&#243; or &oacute;)
ô or ô
small o, circumflex (&#244; or &ocirc;)
õ or õ
small o, tilde (&#245; or &otilde;)
ö or ö
small o, umlaut (&#246; or &ouml;)
÷ or ÷
divide sign (&#247; or &divide;)
ø or ø
small o, slash (&#248; or &oslash;)
ù or ù
small u, grave (&#249; or &ugrave;)
ú or ú
small u, acute (&#250; or &uacute;)
û or û
small u, circumflex (&#251; or &ucirc;)
ü or ü
small u, umlaut (&#252; or &uuml;)
ý or ý
small y, acute (&#253; or &yacute;)
þ or þ
small thorn (Icelandic) (&#254; or &thorn;)
ÿ or ÿ
small y, umlaut (&#255; or &yuml;)

Latin Extended-A

Unicode U+0100 – U+017F   (256-383)

Œ or Œ
capital OE ligature (&#338; or &OElig;)
œ or œ
small oe ligature (&#339; or &oelig;)
Š or Š
capital S, caron (&#352; or &Scaron;)
š or š
small s, caron (&#353; or &scaron;)
Ÿ or Ÿ
capital Y, umlaut (&#376; or &Yuml;)

Latin Extended-B

Unicode U+0180 – U+024F   (384-591)

ƒ or ƒ
Florin, Guilder or Gulden (&#402; or &fnof;)

Spacing Modifier Letters

Unicode U+02B0 – U+02FF   (688-767)

ˆ or ˆ
circumflex accent (&#710; or &circ;)
˜ or ˜
small tilde (&#732; or &tilde;)

Greek

Unicode U+0370 – U+03FF   (880-1023)

Α or Α
capital Alpha (&#913; or &Alpha;)
Β or Β
capital Beta (&#914; or &Beta;)
Γ or Γ
capital Gamma (&#915; or &Gamma;)
Δ or Δ
capital Delta (&#916; or &Delta;)
Ε or Ε
capital Epsilon (&#917; or &Epsilon;)
Ζ or Ζ
capital Zeta (&#918; or &Zeta;)
Η or Η
capital Eta (&#919; or &Eta;)
Θ or Θ
capital Theta (&#920; or &Theta;)
Ι or Ι
capital Iota (&#921; or &Iota;)
Κ or Κ
capital Kappa (&#922; or &Kappa;)
Λ or Λ
capital Lambda (&#923; or &Lambda;)
Μ or Μ
capital Mu (&#924; or &Mu;)
Ν or Ν
capital Nu (&#925; or &Nu;)
Ξ or Ξ
capital Xi (&#926; or &Xi;)
Ο or Ο
capital Omicron (&#927; or &Omicron;)
Π or Π
capital Pi (&#928; or &Pi;)
Ρ or Ρ
capital Rho (&#929; or &Rho;)
Σ or Σ
capital Sigma (&#931; or &Sigma;)
Τ or Τ
capital Tau (&#932; or &Tau;)
Υ or Υ
capital Upsilon (&#933; or &Upsilon;)
Φ or Φ
capital Phi (&#934; or &Phi;)
Χ or Χ
capital Chi (&#935; or &Chi;)
Ψ or Ψ
capital Psi (&#936; or &Psi;)
Ω or Ω
capital Omega (&#937; or &Omega;)
α or α
lower case alpha (&#945; or &alpha;)
β or β
lower case beta (&#946; or &beta;)
γ or γ
lower case gamma (&#947; or &gamma;)
δ or δ
lower case delta (&#948; or &delta;)
ε or ε
lower case epsilon (&#949; or &epsilon;)
ζ or ζ
lower case zeta (&#950; or &zeta;)
η or η
lower case eta (&#951; or &eta;)
θ or θ
lower case theta (&#952; or &theta;)
ι or ι
lower case iota (&#953; or &iota;)
κ or κ
lower case kappa (&#954; or &kappa;)
λ or λ
lower case lambda (&#955; or &lambda;)
μ or μ
lower case mu (&#956; or &mu;)
ν or ν
lower case nu (&#957; or &nu;)
ξ or ξ
lower case xi (&#958; or &xi;)
ο or ο
lower case omicron (&#959; or &omicron;)
π or π
lower case pi (&#960; or &pi;)
ρ or ρ
lower case rho (&#961; or &rho;)
ς or ς
lower case sigma (terminal) (&#962; or &sigmaf;)
σ or σ
lower case sigma (&#963; or &sigma;)
τ or τ
lower case tau (&#964; or &tau;)
υ or υ
lower case upsilon (&#965; or &upsilon;)
φ or φ
lower case phi (&#966; or &phi;)
χ or χ
lower case chi (&#967; or &chi;)
ψ or ψ
lower case psi (&#968; or &psi;)
ω or ω
lower case omega (&#969; or &omega;)
ϑ or ϑ
theta symbol (&#977; or &thetasym;)
ϒ or ϒ
upsilon with hook symbol (&#978; or &upsih;)
ϖ or ϖ
pi symbol (&#982; or &piv;)

General Punctuation

Unicode U+2000 – U+206F   (8192-8303)

’ or ‘
en space (&#8194; or &ensp;)
’ or ‘
em space (&#8195; or &emsp;)
’ or ‘
thin space (&#8201; or &thinsp;)
‌ or ‌
zero width non-joiner (&#8204; or &zwnj;)
‍ or ‍
zero width joiner (&#8205; or &zwj;)
‎ or ‎
left-to-right mark (&#8206; or &lrm;)
‏ or ‏
right-to-left mark (&#8207; or &rlm;)
– or –
en dash (&#8211; or &ndash;)
— or —
em dash (&#8212; or &mdash;)
‘ or ‘
left single quotation mark, left single inverted comma (&#8216; or &lsquo;)
’ or ’
right single quotation mark, right single inverted comma, apostrophe (&#8217; or &rsquo;)
‚ or ‚
single low-9 quotation mark (&#8218; or &sbquo;)
“ or “
left double quotation mark, left double inverted commas (&#8220; or &ldquo;)
” or ”
right double quotation mark, right double inverted commas (&#8221; or &rdquo;)
„ or „
double low-9 quotation mark (&#8222; or &bdquo;)
† or †
dagger (&#8224; or &dagger;)
‡ or ‡
double dagger (&#8225; or &Dagger;)
• or •
bullet, black small circle (&#8226; or &bull;)
… or …
horizontal ellipsis, three dot leader (&#8230; or &hellip;)
‰ or ‰
per mille sign (&#8240; or &permil;)
′ or ′
prime, minutes, feet (&#8242; or &prime;)
″ or ″
double prime, seconds, inches (&#8243; or &Prime;)
‹ or ‹
single angle quotation mark (left) (< is less than) (&#8249; or &lsaquo;)
› or ›
single angle quotation mark (right) (> is greater than) (&#8250; or &rsaquo;)
‾ or ‾
overline, spacing overscore (&#8254; or &oline;)
⁄ or ⁄
figure slash (fraction) (/ is normal slash) (&#8260; or &frasl;)

Currency Symbols

Unicode U+20A0 – U+20CF   (8352-8399)

€ or €
euro (&#8364; or &euro;)

Letterlike Symbols

Unicode U+2100 – U+214F   (8448-8527)

ℑ or ℑ
blackletter I, imaginary part, I fraktur (&#8465; or &image;)
℘ or ℘
script capital P, Weierstrass, power set (&#8472; or &weierp;)
ℜ or ℜ
blackletter R, real part, R fraktur (&#8476; or &real;)
™ or ™
trade mark (&#8482; or &trade;)
ℵ or ℵ
alef symbol, first transfinite cardinal (&#8501; or &alefsym;)

Arrows

Unicode U+2190 – U+21FF   (8592-8703)

← or ←
arrow left (&#8592; or &larr;)
↑ or ↑
arrow up (&#8593; or &uarr;)
→ or →
arrow right (&#8594; or &rarr;)
↓ or ↓
arrow down (&#8595; or &darr;)
↔ or ↔
arrow left and right (&#8596; or &harr;)
↵ or ↵
downwards arrow with corner leftwards, carriage return (&#8629; or &crarr;)
⇐ or ⇐
double arrow left (implied by) (&#8656; or &lArr;)
⇑ or ⇑
double arrow up (&#8657; or &uArr;)
⇒ or ⇒
double arrow right (implies) (&#8658; or &rArr;)
⇓ or ⇓
double arrow down (&#8659; or &dArr;)
⇔ or ⇔
double arrow left and right (&#8660; or &hArr;)

Mathematical Operators

Unicode U+2200 – U+22FF   (8704-8959)

∀ or ∀
for all (&#8704; or &forall;)
∂ or ∂
partial differential (&#8706; or &part;)
∃ or ∃
there exists (&#8707; or &exist;)
∅ or ∅
empty set, null set, diameter (&#8709; or &empty;)
∇ or ∇
nabla, backward difference (&#8711; or &nabla;)
∈ or ∈
element of, set membership (&#8712; or &isin;)
∉ or ∉
not an element of, denial of set membership (&#8713; or &notin;)
∋ or ∋
contains as member (&#8715; or &ni;)
∏ or ∏
n-ary product, product sign (&#8719; or &prod;)
∑ or ∑
n-ary summation, sum of (&#8721; or &sum;)
− or −
minus sign (- is hyphen) (&#8722; or &minus;)
∗ or ∗
asterisk operator (&#8727; or &lowast;)
√ or √
square root, radical sign (&#8730; or &radic;)
∝ or ∝
proportional to (&#8733; or &prop;)
∞ or ∞
infinity (&#8734; or &infin;)
∠ or ∠
angle (&#8736; or &ang;)
∧ or ∧
logical and, wedge (&#8743; or &and;)
∨ or ∨
logical or, vee (&#8744; or &or;)
∩ or ∩
intersection, cap (&#8745; or &cap;)
∪ or ∪
union, cup (&#8746; or &cup;)
∫ or ∫
integral (&#8747; or &int;)
∴ or ∴
therefore (&#8756; or &there4;)
∼ or ∼
tilde operator, varies with, similar to (&#8764; or &sim;)
≅ or ≅
congruent, approximately equal to (&#8773; or &cong;)
≈ or ≈
asymptotic to, almost equal to (&#8776; or &asymp;)
The Adobe PostScript name for this character is approxequal
≠ or ≠
not equal to (&#8800; or &ne;)
≡ or ≡
equivalence, identical to (&#8801; or &equiv;)
≤ or ≤
less than or equal to (&#8804; or &le;)
≥ or ≥
greater than or equal to (&#8805; or &ge;)
⊂ or ⊂
subset of, proper subset (&#8834; or &sub;)
⊃ or ⊃
superset of (&#8835; or &sup;)
⊄ or ⊄
not a subset of (&#8836; or &nsub;)
⊆ or ⊆
subset of or equal to (&#8838; or &sube;)
⊇ or ⊇
superset of or equal to (&#8839; or &supe;)
⊕ or ⊕
circled plus, direct sum (&#8853; or &oplus;)
⊗ or ⊗
circled times, vector product (&#8855; or &otimes;)
⊥ or ⊥
up tack, orthogonal to, perpendicular (&#8869; or &perp;)
⋅ or ⋅
dot operator (· is middle dot) (&#8901; or &sdot;)

Miscellaneous Technical

Unicode U+2300 – U+23FF   (8960-9215)

⌈ or ⌈
left ceiling, APL upstile (&#8968; or &lceil;)
⌉ or ⌉
right ceiling (&#8969; or &rceil;)
⌊ or ⌊
left floor, APL downstile (&#8970; or &lfloor;)
⌋ or ⌋
right floor (&#8971; or &rfloor;)
〈 or ⟨
left-pointing angle bracket, bra (< is less than; ‹ is left single angle quotation mark) (&#9001; or &lang;)
〉 or ⟩
right-pointing angle bracket, ket (> is greater than; › is right single angle quotation mark) (&#9002; or &rang;)

Geometric Shapes

Unicode U+25A0 – U+25FF   (9632-9727)

◊ or ◊
lozenge (&#9674; or &loz;)

Miscellaneous Symbols

Unicode U+2600 – U+26FF   (9728-9983)

♠ or ♠
black spade suit (&#9824; or &spades;)
♣ or ♣
black club suit, shamrock (&#9827; or &clubs;)
♥ or ♥
black heart suit, valentine (&#9829; or &hearts;)
♦ or ♦
black diamond suit (&#9830; or &diams;)


Top

Copyright © 1997–2009 Alan Wood

Created 10th May 1997   Last modified 28th March 2009

Send comments or questions to Alan Wood

Portions © International Organization for Standardization 1986

Derived in part from entities.html. Copyright © World Wide Web Consortium, (Massachusetts Institute of Technology, Institut National de Recherche en Informatique et en Automatique, Keio University). All Rights Reserved. http://www.w3.org/Consortium/Legal/

Alan Wood’s Web site