Alan Wood’s Unicode Resources

Test for Unicode support in Web browsers

General Punctuation

U+2000 – U+206F   (8192–8303)

Characters 8211–8213, 8215–8222, 8224–8226, 8230, 8240, 8242, 8243, 8249, 8250, 8252, 8254 and 8260 are present in Microsoft’s WGL4 character set. Characters 8211, 8212, 8216–8218, 8220–8222, 8224–8226, 8230, 8240, 8249, 8250 and 8260 provide Unicode equivalents for some of the characters in Apple’s MacRoman character set.

Characters 8230, 8242, 8243 and 8260 provide Unicode equivalents for some of the characters in Monotype’s Symbol font.

Characters 8211, 8212, 8216–8218, 8220–8222, 8224–8226, 8230, 8240, 8249 and 8250 provide Unicode equivalents for some of the characters in the ANSI character set.

There is another collection of punctuation marks in the Supplemental Punctuation range, and there are many others in the ranges for specific scripts.

The characters that appear in the “Character” columns of the following table depend on the browser that you are using, the fonts installed on your computer, and the browser options you have chosen that determine the fonts used to display particular character sets, encodings or languages. The entries in the “Entity” column are character entity references that can be used in HTML pages.

Almost all fonts contain at least a few characters from this range. You can find some or all of the characters in this range in the Windows Unicode fonts aakar, Aboriginal Sans, Aboriginal Serif, Abyssinica SIL, Aegean, Aegyptus, Aharoni, AiPaiNutaaq, Akhil HE, Akkadian, Akshar Unicode, Aleem Urdu Unicode, Alexander, Alfios, Alice0 MX, Alice1 MX, Alice2 MX, Alice 3 MX, Alice4 MX, Alice5 MX, Alkaios, ALPHABETUM Unicode, Anaktoria, Analecta, Andale Mono, Andalus, Andika Basic, Andron Scriptor Web, Angsana New, AngsanaUPC, AnmolUni, AnmolUniBani, AR PL ShanHeiSun Uni, AR PL ZenKai Uni, Arabic Transparent, Arabic Typesetting, Arev Sans, Arial, Arial Unicode MS, Aroania, Atavyros, AttikaU, Avdira, BabelStone Han, BabelStone Phags-pa Book, Baekmuk Batang, Baekmuk Dotum, Baekmuk Gulim, Baekmuk Headline, Batang, BatangChe, Berling Antiqua, Bitstream CyberBase, Bitstream CyberBit, Bitstream CyberCJK, BosporosU, BPG Classic 99U, BPG Paata Khutsuri U, Browallia New, BrowalliaUPC, Bukyvede, Calibri, Cambria, Cambria Math, Candara, Cardo, Caslon, CDT Khmer, Century, Chandas, Charis SIL, Chrysanthi Unicode, CN-Times, Code2000, Code2001, Consolas, Constantia, Corbel, Cordia New, CordiaUPC, Courier MonoThai, Courier New, Dai Banna SIL Book, Dai Banna SIL Light, Daicing Bible, Daicing Harmony, Daicing Round, Daicing White, Daicing Xiaokai, DaunPenh, David, David Transparent, DejaVu Sans, DejaVu Sans Condensed, DejaVu Sans Mono, DejaVu Serif, DejaVu Serif Condensed, Dialekt Uni, Digohweli, Dilyana, DokChampa, Doulos SIL, Dukor, Ekushey Azad, Ekushey Durga, Ekushey Puja, Ekushey Punarbhaba, Ekushey Saraswatii, Ekushey Sharifa, Ekushey Sumit, e-PhonTranslit UNI, Ethiopia Jiret, Euphemia, Euphemia CAS, EversonMono, Ezra SIL, Ezra SIL SR, Fixed Miriam Transparent, Fixedsys Excelsior, FMBF Bardi, FrankRuehl, Free Idg Serif, Free Monospaced, Free Sans, Free Serif, Frutiger Linotype, Galatia SIL, Galilee Unicode Gk, Gandhari Unicode, Garava, Gargi, Garuda, Geez Unicode, Gentium, GentiumAlt, Georgia, GFS Bodoni, GFS Didot, GFS Neohellenic, Gisha, GlobalScience, Goher Urdu Unicode, Gulim Che, Gungsuh, GungsuhChe, HAN NOM A, HAN NOM B, Hapax Berbère, Hapax Touareg, Hapax Touareg DàG, Hindsight Unicode, HY Shin Myeongjo Std Acro, Impact, Iskoola Pota, Jaipur Unicode NFLC, jGaramond, JG Basic Lao, JG Chantabouli Lao, JG LaoTimes, Jomolhari, Junicode, KadmosU, Kalinga, KaputaUnicode, Kartika, Kayases, Khmer OS, Khmer OS Fasthand, Khmer OS Freehand, Khmer OS Metal Chrieng, Khmer OS Muol, Khmer OS System, Kidprint, Kisiska, Kliment Std, Kochi Gothic, Kochi Mincho, Kozuka Mincho Pro Acro, Kurdish AllAlphabets, Lao Unicode, Lateef, LeedsUni, Leelawadee, Legendum, Levenim MT, Linux Biolinum O, Linux Libertine O, Loma, Lucida Bright, Lucida Console, Lucida Grande, Lucida Sans, Lucida Sans Typewriter, Lucida Sans Unicode, Malgun Gothic, Marin, Masinahikan, Masinahikan Dene, MD King KhammuRabi, MgOldTimes UC Pol, MgOpen Canonica, Microsoft Himalaya, Microsoft Sans Serif, Microsoft Uighur, Microsoft Yi Baiti, Mike Hebrew, Mike Hebrew Web, Ming(for ISO10646), MingLiU, MingLiU_HKSCS, Minion Pro, Miriam, Miriam Fixed, Miriam Transparent, .Mondulkiri U GR 1.5, Mongolian Baiti, Monospace, MoolBoran, MPH 2B Damase, MS Gothic, MS Mincho, MS PGothic, MS PMincho, MS Reference Sans Serif, MS Reference Serif, MS UI Gothic, MSung Std Acro, Musica, MyaZedi_M17N, MyMyanmar, Myriad Pro, Narkisim, Nastaleeq Like, New Athena Unicode, New Gulim, Norasi, NSimSun, NSimSun-18030, Nyala, OpenSymbol, OskiBlackfoot, OskiDakelh, OskiDeneA, OskiDeneB, OskiDeneC, OskiDeneS, OskiEast, OskiWest, Padauk, padmaa, PakType Naqsh, Palatino Linotype, ParabaikSans, Pashto Kror Asiatype, Phetsarath OT, PhnomPenh OT, Pitabek, Plantagenet Cherokee, PMingLiU, Potha, Quivira, Reader Sans, Rekha, Rod, Rod Transparent, RomanCyrillic Std, Roman Unicode, Rotinonhsonni Sans, Rupakara, Samda, Sanskrit 2003, Santipur OT, Saysettha MX, Saysettha OT, Saysettha Unicode, Sazanami Gothic, Sazanami Mincho, SBL Greek, SBL Hebrew, Scheherazade, Segoe Print, Segoe Script, Segoe UI, Siddhanta, SimHei, SImPL, Simplified Arabic, Simplified Arabic Fixed, SimSun, SimSun-18030, sixpack, Sophia Nubian, StarSymbol, STIXGeneral, STSong Std Acro, Summersby, Sun-ExtA, Sylfaen, Symbola, TabAvarangal2, Tahoma, Thryomanes, Tibetan Machine Uni, Times New Roman, TITUS Cyberbit Basic, Traditional Arabic, Trebuchet MS, TSC FMing S TT, TSC JSong S TT, UnBatang, Ugaritic 3.03 Unicode, UniBurma, Unikurd Web, Uqammaq, Urdu Naskh Asiatype, Urdu Naskh Unicode, Uttara, VangVieng MX, Verajja, Verdana, Visual Geez Unicode, Visual Geez Unicode Agazian, Visual Geez Unicode Title, Vrinda, Vusillus Old Face, Wakor, Wangdi29, WenQuanYi Zen Hei, WenQuanYi Zen Hei Mono, XiengThong MX, XTashi, Yigezu Bisrat Gothic Goffer, YOzFontN and Zawgyi-One; in the Macintosh OS 9 Unicode fonts Apple Chancery, Capitals, Charcoal, Charcoal CE, Charcoal CY, Chicago, Chicago CE, Chicago CY, ChuGothic, Courier, Courier CE, Gadget, Geneva, Geneva CE, Geneva CY, HeiseiKakuGothic, HeiseiMincho, Helvetica, Helvetica CE, Helvetica CY, Hoefler Text, Monaco, Monaco CE, Monaco CY, New York, Osaka, Osaka-Mono, Palatino, Palatino CE, Sand, SaiMincho, Skia, Tahoma, Techno, Textile, Times, Times CE and Times CY; in the Macintosh OS X Unicode fonts Alkaios, AppleGothic, Apple LiGothic, Apple LiSung, AppleMyungjo, Apple Symbols, Arial, Ayuthaya, Batang, Beijing, BiauKai, BJCree Uni, Century, Chalkboard, Charcoal, Charcoal CY, Chicago, Conakry, Courier, Didot, Euphemia UCAS, Fang Song, Futura, Geneva, Geneva CY, Gentium, GentiumAlt, #GothicMedium, Gulim, #GungSeo, Hangang, Hei, Helvetica, Helvetica CY, Hiragino Kaku Gothic Pro, Hiragino Kaku Gothic Std, Hiragino Maru Gothic Pro, Hiragino Mincho Pro, Kai, Junicode, Krungthep, Lucida Grande, Monaco, Monaco CY, MS Gothic, Mshtakan, MS Mincho, MS PGothic, MS PMincho, #MyungjoNeue, New Athena Unicode, New York, Osaka, Osaka-Mono, #PCMyungjo, #PilGi, Plantagenet Cherokee, PMingLiU, Sand, Sathu, Seoul, Silom, SimSun, Skia, Song, STFangsong, STHeiti, STKaiti, STSong, #TaeGraphic, Taipei, TektonPro, Thonburi, Times, Times CY, Times New Roman, Trebuchet MS, Verdana, Zapfino and Zuzumbo; and in the Unix Unicode font Caslon.

To see exactly which characters are included in a particular font, you can use a utility such as Andrew West’s BabelMap, Apple’s TrueEdit, or WunderMoosen’s FontChecker.

If you are not familiar with the characters, you can check the characters displayed here with the graphical display at http://www.unicode.org/charts/PDF/U2000.pdf.

Character
(decimal)
DecimalCharacter
(hex)
HexEntityName
 8192 2000 EN QUAD
81932001 EM QUAD
81942002 EN SPACE
81952003 EM SPACE
81962004 THREE-PER-EM SPACE
81972005 FOUR-PER-EM SPACE
81982006 SIX-PER-EM SPACE
81992007 FIGURE SPACE
82002008 PUNCTUATION SPACE
82012009 THIN SPACE
8202200A HAIR SPACE
8203200B ZERO WIDTH SPACE
8204200C‌ZERO WIDTH NON-JOINER
8205200D‍ZERO WIDTH JOINER
8206200E‎LEFT-TO-RIGHT MARK
8207200F‏RIGHT-TO-LEFT MARK
82082010 HYPHEN
82092011 NON-BREAKING HYPHEN
82102012 FIGURE DASH
82112013–EN DASH   (present in WGL4, ANSI and MacRoman)
82122014—EM DASH   (present in WGL4, ANSI and MacRoman)
82132015 HORIZONTAL BAR (present in WGL4)
82142016 DOUBLE VERTICAL LINE
82152017 DOUBLE LOW LINE (present in WGL4)
82162018‘LEFT SINGLE QUOTATION MARK   (present in WGL4, ANSI and MacRoman)
82172019’RIGHT SINGLE QUOTATION MARK   (present in WGL4, ANSI and MacRoman)
8218201A‚SINGLE LOW-9 QUOTATION MARK   (present in WGL4, ANSI and MacRoman)
8219201B SINGLE HIGH-REVERSED-9 QUOTATION MARK (present in WGL4)
8220201C“LEFT DOUBLE QUOTATION MARK   (present in WGL4, ANSI and MacRoman)
8221201D”RIGHT DOUBLE QUOTATION MARK   (present in WGL4, ANSI and MacRoman)
8222201E„DOUBLE LOW-9 QUOTATION MARK   (present in WGL4, ANSI and MacRoman)
8223201F DOUBLE HIGH-REVERSED-9 QUOTATION MARK
82242020†DAGGER   (present in WGL4, ANSI and MacRoman)
82252021‡DOUBLE DAGGER   (present in WGL4, ANSI and MacRoman)
82262022•BULLET   (present in WGL4, ANSI and MacRoman)
82272023 TRIANGULAR BULLET
82282024 ONE DOT LEADER
82292025 TWO DOT LEADER
82302026…HORIZONTAL ELLIPSIS   (present in WGL4, ANSI, MacRoman, and in Symbol font)
82312027 HYPHENATION POINT
82322028 LINE SEPARATOR
82332029 PARAGRAPH SEPARATOR
8234202A LEFT-TO-RIGHT EMBEDDING
8235202B RIGHT-TO-LEFT EMBEDDING
8236202C POP DIRECTIONAL FORMATTING
8237202D LEFT-TO-RIGHT OVERRIDE
8238202E RIGHT-TO-LEFT OVERRIDE
8239202F NARROW NON-BREAK SPACE
82402030‰PER MILLE SIGN   (present in WGL4, ANSI and MacRoman)
82412031 PER TEN THOUSAND SIGN
82422032′PRIME   (present in WGL4 and in Symbol font)
82432033″DOUBLE PRIME   (present in WGL4 and in Symbol font)
82442034 TRIPLE PRIME
82452035 REVERSED PRIME
82462036 REVERSED DOUBLE PRIME
82472037 REVERSED TRIPLE PRIME
82482038 CARET
82492039‹SINGLE LEFT-POINTING ANGLE QUOTATION MARK   (present in WGL4, ANSI and MacRoman)
8250203A›SINGLE RIGHT-POINTING ANGLE QUOTATION MARK   (present in WGL4, ANSI and MacRoman)
8251203B REFERENCE MARK
8252203C DOUBLE EXCLAMATION MARK (present in WGL4)
8253203D INTERROBANG
8254203E‾OVERLINE   (present in WGL4)
8255203F UNDERTIE
82562040 CHARACTER TIE
82572041 CARET INSERTION POINT
82582042 ASTERISM
82592043 HYPHEN BULLET
82602044⁄FRACTION SLASH   (present in WGL4 and MacRoman, and in Symbol font)
82612045 LEFT SQUARE BRACKET WITH QUILL
82622046 RIGHT SQUARE BRACKET WITH QUILL
82632047 DOUBLE QUESTION MARK
82642048 QUESTION EXCLAMATION MARK
82652049 EXCLAMATION QUESTION MARK
8266204A TIRONIAN SIGN ET
8267204B REVERSED PILCROW SIGN
8268204C BLACK LEFTWARDS BULLET
8269204D BLACK RIGHTWARDS BULLET
8270204E LOW ASTERISK
8271204F REVERSED SEMICOLON
82722050 CLOSE UP
82732051 TWO ASTERISKS ALIGNED VERTICALLY
82742052 COMMERCIAL MINUS SIGN
82752053 SWUNG DASH
82762054 INVERTED UNDERTIE
82772055 FLOWER PUNCTUATION MARK
82782056 THREE DOT PUNCTUATION
82792057 QUADRUPLE PRIME
82802058 FOUR DOT PUNCTUATION
82812059 FIVE DOT PUNCTUATION
8282205A TWO DOT PUNCTUATION
8283205B FOUR DOT MARK
8284205C DOTTED CROSS
8285205D TRICOLON
8286205E VERTICAL FOUR DOTS
8287205F MEDIUM MATHEMATICAL SPACE
82882060 WORD JOINER
82892061 FUNCTION APPLICATION
82902062 INVISIBLE TIMES
82912063 INVISIBLE SEPARATOR
82922064 INVISIBLE PLUS
82942066 LEFT-TO-RIGHT ISOLATE
82952067 RIGHT-TO-LEFT ISOLATE
82962068 FIRST STRONG ISOLATE
82972069 POP DIRECTIONAL ISOLATE
8298206A INHIBIT SYMMETRIC SWAPPING
8299206B ACTIVATE SYMMETRIC SWAPPING
8300206C INHIBIT ARABIC FORM SHAPING
8301206D ACTIVATE ARABIC FORM SHAPING
8302206E NATIONAL DIGIT SHAPES
8303206F NOMINAL DIGIT SHAPES

Copyright © 1999–2013 Alan Wood

The hexadecimal numbers and the character names in the above table are taken from the Unicode 6.3.0 Character Database, Copyright © 1991–2013 Unicode, Inc., as contained in UnicodeData.txt on the Unicode Web site (http://www.unicode.org/Public/UNIDATA/) in November 2013.

Created 3rd February 1999   Last updated 24th November 2013

Send comments or questions to Alan Wood

HTML 4.01