A Numeric Character Reference (NCR) is a construct found in SGML and thus HTML and XML which represents a Unicode glyph. NCR may be decimal, hexadecimal, or perhaps other bases. However, decimal references are most widely supported, despite W3C’s hexadecimal recommendation. Character Entity References (CER) use short names rather than numbers to represent glyphs, but are neither exhaustive nor supported in XML without a Document Type Definition (DTD). There are five exceptions: quot, amp, apos, lt, and gt are the only built-in character entities honored by all XML processors. Apos is not explicitly declared in HTML but does exist in XHTML (because it is based on XML).
In other words, use the column(s) on the left (such as Ӓ) for all your funny characters in HTML and XML. Below is a tiny subset of the Universal Character set:
| Numeric Char Ref | Char Entity Ref | Glyph | Char Name | |||
|---|---|---|---|---|---|---|
| " | " | " | quotation mark | |||
| & | & | & | ampersand | |||
| ' | ' | ' | apostrophe | |||
| < | < | < | less-than sign | |||
| > | > | > | greater-than sign | |||
|   | | no-break space | ||||
| ¢ | ¢ | ¢ | cent sign | |||
| £ | £ | £ | pound sterling sign | |||
| ¥ | ¥ | ¥ | yen sign | |||
| © | © | © | copyright sign | |||
| ° | ° | ° | degree sign | |||
| Å | å | Å | å | Å | å | A, ring |
| Æ | æ | Æ | æ | Æ | æ | AE diphthong (ligature) |
| Ð | ð | Ð | ð | Ð | ð | Eth, Icelandic |
| Ñ | ñ | Ñ | ñ | Ñ | ñ | N tilde |
| Ø | ø | Ø | ø | Ø | ø | O, slash |
| Þ | þ | Þ | þ | Þ | þ | Thorn, Icelandic |
| Ā | ā | Ā | ā | |||
| Ē | ē | Ē | ē | |||
| Ī | ī | Ī | ī | |||
| Ō | ō | Ō | ō | |||
| Ū | ū | Ū | ū | |||
| Ḍ | ḍ | Ḍ | ḍ | |||
| Ḥ | ḥ | Ḥ | ḥ | |||
| Ḷ | ḷ | Ḷ | ḷ | |||
| Ṁ | ṁ | Ṁ | ṁ | |||
| Ṃ | ṃ | Ṃ | ṃ | |||
| Ṅ | ṅ | Ṅ | ṅ | |||
| Ṇ | ṇ | Ṇ | ṇ | |||
| Ṭ | ṭ | Ṭ | ṭ | |||
| € | € | € | Euro Sign | |||
| ← | ← | leftwards arrow | ||||
| ↑ | ↑ | upwards arrow | ||||
| → | → | rightwards arrow | ||||
| ↓ | ↓ | downwards arrow | ||||
| ↖ | ↖ | north west arrow | ||||
| ↗ | ↗ | north east arrow | ||||
| ↘ | ↘ | south east arrow | ||||
| ↙ | ↙ | south west arrow | ||||
| ♠ | ♠ | ♠ | black spade suit | |||
| ♣ | ♣ | ♣ | black club suit | |||
| ♥ | ♥ | ♥ | black heart suit | |||
| ♦ | ♦ | ♦ | black diamond suit | |||
| Numeric Char Ref | Char Entity Ref | Glyph | Char Name | |||