在SGML、 HTML与XML文档,如果某些Unicode字符在文档的当前编码方式(如ISO-8859-1)中不能直接表示,那么可以通过字符值引用或者字符实体引用两种转义序列来表示这些不能直接编码的字符。 下文列出在HTML与XML文档中有效的字符实体引用。
XML中的预定义实体
XML规范并不使用“字符实体”(character entity)或“字符实体引用”(character entity reference)。 XML规范定义了5个“预定义实体”来表示特殊字符。 XML也允许在每个文档定义任意数量的其它命名实体。
下表列出了5个XML预定义实体。 通过名字引用这些实体的格式为&name;
,例如&
将绘制为 &。
名字 | 字符 | Unicode码位(十进制) | 标准 | 描述 |
---|---|---|---|---|
quot | " | U+0022 (34) | XML 1.0 | 双引号 |
amp | & | U+0026 (38) | XML 1.0 | & |
apos | ' | U+0027 (39) | XML 1.0 | 撇号 |
lt | < | U+003C (60) | XML 1.0 | 小于号 |
gt | > | U+003E (62) | XML 1.0 | 大于号 |
HTML中的字符实体引用
HTML 4 DTD定义了252个命名实体。HTML 4规范要求使用标准DTD,并且不许用户定義其它的命名实体。
下表中,“标准”栏指出该字符实体首次定义在哪个版本的HTML DTD中。HTML 4.01没有增加任何新的字符实体。
名字 | 字符 | Unicode码位(十进制) | 标准 | DTD[a] | 舊ISO子集[b] | 描述[c] |
---|---|---|---|---|---|---|
quot | " | U+0022 (34) | HTML 2.0 | HTMLspecial | ISOnum | 双引号 |
amp | & | U+0026 (38) | HTML 2.0 | HTMLspecial | ISOnum | & |
apos | ' | U+0027 (39) | XHTML 1.0 | HTMLspecial | ISOnum | 撇号;参见下文 |
lt | < | U+003C (60) | HTML 2.0 | HTMLspecial | ISOnum | 小于号 |
gt | > | U+003E (62) | HTML 2.0 | HTMLspecial | ISOnum | 大于号 |
nbsp | U+00A0 (160) | HTML 3.2 | HTMLlat1 | ISOnum | 不换行空格[d] | |
iexcl | ¡ | U+00A1 (161) | HTML 3.2 | HTMLlat1 | ISOnum | 倒置叹号 |
cent | ¢ | U+00A2 (162) | HTML 3.2 | HTMLlat1 | ISOnum | 分 (货币符号) |
pound | £ | U+00A3 (163) | HTML 3.2 | HTMLlat1 | ISOnum | 镑 (货币符号) |
curren | ¤ | U+00A4 (164) | HTML 3.2 | HTMLlat1 | ISOnum | 国际通货记号 |
yen | ¥ | U+00A5 (165) | HTML 3.2 | HTMLlat1 | ISOnum | 日元/人民幣符号 |
brvbar | ¦ | U+00A6 (166) | HTML 3.2 | HTMLlat1 | ISOnum | 间断竖线 |
sect | § | U+00A7 (167) | HTML 3.2 | HTMLlat1 | ISOnum | 节号 |
uml | ¨ | U+00A8 (168) | HTML 3.2 | HTMLlat1 | ISOdia | 分音符;参见元音变音 |
copy | © | U+00A9 (169) | HTML 3.2 | HTMLlat1 | ISOnum | 版权符 |
ordf | ª | U+00AA (170) | HTML 3.2 | HTMLlat1 | ISOnum | 阴性序数词指示符 |
laquo | « | U+00AB (171) | HTML 3.2 | HTMLlat1 | ISOnum | 左侧角引号(中文借用為书名号) |
not | ¬ | U+00AC (172) | HTML 3.2 | HTMLlat1 | ISOnum | 逻辑非 |
shy | U+00AD (173) | HTML 3.2 | HTMLlat1 | ISOnum | 软连字符 | |
reg | ® | U+00AE (174) | HTML 3.2 | HTMLlat1 | ISOnum | 注册商标符 |
macr | ¯ | U+00AF (175) | HTML 3.2 | HTMLlat1 | ISOdia | 长音符 (上划线) |
deg | ° | U+00B0 (176) | HTML 3.2 | HTMLlat1 | ISOnum | 度數符 |
plusmn | ± | U+00B1 (177) | HTML 3.2 | HTMLlat1 | ISOnum | 正负号 |
sup2 | ² | U+00B2 (178) | HTML 3.2 | HTMLlat1 | ISOnum | 上角标2 (平方符号) |
sup3 | ³ | U+00B3 (179) | HTML 3.2 | HTMLlat1 | ISOnum | 上角标3 (立方符号) |
acute | ´ | U+00B4 (180) | HTML 3.2 | HTMLlat1 | ISOdia | 尖音符 (= spacing acute) |
micro | µ | U+00B5 (181) | HTML 3.2 | HTMLlat1 | ISOnum | 微 (表示百万分之一的国际单位制词头) |
para | ¶ | U+00B6 (182) | HTML 3.2 | HTMLlat1 | ISOnum | 段落符号 |
middot | · | U+00B7 (183) | HTML 3.2 | HTMLlat1 | ISOnum | 间隔号 (中点) |
cedil | ¸ | U+00B8 (184) | HTML 3.2 | HTMLlat1 | ISOdia | 软音符 |
sup1 | ¹ | U+00B9 (185) | HTML 3.2 | HTMLlat1 | ISOnum | 上角标1 |
ordm | º | U+00BA (186) | HTML 3.2 | HTMLlat1 | ISOnum | 阳性序数词指示符 |
raquo | » | U+00BB (187) | HTML 3.2 | HTMLlat1 | ISOnum | 右侧角引号(中文借用為書名號) |
frac14 | ¼ | U+00BC (188) | HTML 3.2 | HTMLlat1 | ISOnum | 四分之一分数 |
frac12 | ½ | U+00BD (189) | HTML 3.2 | HTMLlat1 | ISOnum | 二分之一分数 |
frac34 | ¾ | U+00BE (190) | HTML 3.2 | HTMLlat1 | ISOnum | 四分之三分数 |
iquest | ¿ | U+00BF (191) | HTML 3.2 | HTMLlat1 | ISOnum | 倒置问号 |
Agrave | À | U+00C0 (192) | HTML 2.0 | HTMLlat1 | ISOlat1 | 带重音符的字母A |
Aacute | Á | U+00C1 (193) | HTML 2.0 | HTMLlat1 | ISOlat1 | 带尖音符的字母A |
Acirc | Â | U+00C2 (194) | HTML 2.0 | HTMLlat1 | ISOlat1 | 带抑扬符的字母A |
Atilde | Ã | U+00C3 (195) | HTML 2.0 | HTMLlat1 | ISOlat1 | 带颚化符的字母A |
Auml | Ä | U+00C4 (196) | HTML 2.0 | HTMLlat1 | ISOlat1 | 带分音符的字母A |
Aring | Å | U+00C5 (197) | HTML 2.0 | HTMLlat1 | ISOlat1 | 带上圆圈的字母A |
AElig | Æ | U+00C6 (198) | HTML 2.0 | HTMLlat1 | ISOlat1 | Æ(连字符AE) |
Ccedil | Ç | U+00C7 (199) | HTML 2.0 | HTMLlat1 | ISOlat1 | 带软音符的字母C |
Egrave | È | U+00C8 (200) | HTML 2.0 | HTMLlat1 | ISOlat1 | 带重音符的字母E |
Eacute | É | U+00C9 (201) | HTML 2.0 | HTMLlat1 | ISOlat1 | 带尖音符的字母E |
Ecirc | Ê | U+00CA (202) | HTML 2.0 | HTMLlat1 | ISOlat1 | 带抑扬符的字母E |
Euml | Ë | U+00CB (203) | HTML 2.0 | HTMLlat1 | ISOlat1 | 带分音符的字母E |
Igrave | Ì | U+00CC (204) | HTML 2.0 | HTMLlat1 | ISOlat1 | 带重音符的字母I |
Iacute | Í | U+00CD (205) | HTML 2.0 | HTMLlat1 | ISOlat1 | 带尖音符的字母I |
Icirc | Î | U+00CE (206) | HTML 2.0 | HTMLlat1 | ISOlat1 | 带抑扬符的字母I |
Iuml | Ï | U+00CF (207) | HTML 2.0 | HTMLlat1 | ISOlat1 | 带分音符的字母I |
ETH | Ð | U+00D0 (208) | HTML 2.0 | HTMLlat1 | ISOlat1 | 带删节线的字母D |
Ntilde | Ñ | U+00D1 (209) | HTML 2.0 | HTMLlat1 | ISOlat1 | 带颚化符的字母N |
Ograve | Ò | U+00D2 (210) | HTML 2.0 | HTMLlat1 | ISOlat1 | 带重音符的字母O |
Oacute | Ó | U+00D3 (211) | HTML 2.0 | HTMLlat1 | ISOlat1 | 带尖音符的字母O |
Ocirc | Ô | U+00D4 (212) | HTML 2.0 | HTMLlat1 | ISOlat1 | 带抑扬符的字母O |
Otilde | Õ | U+00D5 (213) | HTML 2.0 | HTMLlat1 | ISOlat1 | 带颚化符的字母O |
Ouml | Ö | U+00D6 (214) | HTML 2.0 | HTMLlat1 | ISOlat1 | 带分音符的字母O |
times | × | U+00D7 (215) | HTML 3.2 | HTMLlat1 | ISOnum | 乘号 |
Oslash | Ø | U+00D8 (216) | HTML 2.0 | HTMLlat1 | ISOlat1 | 带短斜线的字母O |
Ugrave | Ù | U+00D9 (217) | HTML 2.0 | HTMLlat1 | ISOlat1 | 带重音符的字母U |
Uacute | Ú | U+00DA (218) | HTML 2.0 | HTMLlat1 | ISOlat1 | 带尖音符的字母U |
Ucirc | Û | U+00DB (219) | HTML 2.0 | HTMLlat1 | ISOlat1 | 带抑扬符的字母U |
Uuml | Ü | U+00DC (220) | HTML 2.0 | HTMLlat1 | ISOlat1 | 带分音符的字母U |
Yacute | Ý | U+00DD (221) | HTML 2.0 | HTMLlat1 | ISOlat1 | 带尖音符的字母Y |
THORN | Þ | U+00DE (222) | HTML 2.0 | HTMLlat1 | ISOlat1 | 字母Þ 相当于th |
szlig | ß | U+00DF (223) | HTML 2.0 | HTMLlat1 | ISOlat1 | ß (德文中一个特殊的字母) |
agrave | à | U+00E0 (224) | HTML 2.0 | HTMLlat1 | ISOlat1 | 带重音符的字母a |
aacute | á | U+00E1 (225) | HTML 2.0 | HTMLlat1 | ISOlat1 | 带尖音符的字母a |
acirc | â | U+00E2 (226) | HTML 2.0 | HTMLlat1 | ISOlat1 | 带抑扬符的字母a |
atilde | ã | U+00E3 (227) | HTML 2.0 | HTMLlat1 | ISOlat1 | 带颚化符的字母a |
auml | ä | U+00E4 (228) | HTML 2.0 | HTMLlat1 | ISOlat1 | 带分音符的字母a |
aring | å | U+00E5 (229) | HTML 2.0 | HTMLlat1 | ISOlat1 | 带圆圈的字母a |
aelig | æ | U+00E6 (230) | HTML 2.0 | HTMLlat1 | ISOlat1 | 字母ae的连写符号 |
ccedil | ç | U+00E7 (231) | HTML 2.0 | HTMLlat1 | ISOlat1 | 带软音符的字母c |
egrave | è | U+00E8 (232) | HTML 2.0 | HTMLlat1 | ISOlat1 | 带重音符的字母e |
eacute | é | U+00E9 (233) | HTML 2.0 | HTMLlat1 | ISOlat1 | 带尖音符的字母e |
ecirc | ê | U+00EA (234) | HTML 2.0 | HTMLlat1 | ISOlat1 | 带抑扬符的字母e |
euml | ë | U+00EB (235) | HTML 2.0 | HTMLlat1 | ISOlat1 | 带分音符的字母e |
igrave | ì | U+00EC (236) | HTML 2.0 | HTMLlat1 | ISOlat1 | 带重音符的字母i |
iacute | í | U+00ED (237) | HTML 2.0 | HTMLlat1 | ISOlat1 | 带尖音符的字母i |
icirc | î | U+00EE (238) | HTML 2.0 | HTMLlat1 | ISOlat1 | 带抑扬符的字母i |
iuml | ï | U+00EF (239) | HTML 2.0 | HTMLlat1 | ISOlat1 | 带分音符的字母i |
eth | ð | U+00F0 (240) | HTML 2.0 | HTMLlat1 | ISOlat1 | 带短斜线的字母d |
ntilde | ñ | U+00F1 (241) | HTML 2.0 | HTMLlat1 | ISOlat1 | 带颚化符的字母n |
ograve | ò | U+00F2 (242) | HTML 2.0 | HTMLlat1 | ISOlat1 | 带重音符的字母o |
oacute | ó | U+00F3 (243) | HTML 2.0 | HTMLlat1 | ISOlat1 | 带尖音符的字母o |
ocirc | ô | U+00F4 (244) | HTML 2.0 | HTMLlat1 | ISOlat1 | 带抑扬符的字母o |
otilde | õ | U+00F5 (245) | HTML 2.0 | HTMLlat1 | ISOlat1 | 带颚化符的字母o |
ouml | ö | U+00F6 (246) | HTML 2.0 | HTMLlat1 | ISOlat1 | 带分音符的字母o |
divide | ÷ | U+00F7 (247) | HTML 3.2 | HTMLlat1 | ISOnum | 除号 |
oslash | ø | U+00F8 (248) | HTML 2.0 | HTMLlat1 | ISOlat1 | 带短斜线的字母o |
ugrave | ù | U+00F9 (249) | HTML 2.0 | HTMLlat1 | ISOlat1 | 带重音符的字母u |
uacute | ú | U+00FA (250) | HTML 2.0 | HTMLlat1 | ISOlat1 | 带尖音符的字母u |
ucirc | û | U+00FB (251) | HTML 2.0 | HTMLlat1 | ISOlat1 | 带抑扬符的字母u |
uuml | ü | U+00FC (252) | HTML 2.0 | HTMLlat1 | ISOlat1 | 带分音符的字母u |
yacute | ý | U+00FD (253) | HTML 2.0 | HTMLlat1 | ISOlat1 | 带尖音符的字母y |
thorn | þ | U+00FE (254) | HTML 2.0 | HTMLlat1 | ISOlat1 | þ |
yuml | ÿ | U+00FF (255) | HTML 2.0 | HTMLlat1 | ISOlat1 | 带分音符的字母y |
OElig | Œ | U+0152 (338) | HTML 4.0 | HTMLspecial | ISOlat2 | 字母OE的连写符号[e] |
oelig | œ | U+0153 (339) | HTML 4.0 | HTMLspecial | ISOlat2 | 字母oe的连写符号[e] |
Scaron | Š | U+0160 (352) | HTML 4.0 | HTMLspecial | ISOlat2 | 带扬抑符的字母S |
scaron | š | U+0161 (353) | HTML 4.0 | HTMLspecial | ISOlat2 | 带扬抑符的字母s |
Yuml | Ÿ | U+0178 (376) | HTML 4.0 | HTMLspecial | ISOlat2 | 带分音符的字母Y |
fnof | ƒ | U+0192 (402) | HTML 4.0 | HTMLsymbol | ISOtech | 字母f底部带一个钩子 (用于表示数学函数符号或匈牙利货币福林) |
circ | ˆ | U+02C6 (710) | HTML 4.0 | HTMLspecial | ISOpub | 抑扬符 |
tilde | ˜ | U+02DC (732) | HTML 4.0 | HTMLspecial | ISOdia | 颚音符 |
Alpha | Α | U+0391 (913) | HTML 4.0 | HTMLsymbol | 希腊字母Α | |
Beta | Β | U+0392 (914) | HTML 4.0 | HTMLsymbol | 希腊字母Β | |
Gamma | Γ | U+0393 (915) | HTML 4.0 | HTMLsymbol | ISOgrk3 | 希腊字母Γ |
Delta | Δ | U+0394 (916) | HTML 4.0 | HTMLsymbol | ISOgrk3 | 希腊字母Δ |
Epsilon | Ε | U+0395 (917) | HTML 4.0 | HTMLsymbol | 希腊字母Ε | |
Zeta | Ζ | U+0396 (918) | HTML 4.0 | HTMLsymbol | 希腊字母Ζ | |
Eta | Η | U+0397 (919) | HTML 4.0 | HTMLsymbol | 希腊字母Η | |
Theta | Θ | U+0398 (920) | HTML 4.0 | HTMLsymbol | ISOgrk3 | 希腊字母Θ |
Iota | Ι | U+0399 (921) | HTML 4.0 | HTMLsymbol | 希腊字母Ι | |
Kappa | Κ | U+039A (922) | HTML 4.0 | HTMLsymbol | 希腊字母Κ | |
Lambda | Λ | U+039B (923) | HTML 4.0 | HTMLsymbol | ISOgrk3 | 希腊字母Λ |
Mu | Μ | U+039C (924) | HTML 4.0 | HTMLsymbol | 希腊字母Μ | |
Nu | Ν | U+039D (925) | HTML 4.0 | HTMLsymbol | 希腊字母Ν | |
Xi | Ξ | U+039E (926) | HTML 4.0 | HTMLsymbol | ISOgrk3 | 希腊字母Ξ |
Omicron | Ο | U+039F (927) | HTML 4.0 | HTMLsymbol | 希腊字母Ο | |
Pi | Π | U+03A0 (928) | HTML 4.0 | HTMLsymbol | 希腊字母Π | |
Rho | Ρ | U+03A1 (929) | HTML 4.0 | HTMLsymbol | 希腊字母Ρ | |
Sigma | Σ | U+03A3 (931) | HTML 4.0 | HTMLsymbol | ISOgrk3 | 希腊字母Σ |
Tau | Τ | U+03A4 (932) | HTML 4.0 | HTMLsymbol | 希腊字母Τ | |
Upsilon | Υ | U+03A5 (933) | HTML 4.0 | HTMLsymbol | ISOgrk3 | 希腊字母Υ |
Phi | Φ | U+03A6 (934) | HTML 4.0 | HTMLsymbol | ISOgrk3 | 希腊字母Φ |
Chi | Χ | U+03A7 (935) | HTML 4.0 | HTMLsymbol | 希腊字母Χ | |
Psi | Ψ | U+03A8 (936) | HTML 4.0 | HTMLsymbol | ISOgrk3 | 希腊字母Ψ |
Omega | Ω | U+03A9 (937) | HTML 4.0 | HTMLsymbol | ISOgrk3 | 希腊字母Ω |
alpha | α | U+03B1 (945) | HTML 4.0 | HTMLsymbol | ISOgrk3 | 希腊字母α |
beta | β | U+03B2 (946) | HTML 4.0 | HTMLsymbol | ISOgrk3 | 希腊字母β |
gamma | γ | U+03B3 (947) | HTML 4.0 | HTMLsymbol | ISOgrk3 | 希腊字母γ |
delta | δ | U+03B4 (948) | HTML 4.0 | HTMLsymbol | ISOgrk3 | 希腊字母δ |
epsilon | ε | U+03B5 (949) | HTML 4.0 | HTMLsymbol | ISOgrk3 | 希腊字母ε |
zeta | ζ | U+03B6 (950) | HTML 4.0 | HTMLsymbol | ISOgrk3 | 希腊字母ζ |
eta | η | U+03B7 (951) | HTML 4.0 | HTMLsymbol | ISOgrk3 | 希腊字母η |
theta | θ | U+03B8 (952) | HTML 4.0 | HTMLsymbol | ISOgrk3 | 希腊字母θ |
iota | ι | U+03B9 (953) | HTML 4.0 | HTMLsymbol | ISOgrk3 | 希腊字母ι |
kappa | κ | U+03BA (954) | HTML 4.0 | HTMLsymbol | ISOgrk3 | 希腊字母κ |
lambda | λ | U+03BB (955) | HTML 4.0 | HTMLsymbol | ISOgrk3 | 希腊字母λ |
mu | μ | U+03BC (956) | HTML 4.0 | HTMLsymbol | ISOgrk3 | 希腊字母μ |
nu | ν | U+03BD (957) | HTML 4.0 | HTMLsymbol | ISOgrk3 | 希腊字母ν |
xi | ξ | U+03BE (958) | HTML 4.0 | HTMLsymbol | ISOgrk3 | 希腊字母ξ |
omicron | ο | U+03BF (959) | HTML 4.0 | HTMLsymbol | NEW | 希腊字母ο |
pi | π | U+03C0 (960) | HTML 4.0 | HTMLsymbol | ISOgrk3 | 希腊字母π |
rho | ρ | U+03C1 (961) | HTML 4.0 | HTMLsymbol | ISOgrk3 | 希腊字母ρ |
sigmaf | ς | U+03C2 (962) | HTML 4.0 | HTMLsymbol | ISOgrk3 | 希腊字母ς, 用于词末尾,又称"final sigma" |
sigma | σ | U+03C3 (963) | HTML 4.0 | HTMLsymbol | ISOgrk3 | 希腊字母σ |
tau | τ | U+03C4 (964) | HTML 4.0 | HTMLsymbol | ISOgrk3 | 希腊字母τ |
upsilon | υ | U+03C5 (965) | HTML 4.0 | HTMLsymbol | ISOgrk3 | 希腊字母υ |
phi | φ | U+03C6 (966) | HTML 4.0 | HTMLsymbol | ISOgrk3 | 希腊字母φ |
chi | χ | U+03C7 (967) | HTML 4.0 | HTMLsymbol | ISOgrk3 | 希腊字母χ |
psi | ψ | U+03C8 (968) | HTML 4.0 | HTMLsymbol | ISOgrk3 | 希腊字母ψ |
omega | ω | U+03C9 (969) | HTML 4.0 | HTMLsymbol | ISOgrk3 | 希腊字母ω |
thetasym | ϑ | U+03D1 (977) | HTML 4.0 | HTMLsymbol | NEW | 希腊字母ϑ,手写体 |
upsih | ϒ | U+03D2 (978) | HTML 4.0 | HTMLsymbol | NEW | 希腊字母ϒ,笔画卷曲的字形 |
piv | ϖ | U+03D6 (982) | HTML 4.0 | HTMLsymbol | ISOgrk3 | 希腊字母ϖ,古老的字形 |
ensp | U+2002 (8194) | HTML 4.0 | HTMLspecial | ISOpub | 字母n宽度的空间[d] | |
emsp | U+2003 (8195) | HTML 4.0 | HTMLspecial | ISOpub | 字母m宽度的空间[d] | |
thinsp | U+2009 (8201) | HTML 4.0 | HTMLspecial | ISOpub | 细空间[d] | |
zwnj | U+200C (8204) | HTML 4.0 | HTMLspecial | NEW RFC 2070 | 零宽不连字 | |
zwj | U+200D (8205) | HTML 4.0 | HTMLspecial | NEW RFC 2070 | 零宽连字 | |
lrm | U+200E (8206) | HTML 4.0 | HTMLspecial | NEW RFC 2070 | 左到右标志 | |
rlm | U+200F (8207) | HTML 4.0 | HTMLspecial | NEW RFC 2070 | 右到左标志 | |
ndash | – | U+2013 (8211) | HTML 4.0 | HTMLspecial | ISOpub | n宽度连字号 |
mdash | — | U+2014 (8212) | HTML 4.0 | HTMLspecial | ISOpub | m宽度连字号 |
lsquo | ‘ | U+2018 (8216) | HTML 4.0 | HTMLspecial | ISOnum | 左单引号 |
rsquo | ’ | U+2019 (8217) | HTML 4.0 | HTMLspecial | ISOnum | 右单引号 |
sbquo | ‚ | U+201A (8218) | HTML 4.0 | HTMLspecial | NEW | 下位单引号 |
ldquo | “ | U+201C (8220) | HTML 4.0 | HTMLspecial | ISOnum | 左双引号 |
rdquo | ” | U+201D (8221) | HTML 4.0 | HTMLspecial | ISOnum | 右双引号 |
bdquo | „ | U+201E (8222) | HTML 4.0 | HTMLspecial | NEW | 下位双引号 |
dagger | † | U+2020 (8224) | HTML 4.0 | HTMLspecial | ISOpub | 剑号 |
Dagger | ‡ | U+2021 (8225) | HTML 4.0 | HTMLspecial | ISOpub | 双剑号 |
bull | • | U+2022 (8226) | HTML 4.0 | HTMLspecial | ISOpub | 项目符号 (粗黑点) |
hellip | … | U+2026 (8230) | HTML 4.0 | HTMLsymbol | ISOpub | 省略号 |
permil | ‰ | U+2030 (8240) | HTML 4.0 | HTMLspecial | ISOtech | 千分号 |
prime | ′ | U+2032 (8242) | HTML 4.0 | HTMLsymbol | ISOtech | 角分符号 |
Prime | ″ | U+2033 (8243) | HTML 4.0 | HTMLsymbol | ISOtech | 角秒符号 |
lsaquo | ‹ | U+2039 (8249) | HTML 4.0 | HTMLspecial | ISO proposed | 角形左单引号[f] |
rsaquo | › | U+203A (8250) | HTML 4.0 | HTMLspecial | ISO proposed | 角形右单引号[f] |
oline | ‾ | U+203E (8254) | HTML 4.0 | HTMLsymbol | NEW | 上划线 |
frasl | ⁄ | U+2044 (8260) | HTML 4.0 | HTMLsymbol | NEW | 分数斜线 |
euro | € | U+20AC (8364) | HTML 4.0 | HTMLspecial | NEW | 欧元符号 |
image | ℑ | U+2111 (8465) | HTML 4.0 | HTMLsymbol | ISOamso | 黑体大写字母I |
weierp | ℘ | U+2118 (8472) | HTML 4.0 | HTMLsymbol | ISOamso | 手写体大写字母P,数学上表示幂集 |
real | ℜ | U+211C (8476) | HTML 4.0 | HTMLsymbol | ISOamso | 黑体大写字母R,数学上表示实部 |
trade | ™ | U+2122 (8482) | HTML 4.0 | HTMLsymbol | ISOnum | 商标符号 |
alefsym | ℵ | U+2135 (8501) | HTML 4.0 | HTMLsymbol | NEW | 阿列夫符号[g] |
larr | ← | U+2190 (8592) | HTML 4.0 | HTMLsymbol | ISOnum | 向左箭头 |
uarr | ↑ | U+2191 (8593) | HTML 4.0 | HTMLsymbol | ISOnum | 向上箭头 |
rarr | → | U+2192 (8594) | HTML 4.0 | HTMLsymbol | ISOnum | 向右箭头 |
darr | ↓ | U+2193 (8595) | HTML 4.0 | HTMLsymbol | ISOnum | 向下箭头 |
harr | ↔ | U+2194 (8596) | HTML 4.0 | HTMLsymbol | ISOamsa | 向左向右箭头 |
crarr | ↵ | U+21B5 (8629) | HTML 4.0 | HTMLsymbol | NEW | 向下再向左箭头 (= 回车符) |
lArr | ⇐ | U+21D0 (8656) | HTML 4.0 | HTMLsymbol | ISOtech | 向左双线箭头[h] |
uArr | ⇑ | U+21D1 (8657) | HTML 4.0 | HTMLsymbol | ISOamsa | 向上双线箭头 |
rArr | ⇒ | U+21D2 (8658) | HTML 4.0 | HTMLsymbol | ISOnum | 向右双线箭头[i] |
dArr | ⇓ | U+21D3 (8659) | HTML 4.0 | HTMLsymbol | ISOamsa | 向下双线箭头 |
hArr | ⇔ | U+21D4 (8660) | HTML 4.0 | HTMLsymbol | ISOamsa | 向左向右双线箭头 |
forall | ∀ | U+2200 (8704) | HTML 4.0 | HTMLsymbol | ISOtech | 全称量词 |
part | ∂ | U+2202 (8706) | HTML 4.0 | HTMLsymbol | ISOtech | 偏微分符号 |
exist | ∃ | U+2203 (8707) | HTML 4.0 | HTMLsymbol | ISOtech | 存在量词 |
empty | ∅ | U+2205 (8709) | HTML 4.0 | HTMLsymbol | ISOamso | 空集 |
nabla | ∇ | U+2207 (8711) | HTML 4.0 | HTMLsymbol | ISOtech | 劈形算子(倒三角算子) |
isin | ∈ | U+2208 (8712) | HTML 4.0 | HTMLsymbol | ISOtech | 属于,是...的元素 |
notin | ∉ | U+2209 (8713) | HTML 4.0 | HTMLsymbol | ISOtech | 不属于,不是...的元素 |
ni | ∋ | U+220B (8715) | HTML 4.0 | HTMLsymbol | ISOtech | 包含...作为元素 |
prod | ∏ | U+220F (8719) | HTML 4.0 | HTMLsymbol | ISOamsb | 连乘符号 [j] |
sum | ∑ | U+2211 (8721) | HTML 4.0 | HTMLsymbol | ISOamsb | 求和符号[k] |
minus | − | U+2212 (8722) | HTML 4.0 | HTMLsymbol | ISOtech | 减号 |
lowast | ∗ | U+2217 (8727) | HTML 4.0 | HTMLsymbol | ISOtech | 星算符 |
radic | √ | U+221A (8730) | HTML 4.0 | HTMLsymbol | ISOtech | 平方根符号 |
prop | ∝ | U+221D (8733) | HTML 4.0 | HTMLsymbol | ISOtech | 正比于 |
infin | ∞ | U+221E (8734) | HTML 4.0 | HTMLsymbol | ISOtech | 无穷符号 |
ang | ∠ | U+2220 (8736) | HTML 4.0 | HTMLsymbol | ISOamso | 角符号 |
and | ∧ | U+2227 (8743) | HTML 4.0 | HTMLsymbol | ISOtech | 逻辑合取符号 (= wedge) |
or | ∨ | U+2228 (8744) | HTML 4.0 | HTMLsymbol | ISOtech | 逻辑析取符号 (= vee) |
cap | ∩ | U+2229 (8745) | HTML 4.0 | HTMLsymbol | ISOtech | 集合的交符号 (= cap) |
cup | ∪ | U+222A (8746) | HTML 4.0 | HTMLsymbol | ISOtech | 集合的并符号 (= cup) |
int | ∫ | U+222B (8747) | HTML 4.0 | HTMLsymbol | ISOtech | 积分符号 |
there4 | ∴ | U+2234 (8756) | HTML 4.0 | HTMLsymbol | ISOtech | 所以符号 |
sim | ∼ | U+223C (8764) | HTML 4.0 | HTMLsymbol | ISOtech | tilde算符 (等价于,渐进相等,近似等于,服从...概率分布))[l] |
cong | ≅ | U+2245 (8773) | HTML 4.0 | HTMLsymbol | ISOtech | 全等符号 |
asymp | ≈ | U+2248 (8776) | HTML 4.0 | HTMLsymbol | ISOamsr | 渐近相等 |
ne | ≠ | U+2260 (8800) | HTML 4.0 | HTMLsymbol | ISOtech | 不等 |
equiv | ≡ | U+2261 (8801) | HTML 4.0 | HTMLsymbol | ISOtech | 等价 |
le | ≤ | U+2264 (8804) | HTML 4.0 | HTMLsymbol | ISOtech | 小于等于 |
ge | ≥ | U+2265 (8805) | HTML 4.0 | HTMLsymbol | ISOtech | 大于等于 |
sub | ⊂ | U+2282 (8834) | HTML 4.0 | HTMLsymbol | ISOtech | 是...的子集 |
sup | ⊃ | U+2283 (8835) | HTML 4.0 | HTMLsymbol | ISOtech | 是...的超集[m] |
nsub | ⊄ | U+2284 (8836) | HTML 4.0 | HTMLsymbol | ISOamsn | 不是...的子集 |
sube | ⊆ | U+2286 (8838) | HTML 4.0 | HTMLsymbol | ISOtech | 是...的子集或相等 |
supe | ⊇ | U+2287 (8839) | HTML 4.0 | HTMLsymbol | ISOtech | 是...的超集或相等 |
oplus | ⊕ | U+2295 (8853) | HTML 4.0 | HTMLsymbol | ISOamsb | 圈加号 (= 直和) |
otimes | ⊗ | U+2297 (8855) | HTML 4.0 | HTMLsymbol | ISOamsb | 圈乘号 (= 向量积符号) |
perp | ⊥ | U+22A5 (8869) | HTML 4.0 | HTMLsymbol | ISOtech | 正交符号 = 垂直)[n] |
sdot | ⋅ | U+22C5 (8901) | HTML 4.0 | HTMLsymbol | ISOamsb | 点算符[o] |
lceil | ⌈ | U+2308 (8968) | HTML 4.0 | HTMLsymbol | ISOamsc | 左天花板符号 |
rceil | ⌉ | U+2309 (8969) | HTML 4.0 | HTMLsymbol | ISOamsc | 右天花板符号 |
lfloor | ⌊ | U+230A (8970) | HTML 4.0 | HTMLsymbol | ISOamsc | 左地板符号 |
rfloor | ⌋ | U+230B (8971) | HTML 4.0 | HTMLsymbol | ISOamsc | 右地板符号 |
lang | 〈 | U+2329 (9001) | HTML 4.0 | HTMLsymbol | ISOtech | 角形左括号[p] |
rang | 〉 | U+232A (9002) | HTML 4.0 | HTMLsymbol | ISOtech | 角形右括号[q] |
loz | ◊ | U+25CA (9674) | HTML 4.0 | HTMLsymbol | ISOpub | 菱形符号(钻石符号) |
spades | ♠ | U+2660 (9824) | HTML 4.0 | HTMLsymbol | ISOpub | 黑桃符号[r] |
clubs | ♣ | U+2663 (9827) | HTML 4.0 | HTMLsymbol | ISOpub | 梅花符号 (= shamrock)[r] |
hearts | ♥ | U+2665 (9829) | HTML 4.0 | HTMLsymbol | ISOpub | 红桃符号 (= valentine)[r] |
diams | ♦ | U+2666 (9830) | HTML 4.0 | HTMLsymbol | ISOpub | 扑克牌方块符号[r] |
Notes:
- ^
DTD: the full public DTD name (where the character entity name is defined) is actually mapped from one of the following three defined named entities:
- HTMLlat1 maps to:
- PUBLIC "-//W3C//ENTITIES Latin 1//EN//HTML" in HTML (the DTD is implicitly defined, no system URI is needed);
- PUBLIC "-//W3C//ENTITIES Latin 1 for XHTML//EN" "http://www.w3.org/TR/xhtml1/DTD/xhtml-lat1.ent" in XHTML 1.0;
- HTMLsymbol maps to:
- PUBLIC "-//W3C//ENTITIES Symbols//EN//HTML" in HTML (the DTD is implicitly defined, no system URI is needed);
- PUBLIC "-//W3C//ENTITIES Symbols for XHTML//EN" "http://www.w3.org/TR/xhtml1/DTD/xhtml-symbol.ent" in XHTML 1.0;
- HTMLspecial maps to:
- PUBLIC "-//W3C//ENTITIES Special//EN//HTML" in HTML (the DTD is implicitly defined, no system URI is needed);
- PUBLIC "-//W3C//ENTITIES Special for XHTML//EN" "http://www.w3.org/TR/xhtml1/DTD/xhtml-special.ent" in XHTML 1.0.
- HTMLlat1 maps to:
- ^ Old ISO subset: these are old (documented) character subsets used in legacy encodings before the unification within ISO 10646.
- ^ 描述: ISO 10646与Unicode字符名称写在前面,其它常用同义项写在后面
- ^ 4.0 4.1 4.2 4.3 空间: 蓝色背景表示空格的宽度.
- ^ 5.0 5.1 ligature: this is a standard misnomer as this is a separate character in some languages.
- ^ 6.0 6.1 ISO proposed: these characters have been standardized in ISO 10646 after the release of HTML 4.0.
- ^ alefsym: 阿列夫符号并不等同于U+05D0 '希伯来字母阿列夫', 虽然二字字形几乎相同.
- ^ lArr: 根据ISO 10646,向左双线箭头还可用于'is implied by'(由...推导出)箭头.
- ^ rArr: 根据ISO 10646,向左双线箭头还可用于'implies'(推导出)箭头.
- ^ prod: '连乘符号'不同于U+03A0 '希腊大写字母Pi',虽然二者字形几乎相同.
- ^ sum: '求和符号'不同于U+03A3 '希腊大写字母Sigma',虽然二者字形几乎相同.
- ^ sim: 'tilde算符'不同于U+007E 'tilde'(波浪号), 虽然二者字形近似。但是波浪号可以从标准键盘直接输入,而且在ASCII中有编码。
- ^ sup: 注意到nsup, U+2283 'not a superset of'(不是...的超集), 并没有被包含进HTML字符实体引用之中. 看起来不够对称,不够完美吧? 它在ISOamsn子集中.
- ^ perp: Unicode定义了U+22A5作为"up tack"(向上的大头钉), 以及U+27C2作为"perpendicular" (垂直符号). 这两个符号看起来几乎是一样的,但它们是不同的Unicode字符. 但是, HTML使用U+22A5作为"perpendicular"(垂直符号). 这就在HTML与Unicode之间产生了矛盾.
- ^ sdot: 'dot operator'(点算符)不同于U+00B7 'middle dot'(中点符号).
- ^ lang: 'left-pointing angle bracket'不同于这些字符:U+003C 'less than', U+2039 'single left-pointing angle quotation mark', U+2329 'left-pointing angle bracket', U+27E8 'mathematical left angle bracket', or U+3008 'left angle bracket' ,虽然这些字符看起来都差不多.
- ^ rang: 'right-pointing angle bracket'不同于这些字符:U+003E 'greater than', U+203A 'single right-pointing angle quotation mark', U+232A 'right-pointing angle bracket', U+27E9 'mathematical right angle bracket', or U+3009 'right angle bracket' ,虽然这些字符看起来都差不多.
- ^ 18.0 18.1 18.2 18.3 black: here it seems to mean filled as opposed to hollow.
XHTML中的特殊字符的实体引用
在XHTML DTD中,明确声明了253个字符实体(包括5个XML 1.0的预定义实体)。 除了'
实体这个例外,其它252个字符实体与HTML中的252个字符实体引用一致。每个XHTML文件实例还可以定义任意数目的字符实体。但是XHTML字符实体的可用性受到该文件的处理方式的影响:
- 如果该文件由HTML处理器来处理, 那么只有252个 HTML字符实体可用。
'
或用户定义的实体引用可能不被支持,产生不可预测的效果. - 如果该文件由XML分析器来处理,只有5个XML预定义的字符实体能安全使用,虽然定义在内部DTD子集中的其它实体也许可用。
- 如果XML分析器能读外部实体,那么除了5个XML预定义的字符实体能安全使用,只要XML分析器能读取XHTML DTD,其它248个HTML字符实体也可以使用。声明在内部DTD子集中的实体也可以使用.
由于'
不能在HTML处理器中一致的安全使用,实际上仅有"
, &
, <
, and >
4个字符实体可以在所有处理环境下通用。
参见
参考文献
- Unicode Consortium (页面存档备份,存于互联网档案馆). See also: Unicode Consortium
- World Wide Web Consortium (页面存档备份,存于互联网档案馆). See also: World Wide Web Consortium
- The normative reference to RFC 2070 (still found in DTDs defining the character entities for HTML or XHTML) is historic; this RFC (along with other RFC's related to different part of the HTML specification) has been deprecated in favor of the newer informational RFC 2854 which defines the "text/html" MIME type and references directly the W3C specifications for the actual HTML content.
- Numerical Reference of Unicode code points at Wikibooks
外部链接
- Character entity references in HTML 4 (页面存档备份,存于互联网档案馆) at the W3C
- Multilanguage special character entity list (页面存档备份,存于互联网档案馆) - List of special characters, entities and their names.
- HTML entities quick reference table