Latin1的所有字符编码

时间:2020-12-24 00:39:44

ISO-8859-1 (ISO Latin 1) Character Encoding

Contents

The characters at a glance

Here are all the printable characters, in collating order:

        ! " # $ % & ' ( ) * + , - . /
0 1 2 3 4 5 6 7 8 9 : ; < = > ? @
A B C D E F G H I J K L M N O P Q R S T U V W X Y Z
[ \ ] ^ _ `
a b c d e f g h i j k l m n o p q r s t u v w x y z
{ | } ~ ¡ ¢ £ ¤ ¥ ¦ § ¨ © ª « ¬ ­ ® ¯
° ± ² ³ ´ µ ¶ · ¸ ¹ º » ¼ ½ ¾ ¿ À Á Â Ã Ä Å Æ Ç È É Ê Ë Ì Í Î Ï Ð Ñ Ò Ó Ô Õ Ö
×
Ø Ù Ú Û Ü Ý Þ
ß
à á â ã ä å æ ç è é ê ë ì í î ï ð ñ ò ó ô õ ö
÷
ø ù ú û ü ý þ
ÿ

The first six rows are the ASCII character set.

Note the ordinary ASCII space (before `!') and the ISO Latin-1 non-breaking space (before `¡')

Character codes and names


The columns show, in order:

  • HTML: the HTML notation (decimal);
  • OCTL: the C/Modula-3 octal notation;
  • HEX: the hexadecimal code, as used e.g. in MIME quoted-printable encoding;
  • CMP: the Sun/X11 "Compose" key combinations;
  • CHR: the charater itself, when HTML-printable;
  • MEANING: the ISO-8859-1 (and ASCII) definition.

HTML     OCTL   HEX   CMP   CHR   MEANING
------ + ---- + --- + --- + --- + ------------------------------
&#000; | \000 | =00 | | | Invalid (ASCII NUL)
&#001; | \001 | =01 | | | Unused (ASCII SOH)
&#002; | \002 | =02 | | | Unused (ASCII STX)
&#003; | \003 | =03 | | | Unused (ASCII ETX)
&#004; | \004 | =04 | | | Unused (ASCII EOT)
&#005; | \005 | =05 | | | Unused (ASCII ENQ)
&#006; | \006 | =06 | | | Unused (ASCII ACK)
&#007; | \007 | =07 | | | Unused (ASCII BEL, audible bell)
&#008; | \010 | =08 | | | Unused (ASCII BS, backspace)
| \011 | =09 | | | Horizontal tab (ASCII HT)
| \012 | =0A | | | Line feed (ASCII NL, newline)
&#011; | \013 | =0B | | | Unused (ASCII VT, vertical tab)
&#012; | \014 | =0C | | | Unused (ASCII NP, new page)
| \015 | =0D | | | Carriage Return (ASCII CR)
&#014; | \016 | =0E | | | Unused (ASCII SO)
&#015; | \017 | =0F | | | Unused (ASCII SI)
&#016; | \020 | =10 | | | Unused (ASCII DLE)
&#017; | \021 | =11 | | | Unused (ASCII DC1)
&#018; | \022 | =12 | | | Unused (ASCII DC2)
&#019; | \023 | =13 | | | Unused (ASCII DC3)
&#020; | \024 | =14 | | | Unused (ASCII DC4)
&#021; | \025 | =15 | | | Unused (ASCII NAK)
&#022; | \026 | =16 | | | Unused (ASCII SYN)
&#023; | \027 | =17 | | | Unused (ASCII ETB)
&#024; | \030 | =18 | | | Unused (ASCII CAN)
&#025; | \031 | =19 | | | Unused (ASCII EM)
&#026; | \032 | =1A | | | Unused (ASCII SUB)
&#027; | \033 | =1B | | | Unused (ASCII ESC, escape)
&#028; | \034 | =1C | | | Unused (ASCII FS)
&#029; | \035 | =1D | | | Unused (ASCII GS)
&#030; | \036 | =1E | | | Unused (ASCII RS)
&#031; | \037 | =1F | | | Unused (ASCII US)
| \040 | =20 | | ( ) | Space (ASCII SP)
! | \041 | =21 | | (!) | Exclamation mark
" | \042 | =22 | | (") | Quotation mark (&quot;)
# | \043 | =23 | | (#) | Number sign
$ | \044 | =24 | | ($) | Dollar sign
% | \045 | =25 | | (%) | Percent sign
& | \046 | =26 | | (&) | Ampersand (&amp;)
' | \047 | =27 | | (') | Apostrophe (right single quote)
( | \050 | =28 | | (() | Left parenthesis
) | \051 | =29 | | ()) | Right parenthesis
* | \052 | =2A | | (*) | Asterisk
+ | \053 | =2B | | (+) | Plus sign
, | \054 | =2C | | (,) | Comma
- | \055 | =2D | | (-) | Hyphen
. | \056 | =2E | | (.) | Period (fullstop)
/ | \057 | =2F | | (/) | Solidus (slash)
0 | \060 | =30 | | (0) | Digit 0
. . .
9 | \071 | =39 | | (9) | Digit 9
: | \072 | =3A | | (:) | Colon
; | \073 | =3B | | (;) | Semi-colon
< | \074 | =3C | | (<) | Less than (&lt;)
= | \075 | =3D | | (=) | Equals sign
> | \076 | =3E | | (>) | Greater than (&gt;)
? | \077 | =3F | | (?) | Question mark
@ | \100 | =40 | | (@) | Commercial at-sign
A | \101 | =41 | | (A) | Uppercase letter A
. . .
Z | \132 | =5A | | (Z) | Uppercase letter Z
[ | \133 | =5B | | ([) | Left square bracket
\ | \134 | =5C | | (\) | Reverse solidus (backslash)
] | \135 | =5D | | (]) | Right square bracket
^ | \136 | =5E | | (^) | Caret
_ | \137 | =5F | | (_) | Horizontal bar (underscore)
` | \140 | =60 | | (`) | Reverse apostrophe (left single quote)
a | \141 | =61 | | (a) | Lowercase letter a
. . .
z | \172 | =7A | | (z) | Lowercase letter z
{ | \173 | =7B | | ({) | Left curly brace
| | \174 | =7C | | (|) | Vertical bar
} | \175 | =7D | | (}) | Right curly brace
~ | \176 | =7E | | (~) | Tilde
 | \177 | =7F | | | Unused (ASCII DEL)
€ | \200 | =80 | | | Unused
 | \201 | =81 | | | Unused
‚ | \202 | =82 | | | Unused
ƒ | \203 | =83 | | | Unused
„ | \204 | =84 | | | Unused
… | \205 | =85 | | | Unused
† | \206 | =86 | | | Unused
‡ | \207 | =87 | | | Unused
ˆ | \210 | =88 | | | Unused
‰ | \211 | =89 | | | Unused
Š | \212 | =8A | | | Unused
‹ | \213 | =8B | | | Unused
Œ | \214 | =8C | | | Unused
 | \215 | =8D | | | Unused
Ž | \216 | =8E | | | Unused
 | \217 | =8F | | | Unused
 | \220 | =90 | | | Unused
‘ | \221 | =91 | | | Unused
’ | \222 | =92 | | | Unused
“ | \223 | =93 | | | Unused
” | \224 | =94 | | | Unused
• | \225 | =95 | | | Unused
– | \226 | =96 | | | Unused
— | \227 | =97 | | | Unused
˜ | \230 | =98 | | | Unused
™ | \231 | =99 | | | Unused
š | \232 | =9A | | | Unused
› | \233 | =9B | | | Unused
œ | \234 | =9C | | | Unused
 | \235 | =9D | | | Unused
ž | \236 | =9E | | | Unused
Ÿ | \237 | =9F | | | Unused
  | \240 | =A0 | | ( ) | Non-breaking space (&nbsp;)
¡ | \241 | =A1 | ! ! | (¡) | Inverted exclamation
¢ | \242 | =A2 | c / | (¢) | Cent sign
£ | \243 | =A3 | l - | (£) | Pound sterling
¤ | \244 | =A4 | o x | (¤) | General currency sign
¥ | \245 | =A5 | y - | (¥) | Yen sign
¦ | \246 | =A6 | | | | (¦) | Broken vertical bar
§ | \247 | =A7 | s o | (§) | Section sign
¨ | \250 | =A8 | " " | (¨) | Umlaut (dieresis)
© | \251 | =A9 | c o | (©) | Copyright
ª | \252 | =AA | - a | (ª) | Feminine ordinal
« | \253 | =AB | < < | («) | Left angle quote, guillemotleft
¬ | \254 | =AC | - , | (¬) | Not sign
­ | \255 | =AD | - - | (­) | Soft hyphen
® | \256 | =AE | r o | (®) | Registered trademark
¯ | \257 | =AF | ^ - | (¯) | Macron accent
° | \260 | =B0 | ^ * | (°) | Degree sign
± | \261 | =B1 | + - | (±) | Plus or minus
² | \262 | =B2 | ^ 2 | (²) | Superscript two
³ | \263 | =B3 | ^ 3 | (³) | Superscript three
´ | \264 | =B4 | \ \ | (´) | Acute accent
µ | \265 | =B5 | / u | (µ) | Micro sign
¶ | \266 | =B6 | P ! | (¶) | Paragraph sign
· | \267 | =B7 | ^ . | (·) | Middle dot
¸ | \270 | =B8 | , , | (¸) | Cedilla
¹ | \271 | =B9 | ^ 1 | (¹) | Superscript one
º | \272 | =BA | _ o | (º) | Masculine ordinal
» | \273 | =BB | > > | (») | Right angle quote, guillemotright
¼ | \274 | =BC | 1 4 | (¼) | Fraction one-fourth
½ | \275 | =BD | 1 2 | (½) | Fraction one-half
¾ | \276 | =BE | 3 4 | (¾) | Fraction three-fourths
¿ | \277 | =BF | ? ? | (¿) | Inverted question mark
À | \300 | =C0 | A ` | (À) | Capital A, grave accent
Á | \301 | =C1 | A ' | (Á) | Capital A, acute accent
 | \302 | =C2 | A ^ | (Â) | Capital A, circumflex accent
à | \303 | =C3 | A ~ | (Ã) | Capital A, tilde
Ä | \304 | =C4 | A " | (Ä) | Capital A, dieresis or umlaut mark
Å | \305 | =C5 | A * | (Å) | Capital A, ring
Æ | \306 | =C6 | A E | (Æ) | Capital AE dipthong (ligature)
Ç | \307 | =C7 | C , | (Ç) | Capital C, cedilla
È | \310 | =C8 | E ` | (È) | Capital E, grave accent
É | \311 | =C9 | E ' | (É) | Capital E, acute accent
Ê | \312 | =CA | E ^ | (Ê) | Capital E, circumflex accent
Ë | \313 | =CB | E " | (Ë) | Capital E, dieresis or umlaut mark
Ì | \314 | =CC | I ` | (Ì) | Capital I, grave accent
Í | \315 | =CD | I ' | (Í) | Capital I, acute accent
Î | \316 | =CE | I ^ | (Î) | Capital I, circumflex accent
Ï | \317 | =CF | I " | (Ï) | Capital I, dieresis or umlaut mark
Ð | \320 | =D0 | D - | (Ð) | Capital Eth, Icelandic
Ñ | \321 | =D1 | N ~ | (Ñ) | Capital N, tilde
Ò | \322 | =D2 | O ` | (Ò) | Capital O, grave accent
Ó | \323 | =D3 | O ' | (Ó) | Capital O, acute accent
Ô | \324 | =D4 | O ^ | (Ô) | Capital O, circumflex accent
Õ | \325 | =D5 | O ~ | (Õ) | Capital O, tilde
Ö | \326 | =D6 | O " | (Ö) | Capital O, dieresis or umlaut mark
× | \327 | =D7 | x x | (×) | Multiply sign
Ø | \330 | =D8 | O / | (Ø) | Capital O, slash
Ù | \331 | =D9 | U ` | (Ù) | Capital U, grave accent
Ú | \332 | =DA | U ' | (Ú) | Capital U, acute accent
Û | \333 | =DB | U ^ | (Û) | Capital U, circumflex accent
Ü | \334 | =DC | U " | (Ü) | Capital U, dieresis or umlaut mark
Ý | \335 | =DD | Y ' | (Ý) | Capital Y, acute accent
Þ | \336 | =DE | P | | (Þ) | Capital THORN, Icelandic
ß | \337 | =DF | s s | (ß) | Small sharp s, German (sz ligature)
à | \340 | =E0 | a ` | (à) | Small a, grave accent
á | \341 | =E1 | a ' | (á) | Small a, acute accent
â | \342 | =E2 | a ^ | (â) | Small a, circumflex accent
ã | \343 | =E3 | a ~ | (ã) | Small a, tilde
ä | \344 | =E4 | a " | (ä) | Small a, dieresis or umlaut mark
å | \345 | =E5 | a * | (å) | Small a, ring
æ | \346 | =E6 | a e | (æ) | Small ae dipthong (ligature)
ç | \347 | =E7 | c , | (ç) | Small c, cedilla
è | \350 | =E8 | e ` | (è) | Small e, grave accent
é | \351 | =E9 | e ' | (é) | Small e, acute accent
ê | \352 | =EA | e ^ | (ê) | Small e, circumflex accent
ë | \353 | =EB | e " | (ë) | Small e, dieresis or umlaut mark
ì | \354 | =EC | i ` | (ì) | Small i, grave accent
í | \355 | =ED | i ' | (í) | Small i, acute accent
î | \356 | =EE | i ^ | (î) | Small i, circumflex accent
ï | \357 | =EF | i " | (ï) | Small i, dieresis or umlaut mark
ð | \360 | =F0 | d - | (ð) | Small eth, Icelandic
ñ | \361 | =F1 | n ~ | (ñ) | Small n, tilde
ò | \362 | =F2 | o ` | (ò) | Small o, grave accent
ó | \363 | =F3 | o ' | (ó) | Small o, acute accent
ô | \364 | =F4 | o ^ | (ô) | Small o, circumflex accent
õ | \365 | =F5 | o ~ | (õ) | Small o, tilde
ö | \366 | =F6 | o " | (ö) | Small o, dieresis or umlaut mark
÷ | \367 | =F7 | - : | (÷) | Division sign
ø | \370 | =F8 | o / | (ø) | Small o, slash
ù | \371 | =F9 | u ` | (ù) | Small u, grave accent
ú | \372 | =FA | u ' | (ú) | Small u, acute accent
û | \373 | =FB | u ^ | (û) | Small u, circumflex accent
ü | \374 | =FC | u " | (ü) | Small u, dieresis or umlaut mark
ý | \375 | =FD | y ' | (ý) | Small y, acute accent
þ | \376 | =FE | p | | (þ) | Small thorn, Icelandic
ÿ | \377 | =FF | y " | (ÿ) | Small y, dieresis or umlaut mark

Notes for HTML documents

  1. HTML entity names are given in the "MEANING" column only for ampersand, quote, less than, and greater than, which are significant in HTML syntax; and for the non-breaking space, which may be confused with ordinary space. HTML entity names exist for many other characters, but they are superfluous: the ISO-8859-1 eight-bit codes will work, by definition, on any browser.

  2. The characters carriage return (ASCII CR) and line feed (ASCII NL, newline) are equivalent; they are treated as whitespace, except in <pre> contexts, where they force a line break. (However, a line feed is ignored if it immediately follows a carriage return.)

  3. The horizontal tab character (ASCII HT) skips to the next tabbing column in <pre> contexts, and is treated as whitespace elsewhere.

  4. The non-breaking space (&nbsp;) is honored even in non-<pre> contexts, and can be used to insert extra space between words, images, etc., like this: |      |.

Other notes

  1. Alternative Sun/X11 "Compose" sequences for the Icelandic "thorn" are "t h" (þ, lowercase) and "T H" (Þ, uppercase).

  2. Note that the Sun/X11 "Compose" sequence for masculine ordinal (º) uses an underscore, while the feminine ordinal (ª) uses a minus sign. It takes a lot of imagination to come up with such ideas...

Additional references


Composed by J. Stolfi from several sources found throughout the net.

http://www.ic.unicamp.br/~stolfi/EXPORT/www/ISO-8859-1-Encoding.html