ISO-8859-1: CHARACTER LIST AND COMMENTS (auxiliary material for the lecture "Unix for advanced users") ======================================================================== Number Official name of the character according to Glyph ISO-8859-1 or ISO-10646 (Unicode), further names and explanations ("CP1252" ([] means: dec oct hex means MS-Windows codepage 1252) no Glyph) ________________________________________________________________________ 0 000 00 NULL, NUL, Ctrl-@, ^@ [] In C (and thus also in the Unix kernel), this character marks the end of a string. For this reason, it can occur neither in file names nor in command line arguments. Many programs do not even work properly on files containing null characters. 1 001 01 START OF HEADING, SOH, Ctrl-A, ^A [] 2 002 02 START OF TEXT, STX, Ctrl-B, ^B [] 3 003 03 END OF TEXT, ETX, Ctrl-C, ^C [] 4 004 04 END OF TRANSMISSION, EOT, Ctrl-D, ^D [] Ctrl-D signals to the terminal driver that keyboard entry of a file is terminated. This character is therefore also called "End of file". Note that this is restricted to keyboard entry -- the end of a file in the file system is _not_ marked by any terminating character. (The length of a file is stored internally, therefore an end-of-file mark is superfluous.) 5 005 05 ENQUIRY, ENQ, Ctrl-E, ^E [] 6 006 06 ACKNOWLEDGE, ACK, Ctrl-F, ^F [] 7 007 07 BELL, BEL, Ctrl-G, ^G [] 8 010 08 BACKSPACE, BS, Ctrl-H, ^H [] 9 011 09 CHARACTER TABULATION, HT, Ctrl-I, ^I, Tab [] The tabulation character is usually printed as a sequence of space characters (until column 9, 17, 25, etc.). Note that some editors do not only print tabs as sequence of spaces but _replace_ them by a sequence of spaces under certain conditions; some editors also do it vice versa. 10 012 0a LINE FEED, LF, Ctrl-J, ^J, Newline [] This is the end-of-line character in Unix text files. Other operating systems use Ctrl-M (Macintosh) or Ctrl-M Ctrl-J (MS-Windows, MS-DOS) for that purpose. Unix applications that expect text files as input have often a built-in limit. Even worse, this restriction is often poorly documented. 11 013 0b LINE TABULATION, VT, Ctrl-K, ^K [] 12 014 0c FORM FEED, FF, Ctrl-L, ^L [] 13 015 0d CARRIAGE RETURN, CR, Ctrl-M, ^M, Return [] See Ctrl-J. 14 016 0e SHIFT OUT, SO, Ctrl-N, ^N [] 15 017 0f SHIFT IN, SI, Ctrl-O, ^O [] 16 020 10 DATALINK ESCAPE, DLE, Ctrl-P, ^P [] 17 021 11 DEVICE CONTROL ONE, DC1, Ctrl-Q, ^Q [] See Ctrl-S and Ctrl-V. 18 022 12 DEVICE CONTROL TWO, DC2, Ctrl-R, ^R [] 19 023 13 DEVICE CONTROL THREE, DC3, Ctrl-S, ^S [] Ctrl-S is often use to stop screen output temporarily. (This does not hold for Emacs-style editors.) Note: Even if output is stopped, further keystrokes may be accepted as input. Use Ctrl-Q to resume screen output. 20 024 14 DEVICE CONTROL FOUR, DC4, Ctrl-T, ^T [] 21 025 15 NEGATIVE ACKNOWLEDGE, NAK, Ctrl-U, ^U [] 22 026 16 SYNCHRONOUS IDLE, SYN, Ctrl-V, ^V [] Ctrl-V is often used as an escape character to quote the following control character (in particular in the terminal driver and in vi-style editors), so a Ctrl-C can be entered by typing Ctrl-V Ctrl-C. (Emacs-style editors use Ctrl-Q instead.) 23 027 17 END OF TRANSMISSION BLOCK, ETB, Ctrl-W, ^W [] 24 030 18 CANCEL, CAN, Ctrl-X, ^X [] 25 031 19 END OF MEDIUM, EM, Ctrl-Y, ^Y [] 26 032 1a SUBSTITUTE, SUB, Ctrl-Z, ^Z [] 27 033 1b ESCAPE, ESC, Ctrl-[, ^[ [] The Escape character is traditionally used for control sequences for terminals and terminal emulators. For instance, one can change the name of an xterm window to "xyz" by sending the character sequence "Escape ] 2 ; x y z Ctrl-G" to the xterm. Conversely, the xterm sends escape sequences when, e.g., the cursor keys are used. 28 034 1c FILE SEPARATOR, IS4, FS, Ctrl-\, ^\ [] 29 035 1d GROUP SEPARATOR, IS3, GS, Ctrl-], ^] [] 30 036 1e RECORD SEPARATOR, IS2, RS, Ctrl-^, ^^ [] 31 037 1f UNIT SEPARATOR, IS1, US, Ctrl-_, ^_ [] 32 040 20 SPACE 33 041 21 EXCLAMATION MARK ! 34 042 22 QUOTATION MARK, Double Quote " 35 043 23 NUMBER SIGN, Hash # 36 044 24 DOLLAR SIGN $ 37 045 25 PERCENT SIGN % 38 046 26 AMPERSAND & 39 047 27 APOSTROPHE, Single Quote ' This character can be represented by a multitude of glyphs, e.g., a raised short vertical line (possibly slanted to the right, possibly V-shaped), or a raised small nine (possibly slanted to the right). If you use this character in an e-mail or web page, you should not assume that the reader sees the same glyph as you do. The apostrophe key is sometimes used as a dead key to type accented letters (e.g., <'> => é). In this case, it may be necessary to type <'> to get an isolated apostrophe. This holds analogously for other characters, such as grave accent, tilde, or circumflex. 40 050 28 LEFT PARENTHESIS, Opening Parenthesis ( 41 051 29 RIGHT PARENTHESIS, Closing Parenthesis ) 42 052 2a ASTERISK * 43 053 2b PLUS SIGN + 44 054 2c COMMA , 45 055 2d HYPHEN-MINUS - 46 056 2e FULL STOP, Period . 47 057 2f SOLIDUS, Slash / 48 060 30 DIGIT ZERO 0 49 061 31 DIGIT ONE 1 50 062 32 DIGIT TWO 2 51 063 33 DIGIT THREE 3 52 064 34 DIGIT FOUR 4 53 065 35 DIGIT FIVE 5 54 066 36 DIGIT SIX 6 55 067 37 DIGIT SEVEN 7 56 070 38 DIGIT EIGHT 8 57 071 39 DIGIT NINE 9 58 072 3a COLON : 59 073 3b SEMICOLON ; 60 074 3c LESS-THAN SIGN < 61 075 3d EQUALS SIGN = 62 076 3e GREATER-THAN SIGN > 63 077 3f QUESTION MARK ? 64 100 40 COMMERCIAL AT @ 65 101 41 LATIN CAPITAL LETTER A A 66 102 42 LATIN CAPITAL LETTER B B 67 103 43 LATIN CAPITAL LETTER C C 68 104 44 LATIN CAPITAL LETTER D D 69 105 45 LATIN CAPITAL LETTER E E 70 106 46 LATIN CAPITAL LETTER F F 71 107 47 LATIN CAPITAL LETTER G G 72 110 48 LATIN CAPITAL LETTER H H 73 111 49 LATIN CAPITAL LETTER I I 74 112 4a LATIN CAPITAL LETTER J J 75 113 4b LATIN CAPITAL LETTER K K 76 114 4c LATIN CAPITAL LETTER L L 77 115 4d LATIN CAPITAL LETTER M M 78 116 4e LATIN CAPITAL LETTER N N 79 117 4f LATIN CAPITAL LETTER O O 80 120 50 LATIN CAPITAL LETTER P P 81 121 51 LATIN CAPITAL LETTER Q Q 82 122 52 LATIN CAPITAL LETTER R R 83 123 53 LATIN CAPITAL LETTER S S 84 124 54 LATIN CAPITAL LETTER T T 85 125 55 LATIN CAPITAL LETTER U U 86 126 56 LATIN CAPITAL LETTER V V 87 127 57 LATIN CAPITAL LETTER W W 88 130 58 LATIN CAPITAL LETTER X X 89 131 59 LATIN CAPITAL LETTER Y Y 90 132 5a LATIN CAPITAL LETTER Z Z 91 133 5b LEFT SQUARE BRACKET, Opening Bracket [ 92 134 5c REVERSE SOLIDUS, Backslash, Reverse Slash \ 93 135 5d RIGHT SQUARE BRACKET, Closing Bracket ] 94 136 5e CIRCUMFLEX ACCENT, Hat ^ The circumflex key may be configured as a dead key, cf apostrophe. 95 137 5f LOW LINE, Underline, Underscore _ 96 140 60 GRAVE ACCENT, Backquote ` Like the apostrophe, this character can be represented by a multitude of glyphs, e.g., a raised short line slanted to the left (possibly wedge shaped) or a raised small six (possibly slanted). The grave accent key may be configured as a dead key, cf apostrophe. 97 141 61 LATIN SMALL LETTER A a 98 142 62 LATIN SMALL LETTER B b 99 143 63 LATIN SMALL LETTER C c 100 144 64 LATIN SMALL LETTER D d 101 145 65 LATIN SMALL LETTER E e 102 146 66 LATIN SMALL LETTER F f 103 147 67 LATIN SMALL LETTER G g 104 150 68 LATIN SMALL LETTER H h 105 151 69 LATIN SMALL LETTER I i 106 152 6a LATIN SMALL LETTER J j 107 153 6b LATIN SMALL LETTER K k 108 154 6c LATIN SMALL LETTER L l 109 155 6d LATIN SMALL LETTER M m 110 156 6e LATIN SMALL LETTER N n 111 157 6f LATIN SMALL LETTER O o 112 160 70 LATIN SMALL LETTER P p 113 161 71 LATIN SMALL LETTER Q q 114 162 72 LATIN SMALL LETTER R r 115 163 73 LATIN SMALL LETTER S s 116 164 74 LATIN SMALL LETTER T t 117 165 75 LATIN SMALL LETTER U u 118 166 76 LATIN SMALL LETTER V v 119 167 77 LATIN SMALL LETTER W w 120 170 78 LATIN SMALL LETTER X x 121 171 79 LATIN SMALL LETTER Y y 122 172 7a LATIN SMALL LETTER Z z 123 173 7b LEFT CURLY BRACKET, Opening Brace { 124 174 7c VERTICAL LINE, Vertical Bar | 125 175 7d RIGHT CURLY BRACKET, Closing Brace } 126 176 7e TILDE ~ The tilde key may be configured as a dead key, cf apostrophe. 127 177 7f DELETE, Ctrl-?, ^? [] 128 200 80 PADDING CHARACTER, PAD, Meta-Ctrl-@ [] CP1252: Euro sign 129 201 81 HIGH OCTET PRESET, HOP, Meta-Ctrl-A [] 130 202 82 BREAK PERMITTED HERE, BPH, Meta-Ctrl-B [] CP1252: Single low-9 quotation mark 131 203 83 NO BREAK HERE, NBH, Meta-Ctrl-C [] CP1252: Latin small letter f with hook 132 204 84 INDEX, IND, Meta-Ctrl-D [] CP1252: Double low-9 quotation mark 133 205 85 NEXT LINE, NEL, Meta-Ctrl-E [] CP1252: Horizontal ellipsis 134 206 86 START OF SELECTED AREA, SSA, Meta-Ctrl-F [] CP1252: Dagger 135 207 87 END OF SELECTED AREA, ESA, Meta-Ctrl-G [] CP1252: Double dagger 136 210 88 CHARACTER TABULATION SET, HTS, Meta-Ctrl-H [] CP1252: Modifier letter circumflex accent 137 211 89 CHARACTER TABULATION WITH JUSTIFICATION, HTJ, [] Meta-Ctrl-I CP1252: Per mille sign 138 212 8a LINE TABULATION SET, VTS, Meta-Ctrl-J [] CP1252: Latin capital letter S with caron 139 213 8b PARTIAL LINE FORWARD, PLD, Meta-Ctrl-K [] CP1252: Single left-pointing angle quotation mark 140 214 8c PARTIAL LINE BACKWARD, PLU, Meta-Ctrl-L [] CP1252: Latin capital ligature OE 141 215 8d REVERSE LINE FEED, RI, Meta-Ctrl-M [] 142 216 8e SINGLE-SHIFT TWO, SS2, Meta-Ctrl-N [] CP1252: Latin capital letter Z with caron 143 217 8f SINGLE-SHIFT THREE, SS3, Meta-Ctrl-O [] 144 220 90 DEVICE CONTROL STRING, DCS, Meta-Ctrl-P [] 145 221 91 PRIVATE USE ONE, PU1, Meta-Ctrl-Q [] CP1252: Left single quotation mark 146 222 92 PRIVATE USE TWO, PU2, Meta-Ctrl-R [] CP1252: Right single quotation mark 147 223 93 SET TRANSMIT STATE, STS, Meta-Ctrl-S [] CP1252: Left double quotation mark 148 224 94 CANCEL CHARACTER, CCH, Meta-Ctrl-T [] CP1252: Right double quotation mark 149 225 95 MESSAGE WAITING, MW, Meta-Ctrl-U [] CP1252: Bullet 150 226 96 START OF GUARDED AREA, SPA, Meta-Ctrl-V [] CP1252: En dash 151 227 97 END OF GUARDED AREA, EPA, Meta-Ctrl-W [] CP1252: Em dash 152 230 98 START OF STRING, SOS, Meta-Ctrl-X [] CP1252: Small tilde 153 231 99 SINGLE GRAPHIC CHARACTER INTRODUCER, SGCI, Meta-Ctrl-Y [] CP1252: Trade mark sign 154 232 9a SINGLE CHARACTER INTRODUCER, SCI, Meta-Ctrl-Z [] CP1252: Latin small letter s with caron 155 233 9b CONTROL SEQUENCE INTRODUCER, CSI, Meta-Ctrl-[ [] CP1252: Single right-pointing angle quotation mark 156 234 9c STRING TERMINATOR, ST, Meta-Ctrl-\ [] CP1252: Latin small ligature oe 157 235 9d OPERATING SYSTEM COMMAND, OSC, Meta-Ctrl-] [] 158 236 9e PRIVACY MESSAGE, PM, Meta-Ctrl-^ [] CP1252: Latin small letter z with caron 159 237 9f APPLICATION PROGRAM COMMAND, APC, Meta-Ctrl-_ [] CP1252: Latin capital letter Y with diaeresis 160 240 a0 NO-BREAK SPACE   161 241 a1 INVERTED EXCLAMATION MARK (Spanish) ¡ 162 242 a2 CENT SIGN ¢ 163 243 a3 POUND SIGN £ 164 244 a4 CURRENCY SIGN ¤ 165 245 a5 YEN SIGN ¥ 166 246 a6 BROKEN BAR ¦ 167 247 a7 SECTION SIGN, Paragraph § 168 250 a8 DIAERESIS, Umlaut, Trema ¨ Note: may be confused with quotation mark. 169 251 a9 COPYRIGHT SIGN © 170 252 aa FEMININE ORDINAL INDICATOR, Superscript a ª 171 253 ab LEFT-POINTING DOUBLE ANGLE QUOTATION MARK, « Guillemets (in D closing, in F and CH opening) 172 254 ac NOT SIGN ¬ 173 255 ad SOFT HYPHEN ­ The purpose of this character is unclear. One interpretation is "hyphen that has been inserted for line breaking only and that should be removed again if the line breaking is changed", the other interpretation is "character to mark a _possible_ hyphenation for line breaking". None of these interpretations is widely supported; most programs treat "Soft hyphen" like any other character. 174 256 ae REGISTERED SIGN, registered trademark ® 175 257 af MACRON ¯ 176 260 b0 DEGREE SIGN ° 177 261 b1 PLUS-MINUS SIGN ± 178 262 b2 SUPERSCRIPT TWO ² 179 263 b3 SUPERSCRIPT THREE ³ 180 264 b4 ACUTE ACCENT ´ This character can easily be confused with an apostrophe. The glyph is usually a raised line slanted to the right, sometimes V shaped. 181 265 b5 MICRO SIGN, Greek mu µ 182 266 b6 PILCROW SIGN ¶ 183 267 b7 MIDDLE DOT · 184 270 b8 CEDILLA, ¸ 185 271 b9 SUPERSCRIPT ONE, ¹ 186 272 ba MASCULINE ORDINAL INDICATOR, Superscript o º 187 273 bb RIGHT-POINTING DOUBLE ANGLE QUOTATION MARK, » Guillemets (in D opening, in F and CH closing) 188 274 bc VULGAR FRACTION ONE QUARTER ¼ 189 275 bd VULGAR FRACTION ONE HALF ½ 190 276 be VULGAR FRACTION THREE QUARTERS ¾ 191 277 bf INVERTED QUESTION MARK (Spanish) ¿ 192 300 c0 LATIN CAPITAL LETTER A WITH GRAVE (e.g., French) À 193 301 c1 LATIN CAPITAL LETTER A WITH ACUTE (e.g., Spanish) Á 194 302 c2 LATIN CAPITAL LETTER A WITH CIRCUMFLEX (e.g., French) Â 195 303 c3 LATIN CAPITAL LETTER A WITH TILDE (e.g., Portuguese) Ã 196 304 c4 LATIN CAPITAL LETTER A WITH DIAERESIS (e.g., German) Ä If Ä is not available on the keyboard, one can type this character possibly using or <"> . 197 305 c5 LATIN CAPITAL LETTER A WITH RING ABOVE (e.g., Danish) Å 198 306 c6 LATIN CAPITAL LETTER AE, A E ligature (e.g., Danish) Æ 199 307 c7 LATIN CAPITAL LETTER C WITH CEDILLA (e.g., French) Ç 200 310 c8 LATIN CAPITAL LETTER E WITH GRAVE (e.g., French) È 201 311 c9 LATIN CAPITAL LETTER E WITH ACUTE (e.g., French) É 202 312 ca LATIN CAPITAL LETTER E WITH CIRCUMFLEX (e.g., French) Ê 203 313 cb LATIN CAPITAL LETTER E WITH DIAERESIS (e.g., French) Ë 204 314 cc LATIN CAPITAL LETTER I WITH GRAVE (e.g., Italian) Ì 205 315 cd LATIN CAPITAL LETTER I WITH ACUTE (e.g., Spanish) Í 206 316 ce LATIN CAPITAL LETTER I WITH CIRCUMFLEX (e.g., French) Î 207 317 cf LATIN CAPITAL LETTER I WITH DIAERESIS (e.g., French) Ï 208 320 d0 LATIN CAPITAL LETTER ETH (e.g., Icelandic) Ð 209 321 d1 LATIN CAPITAL LETTER N WITH TILDE (e.g., Spanish) Ñ 210 322 d2 LATIN CAPITAL LETTER O WITH GRAVE (e.g., Italian) Ò 211 323 d3 LATIN CAPITAL LETTER O WITH ACUTE (e.g., Spanish) Ó 212 324 d4 LATIN CAPITAL LETTER O WITH CIRCUMFLEX (e.g., French) Ô 213 325 d5 LATIN CAPITAL LETTER O WITH TILDE (e.g., Portuguese) Õ 214 326 d6 LATIN CAPITAL LETTER O WITH DIAERESIS (e.g., German) Ö If Ö is not available on the keyboard, one can type this character possibly using or <"> . 215 327 d7 MULTIPLICATION SIGN × 216 330 d8 LATIN CAPITAL LETTER O WITH STROKE (e.g., Danish) Ø 217 331 d9 LATIN CAPITAL LETTER U WITH GRAVE (e.g., Italian) Ù 218 332 da LATIN CAPITAL LETTER U WITH ACUTE (e.g., Spanish) Ú 219 333 db LATIN CAPITAL LETTER U WITH CIRCUMFLEX (e.g., French) Û 220 334 dc LATIN CAPITAL LETTER U WITH DIAERESIS (e.g., German) Ü If Ü is not available on the keyboard, one can type this character possibly using or <"> . 221 335 dd LATIN CAPITAL LETTER Y WITH ACUTE (e.g., Icelandic) Ý 222 336 de LATIN CAPITAL LETTER THORN (e.g., Icelandic) Þ 223 337 df LATIN SMALL LETTER SHARP S, es-zet (German) ß If ß is not available on the keyboard, one can type this character possibly using or . 224 340 e0 LATIN SMALL LETTER A WITH GRAVE (e.g., French) à 225 341 e1 LATIN SMALL LETTER A WITH ACUTE (e.g., Spanish) á 226 342 e2 LATIN SMALL LETTER A WITH CIRCUMFLEX (e.g., French) â 227 343 e3 LATIN SMALL LETTER A WITH TILDE (e.g., Portuguese) ã 228 344 e4 LATIN SMALL LETTER A WITH DIAERESIS (e.g., German) ä If ä is not available on the keyboard, one can type this character possibly using or <"> . 229 345 e5 LATIN SMALL LETTER A WITH RING ABOVE (e.g., Danish) å 230 346 e6 LATIN SMALL LETTER AE, a e ligature (e.g., Danish) æ 231 347 e7 LATIN SMALL LETTER C WITH CEDILLA (e.g., French) ç 232 350 e8 LATIN SMALL LETTER E WITH GRAVE (e.g., French) è 233 351 e9 LATIN SMALL LETTER E WITH ACUTE (e.g., French) é 234 352 ea LATIN SMALL LETTER E WITH CIRCUMFLEX (e.g., French) ê 235 353 eb LATIN SMALL LETTER E WITH DIAERESIS (e.g., French) ë 236 354 ec LATIN SMALL LETTER I WITH GRAVE (e.g., Italian) ì 237 355 ed LATIN SMALL LETTER I WITH ACUTE (e.g., Spanish) í 238 356 ee LATIN SMALL LETTER I WITH CIRCUMFLEX (e.g., French) î 239 357 ef LATIN SMALL LETTER I WITH DIAERESIS (e.g., French) ï 240 360 f0 LATIN SMALL LETTER ETH (e.g., Icelandic) ð 241 361 f1 LATIN SMALL LETTER N WITH TILDE (e.g., Spanish) ñ 242 362 f2 LATIN SMALL LETTER O WITH GRAVE (e.g., Italian) ò 243 363 f3 LATIN SMALL LETTER O WITH ACUTE (e.g., Spanish) ó 244 364 f4 LATIN SMALL LETTER O WITH CIRCUMFLEX (e.g., French) ô 245 365 f5 LATIN SMALL LETTER O WITH TILDE (e.g., Portuguese) õ 246 366 f6 LATIN SMALL LETTER O WITH DIAERESIS (e.g., German) ö If ö is not available on the keyboard, one can type this character possibly using or <"> . 247 367 f7 DIVISION SIGN ÷ 248 370 f8 LATIN SMALL LETTER O WITH STROKE (e.g., Danish) ø 249 371 f9 LATIN SMALL LETTER U WITH GRAVE (e.g., Italian) ù 250 372 fa LATIN SMALL LETTER U WITH ACUTE (e.g., Spanish) ú 251 373 fb LATIN SMALL LETTER U WITH CIRCUMFLEX (e.g., French) û 252 374 fc LATIN SMALL LETTER U WITH DIAERESIS (e.g., German) ü If ü is not available on the keyboard, one can type this character possibly using or <"> . 253 375 fd LATIN SMALL LETTER Y WITH ACUTE (e.g., Icelandic) ý 254 376 fe LATIN SMALL LETTER THORN (e.g., Icelandic) þ 255 377 ff LATIN SMALL LETTER Y WITH DIAERESIS (e.g., French) ÿ