ISO/IEC 8859-2
ISO/IEC 8859-2:1999, Information technology — 8-bit single-byte coded graphic character sets — Part 2: Latin alphabet No. 2, is part of the ISO/IEC 8859 series of ASCII-based standard character encodings, first edition published in 1987. It is informally referred to as "Latin-2". It is generally intended for Central[1] or "Eastern European" languages that are written in the Latin script. Note that ISO/IEC 8859-2 is very different from code page 852 (MS-DOS Latin 2, PC Latin 2) which is also referred to as "Latin-2" in Czech and Slovak regions.[2]
ISO-8859-2 is the IANA preferred charset name for this standard when supplemented with the C0 and C1 control codes from ISO/IEC 6429. 0.3% of all web pages use ISO 8859-2 in January 2016.[3] Microsoft has assigned code page 28592 aka Windows-28592 to ISO-8859-2 in Windows.
Codepage 1250 aka Windows-1250 has many of the same characters but in a different arrangement.
These code values can be used for the following languages:
- Bosnian
- Croatian
- Czech
- German (fully compatible with ISO/IEC 8859-1 for German texts)
- Hungarian
- Polish
- Romanian
- Serbian Latin
- Slovak
- Slovene
- Upper Sorbian
- Lower Sorbian
- Turkmen.
It can also be used for Romanian, but it is unsuitable for that language, because of lack of letters s and t with commas below, containing s and t with cedillas instead. These letters were unified in the first versions of the Unicode standard, meaning that the appearance with cedilla or with comma was treated as a glyph choice rather than as separate characters; fonts intended for use with Romanian should, therefore, have characters with comma below at those code points. Still, ISO/IEC 8859-2 and Windows-1250 (with the same problem) have been heavily used for Romanian. Unicode (which supports both variants) has taken the lead for web pages, which however often have s and t with cedilla anyway.
Code page layout
In the following table characters are shown together with their corresponding Unicode code points. Note that code values 00-1F, 7F, and 80-9F are not assigned to characters by ISO/IEC 8859-2. Code 20 is the regular SPACE character, and A0 is the NON-BREAKING SPACE. Code AD is a SOFT HYPHEN, which even in isolation may not appear at all in compliant web browsers.
Legend:
|   Alphabetic   Control character   Numeric digit   Punctuation |   Extended punctuation   Graphic character   International   Undefined | 
| _0 | _1 | _2 | _3 | _4 | _5 | _6 | _7 | _8 | _9 | _A | _B | _C | _D | _E | _F | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 0_ | ||||||||||||||||
| 1_ | ||||||||||||||||
| 2_ | SP 0020 32 | ! 0021 33 | " 0022 34 | # 0023 35 | $ 0024 36 | % 0025 37 | & 0026 38 | ' 0027 39 | ( 0028 40 | ) 0029 41 | * 002A 42 | + 002B 43 | , 002C 44 | - 002D 45 | . 002E 46 | / 002F 47 | 
| 3_ | 0 0030 48 | 1 0031 49 | 2 0032 50 | 3 0033 51 | 4 0034 52 | 5 0035 53 | 6 0036 54 | 7 0037 55 | 8 0038 56 | 9 0039 57 | : 003A 58 | ; 003B 59 | < 003C 60 | = 003D 61 | > 003E 62 | ? 003F 63 | 
| 4_ | @ 0040 64 | A 0041 65 | B 0042 66 | C 0043 67 | D 0044 68 | E 0045 69 | F 0046 70 | G 0047 71 | H 0048 72 | I 0049 73 | J 004A 74 | K 004B 75 | L 004C 76 | M 004D 77 | N 004E 78 | O 004F 79 | 
| 5_ | P 0050 80 | Q 0051 81 | R 0052 82 | S 0053 83 | T 0054 84 | U 0055 85 | V 0056 86 | W 0057 87 | X 0058 88 | Y 0059 89 | Z 005A 90 | [ 005B 91 | \ 005C 92 | ] 005D 93 | ^ 005E 94 | _ 005F 95 | 
| 6_ | ` 0060 96 | a 0061 97 | b 0062 98 | c 0063 99 | d 0064 100 | e 0065 101 | f 0066 102 | g 0067 103 | h 0068 104 | i 0069 105 | j 006A 106 | k 006B 107 | l 006C 108 | m 006D 109 | n 006E 110 | o 006F 111 | 
| 7_ | p 0070 112 | q 0071 113 | r 0072 114 | s 0073 115 | t 0074 116 | u 0075 117 | v 0076 118 | w 0077 119 | x 0078 120 | y 0079 121 | z 007A 122 | { 007B 123 | | 007C 124 | } 007D 125 | ~ 007E 126 | |
| 8_ | ||||||||||||||||
| 9_ | ||||||||||||||||
| A_ | NBSP 00A0 160 | Ą 0104 161 | ˘ 02D8 162 | Ł 0141 163 | ¤ 00A4 164 | Ľ 013D 165 | Ś 015A 166 | § 00A7 167 | ¨ 00A8 168 | Š 0160 169 | Ş 015E 170 | Ť 0164 171 | Ź 0179 172 | SHY 00AD 173 | Ž 017D 174 | Ż 017B 175 | 
| B_ | ° 00B0 176 | ą 0105 177 | ˛ 02DB 178 | ł 0142 179 | ´ 00B4 180 | ľ 013E 181 | ś 015B 182 | ˇ 02C7 183 | ¸ 00B8 184 | š 0161 185 | ş 015F 186 | ť 0165 187 | ź 017A 188 | ˝ 02DD 189 | ž 017E 190 | ż 017C 191 | 
| C_ | Ŕ 0154 192 | Á 00C1 193 | Â 00C2 194 | Ă 0102 195 | Ä 00C4 196 | Ĺ 0139 197 | Ć 0106 198 | Ç 00C7 199 | Č 010C 200 | É 00C9 201 | Ę 0118 202 | Ë 00CB 203 | Ě 011A 204 | Í 00CD 205 | Î 00CE 206 | Ď 010E 207 | 
| D_ | Đ 0110 208 | Ń 0143 209 | Ň 0147 210 | Ó 00D3 211 | Ô 00D4 212 | Ő 0150 213 | Ö 00D6 214 | × 00D7 215 | Ř 0158 216 | Ů 016E 217 | Ú 00DA 218 | Ű 0170 219 | Ü 00DC 220 | Ý 00DD 221 | Ţ 0162 222 | ß 00DF 223 | 
| E_ | ŕ 0155 224 | á 00E1 225 | â 00E2 226 | ă 0103 227 | ä 00E4 228 | ĺ 013A 229 | ć 0107 230 | ç 00E7 231 | č 010D 232 | é 00E9 233 | ę 0119 234 | ë 00EB 235 | ě 011B 236 | í 00ED 237 | î 00EE 238 | ď 010F 239 | 
| F_ | đ 0111 240 | ń 0144 241 | ň 0148 242 | ó 00F3 243 | ô 00F4 244 | ő 0151 245 | ö 00F6 246 | ÷ 00F7 247 | ř 0159 248 | ů 016F 249 | ú 00FA 250 | ű 0171 251 | ü 00FC 252 | ý 00FD 253 | ţ 0163 254 | ˙ 02D9 255 | 
| _0 | _1 | _2 | _3 | _4 | _5 | _6 | _7 | _8 | _9 | _A | _B | _C | _D | _E | _F | 
See also
References
External links
- ISO 8859-2:1999
- Standard ECMA-94: 8-Bit Single Byte Coded Graphic Character Sets - Latin Alphabets No. 1 to No. 4 2nd edition (June 1986)
- ISO-IR 101 Right-Hand Part of Latin Alphabet No.2 (February 1, 1986)
- ISO 8859-2 (Latin 2) Resources
| 
 | ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
