Code page 850
MIME / IANA | IBM850 |
---|---|
Alias(es) | cp850, 850, csPC850Multilingual,[1]DOS Latin 1, OEM 850 |
Language(s) | English,various others |
Classification | Extended ASCII,OEM code page |
Extends | US-ASCII |
Based on | OEM-US |
Transforms / Encodes | ISO/IEC 8859-1(reordered) |
Other related encoding(s) | Code page 858(PC DOS 2000's "modified code page 850" ),code page 437 |
Code page 850(CCSID850) (also known as CP 850, IBM 00850,[2]OEM 850,[3]DOS Latin 1[4]) is acode pageused underDOSoperating systems[a]in Western Europe.[5]Depending on the country setting and system configuration, code page 850 is the primary code page and defaultOEM code pagein many countries, including various English-speaking locales (e.g. in the United Kingdom, Ireland, and Canada), whilst other English-speaking locales (like the United States) default to thehardware code page437.[6]
Code page 850 differs fromcode page 437in that many of thebox-drawing characters,Greek letters,and various symbols were replaced with additional Latin letters withdiacritics,thus greatly improving support for Western European languages (all characters fromISO 8859-1are included). At the same time, the changes frequently caused display glitches with programs that made use of the box-drawing characters to display a GUI-like surface in text mode.
After the DOS era, successoroperating systemslargely replaced code page 850 withWindows-1252,[b]laterUCS-2andUTF-16,[c]and finallyUTF-8.However, legacy applications, especiallycommand-line programs,may still depend on support for older code pages.
Character set
[edit]Each non-ASCII character appears with its equivalentUnicodecode-point. Differences fromcode page 437are limited to the second half of the table, the first half being the same.
0 | 1 | 2 | 3 | 4 | 5 | 6 | 7 | 8 | 9 | A | B | C | D | E | F | |
0x 0 |
NUL | ☺ 263A |
☻ 263B |
♥ 2665 |
♦ 2666 |
♣ 2663 |
♠ 2660 |
• 2022 |
◘ 25D8 |
○ 25CB |
◙ 25D9 |
♂ 2642 |
♀ 2640 |
♪ 266A |
♫ 266B |
☼ 263C |
1x 16 |
► 25BA |
◄ 25C4 |
↕ 2195 |
‼ 203C |
¶ 00B6 |
§ 00A7 |
▬ 25AC |
↨ 21A8 |
↑ 2191 |
↓ 2193 |
→ 2192 |
← 2190 |
∟ 221F |
↔ 2194 |
▲ 25B2 |
▼ 25BC |
2x 32 |
SP | ! | " | # | $ | % | & | ' | ( | ) | * | + | , | - | . | / |
3x 48 |
0 | 1 | 2 | 3 | 4 | 5 | 6 | 7 | 8 | 9 | : | ; | < | = | > | ? |
4x 64 |
@ | A | B | C | D | E | F | G | H | I | J | K | L | M | N | O |
5x 80 |
P | Q | R | S | T | U | V | W | X | Y | Z | [ | \ | ] | ^ | _ |
6x 96 |
` | a | b | c | d | e | f | g | h | i | j | k | l | m | n | o |
7x 112 |
p | q | r | s | t | u | v | w | x | y | z | { | | | } | ~ | ⌂ 2302 |
8x 128 |
Ç 00C7 |
ü 00FC |
é 00E9 |
â 00E2 |
ä 00E4 |
à 00E0 |
å 00E5 |
ç 00E7 |
ê 00EA |
ë 00EB |
è 00E8 |
ï 00EF |
î 00EE |
ì 00EC |
Ä 00C4 |
Å 00C5 |
9x 144 |
É 00C9 |
æ 00E6 |
Æ 00C6 |
ô 00F4 |
ö 00F6 |
ò 00F2 |
û 00FB |
ù 00F9 |
ÿ 00FF |
Ö 00D6 |
Ü 00DC |
ø 00F8 |
£ 00A3 |
Ø 00D8 |
× 00D7 |
ƒ 0192 |
Ax 160 |
á 00E1 |
í 00ED |
ó 00F3 |
ú 00FA |
ñ 00F1 |
Ñ 00D1 |
ª 00AA |
º 00BA |
¿ 00BF |
® 00AE |
¬ 00AC |
½ 00BD |
¼ 00BC |
¡ 00A1 |
« 00AB |
» 00BB |
Bx 176 |
░ 2591 |
▒ 2592 |
▓ 2593 |
│ 2502 |
┤ 2524 |
Á 00C1 |
 00C2 |
À 00C0 |
© 00A9 |
╣ 2563 |
║ 2551 |
╗ 2557 |
╝ 255D |
¢ 00A2 |
¥ 00A5 |
┐ 2510 |
Cx 192 |
└ 2514 |
┴ 2534 |
┬ 252C |
├ 251C |
─ 2500 |
┼ 253C |
ã 00E3 |
à 00C3 |
╚ 255A |
╔ 2554 |
╩ 2569 |
╦ 2566 |
╠ 2560 |
═ 2550 |
╬ 256C |
¤ 00A4 |
Dx 208 |
ð 00F0 |
Ð 00D0 |
Ê 00CA |
Ë 00CB |
È 00C8 |
ı 0131 |
Í 00CD |
Î 00CE |
Ï 00CF |
┘ 2518 |
┌ 250C |
█ 2588 |
▄ 2584 |
¦ 00A6 |
Ì 00CC |
▀ 2580 |
Ex 224 |
Ó 00D3 |
ß 00DF |
Ô 00D4 |
Ò 00D2 |
õ 00F5 |
Õ 00D5 |
µ 00B5 |
þ 00FE |
Þ 00DE |
Ú 00DA |
Û 00DB |
Ù 00D9 |
ý 00FD |
Ý 00DD |
¯ 00AF |
´ 00B4 |
Fx 240 |
SHY 00AD |
± 00B1 |
‗ 2017 |
¾ 00BE |
¶ 00B6 |
§ 00A7 |
÷ 00F7 |
¸ 00B8 |
° 00B0 |
¨ 00A8 |
· 00B7 |
¹ 00B9 |
³ 00B3 |
² 00B2 |
■ 25A0 |
NBSP 00A0 |
Code page 858
[edit]MIME / IANA | IBM00858 |
---|---|
Alias(es) | CCSID00858, CP00858, PC-Multilingual-850+euro[1] |
Transforms / Encodes | ISO 8859-1 |
Preceded by | Code page 850 |
In 1998,code page 858(CCSID858)[11](also known as CP 858, IBM 00858, OEM 858[3]) was derived from this code page by changing code point 213 (D5hex) from adotless i⟨ı⟩to theeuro sign⟨€⟩U+20AC.[12][13][14]Unlike most code pages modified to support the euro sign, the genericcurrency signat CFhexwas not chosen as the character to replace (compareISO-8859-15(fromISO-8859-1), code pages808(from 866),848(from 1125),849(from 1131) and872(from 855), ISO-IR-205 (fromISO-8859-4), ISO-IR-206 (fromISO-8859-13), and the changes toMacRomanandMacCyrillic).
Instead of adding support for the new code page 858, IBM'sPC DOS 2000,also released in 1998, changed the definition of the existing code page 850 to what IBM calledmodified code page 850to include the euro sign at code point 213.[15][16][17][18][19]The reason for this might have been due to restrictions inMS-DOS/PC DOS,which limited.CPI files to 64 KB in size or about six codepages maximum. Adding support for codepage 858 might have meant to drop another (e.g. codepage 850) at the same time, which might not have been a viable solution at that time, given that some applications were hard-wired to use codepage 850. More recent IBM/MS products implemented codepage 858 under its own ID.
Code page 1108
[edit]Code page 1108(DITROFF Base Compatibility) is an extension of this codepage which alters some code points in the range 0–32 from their definitions inCode page 437.[20] DITROFF (device independent troff) is an intermediate format of the standard Unix text formatterTroff.
0 | 1 | 2 | 3 | 4 | 5 | 6 | 7 | 8 | 9 | A | B | C | D | E | F | |
0x 0 |
NUL | ☺ 263A |
ff FB00 |
fi FB01 |
fl FB02 |
ffi FB03 |
ffl FB04 |
• 2022 |
– 2013 |
○ 25CB |
† 2020 |
‡ 2021 |
™ 2122 |
— 2014 |
‘ 2018 |
’ 2019 |
1x 16 |
► 25BA |
◄ 25C4 |
⅛ 215B |
⅜ 215C |
⅝ 215D |
⁰ 2070 |
⁴ 2074 |
⁵ 2075 |
↑ 2191 |
↓ 2193 |
→ 2192 |
← 2190 |
⁶ 2076 |
⁷ 2077 |
⁸ 2078 |
⁹ 2079 |
Code page 1109
[edit]Code page 1109(DITROFF Specials Compatibility) contains characters not available in Code page 1108.[21]
0 | 1 | 2 | 3 | 4 | 5 | 6 | 7 | 8 | 9 | A | B | C | D | E | F | |
2x | SP | ⎸ 23B8 |
⎺ 23BA |
⎽ 23BD |
⎼ 23BC |
⎡ 23A1 |
⎣ 23A3 |
⎤ 23A4 |
⎦ 23A6 |
⎧ 23A7 |
⎨ 23A8 |
⎩ 23A9 |
⎫ 23AB |
⎬ 23AC |
⎭ 23AD |
⎪ 23AA |
3x | □ 25A1 |
Code page 1044
[edit]Code page 1044(CCSID1044) is acode pageused underDOSto use in shipping labels. It is a subset of Code page 850.
Each character appears with its equivalentUnicodecode-point.[22]
0 | 1 | 2 | 3 | 4 | 5 | 6 | 7 | 8 | 9 | A | B | C | D | E | F | |
0x | ||||||||||||||||
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
1x | ||||||||||||||||
2x | SP | " | $ | % | & | ' | ( | ) | * | + | , | - | . | / | ||
3x | 0 | 1 | 2 | 3 | 4 | 5 | 6 | 7 | 8 | 9 | : | ; | = | |||
4x | A | B | C | D | E | F | G | H | I | J | K | L | M | N | O | |
5x | P | Q | R | S | T | U | V | W | X | Y | Z | \ | ||||
6x | ||||||||||||||||
7x | ||||||||||||||||
8x | Ç | Ä | Å | |||||||||||||
9x | É | Æ | Ö | Ü | Ø | × | ||||||||||
Ax | Ñ | |||||||||||||||
Bx | Á | Â | À | |||||||||||||
Cx | Ã | |||||||||||||||
Dx | Ð | Ê | Ë | È | Í | Î | Ï | Ì | ||||||||
Ex | Ó | ß | Ô | Ò | Õ | µ | Þ | Ú | Û | Ù | Ý | |||||
Fx | SHY | ± | ÷ | NBSP |
Code page 1034
[edit]Code page 1034(CCSID1034) is acode pageused underDOSto use in shipping labels. It is the second set used after code page 1044.[23]This is the code page with the fewest characters.
Each character appears with its equivalentUnicodecode-point.[24]
0 | 1 | 2 | 3 | 4 | 5 | 6 | 7 | 8 | 9 | A | B | C | D | E | F | |
2x | SP | |||||||||||||||
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
3x | ||||||||||||||||
4x | ⎊ |
Code page 906
[edit]Code page 906(CCSID906) is acode pageused by the IBM 3812, likecode page 907.It is a modification of Code page 850.[25]
Each character appears with its equivalentUnicodecode-point.[26]
0 | 1 | 2 | 3 | 4 | 5 | 6 | 7 | 8 | 9 | A | B | C | D | E | F | |
0x | ||||||||||||||||
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
1x | ¶ 00B6 |
§ 00A7 |
||||||||||||||
2x | SP | ! | " | # | $ | % | & | ' | ( | ) | * | + | , | - | . | / |
3x | 0 | 1 | 2 | 3 | 4 | 5 | 6 | 7 | 8 | 9 | : | ; | < | = | > | ? |
4x | @ | A | B | C | D | E | F | G | H | I | J | K | L | M | N | O |
5x | P | Q | R | S | T | U | V | W | X | Y | Z | [ | \ | ] | _ | |
6x | a | b | c | d | e | f | g | h | i | j | k | l | m | n | o | |
7x | p | q | r | s | t | u | v | w | x | y | z | { | | | } | ||
8x | Ç 00C7 |
ü 00FC |
é 00E9 |
â 00E2 |
ä 00E4 |
à 00E0 |
å 00E5 |
ç 00E7 |
ê 00EA |
ë 00EB |
è 00E8 |
ï 00EF |
î 00EE |
ì 00EC |
Ä 00C4 |
Å 00C5 |
9x | É 00C9 |
æ 00E6 |
Æ 00C6 |
ô 00F4 |
ö 00F6 |
ò 00F2 |
û 00FB |
ù 00F9 |
ÿ 00FF |
Ö 00D6 |
Ü 00DC |
ø 00F8 |
£ 00A3 |
Ø 00D8 |
× 00D7 |
ƒ 0192 |
Ax | á 00E1 |
í 00ED |
ó 00F3 |
ú 00FA |
ñ 00F1 |
Ñ 00D1 |
ª 00AA |
º 00BA |
¿ 00BF |
® 00AE |
¬ 00AC |
½ 00BD |
¼ 00BC |
¡ 00A1 |
« 00AB |
» 00BB |
Bx | œ 0153 |
Œ 0152 |
Ÿ 0178 |
Á 00C1 |
 00C2 |
À 00C0 |
‘ 2018 |
’ 2019 |
“ 201C |
” 201D |
¢ 00A2 |
¥ 00A5 |
||||
Cx | ã 00E3 |
à 00C3 |
FSP 2007 |
≤ 2264 |
≥ 2265 |
≠ 2260 |
¤ 00A4 | |||||||||
Dx | Ê 00CA |
Ë 00CB |
È 00C8 |
Í 00CD |
Î 00CE |
Ï 00CF |
Ŀ 013F |
ŀ 0140 |
¦ 00A6 |
Ì 00CC |
ij 0133 | |||||
Ex | Ó 00D3 |
ß 00DF |
Ô 00D4 |
Ò 00D2 |
õ 00F5 |
Õ 00D5 |
µ 00B5 |
Ú 00DA |
Û 00DB |
Ù 00D9 |
||||||
Fx | SHY 00AD |
± 00B1 |
¾ 00BE |
¶ 00B6 |
§ 00A7 |
÷ 00F7 |
° 00B0 |
· 00B7 |
¹ 00B9 |
³ 00B3 |
² 00B2 |
■ 25A0 |
NBSP 00A0 |
See also
[edit]Notes
[edit]- ^as well asPsion'sEPOC16operating system
- ^akin to and not always well-distinguished fromISO-8859-1
- ^TheWindows NTline was nativelyUnicodefrom the start, but issues of development tool support and compatibility withWindows 9xkept most applications on the 8-bit code pages.
References
[edit]- ^abCharacter Sets,Internet Assigned Numbers Authority(IANA), 2018-12-12
- ^"00850"(PDF).Code pages by CPGID.IBM.Archived(PDF)from the original on 2012-09-23.Retrieved2020-02-24.
- ^abc"OEM 850".Go Global Developer Center.Microsoft.Archived fromthe originalon 2016-06-06.Retrieved2016-06-06.
- ^"Code Page 850 MS-DOS Latin 1".Developing International Software.Microsoft.Archivedfrom the original on 2016-06-06.Retrieved2016-06-06.
- ^"CCSID 850 information document".Archived fromthe originalon 2016-03-27.
- ^Paul, Matthias R. (1997-07-30). "II.16.iii. Landessprachliche Unterstützung - Landescodes und Keyboard-Kürzel" [II.16.iii. National language support - Country codes and keyboard layout IDs].NWDOS-TIPs — Tips & Tricks rund um Novell DOS 7, mit Blick auf undokumentierte Details, Bugs und Workarounds[NWDOSTIPs — Tips & tricks for Novell DOS 7, with special focus on undocumented details, bugs and workarounds]. MPDOSTIP (in German) (3 ed.). Archived fromthe originalon 2016-06-06.Retrieved2016-06-06.(NB. NWDOSTIP.TXT is a comprehensive work onNovell DOS 7andOpenDOS 7.01,including the description of many undocumented features and internals. It is part of the author's yet larger MPDOSTIP.ZIP collection maintained up to 2001 and distributed on many sites at the time. The provided link points to a HTML-converted older version of the NWDOSTIP.TXT file.)
- ^"cp850_DOSLatin1 to Unicode table"(TXT).The Unicode Consortium.Archivedfrom the original on 2016-06-06.Retrieved2016-06-06.
- ^Code Page CPGID 00850(pdf),IBM, 1986
- ^Code Page (CPGID) 00850(txt),IBM, 1998
- ^"International Components for Unicode (ICU), ibm-850_P100-1995.ucm".GitHub.2002-12-03.Archivedfrom the original on 2022-01-28.Retrieved2022-01-28.
- ^"CCSID 858 information document".IBM.Archived fromthe originalon 2016-03-27.
- ^Code Page (CPGID) 00858(txt),IBM, 1998
- ^"00858".Code pages by CPGID.IBM.Archivedfrom the original on 2016-06-06.Retrieved2016-06-06.
- ^"Code page 858 information document".IBM.Archived fromthe originalon 2016-08-20.
- ^Paul, Matthias R. (2001-08-15)."Changing codepages in FreeDOS"(Technical design specification). Archived fromthe originalon 2016-06-06.Retrieved2016-06-06.
The new official ID for the Multilingual "codepage 850 with EURO SIGN" is 858, not 850. IBM will switch to use 858 instead of their 850 variant with future issues of their products. […] I can only guess why they didn't add 858 to theirEGAx.CPI,COUNTRY.SYS,andKEYBOARD.SYSfiles inPC DOS 2000.Many third-party applications are designed to work with 850 and didn't know about 858 at the time PC DOS 2000 was released, so it's easier for everyone, but unfortunately it's not compatible. […] As explained above, COUNTRY.SYS and KEYBOARD.SYS contain only two codepage entries for a given country in Western issues of DOS. (In Arabic and Hebrew issues there can be up to 8 codepages for one country, in theory there is no limit below the range of allowed codepages 1..65534). […] The problem is that removing support for 850 might have caused compatibility problems with applications which are hard-wired to use 850. Adding 858 as a third choice to all the files would have increased the file and table sizes significantly. The COUNTRY.SYS file parser in MS-DOS/PC DOSIO.SYS/IBMBIO.COMsets aside a 6 Kb (for DOS 6) scratchpad to load all the info. This allows a maximum of 438 entries in a COUNTRY.SYS file to be accepted, otherwise you will get the message "COUNTRY.SYS too large.". TheNLSFUNCparser does not have this limitation, and the file parsers in DR-DOS (kernel and NLSFUNC) also do not know of such a restriction. Older issues of MS-DOS/PC DOS even had a 2 Kb buffer for a maximum of 146 entries.
- ^Paul, Matthias R. (2001-06-10) [1995]."DOS COUNTRY.SYS file format"(COUNTRY.LST file) (1.44 ed.).Archivedfrom the original on 2016-04-20.Retrieved2016-08-20.
- ^Starikov, Yuri (2005-04-11)."15-летию Russian MS-DOS 4.01 посвящается"[15 Years of Russian MS-DOS 4.01] (in Russian).Archivedfrom the original on 2016-06-06.Retrieved2014-05-07.
- ^Paul, Matthias R. (2001-08-27)."Changing codepages in FreeDOS (follow-up)".Archivedfrom the original on 2014-10-01.Retrieved2013-05-08.
[…] one could also create custom.CPI files in the traditional FONT style without difficulties, but you could only store up to […] six codepages in such a file if it should be useable by MS-DOS/PC DOS (some OEM issues and NT can handle files larger than 64 Kb, but MS-DOS/PC DOS can not).
(NB. Based on fd-dev post[1].) - ^Paul, Matthias R. (2001-06-10) [1995]."Format description of DOS, OS/2, and Windows NT.CPI, and Linux.CP files"(CPI.LST file) (1.30 ed.).Archivedfrom the original on 2016-04-20.Retrieved2016-08-20.
- ^"Code Page 1108 DITROFF Base Compatibility"(PDF).Archived fromthe original(PDF)on 2013-01-21.
- ^"Code Page 1109 DITROFF Special Compatibility"(PDF).Archived fromthe original(PDF)on 2013-01-21.
- ^"Code Page 1044"(PDF).Archived fromthe original(PDF)on 2013-01-21.
- ^"IBM i Globalization: Code pages".Archived fromthe originalon 2012-07-16.
- ^"Code Page 1034"(PDF).Archived fromthe original(PDF)on 2013-01-21.
- ^"IBM i Globalization: Code pages".Archived fromthe originalon 2012-07-16.
- ^"Code Page 906"(PDF).Archived fromthe original(PDF)on 2013-01-21.