Code page 936 (IBM)

From Wikipedia, the free encyclopedia
IBM-936
Language(s)Simplified Chinese
Created byIBM
Current statusDeprecated
Transforms / EncodesGB 2312
Succeeded byIBM-1381
Other related encoding(s)Shift JIS

IBM code page 936 was a character encoding for Simplified Chinese including 1880 UDC. It was a combination of the single-byte Code page 903 and the double-byte Code page 928.[1]

Code page 928, the double byte component, included 9,355 characters as double-byte sequences starting with 0x81 through 0xAC and 0xF0 through 0xFA,[2] where lead bytes 0x81–87 were used for non-hanzi, 0x88–9C were used for level 1 hanzi, 0x9C–AC were used for level 2 hanzi, 0xF0 through 0xF9 were used for user-defined characters and 0xFA as used for additional non-hanzi.[3] Its last revision was documented in 1992 and it was superseded in 1993 by Code page 1380, encoding the same characters in a different layout.[3] Chart definitions for Code page 1380 (the document C-H 3-3220-130 1993-11) are provided online by IBM, whereas IBM do not similarly provide the chart definition for the older Code page 928 (the document C-H 3-3220-130 1992-11) and suggest contacting them for more information.[3][4]

IBM code page 936 should not be confused with the identically numbered Windows code page, which is a variant of the GBK encoding and is called Code page 1386 by IBM. International Components for Unicode does not include an IBM-936 codec, and uses the Windows code page for the "cp936" label.[5]

References[]

  1. ^ "CCSID 936 information document". Archived from the original on 2016-03-27.
  2. ^ "CCSID 928 information document". Archived from the original on 2016-03-26.
  3. ^ a b c "Table 1: Registration of GCSGID and CPGID for the IBM CH-S Graphic Character Set". C-H 3-3220-130 1993-11: IBM Simplified Chinese Graphic Character Set (PDF). 1993. p. 6.
  4. ^ "Code page 928 information document". Archived from the original on 2016-03-17.
  5. ^ "windows-936-2000 (alias cp936)". ICU Demonstration - Converter Explorer. International Components for Unicode.

External links[]

  • GB2312: Comparison of conversion tables: the file csets-1.7/SHIFTGB.TXT ("Shifted GB2312.1980. Generated from an algorithm provided with some older Chinese packages.") matches the lead byte ranges documented for level 1 hanzi and level 2 hanzi in Code page 936.
Retrieved from ""