Jump to content

GB 12345

From Wikipedia, the free encyclopedia

GB 12345,[1]entitledCode of Chinese ideogram set for information interchange supplementary set(Chinese:Tin tức trao đổi dùng chữ Hán mã hóa tự phù tập phụ trợ tập), is aTraditional Chinesecharacter setstandard established byChina,and can be thought as the traditional counterpart ofGB 2312.It is used as an encoding of traditional Chinese characters, although it is not as commonly used asBig5.It has 6,866 characters, and has no relationship nor compatibility withBig5andCNS 11643.

Characters

[edit]

Characters in GB 12345 are arranged in a 94×94 grid (as inISO/IEC 2022), and the two-byte code point of each character is expressed in thequ-weiform, which specifies a row (quKhu ) and the position of the character within the row (cell,weiVị ).

The rows (numbered from 1 to 94) contain characters as follows:[2]

  • 01–09: identical toGB 2312,except in row 06 position 57–85, added 29 vertical punctuation forms, and in row 08 position 27–32, added 6pinyincharacters from GB 5007.1–85, the correction of GB 2312.
  • 16–87: arranged the traditional character forms which replaced their simplified forms from GB 2312.
  • 88–89: 103Chinese characterswhich is merged due to the simplification of Chinese characters.

The rows 10–15 and 90–94 are unassigned.

Encodings

[edit]

The specification for theISO-2022-CN-EXTencoding states that the sequenceESC $ )followed by a yet-undetermined byte (shown by the placeholder<X12345>) can be used to indicateGB 12345characters, similarly to the sequenceESC $ ) A(also with theESC $ )prefix) indicatingGB 2312,but only after it receives a registration in theISO-IRregistry specifying what the final byte of the sequence is.[3]As of 2023,no such registration exists.[4]However, the sameRequest for Commentsalso defines the encoding labelCN-GB-12345forGB 12345used with ASCII in a manner analogous toEUC-CN.[3]

Inclusion of non-standard Traditional Chinese characters

[edit]

GB/T 12345 includes a few traditional characters which is different from the table of correspondences between Simplified Chinese characters and Traditional Chinese characters in the standardTable of General Standard Chinese Characters.

  • Phù (57–76): the traditional counterpart of phù is phù (2013:3620, page 76), phù is not in the variant form either.
  • Lệ (33–05): the traditional counterpart of lệ is lệ (2013:1305, page 60), however lệ is in the variant form.
  • 𨻶 (47–22): Khích has no traditional correspondence in the standard.

GB 12345 and Unicode

[edit]

The characters in GB 12345 were taken as one of the sources for theHan unificationwhich led to the unified set ofCJKcharacters in the initialISO 10646/Unicodestandard. All the 6,866 Chinese characters were incorporated.

See also

[edit]

-- References --

  1. ^"GB/T 12345-1990: Code of Chinese ideogram set for information interchange--Supplementary set".Standardization Administration of the People's Republic of China.Retrieved2022-10-01.
  2. ^Lunde, Ken(2009).CJKV Information Processing: Chinese, Japanese, Korean & Vietnamese Computing(2nd ed.).Sebastopol, CA:O'Reilly.pp. 150–151.ISBN978-0-596-51447-1.
  3. ^abZhu, HF.; Hu, DY.; Wang, ZG.; Kao, TC.; Chang, WCH.; Crispin, M. (1996).Chinese Character Encoding for Internet Messages.IETF.doi:10.17487/RFC1922.RFC1922.Note: Currently, there are some GB sets that have not been registered in ISO. Here <X7589>, <X7590>, <X12345>, <X13131> and <X13132> represent the final character that will be assigned by ISO for those sets. These GB sets shall only be used once these final characters are assigned.
  4. ^ISO-IR: ISO/IEC International Register of Coded Character Sets To Be Used With Escape Sequences(PDF)(Registry Index). ITSCJ/IPSJ.

-- External links --