CJK Unified Ideographs Extension B

CJK Unified Ideographs Extension B
RangeU+20000..U+2A6DF
(42,720 code points)
PlaneSIP
ScriptsHan
Assigned42,720 code points
Unused0 reserved code points
Unicode version history
3.1 (2001)42,711 (+42,711)
13.0 (2020)42,718 (+7)
14.0 (2021)42,720 (+2)
Unicode documentation
Code chart ∣ Web page
Note: [1][2]

CJK Unified Ideographs Extension B is a Unicode block containing rare and historic CJK ideographs for Chinese, Japanese, Korean, and Vietnamese submitted to the Ideographic Research Group between 1998 and 2000, plus seven gongche characters for kunqu added in Unicode 13.0, and two characters for the Macao Supplementary Character Set added in Unicode 14.0.[3]

The block has dozens of variation sequences defined for standardized variants.[4]

It also has thousands of ideographic variation sequences registered in the Unicode Ideographic Variation Database (IVD).[5][6] These sequences specify the desired glyph variant for a given Unicode character.

It was the only CJK Unified Ideographs Extension block with a UCS2003 source identifier. Since Extension B contained too many characters, the original code charts were produced with a single glyph for all regions. The glyphs were designed by Beijing Zhongyi Electronic Ltd. After the introduction of multi-column code charts on Unicode 5.2, the original glyphs were retained under the UCS2003 source identifier; they were then removed in Unicode 14.0, being redundant as well as misleading.[7] The glyphs are packaged in the "SimSun-ExtB" font distributed with the Simplified Chinese versions of Windows, and do not adhere to the glyphs for the Mainland China region.

  1. ^ "Unicode character database". The Unicode Standard. Retrieved 2023-07-26.
  2. ^ "Enumerated Versions of The Unicode Standard". The Unicode Standard. Retrieved 2023-07-26.
  3. ^ "18.1: Han (§ Blocks Containing Han Ideographs)" (PDF). The Unicode Standard: Core Specification. Version 15.0. Unicode Consortium. pp. 741–744. 2022. ISBN 978-1-936213-32-0.
  4. ^ "Unicode Character Database: Standardized Variation Sequences". The Unicode Consortium.
  5. ^ "Ideographic Variation Database". Unicode Consortium.
  6. ^ "UTS #37, Unicode Ideographic Variation Database". Unicode Consortium.
  7. ^ "The Unicode Standard, Version 14.0" (PDF). Unicode Consortium.