CJK Unified Ideographs Extension C

CJK Unified Ideographs Extension C
RangeU+2A700..U+2B73F
(4,160 code points)
PlaneSIP
ScriptsHan
Assigned4,154 code points
Unused6 reserved code points
Unicode version history
5.2 (2009)4,149 (+4,149)
14.0 (2021)4,153 (+4)
15.0 (2022)4,154 (+1)
Unicode documentation
Code chart ∣ Web page
Note: [1][2]

CJK Unified Ideographs Extension C is a Unicode block containing rare and historic CJK ideographs for Chinese, Japanese, Korean, and Vietnamese submitted to the Ideographic Research Group between 2002 and 2006, plus five "urgently needed" characters added in Unicode versions 14.0 and 15.0, some of which had previously been mistakenly unified with other characters.[3]

The block has dozens of ideographic variation sequences registered in the Unicode Ideographic Variation Database (IVD).[4][5] These sequences specify the desired glyph variant for a given Unicode character.

Note that the Katakana ligature 𪜈 (U+2A708) has been erroneously encoded in this block as a Han character.[6]

  1. ^ "Unicode character database". The Unicode Standard. Retrieved 2023-07-26.
  2. ^ "Enumerated Versions of The Unicode Standard". The Unicode Standard. Retrieved 2023-07-26.
  3. ^ "18.1: Han (§ Blocks Containing Han Ideographs)" (PDF). The Unicode Standard: Core Specification. Version 15.0. Unicode Consortium. pp. 741–744. 2022. ISBN 978-1-936213-32-0.
  4. ^ "Ideographic Variation Database". Unicode Consortium.
  5. ^ "UTS #37, Unicode Ideographic Variation Database". Unicode Consortium.
  6. ^ IRG Working Set 2021