Unicode character property

The Unicode Standard assigns various properties to each Unicode character and code point.[1][2]

The properties can be used to handle characters (code points) in processes, like in line-breaking, script direction right-to-left or applying controls. Some "character properties" are also defined for code points that have no character assigned and code points that are labeled like "<not a character>". The character properties are described in Standard Annex #44.[2]

Properties have levels of forcefulness: normative, informative, contributory, or provisional. For simplicity of specification, a character property can be assigned by specifying a continuous range of code points that have the same property.[3]

  1. ^ "The Unicode Standard Version 16". The Unicode Consortium. September 2024. Retrieved 2024-09-13.
  2. ^ a b "Unicode Standard Annex #44: Unicode Character Database". Unicode. 2024-08-27.
  3. ^ "Unicode Standard Annex #44: Unicode Character Database, 4.2.3 Code Point Ranges". Unicode. 2024-08-27.