Script (Unicode)

ழ்
ع‎‎ ש‎‎ Д A‎

In Unicode, a script is a collection of letters and other written signs used to represent textual information in one or more writing systems.[1] Some scripts support one and only one writing system and language, for example, Armenian. Other scripts support many different writing systems; for example, the Latin script supports English, French, German, Italian, Vietnamese, Latin itself, and several other languages. Some languages make use of multiple alternate writing systems and thus also use several scripts; for example, in Turkish, the Arabic script was used before the 20th century but transitioned to Latin in the early part of the 20th century. More or less complementary to scripts are symbols and Unicode control characters.

The unified diacritical characters and unified punctuation characters frequently have the "common" or "inherited" script property. However, the individual scripts often have their own punctuation and diacritics, so that many scripts include not only letters but also diacritic and other marks, punctuation, numerals and even their own idiosyncratic symbols and space characters.

Unicode 16.0 defines 168 separate scripts, including 99 modern scripts and 69 ancient or historic scripts.[2][3] More scripts are in the process for encoding or have been tentatively allocated for encoding in roadmaps.[4]

  1. ^ "Glossary". unicode.org.
  2. ^ "Unicode Character Database: Scripts". unicode.org.
  3. ^ "Chapter 14: Additional Ancient and Historic Scripts". The Unicode Standard, Version 15.0 (PDF). Mountain View, CA: Unicode, Inc. September 2022. ISBN 978-1-936213-32-0.
  4. ^ https://www.unicode.org/roadmaps/ Roadmaps to Unicode