Basic Latin (Unicode block)

Basic Latin
or
C0 Controls and Basic Latin
RangeU+0000..U+007F
(128 code points)
PlaneBMP
ScriptsLatin (52 characters)
Common (76 characters)
Major alphabetsEnglish
French
German
Spanish
Vietnamese
Symbol setsArabic numerals
Punctuation
Assigned128 code points
33 Control or Format
Unused0 reserved code points
Source standardsISO/IEC 8859, ISO 646
Unicode version history
1.0.0 (1991)128 (+128)
Unicode documentation
Code chart ∣ Web page
Note: [1][2]

The Basic Latin Unicode block,[3] sometimes informally called C0 Controls and Basic Latin,[4] is the first block of the Unicode standard, and the only block which is encoded in one byte in UTF-8. The block contains all the letters and control codes of the ASCII encoding. It ranges from U+0000 to U+007F, contains 128 characters and includes the C0 controls, ASCII punctuation and symbols, ASCII digits, both the uppercase and lowercase of the English alphabet and a control character.

The Basic Latin block was included in its present form from version 1.0.0 of the Unicode Standard, without addition or alteration of the character repertoire.[5] Its block name in Unicode 1.0 was ASCII.[6]

  1. ^ "Unicode character database". The Unicode Standard. Retrieved 2023-07-26.
  2. ^ "Enumerated Versions of The Unicode Standard". The Unicode Standard. Retrieved 2023-07-26.
  3. ^ "block.txt". The Unicode Consortium. Retrieved 2023-03-23.
  4. ^ "C0 Controls and Basic Latin" (PDF). The Unicode Standard, Version 15.0. Unicode, Inc. 2022. Retrieved March 22, 2023.
  5. ^ The Unicode Standard Version 1.0, Volume 1. Addison-Wesley Publishing Company, Inc. 1990. ISBN 0-201-56788-1.
  6. ^ "3.8: Block-by-Block Charts" (PDF). The Unicode Standard. version 1.0. Unicode Consortium.