Controlled natural language

Controlled natural languages (CNLs) are subsets of natural languages that are obtained by restricting the grammar and vocabulary in order to reduce or eliminate ambiguity and complexity. Traditionally, controlled languages fall into two major types: those that improve readability for human readers (e.g. non-native speakers), and those that enable reliable automatic semantic analysis of the language.[1][2]

The first type of languages (often called "simplified" or "technical" languages), for example ASD Simplified Technical English, Caterpillar Technical English, IBM's Easy English, are used in the industry to increase the quality of technical documentation, and possibly simplify the semi-automatic translation of the documentation. These languages restrict the writer by general rules such as "Keep sentences short", "Avoid the use of pronouns", "Only use dictionary-approved words", and "Use only the active voice".[3]

The second type of languages have a formal syntax and formal semantics, and can be mapped to an existing formal language, such as first-order logic. Thus, those languages can be used as knowledge representation languages,[4] and writing of those languages is supported by fully automatic consistency and redundancy checks, query answering, etc.

  1. ^ "A Survey and Classification of Controlled Natural Languages". direct.mit.edu. Retrieved 2024-03-27.
  2. ^ "Controlled Natural Languages for language generation in artificial cognition". IEEE. Retrieved 2024-03-27.
  3. ^ O'Brien, Sharon (2003). "Controlling Controlled English – An Analysis of Several Controlled Language Rule Sets" (PDF). Proceedings of EAMT-CLAW. Archived from the original (PDF) on 2016-03-03. Retrieved 2011-12-30.
  4. ^ Schwitter, Rolf. "Controlled natural languages for knowledge representation." Proceedings of the 23rd International Conference on Computational Linguistics: Posters. Association for Computational Linguistics, 2010.