Jump to content

Unicode anomaly

From Wikipedia, the free encyclopedia
This is an old revision of this page, as edited by Benlisquare (talk | contribs) at 13:24, 10 December 2010 (removing PROD: not complete OR. if in disagreement, bring to AfD.). The present address (URL) is a permanent link to this revision, which may differ significantly from the current revision.

The Unicode Standard has imposed for itself strict rules to guarantee stability. This implies that when mistakes against these permanent rules are published, these mistakes can not be corrected. Depending on the grade of strictness of a rule, a change can be prohibited or allowed. For example, a "Name" given to a code point can and will not change. But a "Script" property is more flexible, by Unicode's own rules.

Anomalies

  • U+2118 SCRIPT CAPITAL P (℘, ℘): it is not a capital
The name says "capital", but it is a small letter. The true capital is U+1D4AB 𝒫 MATHEMATICAL SCRIPT CAPITAL P (𝒫)
  • U+0818 SAMARITAN MARK OCCLUSION and U+0819 SAMARITAN MARK DAGESH: Names mixed up.
Corrected text, names swapped:
U+0818 SAMARITAN MARK OCCLUSION ("strengthens" the consonant, for example changing /w/ to /b/) and
U+0819 SAMARITAN MARK DAGESH (indicates consonant gemination)[1]

Stability policy

Version 1.0 versus Version 2.0 Names

In version 2.0, Unicode changed many code point "Names" from version 1. At the same moment, Unicode stated that from then on, an assigned Name to a code point will never change anymore.

References