Unicode anomaly
It is proposed that this article be deleted because of the following concern:
If you can address this concern by improving, copyediting, sourcing, renaming, or merging the page, please edit this page and do so. You may remove this message if you improve the article or otherwise object to deletion for any reason. Although not required, you are encouraged to explain why you object to the deletion, either in your edit summary or on the talk page. If this template is removed, do not replace it. This message has remained in place for seven days, so the article may be deleted without further notice. Find sources: "Unicode anomaly" – news · newspapers · books · scholar · JSTOR Nominator: Please consider notifying the author/project: {{subst:proposed deletion notify|Unicode anomaly|concern=Not notable. Tagged for notability since 2013.}} ~~~~ Timestamp: 20170126233728 23:37, 26 January 2017 (UTC) Administrators: delete |
![]() | The topic of this article may not meet Wikipedia's general notability guideline. (May 2013) |
![]() | This article needs attention from an expert on the subject. Please add a reason or a talk parameter to this template to explain the issue with the article.(November 2010) |
The Unicode Standard has imposed for itself strict rules to guarantee stability.[1] Depending on the grade of strictness of a rule, a change can be prohibited or allowed. For example, a "Name" given to a code point can not and will not change. But a "Script" property is more flexible, by Unicode's own rules. In version 2.0, Unicode changed many code point "Names" from version 1. At the same moment, Unicode stated that from then on, an assigned Name to a code point will never change anymore. This implies that when mistakes are published, these mistakes cannot be corrected, even if they are trivial (as happened in one instance with the spelling BRAKCET for BRACKET in a character name).
Anomalies
In 2006 Unicode has published a list of anomalies in character names.[2]
- U+0818 ࠘ SAMARITAN MARK OCCLUSION and U+0819 ࠙ SAMARITAN MARK DAGESH: Names mixed up.
- Corrected text, names swapped:
- U+0818 ࠘ SAMARITAN MARK OCCLUSION ("strengthens" the consonant, for example changing /w/ to /b/) and
- U+0819 ࠙ SAMARITAN MARK DAGESH (indicates consonant gemination)[3]
- U+2118 ℘ SCRIPT CAPITAL P (℘, ℘): it is not a capital
- The name says "capital", but it is a small letter. The true capital is U+1D4AB 𝒫 MATHEMATICAL SCRIPT CAPITAL P (𝒫)[4]
- U+FE18 ︘ PRESENTATION FORM FOR VERTICAL RIGHT WHITE LENTICULAR BRAKCET: BRAKCET is spelled wrong. Since this is the fixed Character Name by policy, it cannot be changed.[5]