Errata fixed in Unicode 14.0.0
[Unicode] Unicode 14.0.0 Tech Site | Site Map | Search
 

Errata Fixed in Unicode 14.0.0

This page contains the definitive listing of all errata of record since the publication of The Unicode Standard, Version 13.0 and considered resolved by the release of Unicode Version 14.0. These errata are listed by date in the table below. For prior errata resolved in Unicode 13.0 and earlier, see Errata Fixed in Unicode 13.0.

For errata still pending subsequent to the release of Unicode 14.0.0, see the list of current Updates and Errata.

Date  Summary 
2021-09-07

A significant number of hieroglyphs have improper glyphs in the Egyptian Hieroglyphs block. After review by Egyptian experts, these glyphs have been corrected, and the corrected glyphs will be displayed in the Unicode 14.0.0 code charts.

See L2/21-028 for discussion.

The exact list of corrected glyphs is shown in the 14.0 delta charts and they are highlighted there in the Egyptian Hieroglyphs block.

2021-09-07

A number of Arabic characters for use in the Kazakh, Kyrgyz and Uyghur languages have suboptimal glyphs in the Unicode 13.0.0 code charts. After review by Arabic script experts, these glyphs have been corrected, and the corrected glyphs will be displayed in the Unicode 14.0.0 code charts.

See L2/20-289 for discussion.

The exact list of corrected glyphs is shown in the 14.0 delta charts and they are highlighted there in the Arabic block and in the Arabic Presentation Forms-A block.

2021-09-07

The kReading and kSrc_NushuDuben values for 4 Nushu characters are incorrect in the Version 13.0.0 of NushuSources.txt in the UCD. The characters with incorrect values are: 1B22A, 1B23B, 1B24B, 1B268. The values will be updated in the Version 14.0.0 of NushuSources.txt. In addition, there are typos in the KReading values for 1B1FE, 1B2EB, and 1B2EC. These typos will also be fixed in Version 14.0.0.

See L2/20-205 for discussion.

2020-05-29

EastAsianWidth.txt for Version 13.0 has a typo in its internal date. The date for Version 13.0 reads:

	# Date: 2029-01-21, 18:14:00 GMT [KW, LI]

The intended year in that date was 2020.

2020-05-29

In Version 13.0 of UAX #38, Unicode Han Database (Unihan), there are errors in the regular expressions for three of the IRG source properties.

kIRG_GSource:
  The second line of the pattern reads: | G4K(-\d{4})?
  The correct value should be: | G4K(-\d{5})?

kIRG_HSource:
  The third line of the pattern reads: | HU-[023][0-9A-F]{4})
  The correct value should omit the final, unmatched parenthesis: | HU-[023][0-9A-F]{4}

kIRG_TSource
  The pattern reads: T([1-7A-F]|13)-[0-9A-F]{4}|U-[023][0-9A-F]{4}
  The correct value should be: T([1-7A-F]|13)-[0-9A-F]{4}|TU-[023][0-9A-F]{4}

These patterns will be corrected in a future version of UAX #38.

2020-04-06

Section 11, Searching and Matching in Version 13.0 of UTS #10, Unicode Collation Algorithm contains a small but significant pair of typos in one part of the statement of step DS4a. Sub-step 4 of DS4a currently reads:

4. Find the largest e' ≥ e such that Q[e',e'] is canonically equivalent to P[n', n'] for some n'.
                    

Each of those substring notations as currently written represents an empty string. The correct, intended text should be:

4. Find the largest e' ≥ e such that Q[e,e'] is canonically equivalent to P[n, n'] for some n'.
                    

This pair of typos has been in the text of DS4a for a number of versions, since a mistake in the final editing of Version 9.0.0. It will be corrected in a future version of UTS #10.

2020-03-11

Blocks.txt for Version 13.0 has a typo in the code point for the end of the range of the Tangut Supplement block. The entry for Version 13.0 reads:

18D00..18D8F; Tangut Supplement
                    

In a future version of the standard, that typo will be corrected to reflect the intended range for the block:

18D00..18D7F; Tangut Supplement
                    

This corrrection also implies the need for corresponding updates to one comment line and one data line in VerticalOrientation.txt:

#         Khitan Small Script & Tangut Sup:  U+18B00..U+18D8F

18D09..18D8F   ; U  # Cn   [135] <reserved-18D09>..<reserved-18D8F>