2019 “State of the Unification” Report

The UTC #160 meeting took place last week at Microsoft’s HQ in Redmond, Washington.

For CJK enthusiasts, the big news is that the UTC accepted CJK Unified ideographs Extension G (aka IRG Working Set 2015), which includes 4,939 characters. Additional CJK Unified Ideographs were appended to the URO (13), Extension A (10), and Extension B (7). As a result, the Extension A block will be completely full, and the URO and Extension B blocks will be nearly full. The total number of CJK Unified Ideographs in Unicode Version 13.0 is expected to be 92,856, which will represent approximately 65% of its characters.

The Pipeline accurately reflects the additional CJK Unified Ideographs that are targeted for Unicode Version 13.0. I suspect that the image at the beginning of this article will be helpful, and when clicked, will provide a PDF version that also includes a table for CJK Compatibility Ideographs.

These additions reflect two issues about which developers need to be aware:

  • Plane 3 (aka TIP or Tertiary Ideographic Plane) is now open for business, and Extension G represents the first block encoded among its 65,534 available code points.
  • This is the first time that CJK Unified Ideographs have been appended to a block other than the URO.


🐡

Leave a Reply

Your email address will not be published. Required fields are marked *