I spent this evening in Gyeongju, Korea enhancing two AFDKO-related tools, extract-cids.pl and extract-gids.pl.
These tools are tx filters, and simply output the list of CIDs or GIDs in a font. I had been using another tool, mkrange.pl, to turn the list into ranges. One of the enhancements, which is the addition of an “-r” option, makes the mkrange.pl tool no longer necessary. In other words, the following two command lines have the same result:
% extract-cids.pl -r <font>
% extract-cids.pl <font> | mkrange.pl
The extract-cids.pl tool was additionally enhanced with an “-s” option, which outputs the ranges on a single line using a comma separator. This makes the output very convenient in terms of repurposing it, such as copying it, then pasting it into a new command line as the argument of the “-g” option that is supported by many AFDKO tools. Consider the following two command lines and their output:
% extract-cids.pl -r cidfont.ps
% extract-cids.pl -r -s cidfont.ps
I am currently in Gyeongju (경주시/慶州市), Republic of Korea (ROK), attending IRG 38. For those who are not aware, the IRG (Ideographic Rapporteur Group) is an ad hoc subcommittee of ISO/IEC JTC1/SC2/WG2 (aka WG2), and is charged with all matters related to CJK Unified Ideographs. In other words, all CJK Unified Ideograph repertoires, the latest being Extension D, are the direct result of work by the IRG. I am attending IRG 38 as the sole US and Unicode delegate, and this is the fifth IRG meeting I have attended. IRG meetings are week-long working meetings, and are held twice per year.
I don’t travel very often as part of my work at Adobe, but when I do, it is always to East Asia. (I have never been to Europe.) The last time I traveled as part of my job was two years ago, to attend IRG 34 in Nagaoka, Japan.
For those who signed up to attend the AFDKO Workshop that takes place on the 25th of this month in Tokyo, be warned that I will be conducting my portion, which is from 10AM until noon, in Japanese, or at least in my rendition of Japanese. ☺
Seriously, though, I hope that my Japanese skills, or lack thereof, don’t serve as an impediment for those who attend this workshop. My focus will be to demonstrate how particular AFDKO tools, such as tx, mergeFonts, rotateFont, autohint, and so on, can be used to not only build, but also to directly manipulate CID-keyed fonts. My hope is that I can convey useful information that can serve as inspiration for making font production workflows more efficient and less time-consuming.
For those who are attending, note that materials will be provided that will be based on a small subset of Kozuka Gothic Medium. If any attendee wants to use their own font data for this purpose, I suggest that you bring along an Adobe-Japan1-0 or greater OpenType/CFF font or CIDFont resource. I have prepared scripts that will quickly create the same materials, but using a different base font.
Wish me luck in finishing the accompanying presentation, which currently has 24 slides. In Japanese.
I am very much looking forward to meeting everyone who has signed up to attend this workshop!
I am spending most of my time preparing for the Tokyo AFDKO Workshop that Morisawa is hosting at their Tokyo office on the 25th of this month, which is a Monday. At this point, most of the sample data is ready, and I now need to prepare the accompanying presentation slides.
This will be my first opportunity to engage a larger number of font developers, and to convey to them useful tips, tricks, and techniques for several AFDKO tools, particularly tx, mergeFonts, and rotateFont. My portion of the workshop, which will consume the entire morning, will be about building and manipulating CID-keyed fonts, to include controlling the assignment of CIDs (glyphs) to FDArray elements. I will also cover the use of the special-purpose Adobe-Identity-0 ROS. I am very much looking forward to this event, and working directly with many font developers.
Note that this workshop will be conducted in Japanese.
弊社のAFDKO（Adobe© OpenType©フォント開発キット）を利用されてフォント制作を行われている開発者の方々を対象としたワークショップとなります。それぞれのツールについて有効な利用法について、弊社米国本社のKen Lunde（ケン・ランディ）を含む弊社スタッフが説明する予定です。ぜひご参加ください。
As alluded to in the February 8, 2012 CJK Type Blog article, there is a good chance that an AFDKO Workshop will take place in Japan, very soon. Please stay tuned to this blog for details, which may emerge in less than a week.
ISO/IEC 10646:2012 (Third Edition) was just published. This is the first version of the standard that includes multiple-column Code Charts for Extension B, and for CJK Compatibility Ideographs. Another significant aspect of ISO/IEC 10646:2012 is that it is equivalent to Unicode Version 6.1.
For Adobe, the publishing of this new version of the standard represents a significant milestone, because it means that every Adobe-Japan1-6 kanji is either directly encoded, or is directly associated with a registered IVS in the IVD (Ideographic Variation Database).
Speaking of Unicode Version 6.1, the printed version of the Core Specification is available via POD from Lulu, and at a very attractive price.
I just received good news, in the form of confirmation that both of my ATypI Hong Kong 2012 presentation abstracts were accepted, which means that I will definitely be attending this conference. I alluded to this in the March 30th, 2012 CJK Type Blog article. One of the abstracts is for a 30-minute presentation entitled Kazuraki: Under The Hood, which will immediately follow a 30-minute presentation entitled Kazuraki: Its Art & Design, that will be presented by my colleagues Taro Yamamoto (山本太郎) and Ryoko Nishizuka (西塚涼子). For those who are not aware, Ryoko is the typeface designer of Kazuraki (かづらき), which is the centerpiece of both 30-minute presentations. The other is for a three-hour workshop entitled Manipulating CID-keyed Fonts Using AFDKO Tools, which will be co-presented by my colleague Masataka Hattori (服部正貴).
I am very much looking forward to attending an ATypI conference for the first time, and meeting many people. If you are planning to attend ATypI Hong Kong 2012, please be sure to introduce yourself to me, in case I don’t introduce myself to you first.
For those font developers who are not aware, the official CMap resource repository for our public ROSes is the CMap Resources open source project at Open @ Adobe, which is hosted by SourceForge. When CMap resources are updated, in addition to providing the updates through this portal, an announcement is made in the CMap Resources Forum.
The UTF-16 and UTF-32 CMap resources were introduced in August of 2001, beginning with Adobe-CNS1-4. Those for Adobe-Korea1-2 and Adobe-Japan2-0 followed in January of 2002, followed by those for Adobe-GB1-4 in June of the same year. The UTF-16 and UTF-32 CMap resources for Adobe-Japan1-5 were not released until November of 2002. From that point, the UCS-2 CMap resources were deprecated, and were no longer updated. Clients that used the UCS-2 CMap resources were encouraged to use the UTF-16 or UTF-32 ones instead. For OpenType font development, in terms of building the Unicode (Format 4 and 12) ‘cmap‘ subtables, the UTF-32 CMap resources are recommended.
Years ago, I wrote a Perl script, called unicode-rows.pl, that takes a fully-qualified PostScript name—composed of a CIDFont resource name, two hyphens, and a UTF-32 CMap resource name—then generates a PostScript file that can be distilled into a PDF. The resulting PDF file is a Unicode table, arranged in groups of 256 code points. If the UTF-32 CMap resource includes even a single mapping for a particular group of 256 code points, a page is created.
I have prepared examples that are based on the UniJIS2004-UTF32-H and UniJIS-UTF32-H CMap resources.
As alluded to at the end of the May 9, 2012 CJK Type Blog article, I had plans to build additional CFR objects for testing purposes. That particular article supplied two 64K-glyph OpenType/CFF fonts, which provided BMP and Plane 1 coverage, and served as component fonts for the supplied CFR object, UnicodeGetaCFR.cfr. In today’s article, I will supply a CFR object that encompasses all of Unicode, meaning the BMP and the 16 Supplementary Planes, along with the component fonts that it references. In other words, coverage for 1,112,030 code points, each of which has a unique glyph. These represent valuable testing resources for developers who plan to support CFR objects in their products as a way to break the 64K glyph barrier.
For those AFDKO users who use, plan to use, or would like to explore the broad capabilities of its tx tool, here’s a command line that is very useful when building new versions of existing fonts, especially when only a small number of glyphs have changed:
% tx -bc -sha1 -z 400 <font_file>
The <font_file> portion of the command line can be any type of font file, such as an OpenType font, a CFF resource, a CIDFont resource, or a name-keyed Type 1 font.
As mentioned at the end of the May 15, 2012 CJK Type Blog article, I will demonstrate in this article how to build a CID-keyed font with 64K glyphs (CIDs 0 through 65534) and 256 FDArray elements. These represent two limits that are inherent in CIDFont resources.
Again, the incredibly powerful AFDKO mergeFonts tool will perform most of the work.
Today, I want to demonstrate how the AFDKO mergeFonts tool can be used to quickly and easily build a CID-keyed font that includes 64K glyphs, meaning CIDs 0 through 65534. This is the maximum number of glyphs that a CIDFont resource can contain. This font, of course, will use the special-purpose Adobe-Identity-0 ROS, and although its is a CID-keyed font, it will include only one FDArray element.
Adobe has thus far released two CID-keyed OpenType/CFF fonts that use the special-purpose Adobe-Identity-0 ROS (“ROS” is an abbreviation for /Registry, /Ordering, and /Supplement, which represent the three /CIDSystemInfo dictionary elements that are present in CIDFont and CMap resources): Kazuraki SP2N L (かづらき SP2N L) and Kenten Generic. The former is a commercial OpenType/CFF font, and the latter is an open source one. I have also developed several Adobe-Identity-0 ROS OpenType/CFF fonts for testing purposes, many of which have been provided in recent CJK Type Blog articles, the most recent of which being the May 9th, 2012 article.
The big question that may be on a font developer’s mind is under what circumstances is it appropriate to use the Adobe-Identity-0 ROS?
In the April 20, 2012 CJK Type Blog article, I wrote about the publishing of ISO/IEC 14496-28:2012 (Composite Font Representation), which provides a venue for breaking the 64K glyph barrier that is inherent in all sfnt-based font formats, including name- and CID-keyed PostScript fonts. If the number of glyphs of the combined component fonts that are referenced by a CFR object exceed 64K, would constitute breaking the 64K glyph barrier.
In my work, I need to deal with character codes on a regular basis, such as Unicode scalar values and hexadecimal values for legacy encodings. This includes writing documents that include them. For most purposes, especially when used in tables, tabular figures work best because they are monospaced. Of course, one could simply choose to use a monospaced font. But, unless a different font is actually desired for character codes, using the same typeface design is usually preferred, because it better matches the surrounding text. The issue is that very few, if any, fonts include tabular glyphs that support hexadecimal notation, specifically referring to ‘A’ through ‘F’ (or ‘a’ through ‘f’ for lowercase). Luckily, I was able to solve this particular dilemma.
In the realm of CJK Unified Ideographs, there is always talk about no more characters to encode, or that any new characters are simply unifiable variants. This is, in large part, merely wishful thinking.
In my experience, there are three important words to embrace: Never Say Never.
Almost three years ago, in a September 2009 article on the sister blog, Typblography, we showed a poster for our Kazuraki (かづらき) typeface, which was designed by Ryoko Nishizuka (西塚涼子), who was also its typeface designer. A request came in today for a PDF version of the poster, and instead of posting it into that relatively old (and now buried) article where it would not likely be noticed, I figured that it’d be best to post it here, today.
Click ☞ here ☜ to get the PDF version of the Kazuraki poster.
Enjoy! And for those in Japan, have a safe and enjoyable Golden Week!