<?xml version="1.0" encoding="UTF-8"?>
<rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:wfw="http://wellformedweb.org/CommentAPI/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom"
	xmlns:sy="http://purl.org/rss/1.0/modules/syndication/"
	xmlns:slash="http://purl.org/rss/1.0/modules/slash/"
	>

<channel>
	<title>CJK Type</title>
	<atom:link href="http://blogs.adobe.com/CCJKType/feed" rel="self" type="application/rss+xml" />
	<link>http://blogs.adobe.com/CCJKType</link>
	<description>CJK Fonts, Character Sets &#38; Encodings. All CJK, all of the time…</description>
	<lastBuildDate>Tue, 15 May 2012 16:21:25 +0000</lastBuildDate>
	<language>en</language>
	<sy:updatePeriod>hourly</sy:updatePeriod>
	<sy:updateFrequency>1</sy:updateFrequency>
	<generator>http://wordpress.org/?v=3.3.2</generator>
		<item>
		<title>Building a CID-keyed font with 64K glyphs</title>
		<link>http://blogs.adobe.com/CCJKType/2012/05/64k-glyph-cidfont.html</link>
		<comments>http://blogs.adobe.com/CCJKType/2012/05/64k-glyph-cidfont.html#comments</comments>
		<pubDate>Tue, 15 May 2012 16:04:05 +0000</pubDate>
		<dc:creator>Dr. Ken Lunde</dc:creator>
				<category><![CDATA[Building Font]]></category>

		<guid isPermaLink="false">http://blogs.adobe.com/CCJKType/?p=1652</guid>
		<description><![CDATA[Today, I want to demonstrate how the AFDKO mergeFonts tool can be used to quickly and easily build a CID-keyed font that includes 64K glyphs, meaning CIDs 0 through 65534. This is the maximum number of glyphs that a CIDFont resource can contain. This font, of course, will use the special-purpose Adobe-Identity-0 ROS, and although [...]]]></description>
			<content:encoded><![CDATA[<p>Today, I want to demonstrate how the <a href="http://www.adobe.com/devnet/opentype/afdko.html" target="_blank">AFDKO</a> <em>mergeFonts</em> tool can be used to quickly and easily build a CID-keyed font that includes 64K glyphs, meaning CIDs 0 through 65534. This is the maximum number of glyphs that a CIDFont resource can contain. This font, of course, will use the <a href="http://blogs.adobe.com/CCJKType/2012/05/sp-ai0-ros.html" target="_blank">special-purpose Adobe-Identity-0 ROS</a>, and although its is a CID-keyed font, it will include only one FDArray element.<br />
<span id="more-1652"></span><br />
We can build such a CID-keyed font by using the following three components:</p>
<ol>
<li>A <a href="http://blogs.adobe.com/CCJKType/files/2012/05/font.pfa" target="_blank">name-keyed Type 1 font</a> that includes at least the <em>.notdef</em> glyph (this will serve as the glyph for CID+0) and another glyph that will be used for CIDs 1 through 65534</li>
<li>A <em>cidfontinfo</em> file</li>
<li>A <a href="http://blogs.adobe.com/CCJKType/files/2012/05/map.txt" target="_blank">mergeFonts mapping file</a> that maps the glyphs in the name-keyed Type 1 font to CIDs 0 through 65534</li>
</ol>
<p>The name-keyed font includes the following two glyphs: <em>.notdef</em> and <em>P_zero_one</em> (&#8220;P01&#8243;). As a side note, the <a href="http://blogs.adobe.com/CCJKType/files/2012/05/extract-names.pl" target="_blank"><em>extract-names.pl</em></a> tool, which is a simple wrapper for the AFDKO <em>tx</em> tool, can be used to list the glyphs in a name-keyed font, and is used as follows, with sample output given after the command line (it skips the <em>.notdef</em> glyph in its output):</p>
<p><code>% extract-names.pl <strong>font.pfa</strong><br />
P_zero_one</code></p>
<p>The <em>cidfontinfo</em> file is below (and can be copied to a file with the same name):</p>
<p><code>FontName&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;(UnicodeP01)<br />
FullName&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;(Unicode P01)<br />
FamilyName&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;(Unicode P01)<br />
Weight&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;(Regular)<br />
version&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;(1.000)<br />
Registry&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;(Adobe)<br />
Ordering&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;(Identity)<br />
Supplement&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;0<br />
XUID&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;[1 11 9273868]<br />
AdobeCopyright&nbsp;&nbsp;(Copyright 2012 Adobe Systems Incorporated. All Rights Reserved.)<br />
Trademark&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;()<br />
FSType&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;8<br />
isFixedPitch&nbsp;&nbsp;&nbsp;&nbsp;true<br />
Serif&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;false<br />
IsBoldStyle&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;false<br />
IsItalicStyle&nbsp;&nbsp;&nbsp;false</code></p>
<p>The mergeFonts mapping file contains 65,536 lines, the first of which indicates that it is a mergeFonts mapping file, followed by the name of the FDArray element. The second and subsequent lines map glyph names to CIDs. Below is an excerpt:</p>
<p><code>mergeFonts UnicodeP01-0<br />
0&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;.notdef<br />
1&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;P_zero_one<br />
2&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;P_zero_one<br />
3&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;P_zero_one<br />
4&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;P_zero_one<br />
5&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;P_zero_one<br />
<em>&lt;65,524 lines omitted></em><br />
65530&nbsp;&nbsp;P_zero_one<br />
65531&nbsp;&nbsp;P_zero_one<br />
65532&nbsp;&nbsp;P_zero_one<br />
65533&nbsp;&nbsp;P_zero_one<br />
65534&nbsp;&nbsp;P_zero_one</code></p>
<p>The <em>mergeFonts</em> command line, which is used to build the CIDFont resource is as follows:</p>
<p>% mergeFonts <strong>-cid cidfontinfo cidfont.ps map.txt font.pfa</strong></p>
<p>That&#8217;s it! The result is a CIDFont resource named <em>cidfont.ps</em> that contains 65,535 CIDs (CIDs 0 through 65534). Because there is a name- to CID-keyed conversion, the &#8220;-cid&#8221; option and a <em>cidfontinfo</em> file is required when using the <em>mergeFonts</em> tool.</p>
<p>The next CJK Type Blog article, which will follow in a day or two, will demonstrate how the same CID-keyed font can be built, but with 256 FDArray elements, which is the maximum for a CIDFont resource. Again, the <em>mergeFonts</em> tool will be used.</p>
]]></content:encoded>
			<wfw:commentRss>http://blogs.adobe.com/CCJKType/2012/05/64k-glyph-cidfont.html/feed</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>The Special-Purpose Adobe-Identity-0 ROS</title>
		<link>http://blogs.adobe.com/CCJKType/2012/05/sp-ai0-ros.html</link>
		<comments>http://blogs.adobe.com/CCJKType/2012/05/sp-ai0-ros.html#comments</comments>
		<pubDate>Fri, 11 May 2012 19:50:02 +0000</pubDate>
		<dc:creator>Dr. Ken Lunde</dc:creator>
				<category><![CDATA[Building Font]]></category>
		<category><![CDATA[Essay]]></category>

		<guid isPermaLink="false">http://blogs.adobe.com/CCJKType/?p=1628</guid>
		<description><![CDATA[Adobe has thus far released two CID-keyed OpenType/CFF fonts that use the special-purpose Adobe-Identity-0 ROS (&#8220;ROS&#8221; is an abbreviation for /Registry, /Ordering, and /Supplement, which represent the three /CIDSystemInfo dictionary elements that are present in CIDFont and CMap resources): Kazuraki SP2N L (かづらき SP2N L) and Kenten Generic. The former is a commercial OpenType/CFF font, [...]]]></description>
			<content:encoded><![CDATA[<p>Adobe has thus far released two CID-keyed OpenType/CFF fonts that use the special-purpose <em>Adobe-Identity-0</em> ROS (&#8220;ROS&#8221; is an abbreviation for <em>/Registry</em>, <em>/Ordering</em>, and <em>/Supplement</em>, which represent the three <em>/CIDSystemInfo</em> dictionary elements that are present in CIDFont and CMap resources): <a href="http://store1.adobe.com/cfusion/store/html/index.cfm?store=OLS-US&amp;event=displayFont&amp;code=KOZK10005000" target="_blank">Kazuraki SP2N L (<span lang="ja-jp">かづらき</span> SP2N L)</a> and <a href="http://sourceforge.net/adobe/kenten-generic/" target="_blank">Kenten Generic</a>. The former is a commercial OpenType/CFF font, and the latter is an open source one. I have also developed several Adobe-Identity-0 ROS OpenType/CFF fonts for testing purposes, many of which have been provided in recent <a href="http://blogs.adobe.com/CCJKType/" target="_blank">CJK Type Blog</a> articles, the most recent of which being the <a href="http://blogs.adobe.com/CCJKType/2012/05/breaking-64k-glyph-barrier.html" target="_blank">May 9th, 2012 article</a>.</p>
<p>The big question that may be on a font developer&#8217;s mind is under what circumstances is it appropriate to use the Adobe-Identity-0 ROS?<br />
<span id="more-1628"></span><br />
The answer is surprisingly simple:</p>
<p>If the glyphs correspond to one of the public ROSes&mdash;<a href="http://wwwimages.adobe.com/www.adobe.com/content/dam/Adobe/en/devnet/font/pdfs/5080.Adobe-CNS1-6.pdf" target="_blank">Adobe-CNS1-6</a>, <a href="http://wwwimages.adobe.com/www.adobe.com/content/dam/Adobe/en/devnet/font/pdfs/5079.Adobe-GB1-5.pdf" target="_blank">Adobe-GB1-5</a>, <a href="http://wwwimages.adobe.com/www.adobe.com/content/dam/Adobe/en/devnet/font/pdfs/5078.Adobe-Japan1-6.pdf" target="_blank">Adobe-Japan1-6</a>, or <a href="http://wwwimages.adobe.com/www.adobe.com/content/dam/Adobe/en/devnet/font/pdfs/5093.Adobe-Korea1-2.pdf" target="_blank">Adobe-Korea1-2</a>&mdash;or a subset thereof, then using a public ROS is recommended. It is never a problem to build fonts that include only a subset of the glyphs in a public ROS.</p>
<p>Otherwise, the Adobe-Identity-0 ROS is a better choice.</p>
<p>One advantage of using a public ROS is that existing resources can be leveraged, such as CMap resources, GSUB feature definitions, and so on. When using the Adobe-Identity-0 ROS, these resources must be created. The <a href="http://blogs.adobe.com/CCJKType/2012/03/building-utf32-cmaps.html" target="_blank">March 7, 2012 CJK Type Blog article</a> details how to build UTF-32 CMap resources, including an example for the Adobe-Identity-0 ROS.</p>
]]></content:encoded>
			<wfw:commentRss>http://blogs.adobe.com/CCJKType/2012/05/sp-ai0-ros.html/feed</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Towards Breaking The 64K Glyph Barrier&#8230;</title>
		<link>http://blogs.adobe.com/CCJKType/2012/05/breaking-64k-glyph-barrier.html</link>
		<comments>http://blogs.adobe.com/CCJKType/2012/05/breaking-64k-glyph-barrier.html#comments</comments>
		<pubDate>Wed, 09 May 2012 15:32:38 +0000</pubDate>
		<dc:creator>Dr. Ken Lunde</dc:creator>
				<category><![CDATA[Building Font]]></category>
		<category><![CDATA[Essay]]></category>

		<guid isPermaLink="false">http://blogs.adobe.com/CCJKType/?p=1603</guid>
		<description><![CDATA[In the April 20, 2012 CJK Type Blog article, I wrote about the publishing of ISO/IEC 14496-28:2012 (Composite Font Representation), which provides a venue for breaking the 64K glyph barrier that is inherent in all sfnt-based font formats, including name- and CID-keyed PostScript fonts. If the number of glyphs of the combined component fonts that [...]]]></description>
			<content:encoded><![CDATA[<p>In the <a href="http://blogs.adobe.com/CCJKType/2012/04/cfr.html" target="_blank">April 20, 2012 CJK Type Blog article</a>, I wrote about the publishing of <a href="http://www.iso.org/iso/iso_catalogue/catalogue_tc/catalogue_detail.htm?csnumber=59240" target="_blank">ISO/IEC 14496-28:2012</a> (<em>Composite Font Representation</em>), which provides a venue for breaking the 64K glyph barrier that is inherent in all sfnt-based font formats, including name- and CID-keyed PostScript fonts. If the number of glyphs of the combined component fonts that are referenced by a CFR object exceed 64K, would constitute breaking the 64K glyph barrier.<br />
<span id="more-1603"></span><br />
For developers who are interested in adding support for CFR objects in their OS, application, or text engine, I created a simple CFR object, <a href="http://blogs.adobe.com/CCJKType/files/2012/05/UnicodeGetaCFR.cfr" target="_blank"><em>UnicodeGetaCFR.cfr</em></a>, that references two (CID-keyed OpenType/CFF) component fonts, <a href="http://blogs.adobe.com/CCJKType/files/2012/05/UnicodeGetaBMP.otf" target="_blank"><em>UnicodeGetaBMP.otf</em></a> and <a href="http://blogs.adobe.com/CCJKType/files/2012/05/UnicodeGetaP01.otf" target="_blank"><em>UnicodeGetaP01.otf</em></a>, both of which include 64K glyphs (65,535 glyphs, meaning CIDs 0 through 65534).</p>
<p>The <em>UnicodeGetaBMP.otf</em> component font includes 65,534 instances of the same glyph, specifically that of the <em>geta</em> symbol (U+3013 &#x3013; GETA MARK; written <span lang="ja-jp">下駄</span> in Japanese), and maps all 63,486 BMP code points (U+0000 through U+D7FF and U+E000 through U+FFFD) to a unique CID. Below is a screenshot of the glyph for CID+65534:</p>
<p><img src="http://blogs.adobe.com/CCJKType/files/2012/05/geta-bmp.jpg" alt="" /></p>
<p>The <em>UnicodeGetaP01.otf</em> component font also includes 65,534 instances of the same glyph, though to distinguish them from those in the <em>UnicodeGetaBMP.otf</em> component font, they have been rotated 90 degrees. This component font maps all 65,534 Plane 1 code points (U+10000 through U+1FFFD) to a unique CID. Below is a screenshot of the glyph for CID+65534:</p>
<p><img src="http://blogs.adobe.com/CCJKType/files/2012/05/geta-p01.jpg" alt="" /></p>
<p>The CFR object, <em>UnicodeGetaCFR.cfr</em>, registers itself as a font named <em>UnicodeGetaCFR</em>, and specifies <em>UnicodeGetaBMP.otf</em> and <em>UnicodeGetaP01.otf</em> as its component fonts. Although not necessary for this particular example, the CFR object explicitly uses the <code>&lt;UnicodeCharSet></code> element to specify the Unicode ranges of each component font.</p>
<p>I plan to develop additional CFR objects that exercise other aspects of ISO/IEC 14496-28:2012, which will be made available in future CJK Type Blog articles.</p>
]]></content:encoded>
			<wfw:commentRss>http://blogs.adobe.com/CCJKType/2012/05/breaking-64k-glyph-barrier.html/feed</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Making &#8220;Character Codes&#8221; Look Better</title>
		<link>http://blogs.adobe.com/CCJKType/2012/05/making-character-codes-look-better.html</link>
		<comments>http://blogs.adobe.com/CCJKType/2012/05/making-character-codes-look-better.html#comments</comments>
		<pubDate>Wed, 02 May 2012 16:49:04 +0000</pubDate>
		<dc:creator>Dr. Ken Lunde</dc:creator>
				<category><![CDATA[Essay]]></category>
		<category><![CDATA[Type]]></category>
		<category><![CDATA[Using Font]]></category>

		<guid isPermaLink="false">http://blogs.adobe.com/CCJKType/?p=1576</guid>
		<description><![CDATA[In my work, I need to deal with character codes on a regular basis, such as Unicode scalar values and hexadecimal values for legacy encodings. This includes writing documents that include them. For most purposes, especially when used in tables, tabular figures work best because they are monospaced. Of course, one could simply choose to [...]]]></description>
			<content:encoded><![CDATA[<p>In my work, I need to deal with character codes on a regular basis, such as Unicode scalar values and hexadecimal values for legacy encodings. This includes writing documents that include them. For most purposes, especially when used in tables, tabular figures work best because they are monospaced. Of course, one could simply choose to use a monospaced font. But, unless a different font is actually desired for character codes, using the same typeface design is usually preferred, because it better matches the surrounding text. The issue is that very few, if any, fonts include tabular glyphs that support hexadecimal notation, specifically referring to &#8216;A&#8217; through &#8216;F&#8217; (or &#8216;a&#8217; through &#8216;f&#8217; for lowercase). Luckily, I was able to solve this particular dilemma.<br />
<span id="more-1576"></span><br />
When I wrote my first two books, <a href="http://shop.oreilly.com/product/9781565920439.do" target="_blank">Understanding Japanese Information Processing</a> (O&#8217;Reilly &#038; Associates, 1993) and <a href="http://shop.oreilly.com/product/9781565922242.do" target="_blank">CJK Information Processing</a> (O&#8217;Reilly &#038; Associates, 1999), I simply used a different font for character codes that were used in the many tables. If memory serves, the font was <a href="http://en.wikipedia.org/wiki/Courier_(typeface)" target="_blank">Courier</a> or a derivative. However, when I wrote my third book, <a href="http://shop.oreilly.com/product/9780596514471.do" target="_blank">CJK Information Processing, Second Edition</a> (O&#8217;Reilly Media, 2009), <a href="http://www.microsoft.com/typography/otspec/default.htm" target="_blank">OpenType</a> offered the possibility of making the character codes <em>look better</em>, both in body text and in tables.</p>
<p>I am grateful that my esteemed colleague, Miguel Sousa, created special (and private) versions of specific faces of the Adobe Originals <a href="http://store1.adobe.com/cfusion/store/html/index.cfm?store=OLS-US&#038;event=displayFontPackage&#038;code=1778" target="_blank">Myriad Pro</a> and <a href="http://store1.adobe.com/cfusion/store/html/index.cfm?store=OLS-US&#038;event=displayFontPackage&#038;code=1776" target="_blank">Minion Pro</a> that included, as alternate glyphs, tabular versions of the following characters: &#8216;a&#8217; through &#8216;f&#8217;, &#8216;x&#8217;, &#8216;A&#8217; through &#8216;G&#8217;, &#8216;U&#8217;, &#8216;V&#8217;, &#8216;X&#8217;, and &#8216;+&#8217;. These glyphs were made accessible through the use of the &#8216;<a href="http://www.microsoft.com/typography/otspec/features_pt.htm#ssxx">ss03</a>&#8216; (<em>Stylistic Set 3</em>) GSUB feature. When I set up the <a href="http://www.adobe.com/products/indesign.html" target="_blank">Adobe InDesign</a> character styles for character codes, for use in body text and in tables, I was able to specify this particular OpenType feature. I also specified the &#8216;<a href="http://www.microsoft.com/typography/otspec/features_uz.htm#zero">zero</a>&#8216; (<em>Slashed Zero</em>) GSUB feature so that all instances of 0 (zero) were slashed to better distinguish them.</p>
<p>In terms of design, Miguel not only needed to make the glyphs tabular, to match the widths of the (default) tabular figures, he also needed to make the uppercase height match that of the figures.</p>
<p>Below is an excerpt from the book, specifically Table 4-3 that appears on page 198 in Chapter 4 (<em>Encoding Method</em>), which shows the tabular Myriad Pro glyphs being used:</p>
<p><img src="http://blogs.adobe.com/CCJKType/files/2012/05/tabular-hexadecimal.jpg" alt="" /></p>
<p>Below is another excerpt from the book, from page xxviii of the <em>Preface</em>, which shows the tabular Minion Pro glyphs being used:</p>
<p><img src="http://blogs.adobe.com/CCJKType/files/2012/05/tabular-hexadecimal-text.jpg" alt="" /></p>
<p>I continue to use these private versions of Myriad Pro and Minion Pro for other document-writing purposes.</p>
]]></content:encoded>
			<wfw:commentRss>http://blogs.adobe.com/CCJKType/2012/05/making-character-codes-look-better.html/feed</wfw:commentRss>
		<slash:comments>1</slash:comments>
		</item>
		<item>
		<title>Never Say Never</title>
		<link>http://blogs.adobe.com/CCJKType/2012/04/never-say-never.html</link>
		<comments>http://blogs.adobe.com/CCJKType/2012/04/never-say-never.html#comments</comments>
		<pubDate>Mon, 30 Apr 2012 21:40:48 +0000</pubDate>
		<dc:creator>Dr. Ken Lunde</dc:creator>
				<category><![CDATA[Essay]]></category>

		<guid isPermaLink="false">http://blogs.adobe.com/CCJKType/?p=1562</guid>
		<description><![CDATA[In the realm of CJK Unified Ideographs, there is always talk about no more characters to encode, or that any new characters are simply unifiable variants. This is, in large part, merely wishful thinking. In my experience, there are three important words to embrace: Never Say Never. While perusing IWATA Corporations&#8217;s website, I came across [...]]]></description>
			<content:encoded><![CDATA[<p>In the realm of <a href="http://en.wikipedia.org/wiki/CJK_Unified_Ideographs" target="_blank">CJK Unified Ideographs</a>, there is always talk about no more characters to encode, or that any new characters are simply unifiable variants. This is, in large part, merely wishful thinking.</p>
<p>In my experience, there are three important words to embrace: <strong>Never Say Never</strong>.<br />
<span id="more-1562"></span><br />
While perusing IWATA Corporations&#8217;s <a href="http://www.iwatafont.co.jp/" target="_blank">website</a>, I came across the <a href="http://www.iwatafont.co.jp/press/kakucyo.html" target="_blank">page</a> about their extension to Kyodo News&#8217; U-PRESS character set, which included a convenient <a href="http://www.iwatafont.co.jp/press/img/kakutyoSET.pdf" target="_blank">PDF</a>. I checked all of the characters, mainly to establish as many mappings to <a href="http://wwwimages.adobe.com/www.adobe.com/content/dam/Adobe/en/devnet/font/pdfs/5078.Adobe-Japan1-6.pdf" target="_blank">Adobe-Japan1-6</a> as possible, and found that 8 of the kanji were not in Unicode, and this effort involved checking the latest version of <a href="http://appsrv.cse.cuhk.edu.hk/~irg/irg/irg38/IRG38.htm" target="_blank">Extension E</a> (aka IRG N1830), which is soon to become standardized. The image below highlights in yellow the 8 kanji that are not yet in Unicode:</p>
<p><img src="http://blogs.adobe.com/CCJKType/files/2012/04/unicode-no.jpg" alt="" /></p>
<p>What this demonstrates is simply that CJK Unified Ideographs are genuinely an open-ended script, and that there is always a possibility that new characters will be coined or discovered.</p>
]]></content:encoded>
			<wfw:commentRss>http://blogs.adobe.com/CCJKType/2012/04/never-say-never.html/feed</wfw:commentRss>
		<slash:comments>2</slash:comments>
		</item>
		<item>
		<title>Kazuraki Poster Redux</title>
		<link>http://blogs.adobe.com/CCJKType/2012/04/kazuraki-poster-redux.html</link>
		<comments>http://blogs.adobe.com/CCJKType/2012/04/kazuraki-poster-redux.html#comments</comments>
		<pubDate>Fri, 27 Apr 2012 16:56:17 +0000</pubDate>
		<dc:creator>Dr. Ken Lunde</dc:creator>
				<category><![CDATA[Type]]></category>

		<guid isPermaLink="false">http://blogs.adobe.com/CCJKType/?p=1543</guid>
		<description><![CDATA[Almost three years ago, in a September 2009 article on the sister blog, Typblography, we showed a poster for our Kazuraki (かづらき) typeface, which was designed by Ryoko Nishizuka (西塚涼子), who was also its typeface designer. A request came in today for a PDF version of the poster, and instead of posting it into that [...]]]></description>
			<content:encoded><![CDATA[<p>Almost three years ago, in a <a href="http://blogs.adobe.com/typblography/2009/09/kazuraki_poster.html" target="_blank">September 2009 article</a> on the sister blog, <a href="http://blogs.adobe.com/typblography/" target="_blank">Typblography</a>, we showed a <a href="http://blogs.adobe.com/typblography/kazuraki-800x1200.jpg" target="_blank">poster</a> for our <a href="http://store1.adobe.com/cfusion/store/html/index.cfm?store=OLS-US&#038;event=displayFont&#038;code=KOZK10005000" target="_blank">Kazuraki</a> (<span lang="ja-jp">かづらき</span>) typeface, which was designed by <a href="http://store1.adobe.com/cfusion/store/html/index.cfm?store=OLS-US&#038;event=displayDesignerInfo&#038;code=NISH" target="_blank">Ryoko Nishizuka</a> (<span lang="ja-jp">西塚涼子</span>), who was also its typeface designer. A request came in today for a PDF version of the poster, and instead of posting it into that relatively old (and now buried) article where it would not likely be noticed, I figured that it&#8217;d be best to post it here, today.</p>
<p>Click &#x261E; <a href="http://blogs.adobe.com/CCJKType/files/2012/04/KazurakiPosterA3-small.pdf" target="_blank"><strong>here</strong></a> &#x261C; to get the PDF version of the Kazuraki poster.</p>
<p>Enjoy! And for those in Japan, have a safe and enjoyable Golden Week!</p>
]]></content:encoded>
			<wfw:commentRss>http://blogs.adobe.com/CCJKType/2012/04/kazuraki-poster-redux.html/feed</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Know Your Documentation</title>
		<link>http://blogs.adobe.com/CCJKType/2012/04/know-your-documentation.html</link>
		<comments>http://blogs.adobe.com/CCJKType/2012/04/know-your-documentation.html#comments</comments>
		<pubDate>Thu, 26 Apr 2012 17:27:04 +0000</pubDate>
		<dc:creator>Dr. Ken Lunde</dc:creator>
				<category><![CDATA[Building Font]]></category>

		<guid isPermaLink="false">http://blogs.adobe.com/CCJKType/?p=1528</guid>
		<description><![CDATA[For those who develop fonts&#8212;professionally or otherwise&#8212;it is prudent to know where the latest and greatest documentation is located. This is useful not only when searching for specific documentation, but also to check whether there are any updates to existing documentation. The right-side navigation bar of this blog&#8217;s landing page includes links to relevant documentation [...]]]></description>
			<content:encoded><![CDATA[<p>For those who develop fonts&mdash;professionally or otherwise&mdash;it is prudent to know where the latest and greatest documentation is located. This is useful not only when searching for specific documentation, but also to check whether there are any updates to existing documentation.</p>
<p>The right-side navigation bar of this blog&#8217;s <a href="http://blogs.adobe.com/CCJKType/" target="_blank">landing page</a> includes links to relevant documentation and resource pages, such as for <a href="http://www.adobe.com/devnet/font.html" target="_blank">font-related Adobe Technical Notes</a>, the <a href="http://www.microsoft.com/typography/otspec/" target="_blank">OpenType Specification</a> (hosted on Microsoft&#8217;s website), and even <a href="http://www.adobe.com/devnet/opentype/afdko.html" target="_blank">AFDKO</a> (<em>Adobe Font Development Kit for OpenType</em>). Also, don&#8217;t forget about the excellent font developer resources offered by <a href="https://developer.apple.com/fonts/" target="_blank">Apple</a> (<em>Fonts</em>), <a href="http://www.fontlab.com/" target="_blank">FontLab</a>, and <a href="http://www.microsoft.com/typography/default.mspx" target="_blank">Microsoft</a> (<em>Microsoft Typography</em>).</p>
]]></content:encoded>
			<wfw:commentRss>http://blogs.adobe.com/CCJKType/2012/04/know-your-documentation.html/feed</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>The AFDKO &#8216;tx&#8217; Tool</title>
		<link>http://blogs.adobe.com/CCJKType/2012/04/tx.html</link>
		<comments>http://blogs.adobe.com/CCJKType/2012/04/tx.html#comments</comments>
		<pubDate>Thu, 26 Apr 2012 00:51:16 +0000</pubDate>
		<dc:creator>Dr. Ken Lunde</dc:creator>
				<category><![CDATA[Building Font]]></category>
		<category><![CDATA[Essay]]></category>

		<guid isPermaLink="false">http://blogs.adobe.com/CCJKType/?p=1508</guid>
		<description><![CDATA[Among the many excellent and powerful tools included in AFDKO (Adobe Font Development Kit for OpenType) is one with a two-letter name: tx. Although it has the shortest name, it is arguably one of the most powerful AFDKO tools. The tx tool is best thought of as a multi-purpose font-file&#8211;manipulation tool. For those who don&#8217;t [...]]]></description>
			<content:encoded><![CDATA[<p>Among the many excellent and powerful tools included in <a href="http://www.adobe.com/devnet/opentype/afdko.html" target="_blank">AFDKO</a> (<em>Adobe Font Development Kit for OpenType</em>) is one with a two-letter name: <em>tx</em>. Although it has the shortest name, it is arguably one of the most powerful AFDKO tools.</p>
<p>The <em>tx</em> tool is best thought of as a multi-purpose font-file&ndash;manipulation tool. For those who don&#8217;t leverage this tool in the font development activities, I strongly encourage you to explore its capabilities, which is best done by perusing its built-in help and through experimentation.<br />
<span id="more-1508"></span><br />
Executing this tool using the &#8220;-u&#8221; or &#8220;-h&#8221; command-line options provides basic help and usage information, but specifying the &#8220;-h&#8221; command-line option after a command-line option that invokes a particular mode, such as &#8220;-t1&#8243; or &#8220;-cff,&#8221; additional mode-specific documentation is displayed. For example, consider the following command line:</p>
<p><code>% tx -t1 -h</code></p>
<p>The <a href="http://blogs.adobe.com/CCJKType/files/2012/03/subr-check.pl" target="_blank"><em>subr-check.pl</em></a> and <a href="http://blogs.adobe.com/CCJKType/files/2012/03/fdarray-check.pl" target="_blank"><em>fdarray-check.pl</em></a> tools that were introduced in the <a href="http://blogs.adobe.com/CCJKType/2012/02/subroutinization.html" target="_blank">February 3, 2012</a> and <a href="http://blogs.adobe.com/CCJKType/2012/02/enumerating-fdarrays-and-cids.html" target="_blank">February 29, 2012</a> CJK Type Blog articles, respectively, are driven by the <em>tx</em> tool.</p>
<p>If no mode is specified, the text-dump mode is assumed, which can be explicitly specified with the &#8220;-dump&#8221; command-line option. This mode is useful to view high-level font information, private dictionary information, glyph names, and even human-readable charstrings. What information is presented depends on the level that is specified, which can range from &#8220;-0&#8243; through &#8220;-6&#8243; (&#8220;-1&#8243; is the default if no level is specified). For example, I can display the human-readable charstring for CID+1200 by using the following command line:</p>
<p><code>% tx -6 -g 1200 KozGoPr6N-Bold.otf<br />
tx: --- KozGoPr6N-Bold.otf<br />
## glyph[tag] {cid,iFD,LanguageGroup,path}<br />
glyph[1200] {1200,11,1,<br />
&nbsp;&nbsp;1000 width<br />
&nbsp;&nbsp;324 455 hstem<br />
&nbsp;&nbsp;36 966 vstem<br />
&nbsp;&nbsp;36 324 move<br />
&nbsp;&nbsp;966 324 line<br />
&nbsp;&nbsp;966 455 line<br />
&nbsp;&nbsp;36 455 line<br />
&nbsp;&nbsp;endchar}</code></p>
<p>The &#8220;-pdf&#8221; command-line option invokes PDF mode, which is most useful for producing a glyph synopsis in the form of a PDF file. If the &#8220;-g&#8221; command-line option is not specified, all of the glyphs in the specified font file are included in the glyph synopsis. The &#8220;-g&#8221; command-line option can be used to specify specific glyphs or glyph ranges, such as the following, which produces a glyph synopsis that includes all of the kanji glyphs in the Adobe-Japan1-6 ROS <em>KozGoPr6N-Heavy.otf</em> OpenType/CFF font:</p>
<p><code>% tx -pdf -g 656,1125-7477,7633-7886,7961-8004,8266,8267,8284,8285,8359-8717,13320-15443,16779-20316,21071-23057 KozGoPr6N-Heavy.otf glyphs.pdf</code></p>
<p>The &#8220;-afm&#8221; command-line option invokes AFM mode, and produces an AFM file.</p>
<p>The &#8220;-t1&#8243; command-line option deserves some explanation, because it supports both name- and CID-keyed fonts. It is possible to convert a CID-keyed font into a name-keyed font by additionally specifying the &#8220;-decid&#8221; command-line option. If the specified glyphs span more than one FDArray element, the &#8220;-usefd&#8221; command-line option, along with an FDArray index as its argument, must also be specified. The resulting Type 1 name-keyed font includes glyphs that are named according to their CID, specifically &#8220;cid&#8221; followed by one to five decimal digits, such as &#8220;cid1200&#8243; for CID+1200.</p>
<p>The <em>tx</em> tool can operate on a wide variety of font files, such as Type 1 fonts, CIDFont resources, CFFs, OpenType/CFF fonts (it references only the &#8216;<a href="http://www.microsoft.com/typography/otspec/cff.htm" target="_blank">CFF</a>&#8216; table), and even TrueType fonts.</p>
<p>I could go on and on&hellip;</p>
<p>If you are an avid AFDKO user, and even if you are not, and have not yet explored the capabilities of the <em>tx</em> tool, I suggest that you take the opportunity to do so. You are likely to find a capability that can save you a lot of time and effort. What I have described above is merely the tip of the proverbial iceberg.</p>
]]></content:encoded>
			<wfw:commentRss>http://blogs.adobe.com/CCJKType/2012/04/tx.html/feed</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>The All-Important Macron</title>
		<link>http://blogs.adobe.com/CCJKType/2012/04/macron.html</link>
		<comments>http://blogs.adobe.com/CCJKType/2012/04/macron.html#comments</comments>
		<pubDate>Tue, 24 Apr 2012 17:05:01 +0000</pubDate>
		<dc:creator>Dr. Ken Lunde</dc:creator>
				<category><![CDATA[Essay]]></category>

		<guid isPermaLink="false">http://blogs.adobe.com/CCJKType/?p=1491</guid>
		<description><![CDATA[When transliterating Japanese text using Latin characters, there are three systems or methods for doing so. Of these, the Hepburn system (ヘボン式 hebon shiki) is the most commonly used one, and differs in one important way: long vowels are represented with a macron (U+00AF MACRON or U+0304 COMBINING MACRON) diacritic. Almost all signage in Japan [...]]]></description>
			<content:encoded><![CDATA[<p>When transliterating Japanese text using Latin characters, there are three systems or methods for doing so. Of these, the <a href="http://en.wikipedia.org/wiki/Hepburn_system" target="_blank">Hepburn system</a> (<span lang="ja-jp">ヘボン式</span> <em>hebon shiki</em>) is the most commonly used one, and differs in one important way: <em>long vowels are represented with a <a href="http://en.wikipedia.org/wiki/Macron" target="_blank">macron</a> (U+00AF MACRON or U+0304 COMBINING MACRON) diacritic</em>. Almost all signage in Japan that includes transliterated text, such as in train and subway stations, uses the Hepburn system. However, if we look back to the 1990s and earlier, it was not common to include glyphs for macroned vowels in fonts, whether they were for Latin or Japanese use.</p>
<p>The two other systems, the <a href="http://en.wikipedia.org/wiki/Kunrei_system" target="_blank">Kunrei system</a> (<span lang="ja-jp">訓令式</span> <em>kunrei shiki</em>) and the <a href="http://en.wikipedia.org/wiki/Nihon-shiki_romanization" target="_blank">Nippon system</a> (<span lang="ja-jp">日本式</span> <em>nippon shiki</em>), represent long vowels with a <a href="http://en.wikipedia.org/wiki/Circumflex" target="_blank">circumflex</a> (U+005E CIRCUMFLEX ACCENT or U+0302 COMBINING CIRCUMFLEX ACCENT) diacritic. It was common for Latin fonts to include glyphs for circumflexed vowels, meaning U+00C2/U+00E2 (Ââ), U+00CA/U+00EA (Êê), U+00CE/U+00EE (Îî), U+00D4/U+00F4 (Ôô), and U+00DB/U+00FB (Ûû), by virtue of being included in <a href="http://en.wikipedia.org/wiki/ISO/IEC_8859-1" target="_blank">ISO/IEC 8859-1</a> (aka <em>Latin 1</em>). However, due to limitations of Shift-JIS encoding, even Japanese fonts did not include glyphs for these characters.<br />
<span id="more-1491"></span><br />
I can think of three specific things that paved the way to broader use of macroned vowels:</p>
<p>First and foremost, Unicode and its <em>de facto</em> status for representing digital text was a key factor, and laid the foundation. These characters are encoded at U+0100/U+0101 (Āā), U+0112/U+0113 (Ēē), U+012A/U+012B (Īī), U+014C/U+014D (Ōō), and U+016A/U+016B (Ūū).</p>
<p>Second, to enable macroned vowels in mainstream Japanese fonts, a standard glyph set needed to include their glyphs. When I was developing <a href="http://wwwimages.adobe.com/www.adobe.com/content/dam/Adobe/en/devnet/font/pdfs/5078.Adobe-Japan1-6.pdf" target="_blank">Adobe-Japan1-4</a> in the late 1990s, glyphs for macroned vowels were early candidates, and eventually became CIDs 9361 through 9370. Of course, they are encoded according to Unicode in the Unicode <a href="http://sourceforge.net/adobe/cmap/" target="_blank">CMap resources</a>.</p>
<p>Third, mainstream Latin fonts began including glyphs for macroned vowels, mainly thanks to Unicode, and OpenType&#8217;s excellent support for Unicode. In terms of Adobe&#8217;s glyph sets, glyphs for macroned vowels are included in fonts that support <a href="http://blogs.adobe.com/typblography/latin_charsets/Adobe_Latin_3.html" target="_blank">Adobe Latin 3</a> (aka <em>Adobe CE</em>) or better.</p>
<p>Now, thanks to these efforts, it is relatively easy to transliterate <span lang="ja-jp">東京</span> using the more common <em>Tōkyō</em>, as opposed to the less common <em>Tôkyô</em>. The difference is shown below at a larger size:</p>
<p><H1>Tōkyō versus Tôkyô</H1></p>
]]></content:encoded>
			<wfw:commentRss>http://blogs.adobe.com/CCJKType/2012/04/macron.html/feed</wfw:commentRss>
		<slash:comments>2</slash:comments>
		</item>
		<item>
		<title>ISO/IEC 14496-28:2012 Published</title>
		<link>http://blogs.adobe.com/CCJKType/2012/04/cfr.html</link>
		<comments>http://blogs.adobe.com/CCJKType/2012/04/cfr.html#comments</comments>
		<pubDate>Fri, 20 Apr 2012 12:49:46 +0000</pubDate>
		<dc:creator>Dr. Ken Lunde</dc:creator>
				<category><![CDATA[Essay]]></category>

		<guid isPermaLink="false">http://blogs.adobe.com/CCJKType/?p=1476</guid>
		<description><![CDATA[Born from the conclusion that OpenType&#8217;s 64K glyph barrier cannot be broken in the context of the format itself, ISO/IEC 14496-28:2012 (Composite Font Representation) was developed, and was subsequently published three days ago, on April 17, 2012, as a new ISO standard. As described in the January 26, 2012 CJK Type Blog article, CID-keyed fonts [...]]]></description>
			<content:encoded><![CDATA[<p>Born from the conclusion that OpenType&#8217;s 64K glyph barrier cannot be broken in the context of the format itself, <a href="http://www.iso.org/iso/iso_catalogue/catalogue_tc/catalogue_detail.htm?csnumber=59240" target="_blank">ISO/IEC 14496-28:2012</a> (<em>Composite Font Representation</em>) was developed, and was subsequently published three days ago, on April 17, 2012, as a new ISO standard. As described in the <a href="http://blogs.adobe.com/CCJKType/2012/01/extreme-otf.html" target="_blank">January 26, 2012 CJK Type Blog article</a>, CID-keyed fonts can include a maximum of 65,535 glyphs (CIDs 0 through 65534). Considering that <a href="http://www.unicode.org/versions/Unicode6.1.0/" target="_blank">Unicode Version 6.1</a> includes over 100K characters, with approximately 75K of which being <a href="http://en.wikipedia.org/wiki/CJK_Unified_Ideographs" target="_blank">CJK Unified Ideographs</a>, it becomes immediately apparent that a single font resource cannot support all of Unicode, let alone all of the characters for a single script (referring to CJK Unified Ideographs).<br />
<span id="more-1476"></span><br />
ISO/IEC 14496-28:2012 defines an XML format for representing composite fonts and fallback fonts, both of which can be implemented by defining a CFR (<em>Composite Font Representation</em>) object.</p>
<p><em>Composite fonts</em> are generally used for authoring purpose, either to (virtually) extend a font resource with additional component fonts, or to (virtually) replace existing glyphs with those that are referenced in a different font resource. In the context of Japanese, it is common for legacy composite font implementations to replace the glyphs for kana with those of a different font. This is based on the premise that typical Japanese text includes more kana than kanji, and changing the kana can significantly alter the <em>look and feel</em> of the text.</p>
<p><em>Fallback fonts</em> are used to display glyphs for a maximum number of code points, by selecting among various font resources, either by simple terms, such as a prioritized list, or more carefully by specifying fonts for particular code point ranges. Web browsers are ideal consumers of fallback fonts. Fallback fonts generally have less fidelity than composite fonts. Then again, the scope of composite fonts is generally more narrow, and composite fonts can serve as one of the component fonts in a fallback font.</p>
<p>Both composite fonts and fallback fonts have the potential to reference more than 64K due to the referencing of multiple font resources, each of which is limited to 64K glyphs.</p>
<p>Of course, publishing the ISO standard represents only the first step. The next steps involve establishing support for CFR in various places, ideally to replace existing composite font and fallback font implementations, almost all of which are proprietary. CFR has great potential in the cloud and for social media, mainly because it represents an interchange format for composite fonts and fallback fonts.</p>
<p>Although the text of this standard cannot be provided here, due to obvious copyright reasons, I can provide its <a href="http://blogs.adobe.com/CCJKType/files/2012/04/iso14496-28-dtd.txt" target="_blank">DTD</a> for those who are interested in examining it. For those who use Mac OS X, I would like to point out that CFR is largely based on Apple&#8217;s <em>SplicedFont</em> format, whose DTD can be found in <em>/System/Library/DTDs/SplicedFont.dtd</em> on the filesystem of that particular OS.</p>
]]></content:encoded>
			<wfw:commentRss>http://blogs.adobe.com/CCJKType/2012/04/cfr.html/feed</wfw:commentRss>
		<slash:comments>2</slash:comments>
		</item>
	</channel>
</rss>

