InDesign CS6…. Welcome to India!

This article talks about the overall objective of Localization in a new market or in business terms an “Emerging Market”. You might wonder, “why that specific word Emerging?” Because of the business opportunity it presents by taking a product to a new market where the demand exists, but somehow the product was not made available.

In the publishing domain, India is still one of the few countries where Print has seen a steady growth. Excerpts from one of the famous research site below:

“Contrary to most other markets in the world that continue to witness an erosion of the print media industry, in India, the sector witnessed a growth of ten percent in 2010 and is expected to continue to grow at a similar pace over the next five years. Rising literacy levels and low print media penetration offer significant headroom for growth, says a FICCI-KPMG report, recently released at FICCI FRAMES 2011 event…………”[Source All About Newspaper, publish date March`2011]

Does this present an opportunity for Adobe to expand in the Print Media space leveraging its one of the most popular Desktop publishing software InDesign®. Yes, but at what cost? Let’s weigh in the cost and benefits.

  1. Over the course of last few years, Adobe India sales force has been meeting Indian customers to understand how InDesign can be made ‘India ready’.
  2. In India, English is quite close to as being the second most spoken language just behind Hindi, giving a leeway to probably still hit the market with an English user interface (UI).
  3. The most talked about area in the frequent customer meetings was the support of Indic scripts in Print and Desktop Publishing Adobe applications. The current World-Ready composers for middle-eastern text included partial support for several Indic scripts. However, a number of bug fixes and product support requirements were needed for Adobe to officially certify and launch the product in India.

The specifics listed above did carve a path for InDesign to see support for Indic scripts in CS6 release. Based on input from the Product Management, the following 10 Indic scripts ranked highest on the priority list to support:

Each of the locales above have a good percentage of Print Media in the Indian market ranging from Newspaper, Magazines, Journals, etc. To support these locales was a tough road ahead since most of these locales use complex character combination, glyphs, hyphenation rules, dictionary support.

Phase 1 of this project included adding dictionary support in InDesign for these locales. We integrated the locale-specific open source dictionaries, evaluated them against competing products (with similar support) spanning a series of script specific test data hand-picked by linguists. The test criteria being:

  • Test maturity and quality of the dictionaries embedded
  • Misspell words intentionally and compare the corrected words
  • Ensure the words in InDesign when copied maintain their sanctity
  • Validating a few language rules, as applicable, such as hyphenation, matras, spellings, etc

Dictionary evaluation did show quite impressive results, allowing us to move to second phase of this endeavor of analyzing InDesign for Indic scripts. After a significant number of complex workflows, a few engineering tweaks along the way, we were able to achieve what we set our eyes at initially.

  • Added dictionaries and spell checkers for the 10 scripts
  • Added Hyphenation for the 10 scripts
  • Bundled 1 Indic font family: Adobe Devanagari
  • Included a script that users can run to set relevant defaults and correctly handle imports from Word docs etc.

Even though we started off this effort as a seed project, codenamed as InDesign Indic 1.0, we were able to achieve more than we shot for. InDesign proved not just compatible for the majority of the locales listed above but offered notable support for even the most complex glyphs.

Switch to the World-Ready Composer, an alternate composition engine, with a single click of indicPreferences.js in Window > Utilities > Scripts panel to explore the Indic world in InDesign. By virtue of basic Indic script support in InDesign CS6, you can now type in these languages and characters would shape and render correctly. And yes, there will be more refinements to the Indic Script support in future releases to come.

Let us know what you think and how you plan to use these features. Please visit here for the complete list of Language support in InDesign CS6.

Contributed by Harpreet Singh (Adobe India)

Announcing PSE11 and PRE11 Programs for French and German Users

This article was originally written in English. Text in other languages is provided via machine translation.

Adobe Prerelease Programs are your chance to experience, evaluate and influence upcoming products & technologies from Adobe within a smaller, more focused user environment. Prerelease Programs facilitate a symbiotic development process allowing Adobe to share products in the development stage to gather early feedback. In the process you get a chance to shape the upcoming products and adapt to the new products faster.

Multiple engagement channels are available to Prerelease participants at Adobe:

  • Access to download Prerelease software/technologies and technical documentation
  • Ability to report bugs & request features for the Prerelease software
  • Access to the Prerelease user forums for sharing ideas directly with Adobe product teams and other likeminded folks of the product’s community
  • Opportunity to participate in various product-related surveys

A Prerelease program is an endeavour to engage the real users of the product – YOU – early in the development cycle of the product, to listen and learn from you on how the product is working for you.

Current Prerelease Opportunities: How to Apply?

You may fill in the application forms for expressing your interest to join a products’ Prerelease program at Adobe. The participation will be entirely based on the requirements of the program and the credentials of the participant.

Following products have been opened up prerelease testing opportunities with French and German builds:

Photoshop Elements 11 for French Users– Sign-up now to participate in the Adobe Photoshop Elements Localized program and preview exciting new functionality!Apply now

Photoshop Elements 11 for German Users – Sign up to participate in the Adobe Photoshop Elements Localized Program and preview exciting new functionality!Apply now

Premiere Elements 11 for French Users – Sign up to participate in the Adobe Premiere Elements 11 Localized Program and preview exciting new functionality!Apply now

Premiere Elements 11 for German Users – Sign up to participate in the Adobe Premiere Elements 11 Localized Program and preview exciting new functionality!Apply now

We look forward to your participation in this pre-release program. In case of any issues of if you need more information, please feel free to contact Manish Kanwal at mkanwal@adobe.com or Nimra Khan at nikhan@adobe.com.

 

Adobe launches first forum in Portuguese

This article was originally written in Portuguese. Text in other languages is provided via machine translation.

Thank you to everyone who participated in the survey that we published last week, on the idea of ​​creating a forum in Portuguese on Adobe Forums. The poll results are available here.

In response to the preferences of most of the 25 respondents, we established the first forum in Portuguese for one of our products (Illustrator):

http://forums.adobe.com/community/international_forums/portuguese/illustrator

If you are a user of Illustrator, -visit us!

Depending on the popularity of this forum, we may consider the creation of new forums for other products.

Leandro Reis
Senior Program Manager, Globalization
Adobe Systems

Adobe Moses Tools now available for Windows

This article was originally written in English. Text in other languages is provided via machine translation.

We have an update on the Adobe Moses Tools which we announced on this blog on May 11. The tools are now available in pre-built packages for Windows! Check out the download section of the M4Loc site to get the Windows packages and for documentation and other information about the tools.

Please download the tools and let us know what you think!

–Raymond Flournoy
Senior Program Manager
Translation Technology Team

Adobe Flash – Content Creation & Localization Guidelines

This article was originally written in English. Text in other languages is provided via machine translation.

1. Purpose of this document
This blog is an outcome of an extensive study undertaken to identify best practices in localization of Flash based content.

Localizing structured content through translation management tools (like SDL WorldServer) can be a simple process. However unstructured content like text appearing in Flash videos is not easy. To add to it, tracking the efficiency of the linguistics becomes all the more difficult for unstructured content.

The blog covers all aspect of Flash localization including creating the English content the right way; so that it becomes localization “friendly”. This should result in cost and time savings; which is multiplied by the number of target languages should yield sufficient dividends.

2. Extraction of Text from a Flash animation

The difficulty of Flash animation localization depends on the globalization level of source – usually English Flash animations. During the localization process, all localizable text is extracted from the source animation, replaced by localized versions and localized strings are modified according to the style guides- including choice of font size, family etc. Occasionally localizable text is not included in the text layers but presented as graphics. Therefore the source animation has to be fixed in the first place to have it ready for localization. As a first step of Flash animation localization is to extract all localizable text from the animation. This is possible by using commercial tools such as “Flash Localization Tool” or renaming source .fla file to the .zip archive (just change extension from .fla to .zip) and extracting all .xml files from “LIBRARY” subfolder of the .zip. See Figure 1 and Figure 2 for more details.


Figure 1 – list of Flash archive with LIBRARY subfolder

Figure 2 – list of text layers in .xml format stored in the LIBRARY subfolder of a Flash archive

Flash native format is a zip archive with defined structure. Having a baisc knowledge of it enables one to easily extract all text layers from the animation. Once done, the only task remains to locate the text within the extracted .xml files and provide the translators with .ini file (if the translation is to be completed by using Trados CAT). Text layers in .xml files can be found surrounded by <characters>Your text</characters> XML tags.
The XML stores the information about font size, family and style (besides storing the localizable text). Most of this information like font family can be easily modified for all text if source font does not support the target language alphabet. For more details about Flash XML file please see Figure 8.

Best Practice # 1:
Never modify texts within text layers with help of action script. It will be difficult to understand script.

3. Pseudo localization
“Pseudo localization” is a process to ensure the animation is ready for localization and all localizable texts is in the text layers only. During this process all strings currently contained in text layers (.xml files) are replaced by pre-defined string, e.g. by “X”. Original .xml files in the LIBRARY subfolder are replaced by modified ones and when this action is completed, render.swf should be played back in Flash player. By playing these renders it is possible to locate any remaining text which was not pseudo localized for various reasons.
Text originally present in the text layers is displayed as “X” in this render. In case there is still any readable text it usually comes from embedded video, graphics or an action script. One can find example of pseudo localization in Figure 3, Figure 4 and Figure 5.

Figure 3 – Example of pseudo localization. All strings except “What’s new” were changed to “X”. “What’s new” is a graphics in Flash animation and thus can’t be easily edited.

Figure 4 – Same screenshot as in Figure “3″ but now localized into Russian. As you can see also “What’s new” is correctly localized. In this case source was fixed and localizable graphical texts were updated to text layers.

Figure 5 – Another example of pseudo localization. Displayed asset is a screenshot from an application and the only editable text is in the upper right corner. In this specific case, it was decided NOT to fake it and leave it in English as the font size was too small

Best Practice # 2:
Apply Pseudo localization to find out any localizable text whichis not in text layers within Flash project. If your project has been globalized ever since the ideal way, pseudo localization can be skiped.

 

4. Text layers in Flash animation
There are many ways to insert text into Flash animation. For example, avoid inserting graphical text (letters as graphical objects) and work with text layers only. If this is the case, DO NOT break text into multiple strings, and keep it in one string only, unless specifically required. Any separation in multiple substrings will result intext segmentation inside .xml file and can lead to loss of context during translation and thus incorrect localization of it’s substring.
In Figure 6 and Figure 7, one can find text “Loyal customers like you save with special upgrade pricing. Adobe Photoshop Elements & Adobe Premiere Elements is now a perfect 10!”
This sentence is separated into two different text layers. Such separation will result into segmentation of strings in the .xml file as well and translator will see them as two separated strings without any relationship between them, see Figure 8.

Figure 6 – “Loyal customers like you save with special upgrade pricing. Adobe Photoshop Elements & Adobe Premiere Elements is now a perfect 10!” text separation into two substrings. On this figure, the first part is visible.

Figure 7 – Second part of text “Loyal customers like you save with special upgrade pricing. Adobe Photoshop Elements & Adobe Premiere Elements is now a perfect 10!”

Figure 8 – Representation of substrings in the .xml file. Every substring is enclosed in the <DOMStaticText> tag inside .xml file.

It is not recommended to split a string into shorter substrings but rather keep them in one text layer whenever possible. Advantages of this approach are:
1. Localized string is well ordered so that localized substrings do not need to be reviewed by native speaker after localization and postprocess.
2. String is not divided into shorter substrings inside .xml file.
Today’s version of Flash allows authors to keep strings with different font family, color, style, size etc. in one text layer. Additionally, there is no need to separate them into several text layers. Keeping source strings in one text layer helps translators and localization engineers to spend less time on translation and post-processing of localized strings in the animation. It also decreases the chance of issues related to word order. In Figure 9 you can find example of complex string (part of the text is in yellow with bigger font) in one text layer.

Figure 9 – Complex string in one text layer only. There is no need to put text in yellow into a separate text layer. With today’s Flash you can create such complex strings.

Best Pratice #3:
Keep the number of text layers within Flash project as low as possible. If any text formatting is required, do it only one text layer (as much as possible).

 

5. Embedded Videos
Many a times, Flash animations may contain embedded videos. These embedded videos can be easily found in Flash object library under “Embedded Video” type, please see Figure 10.
If there are any segments to be localized within embedded videos and only the render has been provided, it will be very difficult and time consuming to localize such strings. In special cases, when the video is carefully analyzed, some text can be “faked“ with small effort.
However, animated text (Figure 11) or static one with moving background is almost impossible to edit and in order to localize this type of source it is necessary to spend many hours to fake just one or two seconds of the final render. The source simply needs to be re-created which increases the effort manifold.
With editable video project file one can modify all text layers inside embedded video and prepare localized versions of those videos.

Figure 10 – Embedded video “soccer.flv” with animated localizable string

Figure 11 – Example of localizable text within the embedded video impossible to fake, text is animated

Best Practice #4:
Embedded videos with localizable texts within Flash project increase the time and effort for localization manifold.
The best way to do is extract all text from Video, localize it, replace all localizable texts and create localized render first. Without editable video file, the only (harrowing) option is to try to fake text within embedded video, which is usually impossible. So DO NOT embed text/assets in videos if you want to localize it. Provide assets and video separetely instead.

6. Screenshot Animations
One can also create an animation inside the Flash project from a series of screenshots. See example of screenshots series in Figure 12 and Figure 13. Text in the screenshots are localizable ones and a source .psd with editable text layer needs to be created initially. Once completed and the text is replaced with a localized version, screenshooting of the localized series can be carried out.
If you have source editable .psd available, please provide it to your localization engineer. Moreover, text effects used in the localized Flash will be identical with those in the source Flash.

Figure 12 – Screenshot series. You can see both preview of the screenshot and also sequence of screenshots in the Flash timeline

Figure 13 – End of the screenshot series in the Flash animation. Text “The Big …” is animated within the series and every screenshot adds one letter of the text into the animation.

Best Practice#5: For any localizable text within a graphics (graphical text), it is important to provide source English assets. If source editable asset is not provided, it has to be re-created by localization vendor first to be able to replace text by a localized version and create render for Flash project. During re-creation of editable source asset some effects can be overseen and hence missed in localized version.

 

7. Buttons & Active Layers
It is vital to keep all buttons with localizable texts as “Movie Clip” type in the Flash project and not to use mixture of different Flash types to simulate button functionality. If button functionality consists of more Flash types, it is very likely that during post processing, the engineers may unintentionally use incorrect alignment of these objects in relation to each other inside certain frame. As a result, this will lead to a functional impact on the flash file, and in some cases, rending it corrupt
An example of corrupted button is described in Figure 14. Here the button consists of graphical text in the yellow box and active layer (shown as a blue frame) which plays role of “clickable” area. You may observe that the box is not aligned with active layer and the left side of this button will not be clickable. Additionally, right side contains an overlapped area which is active which is not correct.

Figure 14 – Corrupted button “What’s new”. Active layer (blue frame) is not aligned with yellow box.

Creating buttons as “Movie clip” will avoid similar issues in Flash animations. Sometimes, modification of assigned action script is required as well.

Best Practice #5:
Active layers outside of button in Flash can lead into misalignment within localized version of Flash. It requires once to resize button and active layer objects with every change in the no. of text characters in the button. Buttons created correctly eliminate potential misalignment risks and decrease the number of button re-sizings by a significant number.

 

8. Flickering
In some Flash players, localized text can exhibit “Flickering“ effect i.e., text is moved by one or a few pixel to left, right, up or down when the cursor is hovered over it. See Figure 15 for more details.

Figure 15 – Flickering in detail – the image is composed from two screenshots of the same part re-captured after few seconds later to see one pixel position changes

The only way how to fix the „Flickering“ issue is to convert text layers within localized Flash project into curves and render into final format. Graphical text (curves) doesn’t show the “Flickering“ effect any more

Best Practice #6:
Flickering effect is visible in some players only. It is recommended to use Flash players without Flickering effect. Otherwise, localized text layers have to be converted into curves before rendering into final format. When multiplied by number of target languages, effort will turn out to be huge.

 

9. Acknowledgements/Credits

This document was prepared from an independent study done in Photoshop Elements 11 by Jakub Škrabal & Petr Knápek from Moravia, and Akulaa Agarwal & Manish Kanwal from Adobe.

Please note that it can be applied to any Adobe/Non Adobe projects requiring localization of text and other assets in Flash files. For any questions, please feel free to reach out to Akulaa Agarwal(akulaa@adobe.com) and Manish Kanwal (mkanwal@adobe.com).

Global publishing with Adobe Digital Publishing Suite

This article was originally written in English. Text in other languages is provided via machine translation.

Digital publishing is a global business

Digital publishing is catching on worldwide, as international catalog, magazine and book publishers are increasingly producing digital versions of their publications, in an growing number of languages.

Adobe Digital Publishing Suite (DPS) offers digital publishers with the ability to create multilingual content for the enjoyment of their international readers. This article provides some basic information about the creation of multilingual publications with DPS.

Content localization: region-specific vs language-specific

Region-specific publications carry the main brand (of the magazine, retailer, etc), but are customized to an individual region or country. In the case of magazines, articles are written by local authors, often covering topics and people of local significance.

Language-specific publications translated versions of a single source of content. The articles and authors are the same, the only thing that changes is the language of the content.

Presenting the translated content

With language-specific publications, there are a few different ways to present the translated content, which can impact layout decisions.

The most common type of presentation is single-language, where each language version of the publication is downloadable as a separate application.

Multilingual applications can contain two or more sets of translations of the original content. The translations can be presented through toggling or side-by-side.
With the toggling approach, readers can navigate between content written in different languages by pressing a ‘language switch’.

The article toggling effect provides a smooth user experience, but it does require additional work (i.e. scripting) behind the scenes to make it happen.

The side-by-side approach puts the translations next to each other, typically with different font types, sizes and colors.

Authoring content in different languages

At the core of the DPS workflow is Adobe InDesign, which allows text authoring in many languages. The latest version of the product (CS6) is available in 3 flavors providing distinct levels of language support:

  • InDesign CS 6.0 - Provides core typographical support for a wide range of languages, including those written in certain non-western scripts. It’s localized into English and 16 other European languages.
  • InDesign CS 6.0 ‘CCJK’ - In addition to the core set of typographical features, provides typographical, layout grid and frame grid features for editing East Asian text. It’s localized into Chinese Simplified, Chinese Traditional, Japanese and Korean.
  • InDesign CS 6.0 ‘ME’ - In addition to the core set of typographical features, this version provides full support for bi-directional languages such as Arabic, Hebrew, Farsi, and Urdu. Find out more about the Middle Eastern features here. The ME version is available in English and French user interfaces.

The linguistic capabilities of InDesign are well documented in the product’s help pages, and in articles written by various InDesign experts. Below are some language-specific topics that can assist in the creation of multilingual content in InDesign:

Creating localizable content

In multilingual digital publishing, a critical aspect of content authoring - regardless of the language it’s originally written in – is to ensure that it’s localizable, i.e, that it can be easily adapted into (an)other language(s). Below are a few guidelines for creating localizable content in InDesign:

  • Allow for text expansion - Word length varies considerably from one language to another. For example, German and Finnish sentences are on average longer than English. Also, Asian fonts require more vertical space than Latin fonts in order to render certain complex symbols clearly. Thus, it’s important to keep some buffer space around text so that translations can fit it nicely.
  • Apply styles – It’s critical that all text formatting is based on styles, as it ensures consistent formatting across all languages, and it allows for easily changing fonts for languages whose characters are not supported by the font of the source document.
  • Link images - Linked images are much easier to manage during translation
  • Connect text frames – This will ensure text will continue to flow nicely after it’s translated.

More guidelines on content localizability with InDesign will be provided in a future post.

Localizing the content

Localization of InDesign files is typically performed by professional translation agencies, who handle exported IDML (InDesign Markup Language) files in commercial translation management systems (TMS). Ben Cornelius’ article provides a good overview of this process.

Also, some vendors are starting to offer new and innovative ways to localize InDesign content, such as 1i0′s one2edit WYSIWYG tool.

But regardless of the way the content is localized, it’s very important that the work comprehensive: everything, including not only the article text, but also titles, captions, headers, footers, footnotes, and art, should be translated or adapted.

For maximum coverage, even media features, such as audio or video clips, should be subtitled and translated, or dubbed.

Below are some examples of locale-sensitive conventions – dates and times – that need to be adapted for each region.

Publishing the content: DPS multilingual options

The bulk of the process for publishing localized or multilingual content with DPS is not any different than English or single-language content, which is described here.

But, there are a few multilingual options available.

For publications written in bi-directional languages such as Arabic, Farsi, Urdu and Hebrew, readers expect to be able to swipe pages by moving their finger from left to right (ie in the opposite direction than in an English publication), thus right edge binding is necessary.

To do activate this feature, in the Digital Publishing Suite, select Right Edge Binding in the Folio Producer page.

You can also set this in In InDesign, by selecting the Right Edge Binding checkbox in the Folio Properties dialog.

3eesho is a fine example of a bi-directional publication created with DPS.

Language tagging

Tagging your publication with language information will allow it to be searched by language from e-stores. This can be done during the building of your viewer application, in the Viewer Builder:

Localized versions and availability

The Digital Publishing Suite user interface is currently localized into English (UK, US), French, German, Italian, Spanish, and Japanese.

DPS Single Edition is available in the U.S., Canada, and Mexico. Availability is expected this year in Australia, Austria, Belgium, Bulgaria, Cyprus, Czech Republic, Denmark, Estonia, Finland, France, Germany, Greece, Hungary, Ireland, Italy, Japan, Latvia, Lithuania, Luxembourg, Malta, New Zealand, Norway, Poland, Portugal, Romania, Slovakia, Slovenia, South Africa Netherlands, Spain, Sweden, Switzerland, and the United Kingdom.

Examples of multilingual publications created with Adobe Digital Publishing Suite

Check out many examples of multilingual digital publications created with Adobe Digital Publishing Suite by visiting the Digital Publishing gallery.

AMT – Adobe Moses Tools Now Available

This article was originally written in English. Text in other languages is provided via machine translation.

I’d like to announce that AMT (Adobe Moses Tools) are at last open sourced and available for download.

All code and documentation has been submitted to M4LOC Moses for Localization.

http://code.google.com/p/m4loc/

Prepackaged DMGs for OSX users can be found here:

http://code.google.com/p/m4loc/downloads/list

Source code for those interested in building on the tools and improving them can be accessed here:

http://code.google.com/p/m4loc/source/checkout

I would be psyched to hear from folks who download and use the tools and are looking to improve them. It’s been a long road from jumping into the deep end with Moses until where we are today. There’s still plenty to get done. Let’s work on building that together.

Cheers!

Jeff

Adobe Creative Cloud launched in 8 languages

This article was originally written in English. Text in other languages is provided via machine translation.

After much anticipation, Adobe Creative Cloud has launched!

http://creative.adobe.com

Creative Cloud is a membership service which provides online services for file sharing, collaboration, and publishing, as well as access to every Adobe Creative Suite 6 application.

Language availability

The Creative Cloud website itself is available in 8 languages: Dutch, English, French, German, Italian, Japanese, Spanish, and Swedish.

The Creative Cloud membership is available through the Adobe Store in: Australia, Austria, Belgium, Brazil, Bulgaria, Canada, Cyprus, Czech Republic, Denmark, Estonia, Finland, France, Germany, Greece, Hungary, Ireland, Italy, Japan, Latvia, Lithuania, Luxemburg, Malta, Mexico, Netherlands, New Zealand, Norway, Poland, Portugal, Romania, Slovakia, Slovenia, Spain, Sweden, Switzerland, UK, and the US.

Creative Cloud members can download and install CS6 applications in any language in which the products are available. Unlike owning the traditional licensed version of a Creative Suite product, Creative Cloud membership allows you to select from multiple languages. For a complete list of languages in which CS6 applications are available, go here.

To learn more about the Creative Cloud

You can learn more about the Creative Cloud by watching 10 videos explaining common questions from installing apps, to sharing and moving files, etc. in the Creative Cloud YouTube channel (English only) and on Adobe TV (English only).

Also, you can learn more about what’s going on with the Creative Cloud by visiting the Creative Cloud Team blog (English only).

 

 

Adobe gave a presentation about Moses Tool Set on TAUS Asia Translation Summit 2012

This article was originally written in English. Text in other languages is provided via machine translation.

TAUS Asia Translation Summit 2012 was organized by Translation Automation User Society (TAUS) in cooperation with China Center for Information Industry Development (CCID) and Translators Association of China (TAC). 80+ attendees from both product companies such as Adobe, Baidu, EMC, Google and Microsoft and LSPs participated in the summit held in Beijing on April 24 – 25, 2012, as well as the complimentary half day event TAUS Open Source Machine Translation Showcase held in the same venue on April 23. The summit provides attendees an excellent platform to share knowledge and experience in MT domain.

TAUS_2012_Beijing_PresentationI was invited by TAUS to give audience an introduction of what Adobe has done on open source MT. In my presentation, I shared how Adobe makes use of the open source MT tool Moses in its localization workflow. We developed a set of tools called Moses Tool Set to simplify the usage of Moses. By using this tool, the training process of Moses can be done in an easier and intuitive way. It consists of 4 features: Corpus Clean Tool, Corpus Splitting Tool, Moses Training Harness, and Moses Scoring Harness. Each feature can not only work independently but be combined into a job which enables users to complete the whole training process in one click.

Many audience especially those from LSPs that just started their adventure of open source MT showed strong interest on the Moses Tool Set. It’s happy to see that seeking for ways to improve localization productivity is no more the responsibility only for the language service buyers. Some LSPs have also started their attempts in MT field. Moses is a good option for them because of its lower entrance cost. In the offline discussion, however, I got a lot of complaints from these potential Moses users about usage of Moses. For those who don’t dive deeply into statistical machine translation, Moses is too complicated to start with. Many parameters are required to generate a trained MT engine. Lack of a friendly user interface is another headache for them. No wonder the very first thing audience eager to know is where they can find and download Moses Tool Set.

Actually, Moses Tool Set is an open source project. Both its installer packages and source codes are available in Google Code.

Adobe participates in Colloquium “The Portuguese language on the Internet and in the Digital World” by IILP

This article was originally written in Portuguese. Text in other languages is provided via machine translation.

The Portuguese language on the Internet and in the Digital World sponsored by the International Institute of the Portuguese Language Instituto Internacional da Língua Portuguesa - IILP) (23-26 April 2012).