Tag Archives: translations

2009 would be the Year of …

Og Maciel writes about the possibility of 2009 being the Year of Translations. With the coming-out of awesome tools like Transifex, Damned Lies, Vertimus etc, it sure feels good to be even marginally involved in the process of translations.

Infrastructural pieces coming together ensure that a translation workflow that appeals to all, is easy for the end-user can be put in place with much ease. And, it would also mean a disruptive playing field for startups like Indifex. Making wide open spaces for innovation in translation workflow and infrastructure is an area that is bound to be welcomed by the folks who spend countless hours making applications, desktops and operating systems available in their local languages/locales. They don’t get appreciated often. They get recognized during release times in release notes and the like, but they do keep the engines running and the lights on. This is going to be their year.

I would venture so far as to state that in a trend of “2009 would be the Year of <insert_your_favorite_prediction>” it would be a Year of Content. Free and Open content un-encumbered by restrictive rights and legalese that would be re-distributable, would be informative, would be educational and would be able to bring about a change. Over a period of the last 24 months, methods and tools that enable content creation on Linux desktops have simplified. Especially when it comes to Indian languages. So, there are fonts available (some of them quite elegant), there are keyboard layouts, on-screen keyboards (like Indic OnScreen Keyboard or iok and even Quillpad), input methods, word-lists and like bits that form the user-experience completion when using a Linux desktop to compose content. In sort, the traditional problems in the fields of input-display-printing have been substantially addressed to bring the end-user experience at a level of where it should be easy to just plug-and-create.

There is a wealth of content in Indian languages, starting right from folk-tales that are part of the oral tradition to commercially generated content which needs to start moving into the UTF-8 encoding space. Projects like the OLPC can benefit from the availability of such forms. Work on Indic OCR remains to move forward at a much aggressive pace than what is currently, but there are signs of good things coming out of it. Digitizing data would also enable a lot of content to be archived and made available for consumption.

This is the year that should see a large part of such things happening. The marriage of content creators with the infrastructure developers is something that needs to happen as well. And, this needs to include folks from fields of comparative literature, media studies and the like. Anyone who really does generate content, should be met with and talked to regarding the need to exert themselves to become part of the process. Content already takes in a large chunk of investment outlay for the mobile players and with the availability of easy means of generating content, it would not be far to start thinking about a need to consolidate, find patterns, predict trends.

The convergence of the computing and application prowess of mobile devices, content creation workflows and upswing in the production of Indic language content for the webspace promises to make 2009 an interesting year of innovations.

Season’s Greetings to all.

…and here we go again

In an article on l10n and i18n published in this month’s edition of LinuxForYou (the article isn’t available online), Kenneth Gonsalves makes a statement as (italics are mine):

The vast majority of applications today are internationalised – the need of the hour is to provide translations in Indian languages. Except for some major applications, very little work is being done in this field. I don’t know whether it is because people are not aware of the need, are too lazy or they do not know how to!

This thought seems to be the new black. Adding on to the pre-existing notions of:

  • translations are very easy to do
  • translations are for hobbyists

On some days I am surprised about why such perceptions prevail. If any language team/community works on

  • a single distribution
  • two desktop environments
  • a browser
  • a mail client
  • an office suite
  • web-site content translation
  • release notes translation

then, assuming that most projects end up following a 6 month release cycle, it leaves folks with around 3+ months (on an optimistic schedule) to work with. In fact with “string freeze” (or, the time when the developers hand off the English versions to the translators) the effective window to actually translate is around 1 month. And, I have seen that for whatever little translations I have done for GNOME and OLPC.

And, the fact that schedules are tight can be seen on the mailing lists during desktop environment release times. So, if we can assume that the teams aren’t lazy and they know what they are doing, adding any more applications to be translated (and localized) would require capacity to be added. Which means that those who do go about FOSS evangelism and FOSS advocacy have to comprehend the following:

  • translation is not easy. Idiomatic English does not lend itself easily to translations and more importantly, message strings are sometimes not well constructed to be translated. For example, read this blog entry.
  • translation is not for hobbyists. It is a process of ensuring newer applications and releases are available in the local language. Thus, it means that teams working on translations improve quality of existing translations, check for consistency and still manage to work on newer releases. It is a serious business and folks take pride in a job well done.

If there are more applications that require translations/l10n, it would be a good effort to start coordinating with the language teams (via the IndLinux mailing list perhaps) rather than assuming that teams don’t know about such applications or, are lazy.