I wrote a short note on the history/context behind the bn-IN/bn-BD decision and other things here If you’d like to comment/discuss, I’d prefer the G+ post than this blog.
On the sidelines of the GNOME.Asia 2011 at Bangalore, an article in The Hindu “This one’s no gnome” was published. When Srinivasa Ragavan tweeted about it, I mentioned about the ill-formed comment
The paragraph of interest is here:
Excited to be in India, he concedes that community interest here is still on the lower side. Setting this straight is particularly important when it comes to GNOME localisation. “Localisation is a huge challenge here, mainly because there are so many languages, and also because of the way the alphabet font is linked. This is where Free software is critical, because smaller the user base, lesser the chances proprietary firms will take this up.” We count on enthusiastic developers, who are proud of their language and want to preserve it in the digital realm, Mr. Cameron adds. In fact, he points out there are “big business opportunities”, domestic and international, for those who commit themselves to such projects.
Brian mentions about community interest in localization being less. On the face of it, this appears to be a conclusion derived from looking at number of participants in the language communities. The way I’ve learnt to look at localization is to try and see the communities and the efforts which sustain. Localization is a steady and incremental process. The communities which take the time and make the effort to reach and maintain their place in and around the “supported” percentage for the individual Indic languages are the ones doing great work.
With each release of GNOME there is an obvious increase in the number of strings up for translation. I don’t have the exact statistics to support this fact but I guess a trend-line generated from the total number of strings available in each release super-imposed over the trend-line for the ‘supported percentage’ figure would see an increase with each release. I’m sure someone actually has this data. This basically means that within a short period of ~4 months (factor in the code freeze and string freeze etc), which may or may not also overlap with other project releases, the localization teams end up completing their existing release work, review pending modules and polish up translation consistency ensuring that with each release of the desktop there is a step towards making it more user-friendly.
That’s why localization is a big challenge. Not because of the number of languages and certainly not because of the “way the alphabet font is linked”. For what it is worth, the latter bit is more in the realms of internationalization and there are efforts at multiple levels to ensure that the remaining few outstanding issues get fixed.
This brings us to a small gripe I’ve had about a lot of Free and Open Source Software projects who take their L10n communities and participants for granted. I’ve rarely seen a project board reach out and talk with the participants to figure out what can be done to make things better. For example, would a community doing L10n do more awesome work if they were funded for a 2 day translation sprint ? Do the language communities have specific requirements on the translation and content infrastructure or, the translation workflow ? Have these issues been brought up in Technical Board meetings ? GNOME isn’t the only one which repeatedly fails to do this transparently, but it is among the highly visible FOSS projects which seems to assume that it is an obligation of its volunteer contributors to keep the statistics ticking.
- IndLinux Hindi v0.37 (Milan) released on October 2003. It was a LiveCD that allowed you to check out a localized GNU/Linux desktop environment, input and display
- The AnkurBangla LiveCD was also released somewhat later that same year
- From May 2004 onwards, the Utkarsh Project released and maintained Gujarati Localizations
- In November 2004, Fedora Core 3 shipped and you could choose to boot into an Indic locale and get your work done. You can also check this page for the RHEL release that happened.
- In 2006, the IndLinux project released Rangoli
- In June 2007, Debian Etch was released with an installer localized in Indic languages (Bengali, Gujarati, Hindi, Malayalam, Nepali, Punjabi, Tamil). As a consequence, Debian (and Ubuntu) users may experience a full Indic-localized system from scratch.
So, before you go and listen to folks talk about the “first Indian language GNU/Linux Operating System” on the media channels, keep these dates in mind.
Update: This does not in any way claim to be the only dates that are relevant. So, if you do recall the dates of releases from other groups working on Indic L10n on Linux, please feel free to leave them in comments with URLs if possible.
The post is brought to you by lekhonee-gnome v0.9