On Lingua and distributed translation

Earlier today, a tweet popped up on my screen from someone I’d just begun to follow.  MeghnaK is a (self-described) 13-year-old blogger from India.  Her blog is what you might expect from a young woman…mostly personal, with some poetry and a few newsworthy stories.  Her grasp on blogging is clearly beyond her years (or is it?  There were no blogs when I was 13) however, and her writing is impeccable.  Anyway, she remarked on Twitter that her blog is often translated into other languages so that people can read it.

Now, I’m not sure how she figured that out, but it is certainly indicative of a growing trend; newsseekers, tired of seeing the same old in the media of their native language, are seeking translated sources more and more.  And MeghnaK’s timing couldn’t have been better, as Chris Salzberg of Global Voices’ Lingua project is speaking at Berkman today on that very topic.  Chris has written a very interesting paper on his experiences with Lingua and Global Voices, published in Translation Journal.

Lingua, allegedly “one of the largest and most active translation communities in the world,” was borne of the Global Voices 2006 Summit in Delhi, India.  The project answered a need from Global Voices readers whose English comprehension was not good enough to understand the stories coming out of GV.  And since Global Voices authors translate blogs from other languages into English on a regular basis, Lingua was a natural extension of that spirit.

Today, Lingua boasts 15 languages, with 7 soon to come.  But what’s fascinating is not that Lingua exists, but how it exists.  Although each language team has an editor (or in some cases, two), the translators themselves are all volunteers. The teams utilize a distributed translation model; some teams translate pieces using a wiki, others farm out translations to individual translators.  All are still working on the process.

Of course, Lingua is not without its challenges. Chris points to “lost context” as the biggest challenge of the project, meaning, when original articles are translated into a foreign language, translators are often stumped on how to translate phrases, concepts or terms. For example, in an article on “genital excision” (also known as female genital mutilation), a Malagasy translator had difficulty translating the foreign concept. She finally settled on “circumcision of young girls.” This is a common occurrence; as a Global Voices author, I’ve had translators contact me on a number of occasions to clarify terms I’ve used in articles on Morocco; terms which are clear in English but may not be in, for example, Korean.

Another challenge faced by Lingua is the discrepancies between GV’s bloggers and its translators; while bloggers are often well-versed in “web 2.0″ and proficient as bloggers, some translators (though volunteers) often have a more professional focus in their translation work and are perhaps less familiar with the Internet.

What is clear, though, is that Lingua is a success. Readership grows by the month, and several languages, including Spanish and Italian, have made it into Google News listings. Many languages used within Lingua (French, Spanish, Portuguese, Swahili) cover multiple regions and countries, allowing for a broader reach. And Lingua has a growing number of partnerships with other organizations: GV Arabic and Al Jazeera Talk have linked up, as well as GV Chinese and China Times.

Most of all, Lingua provides one of the world’s best resources for global news and information and is setting a global precedent for distributed translation projects. As Chris wrote in his paper, “With the Internet becoming more multilingual by the day, there is a growing need for local voices to fill in the gaps of ‘global news.’” Just as Global Voices brings local voices to the forefront, Lingua brings those voices to the masses.