FOSS: Localization(自由/开源软件:本地化)

(翻译基本完成,欢迎协助校对)

[ 译者按:这本小册子是自由/开源软件本地化的综述和政策指导。
IOSN另有一本详细的本地化技术指南,见
http://www.iosn.net/l10n/l10n-howto-toolkit/guide.pdf ]

本书从本地化的简介开始,包括介绍其益处和当前进行中的本地化工作。然后对软件本地化过程中的一些技术细节和相关标准(Unicode,OpenType 字体等)问题进行探讨。最后以政府如何鼓励本地化工作和技术人员如何参与工作作为结束。

这一册提供了自由/开源软件(Free/Open Source Software, FOSS)本地化的广泛视角,对发展中国家的政府和决策人士犹有裨益。它突出了自由/开源软件本地化的益处和策略,并提供了正走在软件自由之路上的一些国家的案例研究。

本册开篇介绍本地化,以及用自由/开源软件代替私有软件的好处。第二节提供了亚太地区自由/开源软件本地化的项目和工作情况的调查,包括从越南、泰国、柬埔寨、印度和马来西亚等国家获得的有益经验和教训。本册也提供了泰国、老挝和柬埔寨三个国家本地化工作的案例研究,以及对技术、资源分配、工具和技巧、实现、成本和语言问题的一些建议。

为了帮助本地化工作者起步,我们在两个附录中提供了本地化的关键概念和技术说明,为规划软件本地化项目的管理和实施人员提供参考。

Starts off with a general introduction of localization, its benefits and current localization efforts. Delves into some of the details of what is involved technically in localizing software, the standards involved (Unicode, OpenType fonts, etc). Concludes with details of what governments can do to encourage localization efforts and how technical individuals can get involved.

This primer provides a broad perspective on the localization of Free/Open Source Software (FOSS) for the benefit of policy and decision-makers in developing countries. It highlights the benefits and strategies of FOSS localization, along with case studies from various countries that are on the road to software freedom.

The primer begins with an introduction to localization and the benefits of choosing FOSS over proprietary software. The next section provides a survey of initiatives and efforts in localization of FOSS within the Asia-Pacific region, including best practices and lessons learned specifically in countries such as Viet Nam, Thailand, Cambodia, India and Malaysia. The primer also provides three case studies of localization efforts in Thailand, Lao PDR and Cambodia, as well as recommendations on technical issues, resource allocation, skills and tools, implementation, costs and language considerations.

To help localizers get started, two annexes regarding key concepts and the technical aspects of localization are provided. These are intended for project managers and implementers who are planning software localization projects.

  • 版权页
  • 前言
  • 致谢
  • 自由/开源软件本地化的总论
  • 亚太地区本地化工作
  • 自由/开源软件本地化的建议
  • 进一步阅读
  • 资源和工具
  • 词汇表
  • 关于作者
  • 封底

版权页

(C)联合国开发计划署亚太发展信息项目(UNDP-APDIP),2005
网址:www.apdip.net
电子邮件:info@apdip.net

本书按照创作共用署名许可2.0版发表。
许可的详情请参阅如下网址:
Creative-commons.org/licenses/by/2.0/legalcode

ELSEVIER
Reed Elsevier India Private Limited
17 A/1, Lajpat Nagar IV,
新德里 110024
电话:91-11-26447160
传真:91-11-26447156
网址:www.asiaelsevier.com

ISBN: 81-8147-754-5


前言

自由软件是尊重用户自由的软件。它意味着用户可以自由地运行程序,自由地学习和改变软件(或者雇人帮他们做),自由地重新发布拷贝,以及自由地发布修改版本。其结果是用户可以自由地分享软件,并形成社区以对他们使用的软件进行有效的控制。自由软件也可能是免费的,但并非总是如此。

一些用户将这称为开源。这个词代表的是大量的开发者和用户对软件的审议带来的技术优势,使用这种软件的商业优势,以及支持其开发和使用的商业模式。而“自由软件”这个词指的是自由的社会和道德价值,以及它带来的实际益处。

分享和改变像软件这样的有用创作是社会和技术的共同要求,它引起了一场“出版”意义的革命。传统的印刷出版要求读者把作品看作固定不变的——作品完成后读者只能被动地使用。这种技术不允许没有自己的出版社的读者们改造、提高和分享书籍,而读者们也习惯了这些限制。因此当大量的公民开始使用个人计算机时,许多人都没有质疑立法和商业机构施加于软件上的类似限制。但这一次,这种限制并不是技术的自然结果。它是由认为控制用户有利可图的软件开发者带来的。这些开发者限制用户分享软件,而且不让用户得到源代码从而阻止他们改变软件。

但现在,用户有了选择:自由软件,由多达上百万开发者——一些是志愿者,一些从中获得报酬——组成的社区的产品。自由软件允许用户充分利用他们计算机的能力。它是我们逃离印刷出版的限制的一条途径,让软件在用户当中,为了用户的目的,由用户来开发。

这样的软件不会采用通过限制用户来鼓励开发的错误的激励机制,这样的机制产生人为的稀缺。

用户可以按照自己的意愿制作软件。软件的用户可以自由地运行、分享,并通过集体或个人对其进行修改。这就是自由/开源软件(Free and Open Source Software, FOSS)的世界。

非自由软件让用户孤立无援。当一个国家增加非自由软件的使用时,那不是发展,而是永久性地依赖。只有使用自由/开源软件才能实现可持续发展——它是本地人民能够自由学习、维持、改造和利用的技术。

政府怎样推动国家采用自由/开源软件呢?有两种方法。首先,在学校中使用自由软件能教会孩子们社群协作的精神,同时造就能熟练使用和维护自由软件的毕业生。第二,规定在政府部门中使用自由软件将使这些毕业生的技能有用武之地,并建立起自由软件支持的本地经济。

国际开源组织(the International Open Source Network, IOSN)是 UNDP 亚太开发信息项目的一个机构,其工作指导方针是“所有人的软件自由”(Software Freedom for All, SFA)。组织的工作包括提供支持和技术帮助,发展的中心,以及一个亚太地区自由/开源软件信息交换的场所。

通过 IOSN/SFA 项目,UNDP向政府部门、非营利组织、捐赠机构和其他组织提供政策支持和建议服务。它向自由/开源软件从业者和用户发布实用工具和学习材料,包括简单的“如何”(how to)册子和指导书,培训材料,以及 GNU/Linux 操作系统的免安装 CD。它也支持自由/开源软件的本地化及其他研发活动,并组织会议和培训课程以联络相关各方,并增强当地能力。欢迎对这些活动感兴趣或愿意参与组织这些活动的人们。

我很高兴能与 IOSN/SFA, APDIP 和 UNDP 合作,把软件自由的理念带给大众和开发部门。我们将一起协助人们跨越数字鸿沟,并用发展取代依赖。

Richard M. Stallman


致谢

本书的作者向下列个人和组织表示感谢:

感谢 Phet Sayo,UNDP-APDIP 的能力构建和合作项目专家,帮助我们起步;Kenneth Wong,IOSN 的项目顾问;Tan Wooi Tong,IOSN 的项目专家;感谢他们的耐心和勤奋工作。有他们的帮助,管理和操作这个大型项目变得简单多了。这个团队不仅推动了计划进行,也提供了让这本小册子独一无二的反馈和建议。

感谢来自 Khmer(柬埔寨)软件计划的 KhmerOS 项目协调员 Sarmad Hussain, M Sasikumar 和 Javier Sola 评审这本册子并提供了宝贵的建议。我们特别要感谢 Javier 在12-15页提供了关于自由/开源软件的益处和 Khmer 本地化的案例研究。

感谢 Vorasone Dengkayaphichith, Vincent Berment, Alberto Escudero-Pascual, Pramod Raghavendra 和 Guntapalli Karunakar 读过这本册子后提出的批评意见。

感谢 Arthit-Thai OpenOffice 项目组成员,印度和越南的成员提供调查反馈。

感谢 Fred Williams 宝贵的贡献和建议。

感谢 PAN 本地化计划(www.panl10n.net),PAN IDRC 项目和国立计算机与新兴科学大学(www.nu.edu.pk)的 Urdu 语处理研究中心(Center for Research in Urdu Language Processing, CRULP, www.crulp.org)在“本地语言计算基础”培训课程中提供了用于本书第二节的调查信息。

Anousak 将本书献给他的家人和已故的父母。


自由/开源软件本地化的总论

This primer provides a broad perspective on the localization of Free/Open Source Software (FOSS) for the benefit of policy- and decision-makers in developing countries. It highlights the benefits and strategies of FOSS localization, along with case studies from various countries that are on the road to software freedom.

The primer begins with an introduction to localization and the benefits of choosing FOSS over proprietary software. The next section provides a survey of initiatives and efforts in localization of FOSS within the Asia-Pacific region, including best practices and lessons learned specifically in countries such as Viet Nam, Thailand, Cambodia, India and Malaysia. The primer also provides three case studies of localization efforts in Thailand, Lao PDR and Cambodia, as well as recommendations on technical issues, resource allocation, skills and tools, implementation, costs and language considerations.

To help localizers get started, two annexes regarding key concepts and the technical aspects of localization are provided. These are intended for project managers and implementers who are planning software localization projects.

这本册子帮助发展中国家的政策制定者和决策者从一个宽泛的视角看自由/开源软件的本地化。本书的重点是自由/开源软件本地化的益处和策略,同时提供了走在软件自由之路上的不同国家的案例研究。

册子从对本地化的介绍和舍弃私有软件而选择自由/开源软件的好处开始。后一节提供了亚太地区自由/开源软件本地化的项目和工作情况的调查,包括从越南、泰国、柬埔寨、印度和马来西亚等国家获得的有益经验和教训。本册也提供了泰国、老挝和柬埔寨三个国家本地化工作的案例研究,以及对技术、资源分配、工具和技巧、实现、成本和语言问题的一些建议。

为帮助本地化工作者起步,本书的附录提供了与本地化相关的主要概念和技术信息,供软件本地化工程的项目管理人员和实施者使用。

计算机、英语和本地用户
本地化
本地化的重要性
什么是自由/开源软件?
什么是 GNU/Linux ?
为什么本地化自由/开源软件?


计算机、英语和本地用户

如果你看不懂说明、按钮和菜单,你几乎没法使用计算机。因此亚洲的许多国家在工作场所、学校和家庭采用现代计算机技术的进程落后于欧美国家也就不足为奇了。

计算机并不一定要用英文显示一切信息。实际上,计算机“不在乎”它显示的是什么文字,因为所有的信息最终都被转换为1和0。即使现实的信息被翻译为另一种语言,计算机也照样运行。

把软件翻译为别的语言并不是新鲜事。这个称为“本地化”的过程,技术上并不困难。但是,它需要专业的管理、翻译人员,以及财力资源,在初次翻译的工作中尤其如此。

商业软件公司已经为特定的市场做了一些软件本地化。一般来说,他们通过收取自己软件本地化版本的许可费用来回收成本。在一般公民无力支付这种费用的国家,人们要么不使用本地化软件,要么使用非法拷贝。所有国家都希望享有本地化软件的好处,但有些国家支付不起昂贵的许可费用。有了自由/开源软件,这个问题就解决了。
自由/开源软件的方便获取,加上鼓励软件本地化/多种语言翻译的政策,提供了在世界范围内推广计算机应用的理想方式。

现在是发展中国家欢迎自由/开源软件运动,通过自由/开源软件本地化来加速计算技术普及的时候了。

It is almost impossible to use a computer if you cannot read the instructions, buttons and menus. Thus it is not surprising that many countries in Asia lag behind Europe and America in the adoption and use of modern computer technologies at work, in schools, and in the home.

Computers don't have to display everything in English. In fact, the computer "doesn't care" what language is displayed, since everything is ultimately converted into ones and zeros. Even if the display is translated into another language, the computer continues to operate as before.

Translating software is nothing new. This process, which is known as `localization', is not technically difficult. However, it requires professional management, a team of translators, and financial resources, especially for the initial translations.

Commercial companies have localized software for specific markets. Typically, they recoup their costs by charging license fees for the localized versions of their software. In countries where the average citizen cannot afford to pay the fees, they either do without localized software or resort to illegal copying. All countries want the benefits of localized software, but some cannot afford the expensive licenses. With FOSS, this problem is solved. A combination of policies that encourage software localization/ translation into different languages and the ready availability of FOSS, presents an ideal means for expanding computer use worldwide.

Now is the time for developing countries to embrace the FOSS movement, and accelerate their adoption of computing technology by localizing FOSS.


本地化

本地化是为一个特定的市场(为特定的区域或文化习惯;区域通常决定了像排序方式,键盘布局,日期,时间,数字和货币格式这样的习惯)修改、翻译和定制产品的过程。对于软件的本地化,这个过程意味着生成有意义的,用户能够理解的软件界面。

本地化行业标准协会(Localization Industry Standards Association, LISA)这样定义本地化:“本地化涉及到选择一个产品并使其在语言和文化方面与它将被使用和出售到的目标区域(国家/地区和语言)相适应。”(1)一般地说,这涉及到翻译用户界面(程序向用户表达的信息)以便用户能够创建文本和数据,并修改、打印,或用电子邮件发送它们,等等。

技术上,本地化自由/开源软件与本地化商业软件没有什么分别。都要改换字体,定义键盘布局,并使用相关的标准。不同的是价格和许可制度。对于自由/开源软件,其价格较低,而且授权对所有人开放。为了节省金钱和时间,扶持本地的创新,以及与软件非法拷贝做斗争,自由/开源软件的本地化是一个更好的选择。

Localization is the process of adapting, translating and customizing a product (software) for a specific market (for a specific locale or cultural conventions; the locale usually determines conventions such as sort order, keyboard layout, date, time, number and currency formats). In terms of software localization, this means the production of interfaces that are meaningful and comprehensible to local users.

The Localization Industry Standards Association (LISA) defines localization as: "Localization involves taking a product and making it linguistically and culturally appropriate to the target locale (country/region and 1 language) where it will be used and sold." Typically, this involves the translation of the user interface (the messages a program presents to users) to enable them to create documents and data, modify them, print them, send them by e-mail, etc.

Technically localizing FOSS is no different from localizing commercial software. Fonts must be changed, keyboard layouts devised, and standards adopted. The difference is price and licensing. With FOSS, the price is lower and the license open to all. For saving money and time, nurturing local innovation, and combating illegal copying of software, FOSS localization is a better alternative.


本地化的重要性

目前,人们要使用计算机,就要先学习英语。在识字率较低的国家,这个条件阻碍了人们使用信息和通讯技术(Information and communication technologies, ICTs),农村的穷人和妇女受教育的机会更少。即使学过英语,用户也需要在外国软件的许可上花费数百美元,或者使用不断蔓延的非法软件拷贝,才能使用 ICTs。概括地说,信息技术的使用是[经济和社会]发展的关键之一,本地化的自由/开源软件是信息基础设施中重要的遗失的环节。

本地化带来如下的好处:

  • 极大地减少让用户能够使用计算机系统所需要的培训工作量。
  • 促进中小型企业采用计算机技术。
  • 帮助国家在中央、省级和区级行政部门发展计算机系统,让公务员能够完全用本地语言工作并管理含有本地语言名称和数据的数据库。
  • 协助省市级的数据分布。这对基础设施公司(电力、自来水、电话)同样适用。他们可以使用本地语言的数据库,从而降低成本和优化服务。
  • 允许公民用自己的语言书写电子邮件并沟通交流。
  • 促进本地软件开发公司为管理部门、公共事业单位和私有企业工作。
  • 向本地设计业提供高质量的字体。
  • 帮助大学培训更多的软件工程师。

这个牵涉多方的项目的受益人包括:

  • 直接的,所有本地计算机用户——可以容易地使用电脑而不用先学习英语。
  • 间接的,与政府打交道的全体公民——通过使用本地化计算机系统带来的政务效能提高。
  • 本地政府能够开发基于当地语言的数据库和应用程序。本地化带来足够的技术力量和有能力的本地开发公司。政府也将能拥有在类似的管理机构(例如省级单位)协调部署应用程序的工具,这样用最低的成本就能实现信息技术带来的执政能力的改善。
  • 软件产业。政府对兼容的计算机技术的采用鼓励软件公司开发兼容的、应用于不同部门的计算机系统,从而在境内建立稳定的软件产业。一旦(使用自由/开源软件)掌握了技术,这些公司将获得以极具竞争力的价格为外国公司开发类似项目的能力,促进在外地市场的销售。

Currently, people who want to use computers must first learn English. In a country with low literacy rates, this blocks access to information and communications technologies (ICTs), especially for the rural poor and women who do not have equal access to education. Even after having learnt English, users must pay hundreds of dollars to license foreign software, or resort to widespread illegal copying of software, in order to gain access to ICTs. In short, access to information technology is one of the keys to development, and localized FOSS applications remain a crucial missing link in communications infrastructure.

Localization brings the following benefits:
Significantly reduces the amount of training necessary to empower end-users to use a computer system.
Facilitates the introduction of computer technology in Small and Medium Enterprises (SMEs).
Opens the way for the development of computer systems for a country's national, provincial and district level administration that will allow civil servants to work entirely in the local language and manage databases of local language names and data.
Facilitates the decentralization of data at provincial and district levels. The same applies to utility companies (electricity, water, telephone), who will develop local language databases, thereby reducing costs and giving better service to citizens.
Allows citizens to communicate through e-mail in their own language.
Empowers local software development companies to work for the administration, the public sector and private companies.
Provides the local design industry with good fonts.
Helps universities train more software engineers.

The beneficiaries of this multi-stakeholder project are:
Directly, all local computer users, who will have easier access to the use of computers as they will not have to learn English first.
Indirectly, through improvements in governance using native computer systems, all local citizens in the quality of their dealings with the administration.
The local government who will have the opportunity to develop databases and applications in the local language. Sufficient technology and empowered local development companies will be available. The government will also have the tool to coordinate applications among similar administrations (e.g., provinces), so that IT-based improvements in governance can be made at the lowest possible cost.
The software industry. The government's use of standards-compliant computer technology encourages software companies to start developing compatible computer systems that will be used by the different bodies of the administration, thereby creating a stable software industry in the country. Once this expertise is developed (using FOSS), these companies will be empowered to undertake similar projects for foreign companies at extremely competitive prices, facilitating sales beyond the local market.


什么是自由/开源软件

过去十年中发生的事在之前的年代看来是不可能的。志愿的计算机科学家群体开发出了计算机操作系统,高端的图形界面(桌面),以及大量的应用程序,在质量、外观和可靠性上都能与最先进的私有软件(如微软的 Windows)媲美。

自由/开源软件(Free/Open Source Software)中的“Free”字眼是指使用、学习、修改和分享软件的自由。分享自由/开源软件的自由意味着人们可以使用和翻译这些软件而不用支付任何费用。但是,有些可以免费使用的软件,如“共享软件”(shareware)和“免费软件”(freeware),不能被用来研究、修改和分享,所以它们不是自由/开源软件。

开发自由/开源软件的曾经都是志愿爱好者。但今天,像 IBM 和升阳这样的大型计算机公司也支持和开发自由/开源软件。

越来越多的欧洲国家和地方政府制定了或正在制定提倡使用自由/开源软件代替私有系统和工具的政策。这不仅使它们不再依赖于商业软件供应商,而且也帮助扶持了它们自己的软件开发产业。

今天的自由/开源软件无疑是生机盎然,质量上乘的。因此,许多政府选择了自由/开源软件进行本地化。

The last decade witnessed a phenomenon which in the preceding one would have been thought of as impossible. A community of volunteer computer scientists has developed computer operating systems, advanced user interfaces (desktops), and a number of applications that compete in quality, appearance and robustness with some of the most advanced proprietary software (such as Microsoft Windows).

The term "free" in Free/Open Source Software refers to freedom to use, study, modify and share the software. The freedom to share FOSS implies that it can be used and translated by people without their having to pay any fees. However, some software that can be used without having to pay user fees, such as `shareware' or `freeware', cannot be studied, modified or shared, which means that they are not FOSS.

At one time, FOSS was exclusively developed by volunteer enthusiasts. Today, however, even large computer companies such as IBM and Sun Microsystems support and develop FOSS.

A growing number of European national and local administrations have developed or are developing policies to promote the use of FOSS instead of proprietary systems and tools. This not only gives them independence from commercial vendors, but also nurtures their own software development industries.

There is little doubt now that FOSS is viable and of high quality. Therefore, many governments are choosing FOSS for localization.


什么是 GNU/Linux ?

GNU/Linux 是由 Linus Torvalds 和 GNU 项目在全世界成千上万名志愿开发者协助下从头创建的一个自由的类 Unix 操作系统。它是最流行的自由/开源软件。
GNU/Linux 的源代码在 GNU 通用公共许可下开发,对每个人都自由开放。下载和使用它不需要任何授权费用。Linux 是稳定安全的,也没有“隐藏的”功能,因为所有源代码都是对公众开放的。与私有软件不同,全球成千上万的开发者都可以审查代码,发现隐患,提供安全补丁和改进代码,因此 GNU/Linux 一直以来都很难被入侵者攻破。

GNU/Linux is a free Unix-type operating system originally created by Linus Torvalds and the GNU Project with the assistance of thousands of volunteer developers around the world. It is the most popular FOSS. Developed under the GNU General Public License, the source code for GNU/Linux is freely available to everyone. And one does not have to pay a licensing fee to download and use it. It is robust and secure, and it has no "hidden" features, because the source code is publicly available. Unlike proprietary operating systems, thousands of developers across the globe can inspect the code, identify vulnerabilities, provide security patches, contribute improvements and therefore historically GNU/Linux systems cannot be easily compromised by attackers.


为什么本地化自由/开源软件?

由使用英语的公司控制,几乎占据垄断地位的英文软件不能满足一个国家的长期需要,这是人们的共识。微软和其他几个美国大公司主宰了国际软件市场,获取巨额利润,拥有庞大势力。对于资金有限的用户,支付私有软件的负担意味着用于其他关键软件的花销必须削减,同时也失去了语言自由。

虽然私有软件的质量一般都属上乘,世界各地的政府都明白对商业公司的依赖会带来高风险。如果一个公司决定不再为某个语言版本的软件提供支持,就只有熟练掌握英语的人才能有效地使用计算机。而本地的信息和通信技术专业人士一旦同时掌握了计算机技术和熟练的英语,他们往往很快被吸引到国外,导致“脑力流失”,对发展中国家造成今后几代人的损害。

自由/开源软件本地化的关键优势:

  • 减少进口依赖。
  • 本地用户不需要先学习英语。
  • 本地程序员能够获得专业知识和经验。
  • 本地人能够自行控制软件的外观和功能。
  • 新的本地技术标准和教育机会。
  • 建立本地软件产业。外国人来做本地化是很难的,因为他们一般没有本地语言的语感,语言经常有缺陷。
  • 关于本地内容的国家政策不必依赖于别人提供私有软件或硬件。
  • 根据国家需要,可以重点支持某些软件的本地化。
  • 可以使用有重要民族性的语言,即使[私有软件本地化]在经济上不可行。

私有软件的缺点:

  • 授权和维护费用高昂。
  • 英语占统治地位。
  • 被外国企业控制。
  • 依赖于私有或封闭的标准。
  • 缺少或没有本地支持。
  • 高价软件造成非法拷贝。
  • 本地软件业得不到发展。
  • 不能本地化或修改软件。

自由/开源软件通常都比私有或封闭源代码的软件更加安全。用秘鲁国会议员 Villanueva 的话说,“为了保证国家安全,政府必须使用任何组件都不能被外人控制的系统。开源系统允许政府和公民自行检查代码,检验是否有后门和间谍软件存在。”(2)

当然,在一个不确定的、时有危险的世界上,没有哪个政府敢于依赖一个他们无法控制的诡秘组织(例如一个商业软件公司)的良好意愿,而拿他们的信息基础设施的安全冒险。

It is an acknowledged fact that the near-monopoly of English language software, controlled by English- speaking companies, does not serve the long-term needs of any country. Microsoft and a few other large US corporations dominate the international software market, earning large profits and wielding enormous power. For those with limited funds, the burden of paying for proprietary software means less money for other programmes of vital importance, as well as giving up linguistic freedom.

While proprietary software is often of the highest quality, policy-makers worldwide know that it carries a high risk of dependence on commercial corporations. If a corporation decides to no longer support software in another language, only those who are fluent in English would be able to operate computers effectively. And when local ICT professionals become proficient in both computers and English, they are quickly lured away from home, leading to a "brain-drain" that can damage a developing country for generations to come.

Key Advantages of FOSS Localization
Reduced reliance on imports.
No need for local users to learn English first.
Local programmers gain expertise and experience.
Local control over software appearance and functionality.
New local technical standards and educational opportunities.
Establishment of a local software industry. It is difficult for foreigners to do localization as they
do not normally have an intuitive feel for the local language and therefore the language is com-
promised in most cases.
National policy on local content would not be dependant on the availability of proprietary
software or hardware.
Localization of applications can be prioritized according to the national needs.
Languages that are nationally important but financially unfeasible can be used.

Disadvantages of Proprietary Software
Expensive to license and maintain.
Dominated by the English language.
Controlled by foreign corporations.
Dependent on proprietary or closed standards.
Has little or no local support.
The high cost of the software leads to illegal copying of the software.
The local software industry is not developed.
Software cannot be localized or modified.

FOSS is generally much more secure than proprietary or closed source software. In the words of Peruvian Congressman Villanueva, "To guarantee national security, the State must be able to rely on systems without elements controlled from a distance. Systems with open source code allow the State and citizens to inspect the code themselves and check for back doors and spyware." (2)

Certainly, in an uncertain and sometimes dangerous world, few governments can afford to risk their information infrastructure security by relying on the goodwill of a secretive organization (such as a commercial software company) that they cannot control.



1 The Localization Industry Primer, LISA - The Localization Industry Standards Association, 2nd Edition, 2003; available from www.lisa.org/interact/LISAprimer.pdf.

2 `The OSS advocacy report: Peruvian Congressman Responds to Microsoft', March 10, 2002; available from www.studioforrecording.org/mt/Pubdomain_Bread/archivist/000013.html


亚太地区本地化工作

一项调查
案例研究


一项调查

In the winter of 2003, a survey was done on FOSS localization efforts in several Asian countries. Progress in localization varies considerably from place to place. Governmental sponsorship of the project appears to be the single most important factor in achieving rapid success in localization. Without exception, the developers surveyed cited "Freedom to Develop" as their primary reason for choosing to localize FOSS instead of using proprietary software.

2003年冬,有人对几个亚洲国家自由/开源软件本地化的工作进行了一项调查。各国本地化的进程有很大差异。政府对本地化项目的支持似乎是本地化取得快速成功的最关键因素。接受调查的开发者们无一例外地选择了“开发的自由”作为他们选择本地化自由/开源软件而不是使用私有软件的首要理由。

The CJK Initiative

=CJK 项目=

China, Japan and Korea are officially cooperating in FOSS localization. The well-funded and extensively promoted "CJK" programmes encouraging use of localized FOSS are at par with that of the wealthiest nations.

With educational and technological infrastructure and a large pool of skilled technology and language specialists, these three countries have the potential to dominate East Asian FOSS development in the next decade.

中国、日本和韩国在自由/开源软件的本地化上进行了官方合作。资金充足、推广全面的“CJK”项目谷里使用本地化的自由/开源软件,其力度可以与最富裕国家的类似项目媲美。

这三个国家拥有的教育和技术基础设施和大量熟练的技术和语言专家,使它们有潜力在今后十年中的东亚自由/开源软件开发中占主导地位。

As donors and advisors, all three countries have a track record of supporting FOSS initiatives worldwide. Others should emulate the technical and organizational competence of the CJK FOSS and localization initiative, which is being led by the the Chinese Software Industry Association (CSIA), Japanese IT Services Industry Association (JISA), and the Federation of Korean Information Industries (KFII).

This primer is too short to fully detail the impressive work of the CJK initiative. However, a number of references to CJK as well as contact information for the key leaders of the movement are provided.

作为出资者和监督者的三个国家都有支持世界各地自由/开源软件项目的记录。由中国软件行业协会(Chinese Software Industry Association, CSIA)、日本信息技术服务产业协会(Japanese IT Services Industry Association, JISA)和韩国信息业联盟(Federation of Korean Information Industries, KFII)领导的 CJK 自由/开源软件和本地化项目具有的技术实力和组织优势值得其他国家效仿。

本书没有足够的篇幅详细叙述 CJK 项目的突出成果,但提供了关于 CJK 的参考资料以及项目关键人物的联系信息。(3)

Indian Languages

=印度语言=

There are already multiple localized Indic language versions of GNU/Linux, Mozilla and OpenOffice.org available online. The depth of technical experience of Indian programmers in localization should enable them to provide assistance to other Asian countries in their localization efforts.

India has world-class IT infrastructure and skills. It also has a multiplicity of languages. Currently, the Indian Ministry of Information Technology is localizing to Hindi, Marathi, Kannada, Malayalam, Tamil, Telugu and Sanskrit.

网络上已经有多种本地化的印地语 GNU/Linux,Mozilla 和 OpenOffice.org 版本。印度程序员丰富的本地化经验将使他们能够为其他亚洲国家的本地化工作提供帮助。

印度拥有世界级的信息技术基础设施和技术力量。同时印度也是一个多语言的国家。目前,印度信息产业部正在进行印地语、马拉地语、坎纳德语、Malayalam、泰米尔语、泰卢固语和梵语的本地化工作。

At least two independent groups as well as the Ministry of Information Technology are currently localizing to Tamil.

An initiative to localize GNU/Linux to around 10 Indian languages, called Indix, is being spearheaded by the Centre for Development of Advanced Computing (CDAC), a scientific society of the Ministry of Communications and Information Technology, Mumbai. There is also a Hindi GNU/Linux and a Bangla version in progress.

除了信息产业部以外,还有两个独立的团体在进行泰米尔语的本地化。

一项将 GNU/Linux 本地化为10种印度语言版本的称为 Indix 的项目,正在由通信和信息技术部位于孟买的一个名为先进计算开发中心(Centre for Development of Advanced Computing, CDAC)的科学团体牵头开发。一个印地语和一个 Bangla 语的版本也在开发之中。

English is widely used in India. Therefore, the underlying programming languages are easily understood, making the work much easier. For English language FOSS development, South Asia can realistically compete with Europe and the United States today.

印度广泛地使用英语。因此,他们更容易理解编程语言,也使本地化工作更容易进行。对于基于英语的自由/开源软件开发,今天的南亚与欧美国家可以同场较量。

Thai

=泰国语=

Localization of FOSS in the Thai language was started in 1993 by Thai students in Japan in an effort to use Thai in Unix environments. Later, a similar movement commenced in Thailand. These two movements merged and Linux TLE and the Thai Linux Working Group were launched in 1999. However, the movement was limited to a small community until mid-2001 when the Thai version of OpenOffice.org, dubbed Pladao and sponsored by Sun Microsystems (Thailand), was announced to the public. Documentation is being carried out by private publishers who, in exchange, will retain the copyright on their work for later sale to the public in the form of books and manuals.

1993年由留日泰国学生进行的在 Unix 环境下使用泰国语的尝试,标志着将自由/开源软件本地化为泰国语版本的工作开始。不久,类似的活动在泰国开展起来。这两个项目后来合并到一起,并在1999年启动了 Linux TLE 和泰国 Linux 工作组项目。但是,运动始终都处于小群体的水平,直到2001年中由升阳微系统(泰国)公司支持的被称为 Pladao 的 OpenOffice.org 泰国语版本向公众发布。文档工作任务由私人发行商承担,作为交换,他们可以保留他们工作的版权并在今后以书籍和手册的形式向公众出售这些文档。

Only 10 individuals are actively involved in the project at this time, with development resources provided by a combination of private companies and government.

目前计划的活跃成员只有10名,开发的资源则由私人公司和政府共同提供。

Thai developers are using GNU Compiler and Tools for modifying GNU/Linux, Mozilla’s “Bugzilla” for defect tracking, and CVS for version and source control. In addition, OpenOffice.org, a productivity suite, is being localized for Thai users.

泰国的开发者们使用 GNU 编译器和工具修改 GNU/Linux,用 Mozilla 的“Bugzilla”跟踪错误,并用 CVS 进行版本和代码控制。此外,OpenOffice.org 这种办公套件,已经为泰国用户进行了本地化。

The primary difficulties the Thais are experiencing include lack of feedback from users, lack of documentation, and conflicts with other Thai localization projects because of a lack of coordination and standards.

泰国开发者面临的主要问题是缺乏来自用户的反馈,缺乏文档,以及由于缺乏协调和标准而与其他泰国本地化工程产生的冲突。

The Thais intend to roll out the software for both the private sector and public institutions, but are still in the planning stage at this time. The team has not yet seen a high-profile effort to promote the use of FOSS in Thailand. This may change in the near future, depending on government and private funding. In comparison to the CJK efforts, Thailand has fewer developers, less money, and less outside interest due to the size of its market. Other Thai localization projects are discussed on pages 7-10.

泰国准备为私有企业和公共部门同时提供本地化的软件,但目前工作仍然处于筹备阶段。在泰国还没有高级别的部门进行工作推广自由/开源软件的使用。这种情况可能在不久的将来有所改观,取决于政府和私人的资助。与 CJK 项目相比,泰国的开发者较少,资金较少,而且外界由于其市场的规模也缺乏兴趣。其他的泰国语本地化工程在案例研究部分有详细讨论。

Vietnamnese

=越南语=

FOSS localization to the Vietnamese language began in 1998, primarily as an individual effort by an enthusiast. Today, there are between 20-50 people working on the project. They are volunteers, providing their resources and time for free, and all are using FOSS tools for the work. But even with the number of individuals currently involved, there are not enough people to perform all of the necessary work.

自由/开源软件的越南语本地化开始于1998年,最初是爱好者的个人行为。今天,有20-50人为这项计划工作。他们都是志愿者,无偿贡献他们的资源和时间,而且都在工作中使用自由/开源软件工具。但现在参与到计划中的人仍然不足以进行全部必要的工作。

Vietnamese localizations of GNU/Linux, Gnome, KDE, XFCE, Fluxbox, OpenOffice.org, Mozilla, XMMS, GnuCash, Gaim and Knoppix are in progress.

目前进行中的工作包括 GNU/Linux、Gnome、KDE、XFCE、Fluxbox、OpenOffice.org、Mozilla、XMMS、GnuCash、Gaim 和 Knoppix 的越南语本地化。

Plans for introducing the software to the Vietnamese people include the publication of books, formation of user groups in major cities, and provision of the software to private and public institutions. Already, there has been some effort to promote the use of FOSS in secondary schools, universities and government offices. Recently the government adopted a FOSS policy.

将这些软件介绍给越南人民的计划包括出版书籍,在大城市组建用户组,以及向私人和公共单位提供软件。目前在中学、大学和政府推广使用自由/开源软件的工作已经展开。最近越南政府颁布了与自由/开源软件有关的政策。

Malay

=马来语=

The Malaysian Institute of Microelectronic Systems (MIMOS) has launched a Web site to document projects in FOSS within Asia with links to key resources. MIMOS’s goal is to localize key applications into the national language, Bahasa Melayu. MIMOS has already developed a local language GNU/Linux GUI and open source applications for the government. Two other localization projects are GNOME, led by Hasbullah Bin Pit, and OpenOffice.org, spearheaded by MIMOS.

马来西亚微电子系统研究院(Malaysian Institute of Microelectronic Systems, MIMOS)启动了一个网站,用于记录亚洲的自由/开源软件项目,并有重要资源的链接。MIMOS 的目标是把重要的应用程序本地化为国家通用语言 Bahasa Melayu。MIMOS 已经开发了一个 GNU/Linux 的本地语言 GUI 以及政府用的开源应用程序。此外还有 Hasbullah Bin Pit 领导的 GNOME 本地化计划和 MIMOS 牵头的 OpenOffice.org 本地化计划。

Khmer

=谷美尔语=

The Khmer Software Initiative has an ambitious though under-funded plan to localize FOSS for Khmer speakers in Cambodia and elsewhere. In addition to a localized version of the GNU/Linux user interface and some applications, they intend to create a development library, complete documentation both online and in print, and specialized training materials for developers and end users. It is not known how many individuals are working on the project currently.

谷美尔软件项目有一项虽然资金不足但目标远大的计划,为柬埔寨和其他地方的古美尔语使用者本地化自由/开源软件。除了本地化版本的 GNU/Linux 用户界面和一些应用程序外,他们还打算建立一个开发图书馆,完善网络和印刷版本的文档,以及为开发者和最终用户专门编写的培训材料。目前不清楚有多少人在为这项计划工作。

As elsewhere, the Khmer team envisions providing the software to both private and government users. A publicity campaign is planned and the developers expect significant support from the international development community to complete much of the work.

像其他国家一样,谷美尔人的开发队伍也预备同时向私人和政府用户提供软件。他们正在计划一次宣传攻势,开发者们也希望得到国际开发社区的帮助以完成大量的工作。

The quality of the planning for Khmer, a crucial first step, is very high. As with other developing countries, funding and personnel to complete the work are the main hurdles. For more information on this area, please read Khmer Case Studies on pages 12-15.

作为关键的第一步,谷美尔本地化项目的规划工作质量很高。像其他发展中国家一样,用于工作的资金和人力不足是主要的障碍。关于这一地区的更多信息请参阅后面的谷美尔案例研究。



3 See www.linuxinsider.com/story/32421.html, www.economist.com/business/displayStory.cfm?story_id=2054746, encyclopedia.thefreedictionary.com/CJK, www.chinadaily.com.cn/english/doc/2004-05/11/content_329529.htm and www.firstmonday.dk/issues/issue8_10/jesiek/.


案例研究

泰国语本地化
老挝语本地化
库美尔语本地化
本地化计划现状


泰国语本地化

Fortunately for localizers, Thai language support has been stabilized by means of standardization achieved in the preceding years. The only limitation is the readiness of the internationalization frameworks within Gnome, KDE, Mozilla and Open Office.

对泰国语的本地化工作者来说,幸运的是泰国语在过去已经通过标准化的方式稳定下来。唯一的限制是 Gnome,KDE,Mozilla 和 OpenOffice 中国际化框架的完备程度。

FOSS communities of Thailand contribute a lot to both software development and user support. Governmental organizations also play important roles in FOSS reaching the masses.

泰国的自由/开源软件社群为软件开发和用户支持都做出了很大贡献。政府组织也在大众接触自由/开源软件的过程中起了重要的作用。

Thai Language

=泰国语=

The official language of Thailand is Thai. Spoken by almost the entire population, with several dialects in different regions, Thai is a tonal, uninflected and predominantly monosyllabic language. Most polysyllabic words in the vocabulary have been borrowed, mainly from Khmer, Pali and Sanskrit.

泰国的官方语言是泰国语。几乎整个人口都说泰国语,只在不同地区有几种方言。它是一种有声调,没有词性变化和几乎完全单音节的语言。词汇中的大多数双音节词都是外来语,主要来源是库美尔语、巴厘语和梵语。

Thai belongs to the Tai language family, which includes languages spoken in Assam, northern Myanmar, Thailand, Lao PDR, northern Viet Nam, and the Chinese provinces of Yunnan, Guizhou and Guanxi.(4)

泰国语属于 Tai 语族,包括在印度 Assam 邦、缅甸北部、泰国、老挝人民民主共和国、越南北部,以及中国云南、贵州和广西省(4)使用的一些语言。

Thai script belongs to the Bhrami family. The oldest evidence of Thai script dates back over 700 years to the Sukhothai Age. Over the years, Thai script has gradually changed. Contemporary Thai script is composed of 44 consonants, 21 vowel symbols (32 sounds when combined), four tone marks, two diacritic marks and 10 decimal digits. Tone marks, diacritic marks and some vowel signs are stacked over or below the base consonants. The stacking is a shared characteristic among many South-East Asian scripts, especially those that are derived from Bhrami like Lao, Khmer and Myanmar languages. However, there are no complex precombined conjuncts in Thai, unlike most Indic scripts (Devanagari, for example). Only stacking is required. Lao is the script closest to Thai.

泰语字母属于 Bhrami 语族。书面泰语的最早证据可追溯到700多年前的 Sukhothai 时代。从古至今,泰语字母渐渐改变了。现代泰语字母包括44个辅音字母,21个元音字母(合并使用可表示32种读音),4种声调,2种变音符号和10个十进制数字。声调符号、变音符号和一些元音符号标注在基本元音的上方或下方。这种叠加标注方式是许多东南亚语言的共同特点,特别是从 Bhrami 语发展而来的语言,如老挝、库美尔和缅甸语。但是,在泰国语中没有复杂的复合音,这点不像大多数印地语字母(例如 Devanagari 语)。只需要叠加就可以表示所有语音。老挝语字母是最类似泰国语字母的。

Standardization

=标准化=

Standardization is the key to the success of Thai language support in computers. It allows interoperability and resolves many localization issues. Important standards include character set, keyboard layout and input/output method specifications.

标准化是实现计算机泰国语支持成功的关键。它允许互通性并能解决许多本地化问题。重要的标准包括字符集,键盘布局和输入/输出方法标准。

The standardization of IT in Thailand has been recognized since 1984 (5), when there were many efforts to use the Thai language in computers. More than 26 sets of code pages were defined by different vendors resulting in incompatibility. As a solution they were all unified as TIS 620-2529/1986 as the national standard by the Thai Industrial Standard Institute (TISI). A prominent legacy was the code table defined by Kasetsart University (KU) in a successful R&D effort to enable the Thai system in MS-DOS. It was the most widely adopted standard. Therefore, computer programs were obliged to support both encodings until TIS-620 became more popular and KU became obsolete.

泰国信息技术的标准化工作在1984年得到重视 (5),当时有许多致力于在计算机上使用泰国语的工作。不同的提供商制定了至少26种互不兼容的编码方案。泰国工业标准研究所(Thai Industrial Standard Institute, TISI)的解决方法是把它们统一为国家标准 TIS 620-2529/1986。这个时代一份重要的遗产是 Kasetsart 大学(KU)一次成功的研发工作,为 MS-DOS 下的泰国语系统定义的码表。它是使用最广泛的标准。因此,计算机程序不得不同时支持两种编码,直到 TIS-620 越来越流行,而 KU 编码终于退出历史舞台。

Therefore, when Microsoft released Windows into the Thai market, TIS-620 was the only encoding adopted. The same was true for Macintosh. Thus, the character encoding issue was firmly settled.

因此,当微软向泰国市场发售 Windows 时,TIS-620 是唯一的实用编码。对于 Macintosh 平台也是如此。这样,编码标准的问题就完全解决了。

In 1990, TIS-620 was amended to conform to ISO standards, but the code table was left completely unchanged. This new version was called TIS 620-2533/1990. The amendment enabled TISI to actively join many international standardization activities. For example, it submitted the character set to the European Computing Manufacturers’ Association (ECMA) for registration in the ISO 2375 repertoire, and was assigned as ISO-IR-166 so that it could be used with ISO/IEC 2022 mixed code page encoding. An example of such implementation is GNU Emacs. Around 1996, a TISI technical committee drafted a proposal for the Latin/Thai part (part 11) of ISO/IEC 8859, based on TIS-620. However, it was suspended due to the prohibition of combining characters. It was reactivated in 1999, and endorsed as an international standard in 2000.

1990年,为了与 ISO 标准兼容,TIS-620进行了修改,但码表部分完全没有改变。新版本被称为 TIS 620-2533/1990。这次修改使得 TISI 能够活跃地参与国际标准化运动中去。例如,它将字符集提交给欧洲计算制造商协会(European Computing Manufacturers' Association, ECMA)以在 ISO 2375 目录中注册,并且被分配了 ISO-IR-166 这个编号,这样泰国语就可以在 ISO/IEC 2022 混合码表编码方式下使用。另一个这种实现的例子是 GNU Emacs。1996年前后,一个 TISI 技术委员会起草了一份基于 TIS-620 的关于 ISO/IEC 8859 的拉丁/泰国语部分(第11部分)的提案。但是,提案由于对合并字符的限制而被搁置。提案在1999年重新讨论,并且在2000年成为一个国际标准。

The TIS-620 code table was pushed for inclusion in the Unicode table. Although the influence of the ISCII encoding scheme (which forced all vowels, including the leading vowels, to always be encoded after consonants) had made the Unicode consortium force Thai to change its encoding scheme, TISI defended the TIS-620 practice, as Thai script did not need such complications. Although this made Thai (and Lao) different from other Indic scripts, it saved Thai (and possibly Lao) implementations from the big hindrance of lacking supporting technology for ISCII practice at that time, as well as from the burdens of migration from the well-settled and widely-adopted practice. So all Thai language-processing codes for TIS-620 and Unicode, apart from the necessary code conversion, are fully compatible.

TIS-620 编码被要求加入 Unicode 码表中。虽然 Unicode 的 ISCII 编码系统(要求所有元音,包括音节开头的元音,都放在辅音之后)使得 Unicode 联合会迫使泰国改变其编码方案,TISI 维护了 TIS-620 的编码方式,因为泰国文字不需要这样的改动。虽然这使泰国语(可能也包括老挝语)与其他印地文字不同,但它将泰国语(以及老挝语)将当时缺乏 ISCII 编码方案技术支持带来的巨大困难中拯救出来,也避免了改变广泛应用和固定的习惯所带来的麻烦。因此所有处理 TIS-620 和 Unicode 的代码,除了必要的编码转换部分之外,都是完全兼容的。

In addition to the character set, the Thai keyboard layout was standardized as TIS 820-2531 in 1988, and later amended by adding keys for special symbols and published again as TIS 820-2538 in 1995. Another keyboard variant designed after research on character frequencies, called Pattajoti, is also available. But it is not as popular and is not a national standard.

除了字符集之外,泰国语键盘布局也在1988年标准化为 TIS 820-2531,后来增加了特殊符号键位并在1995年重新颁布为 TIS 820-2538。在研究了字符出现频率后设计的一种称为 Pattajoti 的变体也可以使用,但它没有规模相当的用户群,也不是国家标准。

Thai input/output methods were also standardized through the efforts of the Thai API Consortium (TAPIC), which was formed by a group of vendors and academies and sponsored by the National Electronics and Computer Technology Center (NECTEC). The specification, called WTT 2.0 (from its Thai abbreviation ‘Wor Tor Tor’, which stands for “Wing Took Tee” or “runs everywhere”), was published in 1991. Its contents were composed of three parts, describing character set and encoding scheme, input/output methods, and printer identification numbers.

泰国文字输入/输出方法也通过由供应商和学术界组成,并由国家电子和计算机技术中心(National Electronics and Computer Technology Center, NECTEC)资助的泰国 API 联合会(Thai API Consortium, TAPIC)的努力得以标准化。这份标准被称为 WTT 2.0(得名于其泰国语缩写 Wor Tor Tor,意思是“Wing Took Tee”,即“随处运行”),于1991年发布。其内容包括三部分,分别描述了字符集和编码体系,输入/输出方法,以及打印机识别字。

Although not endorsed by TISI as a national standard, WTT 2.0 was adopted by virtually all the major vendors who participated in the draft, including DEC, Sun, Microsoft, MacIntosh and IBM. WTT 2.0 had enjoyed being the de facto national standard for seven years, until it was dubbed TIS 1566-2541 by TISI in 1998.

虽然 TISI 并未将 WTT 2.0 作为国家标准推广,几乎所有参与草案编写的制造商,包括 DEC,升阳,微软,苹果和 IBM 都采用了这个标准。WTT 2.0 在后来7年中都是事实上的国家标准,TISI 在1998年将其定名为 TIS 1566-2541。

There are other activities (6) with international standards bodies that promote understanding of Thai language requirements among vendors. For example, in 1998 the “tis-620” MIME character set was registered with the Internet Assigned Number Authority (IANA) for information interchange on the Internet (7). Another example is an annex of ISO/IEC 14651, International String Ordering, describing how the predefined sorting order for Unicode can be tailored to match the requirements of Thai string ordering.

泰国还有其他与国际标准机构合作,促进供应商对泰国语言需求了解的活动 (6)。例如,1998年“tis-620”因特网邮件扩展(MIME)字符集在因特网数字分配管理处(Internet Assigned Number Authority, IANA)注册为因特网信息交换的标准 (7)。另一个例子是 ISO/IEC 14651 的附录《国际字符串排序》,其中描述了 Unicode 预先定义的排序顺序应当如何被修改以适应泰国语言字符串排序的需要。

With these established standards, specifications for Thai implementations are clear and interoperability is guaranteed. The standards have played an important role in several developments in the Thai computer industry, including FOSS localization.

这些标准的建立使得泰国语言的编程实现有清晰的原则可循,并能保证互操作性。这些标准在泰国计算机工业的几项开发中起着重要作用,包括自由/开源软件的本地化。

Localization

=本地化=

Thai localization of FOSS was started in 1993 by Thai students in Japan with the ThaiTeX project initiated by Manop Wongsaisuwan as a first effort to use Thai in the versatile typesetting program (8). Subsequently, the project was maintained by the Thai Linux Working Group (TLWG) (9) which was formed in 1999.

1993年在日本由 Manop Wongsaisuwan 发起,其他学生参与的 ThaiTeX 项目是第一次在多用途排版程序中使用泰国语的尝试(8),并宣告了泰国语自由/开源软件本地化的开始。此后,这个项目由1999年组建的泰国语 Linux 工作组维护。

Apart from Thai LaTeX, other surrounding UNIX environments have been modified by the same group of people to support Thai. Their work may be summarized as follows:

除了泰国语 LaTeX,这一组开发者还修改了其他周边 UNIX 环境以支持泰国语,他们的工作简述如下:

Manop Wongsaisuwan: ThaiTeX (ttex), X bit-map fonts.
Thai Project (by Vuthichai Ampornaramveth): (10) cttex (C version of ttex), xiterm+thai (Thai X terminal), likit (Thai text editor for X).
ZzzThai project (led by Poonlap Veerathanabutr): (11) thailatex-component, X bit-map fonts, Thai support in xfig, Thai-HOWTO and Thai RPMs.

  • Manop Wongsaisuwan:ThaiTeX(ttex),X 点阵字体。
  • 泰国语计划(Vuthichai Ampornaramveth领导)(10):cttex(ttex 的 C 版本),xiterm+thai(泰国语 X 终端),likit(X 下的泰国语文本编辑器)。
  • ZzzThai 计划(Poonlap Veerathanabutr领导)(11):thailatex 组件,X 点阵字体,xfig 泰国语支持,泰国语 HOWTO 和泰国语 RPM。

At the same time, other Thai support projects were developed by researchers of the NECTEC in Thailand, including:

同时,NECTEC 的研究人员开发了其他的泰国语支持项目,包括:

Virach Sornlertlamvanich: Thai support in Omega (Unicode-based TeX kernel), Thai in GNU. Emacs, machine translation, and many other NLP projects.
Surapant Meknavin: thailatex (babel-based), Thai search engine.
Phaisarn Charoenpornsawat: swath (Thai word break utility).
Theppitak Karoonboonyanan: Thai string collation , Thai Locale. (12)
The National Fonts Project: Standardized font design specification, three public domain vector fonts (Kinnari, Garuda and Norasi).

  • Virach Sornlertlamvanich:Omega(基于 Unicode 的 TeX 语言核心),GNU Emacs 泰国语支持,机器翻译,以及其他许多 NLP 项目。
  • Surapant Meknavin:thailatex(基于 babel 宏包),泰国语搜索引擎。
  • Phaisarn Charoenpornsawat:swath(泰国语分词工具)。
  • Theppitak Karoonboonyanan:泰国语字符串组合,泰国区域设置 (12)。
  • 国家字体计划:标准化字体设计规范,三种公共领域的向量字体(Kinnari,Garuda 和 Norasi)。

Another project worth mentioning is Linux SIS (School Internet Server) (13), which was initiated by NECTEC for use in the SchoolNet project (14). Although it is a server-centric distribution, it was during this project that another main task force for Thai FOSS localization was constituted. Through a mailing list for supporting its users, the volunteers agreed that another distribution for desktops was needed, and was feasible to develop, as almost all of the specialists mentioned above were there. This task was undertaken by the Thai Linux Working Group. A Web site (linux.thai.net) was created for general user support. A new GNU/Linux distribution called Linux TLE (Thai Language Extension) was created to collect, as comprehensively as possible, the existing works of Thai developers and package them for users.

另一项值得一提的工作是 Linux SIS(Linux 中小学校因特网服务器)(13),它由 NECTEC 提出,用于 SchoolNet 工程(14)中。虽然这是一个面向服务器应用的发行版本,但另一支泰国语自由/开源软件本地化的队伍在这个项目中得以建立。在用户支持邮件列表中,志愿者们认为还需要一个面向桌面的版本,而且在上述专家的帮助下开发也是可行的。这项任务由泰国 Linux 工作组承担。他们为一般用户支持工作建立了一个网站(linux.thai.net)。一个称为 Linux TLE(泰国语扩展)的 GNU/Linux 发行版本被开发出来以尽可能全面地纳入泰国开发者的本地化作品并将它们打包给用户使用。

Apart from being a tool for boosting the use of FOSS by Thai users, Linux TLE also provided a platform for development, and a test cycle where users could participate through bug reports. The ultimate goal was to improve Thai support for FOSS from the source. Therefore, getting patches checked-in to upstream projects was the final success indicator.

除了作为推进泰国用户使用自由/开源软件的工具,Linux TLE 也为开发提供了平台,以及用户可以参与报告错误的测试流程。其最终目标是从源代码水平增强自由/开源软件对泰国语的支持。因此,将补丁加入上游开发工程是评价成功的最终标准。

So far, lots of source code from TLWG and Linux TLE have been incorporated in upstream projects, including:

目前,许多来自 TLWG 和 Linux TLE 的源代码都已经被加入到上游工程中,包括:

Thai locale definition in GNU C library.
Thai keyboard maps in XFree86.
Thai XIM in XFree86.
Thai fonts in XFree86.
Thai Pango modules.
Thai string ordering in MySQL.
GTK+, GLib, Qt, KDE, Mozilla, Xpdf, etc.

  • GNU C 库中的泰国本地设置定义。
  • XFree86 中的泰国语键盘映射。
  • XFree86 中的泰国语 X 输入法。
  • XFree86 中的泰国语字体。
  • 泰国语 Pango 模块。
  • MySQL 中的泰国语字符串排序。
  • GTK+,GLib,Qt,KDE,Mozilla,Xpdf 等等。

In 2000, Linux TLE was handed over to NECTEC for maintenance. Three versions (3.0, 4.0 and 4.1) were
released and gained a lot of recognition from users in all parts of the country. A dedicated Web site was
created for it at opentle.org in 2003. TLWG continued to build its user and developer communities.

2000年,Linux TLE 被移交给 NECTEC 进行维护。发布的三个版本(3.0,4.0和4.1)都受到了全国各地用户的广泛赞誉。2003年项目有了专门的网站 opentle.org。TLWG 继续建设其用户和开发者社区。

Some of the TLWG projects that are hosted and maintained by the community are:

社区存放和维护的一些项目包括:

libthai (15): a library of Thai basic support routines, including character sets conversion, input/output methods, word break, string collation, etc. Some plug-ins for Pango GTK+ IM are also provided.
thaifonts-scalable (16): a collection of scalable fonts available to the public, plus some fonts developed in-house. All fonts are maintained and improved based on standard technical specifications.
thailatex (17): Babel-based Thai support for LaTeX document preparation, based on Surapant
Meknavin's work.

  • libthai (15):一个泰国语基本支持函数库,包括字符集转换,输入/输出方法,分词,字符串处理等功能。也提供了一些 Pango GTK+ 输入法的插件。
  • thaifonts-scalable (16):一套提供给公众使用的可缩放字体,以及一些个人开发的字体。所有字体都按照标准的技术要求维护和改进。
  • thailatex (17):基于 Babel 的 LaTeX 排版系统泰国语支持,由 Surapant Meknavin 的工作发展而来。

Other significant development efforts to boost the Thai environment in FOSS are Pladao (18) and OfficeTLE (19). Both projects aim to develop Thai support in OpenOffice.org. Pladao was initiated by Sun Microsystems (Thailand) and was subsidized by Algorithms Co. OfficeTLE was initiated and operated by NECTEC. Both of these projects are working in parallel, emphasizing different aspects. Pladao is feature-rich, while OfficeTLE emphasizes Thai language processing quality. Many hope that they will merge, possibly through upstream projects.

其他重要的推进了自由/开源软件的泰国语环境改善的开发工作还有 Pladao (18) 和 OfficeTLE (19)。两个计划的目的都是开发 OpenOffice.org 的泰国语支持。Pladao 是由升阳微系统(泰国)公司发起,由 Algorithms 公司赞助。OfficeTLE 是由 NECTEC 发起和运作。两个计划并行工作,强调不同的方面。Pladao 功能丰富,而 OfficeTLE 强调处理泰国语的质量。许多人希望这两个计划能够通过上游开发项目合并。

Obstacles

=阻碍=

These are the obstacles to the localization of FOSS in Thailand:

以下是泰国自由/开源软件本地化的主要障碍:

Too few developers. With the growth of the user base, the lack of FOSS developers to serve the growing requirements and expectations is a big problem. The premature growth of FOSS adoption in Thailand has unbalanced the community, both in terms of the ratio between users and developers, and in terms of unrealistic expectations with regard to price and freedom. Worse, the community existing developers have been fragmented because they are often employed by competing businesses. Proprietary competition has reduced the traditional cooperation responsible for the progress of FOSS in Thailand.

Misconception. If one wishes a movement to be initiated by the government, it is necessary for the government to have a good understanding of all of the concerns. The government must realize the benefits of FOSS as a means for developing technologies, as well as for improving the technical skills of the developers. Otherwise, it just becomes exploitative. According to some, even though the Thai government has popularized FOSS through the recent campaign for affordable PCs, they have also damaged public opinion of FOSS by their failure to provide proper support. Premature promotion of FOSS to uninitiated users when it is not ready would only create a bad impression. On the other hand, claiming that Microsoft's decision to lower the price of proprietary software is actually a success for FOSS, can be seen as exploiting FOSS as a negotiation tool. Moreover, "localization" tends to be perceived by the government as an activity that is about development of local GNU/Linux distributions, which is not true. Thus, many government policies simply miss the point and sometimes make things worse.

  • 开发者太少。随着用户群的增长,缺乏自由/开源软件开发者满足不断增长的需求和愿望成了一个大问题。泰国自由/开源软件的发展不成熟,使得社区的用户和开发者比例不平衡,价格及自由的制约也无法满足不切实际的期望。更糟糕的是,现有的开发者群体由于开发者被竞争的企业雇佣而变得分散。私企的竞争在泰国减少了自由/开源软件的发展所必须的传统的合作精神。
  • 误解。如果政府要牵头进行一场运动,它必须对所有牵涉的问题都有很好的理解。政府必须认识到自由/开源软件作为一种技术开发和增强开发者技能的方式的好处。否则,它就变得只知索取。据说,尽管泰国政府最近通过廉价 PC 的推广活动使自由/开源软件广为人知,他们无法提供适当的支持导致了公众对自由/开源软件的不佳看法。时机不成熟时对没有发动起来的用户推广不完善的自由软件只会造成坏印象。另一方面,宣称微软降低私有软件价格的决定是自由/开源软件的胜利,可以看作是将自由/开源软件当作谈判工具来利用。而且,“本地化”容易被政府错误地认为是开发自有的 GNU/Linux 发行版本。因此,许多政府的政策常常起不到应有的作用,或者使情况更糟。



4 Thai Language Audio Resource Center, `Some Historical Background of Thai Language'; available from thaiarc.tu.ac.th/thai/thai.htm
5 Koanantakool, T. and the Thai API Consortium, Computer and Thai Language, National Electronics and Computer Technology Center, 1987, in Thai.
6 Karoonboonyanan, T. and Koanantakool T., Standardization Activities and Open Source Movements in Thailand, Country Report, MLIT-4, Myanmar; also available at www.nectec.or.th/it-standards/mlit99/mlit99-country.html.
7 Tantsetthi, T., ‘Campaign for Internet-Standard-Conforming Thai Usage’; available from software.thai.net/tis-620.
8 Wongsaisuwan, M., `Introduction to ThaiTeX'; available from thaigate.nii.ac.jp/files/thaitex.pdf.
9 Thai Linux Working Group, `Thai LaTeX'; available from inux.thai.net/plone/TLWG/thailatex.
10 Ampornaramveth, V., `NACSIS R&D Thai Project Page'; available from thaigate.nii.ac.jp.
11 eucthai, `ZzzThai Project'; available from www.fedu.uec.ac.jp/ZzzThai/.
12 Karoonboonyanan, T., `Thai Sorting Algorithms'; available from linux.thai.net/thep/tsort.html; Karoonboonyanan, T., Raruenrom S. and Boonma P., `Thai-English Bilingual Sorting'; available from linux.thai.net/thep/blsort.html; Karoonboonyanan, T., `Thai Locale'; available from linux.thai.net/thep/th-locale.
13 National Electronics and Computer Technology Center, `Linux SIS: Linux School Internet Server'; available from www.nectec.or.th/linux-sis.
14 National Electronics and Computer Technology Center, `SchoolNet Thailand'; available from www.school.net.th.
15 Thai Linux Working Group, `LibThai Library'; available from linux.thai.net/plone/TLWG/libthai/ and libthai.sourceforge.net.
16 Thai Linux Working Group, `ThaiFonts-Scalable'; available from linux.thai.net/plone/TLWG/thaifonts_scalable/.
17 Thai Linux Working Group, `Thai LaTeX'; available from linux.thai.net/plone/TLWG/thailatex/.
18 See www.pladao.org.
19 See opentle.org/office-tle.


老挝语本地化

"Laonux" is the name of the Lao language version of GNU/Linux. This section deals with Laonux implementation. An overview of its success and obstacles to its development is also provided.

“Laonux” 是老挝语版本 GNU/Linux 的名字。这一节主要关注 Laonux 的实现,也提供了其开发过程的成功和困难的概述。

Traditionally, the Lao language and its literature have been written in two scripts, Lao and Tham. The Tham script is derived from the Lanna script (of present-day Chiang Mai and northern Thailand), which in turn originates from ancient Mon (20). The Lao language belongs to the Tai language family, which includes languages spoken in Assam, northern Myanmar, Thailand, Lao PDR, northern Viet Nam, and the Chinese provinces of Yunnan, Guizhou, and Guanxi (21). Lao script is believed to have originated from the Khmer writing system. It originated in the Grantha script, which was the southern form of the ancient Indian Brahmi writing system (22).

传统的老挝语言和文字用两种字符表示,老挝字符和 Tham 字符。Tham 字符是从 Lanna (现在的清迈和泰国北部)字符发展而来,而 Lanna 字符又是起源于古代 Mon 语言 (20)。老挝语属于 Tai 语族,该语族包括印度 Assam 邦,缅甸北部,泰国,老挝人民民主共和国、越南北部,以及中国的云南、贵州、广西使用的语言 (21)。一般认为老挝文字起源于库美尔书写系统。它的来源是 Grantha 文字,是古代印度 Brahmi 书写系统的南方变体 (22)。

Lao script shares characteristics with other South-East Asian writing systems. It follows complex rules of layout involving consonants, vowels, special symbols, conjuncts and ligatures. Spaces are not used to separate words, and vowels appear before and after, under and over consonants.

老挝文字与其他东南亚书写系统有类似的性质。它复杂的书写规则牵涉到辅音、元音、特殊符号、复合词和连写字符。单词间不用空格分隔,元音可能表示在辅音的前或后,以及上方或下方。

Obstacles and Successes

=困难和成功=

In localizing FOSS for the Lao script, the following tasks must be completed:

在自由/开源软件的老挝语本地化中,必须完成以下的任务:

Identifying the technical obstacles to be overcome.
Creating the world's first English/Lao technical dictionary.
Finding and coordinating technical volunteers.
Establishing de facto technical standards.
Performing and testing the work.
User training and education.
Identifying the sources of funding.

  • 发现需要克服的技术障碍。
  • 建立世界上第一个英语/老挝语技术词典。
  • 寻找并组织技术志愿者。
  • 建立事实上的技术标准。
  • 培训和教育用户。
  • 发现资金来源。

The work of investigating how GNU/Linux can be localized for Lao began in the summer of 1999 and proceeded slowly at first. Months of research and experimentation identified the technical difficulties and tools available to resolve them. Anousak Souphavanh, working from Rochester, New York, eventually formed a team of volunteers, teachers and students from the National University of Lao PDR to focus on translating KDE into Lao. The KDE desktop in Lao allows the end user to access the basic functionalities of a computer, including email, Web browser and office applications.

对 GNU/Linux 本地化如何进行的研究开始于1999年夏天,开始时进展缓慢。数个月的研究和实验发现了一些技术问题和解决问题的工具。在纽约州 Rochester 工作的 Anousak Souphavanh 最终建立起一支由志愿者以及老挝人民民主共和国国立大学师生组成的队伍,专注于把 KDE 翻译成老挝语。老挝语 KDE 桌面允许最终用户使用计算机的基本功能,包括 email,web 浏览器和办公室套件。

Until 2002, the work progressed slowly but steadily, aided largely by helpful and frequent advice from fellow FOSS enthusiasts worldwide. The community regularly provides invaluable information and assistance in tracking down documentation of technical issues, relating experiences in solving similar problems, or simply encouraging the volunteers in their efforts.

到2002年,在世界各地自由/开源软件支持者大量有益建议的帮助下,本地化工作进展虽然缓慢但稳定。社区在提供技术文档,相关问题的经验,或鼓励志愿者工作方面都提供了宝贵的信息和支持。

In 2002, the Jhai Foundation provided a small stipend to support the development of Laonux. As before, research had to be performed on the technical issues and tools, and more volunteers had to be found for the required translation and programming. The programming work was completed in 2003. However, the bulk of the translating remains undone. The major obstacle is the lack of an English/Lao technical dictionary. Without this, any translation will be inconsistent and confusing to users.

2002年,Jhai 基金会提供了一笔小额资金用于支持 Laonux 的开发。像以前一样,对技术问题和工具的研究仍在进行,而翻译和编程需要更多的志愿者。编程工作在2003年完成。但是大量的翻译工作还未结束。主要的困难是缺少一部英语/老挝语技术词典。没有它,翻译就不一致,容易使用户迷惑。

Upon completion of the dictionary, it is relatively simple to translate the remaining message strings and integrate them into the desktop environment. This work can be performed by either professional or student translators at a relatively low cost. But until the dictionary is completed, the work will continue at a snail's pace.

到词典完成时,翻译剩下的信息字符串并将它们整合到桌面环境中就比较容易。无论专业人员还是学生翻译者都可以以较低的成本完成这项工作。但是词典完成之前,工作只能以极其缓慢的速度进行。

Funding for the dictionary and translators is now being sought from international development agencies. In addition, funding for professional documentation, training and user education is required.

目前项目正在从国际开发组织寻找词典和翻译工作所需的资助。此外,也需要专业文档、培训和用户教育的资金支持。

Standardization

=标准化=

The lack of technical standards for the Lao script and its implementation in software remains a difficult challenge. The following standards must be completed and accepted officially:

缺乏老挝文字及其计算机实现的标准带来了巨大的挑战。官方必须完成并采纳以下的标准:

Character set.
Keyboard layout.
Unicode fonts.
Input methods.
Output methods.

  • 字符集。
  • 键盘布局。
  • Unicode 字体。
  • 输入法。
  • 输出方法。

To date this project has yielded some very useful de facto technical standards. Lao officials now recognize the vital importance of these standards and have adopted the goal as their own. These standards and the technical dictionary could dramatically speed up the ensuing. Until the majority of Lao developers agree to follow standards, further development will be hampered.

目前本地化计划已经产生了一些非常有用的技术标准。老挝官方现在认识到了这些标准的重要性并且设立了他们自己的目标。这些标准和技术词典可以极大地加速以上的目标。只有大部分的老挝开发者服从以上标准,今后的发展才不会受阻碍。

Localization

=本地化=

Team members have resolved the following issues. These solutions follow ISO standards and should be adopted by future localization efforts to avoid duplication of effort and incompatibility between systems.

团队成员已经解决了以下的问题。这些解决方案遵循 ISO 标准,应当在将来的本地化工作中利用,以避免重复劳动和系统的不兼容。

First, Unicode fonts were used instead of the existing fonts that override glyphs of English letters as per keyboard layout because these fonts are not standards compliant. Unicode is now the worldwide standard for FOSS developers. Also, development tools such as Kbabel and KDE require Unicode compatibility.

首先,用 Unicode 字体代替了现有的用英语字母替换为老挝语字符的键盘布局形式的字体,因为这些字体与标准不兼容。Unicode 现在是自由/开源软件开发的国际标准。而且,Kbabel 和 KDE 这样的开发工具也需要 Unicode 兼容性。

Laonux is a KDE localization, which is why the rendering of fonts is handled by the Qt libraries (as opposed to Pango or X Window). The Qt library file “Qt-qfont_X11.cpp” was modified slightly to point to the appropriate range for Lao. Initially Lao fonts were not rendered properly while stacking combined characters (i.e., consonant and a vowel). With the help of Theppitak Karoonboonyanan, patches were submitted to fix the rendering. Resolving these issues was a major breakthrough for Laonux development.

Laonux 使用本地化的 KDE 界面,因此字体的渲染是由 Qt 库来进行(而不是使用 Pango 或 X Window)。为了给老挝语字符安排合适的指针范围,Qt 库文件“Qt-qfont_X11.cpp”被开发者进行了少量修改。开始时堆叠字符(例如辅音和一个元音)的老挝语字体没有被正确渲染。在 Theppitak Karoonboonyanan 的帮助下,他们提交了补丁用来修正渲染问题。解决这些问题是 Laonux 开发的重大突破。

Inputs are handled by XKB, the keyboard map for X. It is used in converting key-strokes to key symbols defined in “X11/keysymdefs.h”, with language switching capability. XKB Lao keys were created with the appropriate Lao Unicode range. Finally, the GNU C library definition for Lao was created. This is mainly UTF-8 locale settings for date/time and related issues.

输入是由 X 键盘映射(XKB)处理。它被用来将击键动作根据“X11/keysymdefs.h”中的定义转换成符号,并具有选择语言的能力。开发者建立了覆盖老挝语 Unicode 范围的 XKB 键盘映射。最后,老挝语的 GNU C 开发库的定义也被建立。这主要包括处理日期/时间和其他相关问题的 UTF-8 区域设置。

Future Plans

=未来规划=

Continue work on Laonux.
Begin work on localizing OpenOffice.org.
Continue translation work for PHP Web portal tools.
Begin work on localizing Mozilla and other Web tools.
Train users and technical staff for localization projects.

  • 继续 Laonux 的工作。
  • 开始 OpenOffice.org 的本地化工作。
  • 继续 PHP Web 门户工具的翻译工作。
  • 开始本地化 Mozilla 和其他 Web 工具。
  • 培训本地化计划的用户和技术人员。

The first priority is to create the English/Lao technical dictionary. Whether this will be funded by governmental sources, grants or international aid is unclear. Without this dictionary, translating and localizing the software cannot be completed. Another priority is to continue cooperating with other regional software localization efforts, especially where they have already addressed similar issues.

首要的任务是建立英语-老挝语的技术词典。还不清楚这将由政府赞助,还是由其他基金或国际援助来资助。没有这部词典,软件的翻译和本地化无法完成。另一个需要优先进行的工作是继续与其他的地区性软件本地化工作合作,特别是那些已经遇到类似问题的项目。

Additional modifications and updates to ISO and Lao government IT standards are required for the long term. These include Input/Output methods, keyboard layout, collation, locales, and additional Lao OpenType Unicode fonts standards.

为长远打算,需要对 ISO 和老挝政府信息技术标准进行添加和更新。这些标准包括输入/输出方法,键盘布局,编码,区域设置,以及附加的老挝语 OpenType Unicode 字体标准。

Cooperation with universities for localization is ongoing. With most of the technical issues resolved, localization is now primarily a language issue. To avoid unnecessary anglicisms, professional linguists are needed. If technical professionals are involved in this process, they are likely to impose what they already know, which is English. It is far better for the future to adopt native language terms wherever practical. However, without funding, it is likely that a hodge-podge of anglicisms will prevail.

与大学的本地化合作正在进行。解决了大部分的技术问题后,本地化目前主要是一个语言问题。为避免不必要的西方腔调,需要专业的语言学家。参与本地化过程的技术专业人士倾向于使用他们所理解的语言,带有英语的色彩。如果可行,使用本土语言的词汇对将来更有好处。但是,没有资金的支持,产品难免会夹杂大量的英语腔调。

On the technical level, support for Lao script in IBM's ICU library is needed. ICU is the script support base for OpenOffice.org and for Java (released by IBM and Sun Microsystems). It is a complete library, including script rendering, layout, collation (sorting of words), line breaking, spell-checking, etc.

在技术层面上,需要 IBM 的 ICU 库对老挝语的支持。ICU 是 OpenOffice.org 和 Java (由 IBM 和升阳微系统公司发布)的字符支持库。它是一个包括字符渲染、外观、词语排序,断行,拼写检查等功能的完整的库。

Some difficult and important technical work has been done. However, full integration in ICU will force us to define all of these issues very clearly, and this will pave the way for the developers to create Lao applications in Java and C++. Without this, professional Lao software development can be difficult.

这需要完成一些困难而重要的工作。但是,对 ICU 的全面整合将迫使我们非常清晰地定义这些要素,而这将为使用 Java 和 C++ 的开发者铺平道路。做不到这点的话,专业的老挝语软件开发可能将十分困难。



20 See www.lan-xang.com/literature/lit_3.html.
21 Thai Language Audio Resource Center, `Some Historical Background of Thai Language'; available from thaiarc.tu.ac.th/thai/thai.htm.
22 See seasrc.th.net/font/alphabet.htm.


库美尔语本地化 (23)

Khmer Language

=库美尔语=

As in the case of Thai and Lao, Khmer script originates from the Grantha script, the south Indian form of the ancient Indian Brahmi writing system.

像泰国和老挝的文字一样,库美尔文字起源于一种古代印度 Brahmi 书写系统的南印度形式, Grantha 文字。

Khmer script follows complex rules of layout in which consonants may take two different forms (e.g., the small form is placed on a lower line if it immediately follows another consonant). Space is used not to separate words but to indicate a pause in reading (very much like a comma in English). Vowels pronounced after a consonant may appear before, after, above, below; before and after (formed by two glyphs); before and above; below and above; or under and after the consonant.

库美尔文字的排列规则比较复杂,辅音可以有两种形式(例如,当辅音紧跟另一个辅音时,采用小写形式写在较低处)。空格不仅表示分词,也表示读音的停顿(很像英语中的逗号)。在辅音后发音的元音可以出现在其前,后,上方,下方,或者同时出现在辅音前后(用两个符号表示),前方和上方,后方和上方,或下方和后方。

At present, the definition of the language is so poor that even the number of vowels in the language is not clear. The number of vowels in the official reference (the only available dictionary) is different from the number of vowels taught in schools. The reference dictionary is sorted phonetically, making a systematic collation algorithm that will follow the same order impossible. Words starting with the same consonant may be ordered under different listings depending on how that consonant is pronounced in that word. As in the Lao localization project, an English/Khmer technical dictionary is not available, and the lack of it severely hampers the efforts to translate software into the local language.

目前,这种语言的定义还很不严格,甚至元音的个数都尚不清楚。在官方的参考书(唯一的词典)中规定元音的个数与学校中教授的原因个数都不同。参考词典是按语音顺序排列的,使得系统的排列算法无法按照相同的顺序实现。以相同元音起头的单词可能因为其辅音读音的不同被编入不同的部分中。像老挝的本地化计划一样,由于缺乏英语/库美尔语技术词典,将软件翻译成本地语言的工作受到了严重的阻碍。

Obstacles and Successes

=困难和成功=

When the KhmerOS project was first being considered, the technical situation was as follows:

当最开始考虑库美尔OS项目时,技术方面的现状如下:

Khmer had already been included in Unicode. Fixed in 1996 by a team of people who had no contact with the Cambodian government, the definition in Unicode was later disputed, but to no avail. The Unicode Consortium refused to change anything, including the addition of necessary Khmer vowels, on the basis that these could be formed by combining other existing Khmer characters. The Consortium only permitted adding comments to the existing standard. The standard is now considered fixed by the Khmer government (in its 4.0).

Microsoft had published OpenType specifications for Khmer, and included the language in its Uniscribe complex text layout engine. MS Publisher worked very well in Khmer, but MS still did not handle either line-breaking or sorting. MS Word crashed quite often while using Khmer.

Some OpenType Khmer fonts already existed, though none in the public domain.

No FOSS programs were implemented in Khmer in the GNU/Linux environment, but some FOSS (such as Mozilla) worked well in Khmer under Windows, using the Microsoft Uniscribe engine.

Some people had been considering FOSS in Khmer, but the idea had not gone beyond mailing list discussions.

There has been an amazing proliferation of legacy (non-Unicode) fonts. Up to 26 different font encodings had been defined. They worked well enough under MS Word by modifying "normal.dot".

  • 库美尔语已被纳入 Unicode 当中。这个在1996年由一个与柬埔寨政府没有联络的小组制定的标准存在争议,但没有结果。Unicode 联合会拒绝做任何改变,包括增加必要的元音,理由是现有的库美尔字符可以通过合并表示这些元音。联合会只批准了对现有标准的批注。库美尔政府现在将标准(4.0版)视为修订完毕。
  • 微软发布了库美尔语的 OpenType 说明,并将这种语言包含于它的 Uniscribe 复杂文本显示引擎中。MS Publisher 软件能很好地处理库美尔语。但微软还不能处理断行或排序。使用库美尔语时 MS Word 经常崩溃。
  • 已经有了一些 OpenType 库美尔字体,但它们都不属于公共领域。
  • GNU/Linux 环境下没有使用库美尔语实现的自由/开源软件,但有些自由/开源软件(例如 Mozilla)在 Windows 下能很好地用库美尔语在微软 Uniscribe 引擎支持下工作。
  • 有人在考虑库美尔语自由/开源软件,但想法都停留在邮件列表中的讨论阶段。
  • 旧式(非 Unicode)字体的使用出人意料地广泛。有多达26种不同的字体编码。通过修改“normal.dot”它们在 MS Word 下工作得很好。

It is also important to understand the social situation, as follows:

了解以下的社会状况也很重要:

The lack of computers in the Khmer language increases the digital divide. Only people who speak English have access to jobs that require the use of computers.

Because computer interfaces are in English, the words used for computer-related work are also in English, introducing many `'anglicisms `'that could have been easily avoided if the software had been translated.

As people have to memorize new English words (in the menus), training for computer use takes a long time and it is soon forgotten if it is not used.

There is no English/Khmer technical dictionary.

The development of databases for governmental purposes is not possible, as no database management systems that handle either legacy or Unicode Khmer encodings have been developed.

In order to join the World Trade Organization, Cambodia has passed an intellectual property law that, when in force will, in theory, compel people to pay for proprietary software. FOSS allows developing countries to comply with the requirements of WTO without having to spend large amounts of money.

  • 缺乏使用库美尔语的计算机增大了数字鸿沟。只有会说英语的人才能获得需要使用计算机的工作机会。
  • 因为计算机使用英文界面,描述与计算机有关的工作的词也都使用英语,带来了许多通过翻译软件可以避免的“英语腔”。
  • 因为人们需要记住(软件菜单中)新的英语单词,计算机培训需要很长时间,而且不使用很快就会遗忘。
  • 没有英语-库美尔语的技术词典。
  • 不能开发政府使用的数据库,因为没有开发出能处理库美尔语旧式编码或 Unicode 编码的数据库管理系统。
  • 为了加入世界贸易组织(World Trade Organization, WTO),柬埔寨通过了一部知识产权法。一旦它生效,将迫使人们支付私有软件的费用。自由/开源软件允许发展中国家满足 WTO 的规定而不用花费大量资金。

Spanish computer scientist Javier Solá, who lives in Cambodia and has more experience with computers and strategy than with FOSS, has decided to see what can be done in this situation. He has started writing an ambitious project with the following deliverables:

居住在柬埔寨的西班牙计算机科学家 Javier Solá 对计算机和政策有着比对自由/开源软件更多的经验,他决定尝试在这样的现状下可以做些什么工作。他开始进行一项目标远大的计划,要提供以下的成果:

A full computer operating system, office suite and entertainment applications needed by an average computer user – entirely in the Khmer language. A user will see only the Khmer language script on his/her screen. The system will include full documentation in Khmer, in electronic and paper formats.

A “development library” (a set of programs) to be used by software developers to include support for the Khmer language in their applications; documentation of the development library.

A set of up to 50 computer fonts (Unicode OpenType fonts) to be used in the application menus, for word-processing or for computer design.

A keyboard layout, supporting drivers and 5,000 physical keyboards that support the abovementioned Unicode fonts.

5,000 copies of an installation disk that would be easy to use and would include all of the above software and documentation; with 1,000 of them accompanied by a full printed documentation.

Training materials addressed to end-users, including the use of the system and the applications, and a training course for typing with the new keyboard.
Computer end-user trainers trained to teach the new system using the above-mentioned materials: University professors, students and software development company personnel trained in advanced GNU/Linux and FOSS and application development using the Khmer script support tools provided by the project; computer vendor personnel trained in the installation of the system.

FOSS expertise centres in universities, including trained professors, students and computers with Internet connectivity.

Software development companies empowered to develop applications based on FOSS that require support for the Khmer language script.

Government personnel trained to use the system. Personnel expertise on software purchasing, coordination of applications between similar administrations, and analysis of priority applications for improved governance.

Marketing materials for deployment of the system.

Widespread publicity of the system directly or through computer and software vendors.

  • 一个一般用户需要的完整的计算机操作系统、办公套件和娱乐软件——全部使用库美尔语。用户在他/她的屏幕上只会看到库美尔文字。系统将包括用库美尔语写成的完整文档,包括电子版和纸上格式。
  • 供软件开发者在他们的程序中加入库美尔语支持时使用的一套“开发库”(一系列程序);开发库的文档。
  • 在应用程序菜单中使用,以及用于字处理和计算机设计的多达50种的计算机字体(Unicode OpenType 字体)。
  • 支持上述字体的一种键盘布局,对应的驱动程序和5,000个键盘。
  • 包括所有上述软件和文档的,容易使用的5,000份安装盘;其中1,000份附带完整的印刷版文档。
  • 发给最终用户使用的培训材料,包括系统和程序的使用,以及用新键盘打字的培训教程。
  • 培训最终用户的人员也要使用上述的材料接受培训,他们包括学会了 GNU/Linux 和自由/开源软件的高级使用以及用计划提供的库美尔文字支持工具进行软件开发的大学教授和学生,也包括学习过安装这套系统的计算机厂商工作人员。
  • 大学中的自由/开源软件技术中心,包括培训过的教授,学生和可以访问因特网的计算机。
  • 有能力开发基于自由/开源软件并支持库美尔文字的应用程序的软件开发公司。
  • 政府中接受过系统使用培训的人员。
  • 软件采购,协调类似机构的应用,以及为改进执政效能分析优先应用的个人技能。
  • 部署系统需要的市场推广材料。
  • 直接地或者通过计算机和软件厂商对系统进行广泛的宣传。

The project not only considers the final goal but also the order in which the programs will be translated. First Mozilla, an email client, and then OpenOffice.org, an office suite, will be released under MS Windows. Finally these will be included in a fully Khmer GNU/Linux release when the user interface is translated.

这个计划不仅考虑了最终目标,也考虑了翻译软件的顺序。首先是 email 客户端 Mozilla,然后是办公套件 OpenOffice.org,都在微软 Windows 下发布。最终这些软件在用户界面翻译完毕后都会被包括在完整的库美尔 GNU/Linux 发行版中。

After considering and later rejecting (for lack of real industry interest) the creation of an Industry Consortium, the project soon found a home in the local Open Forum of Cambodia, an NGO with a long history of supporting technologies for social purposes in Cambodia.

在考虑并放弃了成立行业联合会的想法(由于缺乏行业利益)之后,计划很快在本地的柬埔寨开放论坛安置下来,这是一个在柬埔寨长期支持有利社会的技术发展的非政府组织。

Sharing the idea of the project with anybody who would listen, setting up a Web site (www.khmeros.info), and initiating contact with various people interested in Khmer and computers, brought in the first volunteers.

通过与任何有兴趣的人分享计划的想法,设立专门网站(www.khmeros.info),以及与许多对库美尔语和计算机有兴趣的人们的初步接触,第一批志愿者参与到了计划中。

Typographer Danh Hong put into the public domain one of his Khmer OpenType fonts. This was an important preliminary step to get the FOSS community interested in this language.

字型设计师 Danh Hong 将他的一种库美尔语 OpenType 字体提供给公众。这是让自由/开源软件社区开始对这种语言感兴趣的重要的第一步。

Lin Chear, a Canadian engineer of Khmer origin, started looking at Pango and in a few days created the necessary programs to support Khmer in Gnome and its family of products. Khmer will be supported by version 1.6 of Pango. KDE followed soon after.

一位库美尔裔加拿大工程师 Lin Chear 开始研究 Pango 并且用几天时间开发了在 Gnome 和其相关产品中支持库美尔语的必要程序。1.6 版的 Pango 将提供对库美尔语的支持。KDE 紧随其后。

With the help of Lin Chear, the European maintainer of Qt, developed the routines needed for supporting Khmer in KDE and programs that use Qt for language support. Lin Chear applied a Pango patch to Mozilla and got a version of Mozilla working in Khmer on GNU/Linux.

在 Lin Chear 的帮助下,Qt 的欧洲维护者开发了在 KDE 和使用 Qt 作为语言支持的程序中的库美尔语支持。Lin Chear 在 Mozilla 上使用了一个 Pango 补丁并获得了一个能在 GNU/Linux 上用库美尔语工作的 Mozilla 版本。

Back in Phnom Penh, the KhmerOS project established itself in a small office at the Open Forum facilities. Using USD1,500 from the first small donations, they hired two computer scientists and with a couple of old donated computers, started creating a glossary of Khmer computer terms as preliminary work before starting the translation. A donation from a Hong Kong businessman would later permit the purchase of new computers.

在 Phnom Penh,KhmerOS 计划在开放论坛的设施中为自己建立了一个小办公室。他们用第一笔小额捐款中的1,500美元雇请了两个计算机科学家,加上一些捐助的旧计算机,开始建立库美尔语的计算机术语词典,作为翻译工作开始前的预备工作。来自一位香港商人的捐助将用于购买新的计算机。

Meanwhile, work and discussion regarding keyboards and collation algorithms are ongoing. Implementations in the Thai language show that dictionary based line-breaking can be done in languages that do not separate their words. Work on conversion of texts in legacy fonts to Unicode encoding has also started.

同时,关于键盘和排列算法的工作正在进行。泰国语的实现表明基于词典的断行可以用于没有分词标志的语言中。将旧式字体写成的文本转换成 Unicode 编码的工作也展开了。

“Evangelization” for Unicode is a necessary part of the project’s work. Large amounts of money are donated to Cambodia or used by NGOs for computer-related projects, such as creating a database of all Cambodian laws. If these projects are not done in Unicode, the databases will be useless in a few years.

Unicode 的“传教”工作也是计划的必要部分。大量的金钱被柬埔寨或非政府组织用于有关计算机的计划,比如建立所有柬埔寨法律的数据库。如果这些计划不使用 Unicode 完成,这些数据库在几年内就没有用了。

Future plans include increasing the project team to five translators with at least one of them a professional translator, and releasing email, browsing, word processing and spreadsheet programs in Khmer (Thunderbird and Firefox from Mozilla and OpenOffice.org Writer and Calc).

未来的规划包括将队伍增加到5名翻译人员,其中至少一人是专业翻译,并且发布库美尔语的 email,浏览器,字处理和电子表格软件(Mozilla 的 Thunderbird 和 Firefox 以及 OpenOffice.org 的 Writer 和 Calc)。

These programs will all be released on MS Windows (2000 and XP), and will have full documentation in Khmer, as well as basic training modules in Khmer to be used by computer training professionals. Training materials are important in order to fix the language and terminology used by teachers, assuring standardization and avoiding use of too many ‘’anglicisms’’.

这些软件将在 Windows (2000和 XP)上发布,并附带完整的库美尔语文档,以及电脑培训专业人士使用的用库美尔语教授的基本培训课程。在教师使用的培训材料中修正语言和术语是很重要的,这样能保证标准化并避免使用过多的“英语腔”。

The translation of Gnome or KDE (or both) began in 2004, as well as a series of minor applications that will allow the release of a complete GNU/Linux-based system in Khmer in the second half of 2005.

Gnome 和 KDE 以及一系列小程序的翻译都开始于2004年,这使得在2005年下半年发布完整的基于 GNU/Linux 的库美尔语操作系统成为可能。

Funding is still a major concern. Project speed is adjusted to suit funding, which comes either through grants or through contracts with corporations or other NGOs that need translated programs or services related to Khmer Unicode that can be provided by the project itself.

资金仍然是一个主要问题。计划的进度必须依照资金的情况调整。资金来自资助组织,或者那些需要库美尔语 Unicode 相关软件翻译的公司或非政府组织与计划签订的合同。

On the technical level, support is still required for Khmer script in IBM’s ICU library. Full integration in ICU will compel the project to define all such issues clearly, as well as open the way for developing Khmer applications in Java.

在技术层面上,仍然需要 IBM 的 ICU 库对库美尔文字的支持。完全整合到 ICU 中将迫使计划清晰地定义所有有关事项,也为用 Java 开发库美尔语程序提供了可能性。

Another concern is being able to use low-priced computers. In the Microsoft environment, Khmer Unicode will work only in Windows 2000 and XP, which require modern computers with a fair amount of memory and large hard disks. The way to low-priced (i.e., secondhand) Khmer computers will definitely have to be either through GNU/Linux or by getting Khmer script support in earlier versions of Windows such as Windows 98.

另一个问题是使用廉价的计算机。在微软的环境中,库美尔语 Unicode 只能在 Windows 2000 和 XP 下工作,这两个系统都需要配备大量内存和大硬盘的现代计算机。实现廉价的(例如二手的)库美尔语计算机必须通过 GNU/Linux 系统或者让库美尔语支持在 Windows 98 这样的早期 Windows 版本上工作。

The project’s distribution strategy is to try to have the software pre-installed by computer vendors and distributed by “two-dollars-a-CD” software vendors. This is to allow easy copying and wide distribution, thereby saving the project the cost of making copies.

计划的发行策略是让计算机厂商预装软件,以及通过“两美元一张盘”的软件供应商发行。这有助于方便的拷贝和广泛传播,因此为计划节省了制作拷贝的费用。

The government, under an IDRC grant, is pushing for the localization of Windows and MS Office. They have neither plans nor resources to get involved in FOSS localization, but are said to be interested in coordinating efforts so as not to duplicate work and to ensure that the Khmer language (collation, etc.) is implemented consistently in both platforms.

政府在 IDRC 的一项资助下正在推行 Windows 和微软 Office 的本地化。他们没有加入自由/开源软件本地化的计划和资源,但据说为避免重复劳动和正确一致地在两个平台上实现库美尔语(排列算法等等)支持,对合作有兴趣。

The project aims not to be sectarian about FOSS. It is better to have people using email in Khmer on Windows in the short run than to try to make them change to a completely new system abruptly. There are still two or three years before the intellectual property law is expected to be strongly enforced. This period can be used as transition time, assuring a gradual success rate, rather than trying to force an overnight change and thereby failing.

计划并不限于自由/开源软件。让人们短期内能够在 Windows 上用库美尔语使用电子邮件比试图让他们突然转换到一个全新的系统上更有好处。在知识产权法得到严格执行前还有两三年时间。这一阶段可以作为过渡,确保逐步的成功迁移,而不是强迫突然的改变而导致失败。



23 本案例由 Javier Sola 撰写。


本地化计划现状

A survey(24) of the status of localization projects was conducted by the PAN Localization Project(25) during the training on "Fundamentals of Local Language Computing" in Lahore, Pakistan in January 2004. Participants from 13 Asian countries were asked to provide information about the status of localization tasks in their countries. The survey revealed the status of standardization, localization and national policy for local language computing of the participants' countries. Please note that the informal survey technique employed gives only a rough picture of the actual situation.

一项关于本地化计划现状的调查 (24) 由 PAN 本地化计划 (25) 于2004年一月在巴基斯坦的 Lahore 进行的“本地语言计算基础”培训中完成。他们要求来自13个亚洲国家的学员提供自己国家本地化工作的状况信息。调查显示了学员所在国家本地语言计算的标准化、本地化以及国家政策情况。需要注意这种不正式的调查方式只能给出较粗略的图景。

Table 1 summarizes the survey on standardization for character set, keyboard layout, key-pad layout (e.g., for mobile phones), collation sequence, terminology translation and locale definition.

表1概括了关于字符集标准化,键盘布局,小键盘布局(例如手机使用的键盘),码表序列,术语翻译和区域设置定义的调查结果。

Table 1: PAN Survey: Local Standards
Country Language Char Set KBD Keypad Coll. Seq. Interface Term. Locale
Myanmar Burmese o * *
Cambodia Khmer o o
Mongolia Mongolian o o ? * * o
Lao PDR Lao o o
Nepal Nepali * * * * *
Sri Lanka Sinhalese o o * * *
Thailand Thai o o * * * o
Bhutan Dzongkha o o * *
China Tibetan o o ?
Japan Japanese o o o o o o
Bangladesh Bangla o ? * * o
Afghanistan Pashto o o ? *
Iran Farsi o o * * *
Parkistan Urdu o * * * *

o: complete; *: partially done; ?: don't know; blank: no work done.

表1 PAN 调查:本地化标准
国家 语言 字符集 键盘 小键盘 码表 界面术语 区域设置
缅甸 缅甸语 o * *
柬埔寨 库美尔语 o o
蒙古 蒙古语 o o ? * * o
老挝人民民主共和国 老挝语 o o
尼泊尔 尼泊尔语 * * * * *
斯里兰卡 僧伽罗语 o o * * *
泰国 泰国语 o o * * * o
不丹 Dzongkha o o * *
中国 藏语 o o ?
日本 日语 o o o o o o
孟加拉 孟加拉语 o ? * * o
阿富汗 普什图语 o o ? *
伊朗 波斯语 o o * * *
巴基斯坦 乌尔都语 o * * * *

o:完成 *:部分完成 ?:情况不明 空白:工作未进行。

From the table, character set and keyboard layout seems to be well defined for most countries, while other issues need more work.

从表中可以看出,大多数国家的字符集和键盘布局都有完善的定义,而其他的部分需要进一步的工作。

Table 2 summarizes the current status of localization of applications on the GNU/Linux platform, namely, keyboard input, fonts, sorting and find/replace utilities, natural language processing, spell checker, and thesaurus. It also shows whether any GNU/Linux distribution has been released in the country.

表2概括了 GNU/Linux 平台上应用程序本地化的现状,包括键盘输入,字体,排序和查找/替换程序,自然语言处理,拼写检查,以及词汇库。这张表也显示了该国是否发行了 GNU/Linux 的版本。

Table 2 PAN survey: Basic Localized Applications on GNU/Linux
Country Language KBD Driver Font Coll. Find/Replace NLP Spell Check Thes. Linux Distr.
Myanmar Burmese *
Cambodia Khmer
Mongolia Mongolian o o * * * ? o
Lao PDR Lao
Nepal Nepali o * *
Sri Lanka Sinhalese * o * *
Thailand Thai o * o o * * o
Bhutan Dzongkha
China Tibetan * * ?
Japan Japanese o o o o ? ? ? o
Bangladesh Bangla o * * ? *
Afghanistan Pashto * ? o
Iran Paisi o * o * o * * *
Pakistan Urdu * * o *

o: complete; *: partially done; ?: don't know; blank: no work done

表2 PAN 调查:GNU/Linux 上基本本地化的程序
国家 语言 键盘驱动 字体 码表 查找/替换 自然语言处理 拼写检查 词汇库 Linux发行版
缅甸 缅甸语 *
柬埔寨 库美尔语
蒙古 蒙古语 o o * * * ? o
老挝人民民主共和国 老挝语
尼泊尔 尼泊尔语 o * *
斯里兰卡 僧伽罗语 * o * *
泰国 泰国语 o * o o * * o
不丹 Dzongkha
中国 藏语 * * ?
日本 日语 o o o o ? ? ? o
孟加拉 孟加拉语 o * * ? *
阿富汗 普什图语 * ? o
伊朗 波斯语 o * o * o * * *
巴基斯坦 乌尔都语 * * o *

o: 完成;*:部分完成;?:不明;空白:工作未进行。

The survey shows that FOSS localization activities are taking place in many countries. Sharing expertise and resources among countries can boost progress in this area.

调查表明自由/开源软件本地化活动正在许多国家开展。在国家间分享技术和资源能加速这一地区的工作进程。

For message translation, a number of Asian language localization projects are underway. All of these projects are tracked in real time on the Internet. The following links provide the current status of some of these projects.

在信息翻译方面,许多亚洲语言本地化计划正在进行。所有这些计划都在因特网上被实时追踪。以下的链接提供了其中一些项目的当前情况。

As of June 2004, Japanese and Korean translators have already completed most of their initial work in FOSS localization. China is close behind, followed by India and the South-East Asian nations. The situation is highly fluid, however, and the above sites should be checked for detailed and updated information on the current status.

到2004年六月,日本和韩国的翻译人员已经完成了自由/开源软件本地化的大部分初始工作。中国紧随其后,后面则是印度和东南亚国家。但是本地化情况非常多变,需要访问上述网站获取关于现状的详细的更新信息。



24 Hussain, S. and Gul S., PAN Localization Project: A regional initiative to develop local language computing capacity in Asia, 2004.
25 See www.panl10n.net/.


自由/开源软件本地化的建议

亚洲自由/开源软件本地化的实现
本地化计划的技能和工具需求
自由/开源软件本地化的成本
本地化工作
附录A. 本地化关键概念
附录B. 本地化技术一览


亚洲自由/开源本地化的实现

亚洲是世界自由/开源软件本地化的主战场。垄断私有软件公司还没有在亚洲占据统治地位。出于国家利益的考虑,亚洲许多政府推行了鼓励多元的软件开发,特别是自由/开源软件的政策。

某些国家目前的本地化工作只限于散布在世界各地的少数热心人的成果。他们的工作很少得到报偿,而他们的翻译工作无组织的特点会产生意想不到的歧义。

大多数情况下,志愿者都是程序员而不是语言学家。他们需要得到翻译人员,科技作家和测试人员的帮助。因为本地化主要与语言而不是与程序有关,非技术人员人数应该达到程序员的五倍。在制定正式的自由/开源软件政策之前,技术词典和本地化标准的支持工作就可以开始。本地语言专家只需要普通职业人士的工资和办公室待遇。技术作家和测试人员可以在几个月内训练完毕。

支持非常相似的语言(如泰国语,老挝语和库美尔语)分享代码和技术资源的国际合作本地化是非常有效的。它对社会整体创造的价值是无法否定的——新词典和技术标准使得技术和翻译人员能够低成本的、一致地本地化任何自