Difference between revisions of "Localization and Internationalization"

m
Protected "Localization and Internationalization": Excessive spamming ([edit=autoconfirmed] (indefinite) [move=autoconfirmed] (indefinite))
 
m (Protected "Localization and Internationalization": Excessive spamming ([edit=autoconfirmed] (indefinite) [move=autoconfirmed] (indefinite)))
 
(90 intermediate revisions by 7 users not shown)
Line 101: Line 101:
==Globalization==
==Globalization==
The term globalization (G11N) is often used synonymously with internationalization.  But usually it encompasses both Internationalization and Localization. It is a process which involves design, implementation and localization.
The term globalization (G11N) is often used synonymously with internationalization.  But usually it encompasses both Internationalization and Localization. It is a process which involves design, implementation and localization.
== The Importance of Localization ==
Currently, people who want to use computers must first learn English. In a country with low [[w:Literacy|literacy rates]], this blocks access to [[w:Information technology|information and communications technologies]] (ICTs), especially for the rural poor and women who do not have equal access to education. Even after having learnt English, users must pay hundreds of dollars to license foreign software, or resort to widespread illegal copying of software, in order to gain access to ICTs. In short, access to information technology is one of the keys to development, and localized FOSS applications remain a crucial missing link in communications infrastructure.
[[w:Localization|Localization]] brings the following benefits:
# Significantly reduces the amount of training necessary to empower end-users to use a computer system.
# Facilitates the introduction of computer technology in [[w:Small and medium enterprises|Small and Medium Enterprises]] (SMEs).
# Opens the way for the development of computer systems for a country's national, provincial and district level administration that will allow civil servants to work entirely in the local language and manage databases of local language names and data.
# Facilitates the decentralization of data at provincial and district levels. The same applies to utility companies (electricity, water, telephone), who will develop local language databases, thereby reducing costs and giving better service to citizens.
# Allows citizens to communicate through e-mail in their own language.
# Empowers local software development companies to work for the administration, the public sector and private companies.
# Provides the local design industry with good fonts.
# Helps universities train more software engineers.
The beneficiaries of this multi-stakeholder project are:
# Directly, all local computer users, who will have easier access to the use of computers as they will not have to learn English first.
# Indirectly, through improvements in governance using native computer systems, all local citizens in the quality of their dealings with the administration.
# The local government who will have the opportunity to develop databases and applications in the local language. Sufficient technology and empowered local development companies will be available. The government will also have the tool to coordinate applications among similar administrations (e.g., provinces), so that IT-based improvements in governance can be made at the lowest possible cost.
# The [[w:Software industry|software industry]]. The government's use of standards-compliant computer technology encourages software companies to start developing compatible computer systems that will be used by the different bodies of the administration, thereby creating a stable software industry in the country. Once this expertise is developed (using FOSS), these companies will be empowered to undertake similar projects for foreign companies at extremely competitive prices, facilitating sales beyond the local market.
Source: http://en.wikibooks.org/wiki/FOSS_Localization/Introduction


==Culturally Biased wrong Assumptions==
==Culturally Biased wrong Assumptions==
Line 112: Line 134:
*'''Words contains consonants and vowels''':Arabic and Hebrew don't require vowels.
*'''Words contains consonants and vowels''':Arabic and Hebrew don't require vowels.


===Rules of Thumb for Software Internationalization===
==Rules of Thumb for Software Internationalization==
Internationalized software must enable easy porting to other locales. A locale defines language and specific cultural conventions. The process of adjusting internationalized software to a particular locale is called localization (a common acronym for this term is L10N). You can think of software internationalization as a prerequisite for localization. Localization consists of more than just translating the user interface. Consider North America and Britain, for instance. Seemingly, they use the same language. However, not only do these locales differ in spelling (program vs. programme, realize vs. realise, color vs. colour etc.), certain cultural conventions such date formatting (in Europe the date format is DDMMYYY whereas in north America it's MMDDYYYY), currency, and measurement system. Other locales exhibit additional cultural differences. In Germany and other European countries, the sign of a decimal fraction (also called radix) is a comma, e.g., 10,5. In North America, the radix is called "decimal point", and as the name suggests, it's represented as follows: 10.5.
Internationalized software must enable easy porting to other locales. A locale defines language and specific cultural conventions. The process of adjusting internationalized software to a particular locale is called localization (a common acronym for this term is L10N). You can think of software internationalization as a prerequisite for localization. Localization consists of more than just translating the user interface. Consider North America and Britain, for instance. Seemingly, they use the same language. However, not only do these locales differ in spelling (program vs. programme, realize vs. realise, color vs. colour etc.), certain cultural conventions such date formatting (in Europe the date format is DDMMYYY whereas in north America it's MMDDYYYY), currency, and measurement system. Other locales exhibit additional cultural differences. In Germany and other European countries, the sign of a decimal fraction (also called radix) is a comma, e.g., 10,5. In North America, the radix is called "decimal point", and as the name suggests, it's represented as follows: 10.5.


Line 124: Line 146:


==Locale==
==Locale==
A locale denotes a specific language along with its conventional information such as date, currency, calendar, number format etc.It also includes the following
A locale denotes a specific language along with its conventional information such as date, currency, calendar, number format etc.It also includes the following:
# Names of the months
# Names of the months
# Days of the week
# Days of the week
Line 136: Line 158:
===Script Type===
===Script Type===
'''Alphabetic''':
'''Alphabetic''':
Individual units  for writing are composed of consonants, and in some cases vowels. When compined they spell out words phonetically. Eg: Indic, Arabic, Latic, Greek etc.
Individual units  for writing are composed of consonants, and in some cases vowels. When combined they spell out words phonetically. Eg: Indic, Arabic, Latic, Greek etc.


'''Syllabic''':
'''Syllabic''':
Line 165: Line 187:
In Chinese and Japanese text is written vertically
In Chinese and Japanese text is written vertically


===Other Chracteristics===
===Other Characteristics===


'''Diacritics''':
'''Diacritics''':
Line 171: Line 193:


'''Word seperator''':
'''Word seperator''':
Most of the languages use space as word seperator.Exceptions are Chinese, Thai , and Japanese
Most of the languages use space as word separator. Exceptions are Chinese, Thai , and Japanese


'''Punctuation''':
'''Punctuation''':
Line 187: Line 209:
#[[wikipedia:Unicode | Wikipedia page on Unicode]]
#[[wikipedia:Unicode | Wikipedia page on Unicode]]
#[http://unicode.org Unicode.org]
#[http://unicode.org Unicode.org]
==Internationalized Resource Identifiers==
Internationalized Resource Identifiers (IRI) is also known as Multilingual Web Addresses.
Currently Web addresses are typically expressed using Uniform Resource Identifiers or URIs. This restricts Web addresses to a small number of characters: basically, just upper and lower case letters of the English alphabet, European numerals and a small number of symbols. Recent developments enable you to add non-ASCII characters to Web addresses.
Detailed information is available from [http://www.w3.org/International/articles/idn-and-iri/ An Introduction to Multilingual Web Addresses]


==Input Methods==
==Input Methods==
Line 261: Line 290:


==Related Links==
==Related Links==
*[http://en.wikibooks.org/wiki/FOSS_Localization FOSS Localization]
*[http://developer.gnome.org/doc/tutorials/gnome-i18n/developer.html Gnome L10N Guidelines for Developers by Christian Rose]
*[[wikipedia:Date and time notation by country|Date and time notation by country]]
*[[wikipedia:Date and time notation by country|Date and time notation by country]]
*[[wikipedia:List of languages by name|List of languages by name]]
*[[wikipedia:List of languages by name|List of languages by name]]
Line 268: Line 299:
*[[ಕನ್ನಡ| ಕನ್ನಡ (Kannada)]] - Kannada FOSS Team
*[[ಕನ್ನಡ| ಕನ್ನಡ (Kannada)]] - Kannada FOSS Team
*[[తెలుగు|తెలుగు (Telugu)]] - Telugu FOSS Team
*[[తెలుగు|తెలుగు (Telugu)]] - Telugu FOSS Team
  Feel free to edit this page and share your knowledge/experience in this subject - [[User:Santhosh|Santhosh Thottingal]]