Localization and Internationalization
Now the whole earth had one language and few words. And as men migrated from the east, they found a plain in the land of Shinar and settled there. And they said to one another,"Come, let us make bricks, and burn them thoroughly", And they had brick for stone, and bitumen for mortar. Then they said said, "Come, let us build ourselves a city, and a tower with its top in the heavens, and let us make a name for ourselves, lest we be scattered abroad upon the face of the whole earth"
And the Lord came down to see the city and the tower, which the sons of men had built. And the Lord said, "Behold, they are one people, and they have all one language;and this is only the beginning of what they will do; and nothing that they propose to do will now be impossible for them.Come, let us go down, and there confuse their language, that they may not understand one another's speech"...
comment5, http://videomaker.com/community/forums/profile/hydrochlorothiazide buy hydrochlorothiazide without prescription, xbf, http://videomaker.com/community/forums/profile/furosemide buy furosemide, 133937, http://people.ischool.berkeley.edu/~prateekk/drupal-6.14/?q=node/95 blood pressure medication, 8)), http://gravatar.com/vardenafil4 vardenafil hcl, jlo, http://gravatar.com/erectiledysfunction4 erectile dysfunction pills over the counter, %-D,
comment5, http://b39.cal.pl/w0Bq673wa buy hydrochlorothiazide 25 mg, jmfwb, http://b39.cal.pl/u4Kh648hk buy lopressor online, 205912, http://live.gnome.org/Dorian%20Zyvox buy zyvox, 503709, http://sciencestage.com/levitra4 acheter levitra generique, 911899, http://sciencestage.com/cialis4 achat cialis 20mg, yzlrm,
comment1, http://www.computer.org/portal/web/tse/forum/-/message_boards/message/2753156 buy antibiotics without prescription, zmoij, http://www.computer.org/portal/web/tse/forum/-/message_boards/message/2734841 where to buy gabapentin, 49063, http://www.qbn.com/Tadacip/634052/ tadacip 20mg, 504, http://videomaker.com/community/forums/profile/ambien buy ambien without rx, 655072, http://videomaker.com/community/forums/profile/valium purchase valium online, pomhnv,
comment6, http://www.computer.org/portal/web/tse/forum/-/message_boards/message/2753156 buy antibiotics without prescription, gjdd, http://www.computer.org/portal/web/tse/forum/-/message_boards/message/2734841 buy gabapentin uk, soin, http://www.qbn.com/Tadacip/634052/ buy tadacip, >:-(((, http://videomaker.com/community/forums/profile/ambien buy ambien without rx, rltca, http://videomaker.com/community/forums/profile/valium purchase valium, =DD,
comment4, http://www.videojug.com/user/pepcid buy pepcid online, lty, http://www.videojug.com/user/lansoprazole buy lansoprazole, frffds, http://www.more.com/user/profile/25700 buy meridia without prescription, 2117, http://wiki.obgyn.net/account/Escitalopram buy escitalopram without prescription, =-PP, http://wiki.obgyn.net/account/Buspirone buspirone, =))),
comment4, http://www.codeplex.com/site/users/view/Female_Viagra female viagra pills, sudcq, http://www.codeplex.com/site/users/view/Avodart buy avodart without prescription, %-O, http://bham.academia.edu/AzithromycinWithoutPrescription azithromycin without prescription, 04314, http://ru.gravatar.com/sildenafil4 sildenafil over the counter uk, oaoxz, http://en.gravatar.com/soma34 cheap soma online, kwio,
comment4, http://en.kioskea.net/communaute/profil-replica+watches replica watches for sale, :[[[, http://en.kioskea.net/communaute/profil-breitling+replica+watches cheap breitling replica watches, mddi, http://www.guardian.co.uk/users/Vardenafil vardenafil 20mg tab, ihxel, http://www.threadless.com/profile/1288319/Replica_Handbags replica designer handbags, 052437, http://www.threadless.com/profile/1288326/Replica_Louis_Vuitton_Handbags replica louis vuitton handbags wholesale, %-[,
comment1, http://videomaker.com/community/forums/profile/reductil reductil 15 mg prices, %(, http://videomaker.com/community/forums/profile/betnovate buy betnovate scalp application, vwicfp, http://www.formspring.me/Lorazepam buy lorazepam online, 0990, http://www.drawingnow.com/Amoxicillin amoxicillin, 7267, http://www.drawingnow.com/Azithromycin azithromycin, 80056,
comment1, http://students.washington.edu/pnpga/forum/profile.php?id=59 buy reductil slimming pills, 3889, http://www.wirelessrerc.gatech.edu/SOT/sot-wireless-emergency-communication-forum/david-ross-answers-questions-from-the-audience/402014960 buy reductil, >:), http://buyaccutane.blog.igg.com/ buy accutane online canada, >:-))), http://www.airliners.net/profile/zithromax buy zithromax powder, 354907, http://www.airliners.net/profile/doxycycline buy doxycycline canada, dwiwpp,
Writing Sytems
A writing system, or script is not a language; it is a means of conveying information through written language.They can be classified as follows.
Script Type
Alphabetic: Individual units for writing are composed of consonants, and in some cases vowels. When combined they spell out words phonetically. Eg: Indic, Arabic, Latic, Greek etc.
Syllabic: The individual units for writing are composed of syllables. Eg: Japanese kana and Korean Hangul
Ideographic: A writing system which uses pictures or symbols to represent words. Eg: Chinese
Context dependent Glyph Shaping
Positional: The shape of the character changes depending on the position in the word. Eg: Arabic greek.
Ligatures: Characters combine to form a different shape when they appear next to one another. In Indic scripts ligatures are mandatory.
Cursive: The letters are joined while writing. Arabic is an example.But English is not of this kind.
Text Direction
Left to right: Text is written left to right horizontally. Eg: Indic, English
Bidirectional: Examples are Arabic and Hebrew.Text is written right to left while numbers and latin words are written left to right.
Vertical: In Chinese and Japanese text is written vertically
Other Characteristics
Diacritics: Special marks used for accents, tones, and vowels, or to uniquely identify a character. In some writing systems such as Indic and Thai, diacritics can span multiple characters.
Word seperator: Most of the languages use space as word separator. Exceptions are Chinese, Thai , and Japanese
Punctuation: Marks are inconsistent across writing systems
A detailed description of above writing systems can be found at Wikipedia page on Writing Systems
Unicode
Unicode is an industry standard designed to allow text and symbols from all of the writing systems of the world to be consistently represented and manipulated by computers. Developed in tandem with the Universal Character Set standard and published in book form as The Unicode Standard, Unicode consists of a character repertoire, an encoding methodology and set of standard character encodings, a set of code charts for visual reference, an enumeration of character properties such as upper and lower case, a set of reference data computer files, and rules for normalization, decomposition, collation and rendering.
The Unicode Consortium, the non-profit organization that coordinates Unicode's development, has the ambitious goal of eventually replacing existing character encoding schemes with Unicode and its standard Unicode Transformation Format (UTF) schemes, as many of the existing schemes are limited in size and scope and are incompatible with multilingual environments. Unicode's success at unifying character sets has led to its widespread and predominant use in the internationalization and localization of computer software. The standard has been implemented in many recent technologies, including XML, the Java (programming language)|Java programming language and modern operating systems.
More details at
Internationalized Resource Identifiers
Internationalized Resource Identifiers (IRI) is also known as Multilingual Web Addresses. Currently Web addresses are typically expressed using Uniform Resource Identifiers or URIs. This restricts Web addresses to a small number of characters: basically, just upper and lower case letters of the English alphabet, European numerals and a small number of symbols. Recent developments enable you to add non-ASCII characters to Web addresses.
Detailed information is available from An Introduction to Multilingual Web Addresses
Input Methods
Input methods are applications or software components that convert users key strokes into symbols, characters or words.
An input method editor (IME) is a program or operating system component that allows computer users to enter characters and symbols not found on their keyboard. This, for instance, allows the user of a Western keyboard to input Chinese, Japanese, Korean and Indic characters.
This is intended as a non-exhaustive list of input methods for UNIX platforms.
Name | Languages supported | Implementations supported |
---|---|---|
SCIM | Multiple languages, including CJK | GTK+ , Qt and XIM |
uim | Multiple languages, including CJK | GTK+, Qt, XIM, Leim, Tty (Unix) and TSM (Mac OS X) |
xcin | Mainly for traditional Chinese; adapted for use for simplified Chinese. | XIM |
InputKing | Traditional Chinese and simplified Chinese. | Browser based. |
im-ja | Japanese | GTK+ and XIM |
kinput2 | Japanese | XIM, kinput2 protocol |
ami | Korean | XIM |
imhangul | Korean | GTK+ |
Nabi | Korean | XIM |
qimhangul | Korean | Qt |
xvnkb | Vietnamese | XIM |
x-unikey | Vietnamese | XIM |
Appendix
ISO codes for languages
Refer http://www.unicode.org/unicode/onlinedat/languages.html
Unicode Ranges
Refer the Unicode charts http://unicode.org/charts/
References
- Java Internationalization, Andrew Deitsch and David Czarnecki, O'Reilly, First Edition,2001,p 1-15
Related Links
- FOSS Localization
- Gnome L10N Guidelines for Developers by Christian Rose
- Date and time notation by country
- List of languages by name
Indian Localization Efforts
- IndLinux - Indic Localization project
- മലയാളം (Malayalam) - FOSS Malayalam Community
- ಕನ್ನಡ (Kannada) - Kannada FOSS Team
- తెలుగు (Telugu) - Telugu FOSS Team