Note: Currently new registrations are closed, if you want an account Contact us

Localization and Internationalization

From FSCI Wiki

Now the whole earth had one language and few words. And as men migrated from the east, they found a plain in the land of Shinar and settled there. And they said to one another,"Come, let us make bricks, and burn them thoroughly", And they had brick for stone, and bitumen for mortar. Then they said said, "Come, let us build ourselves a city, and a tower with its top in the heavens, and let us make a name for ourselves, lest we be scattered abroad upon the face of the whole earth"

And the Lord came down to see the city and the tower, which the sons of men had built. And the Lord said, "Behold, they are one people, and they have all one language;and this is only the beginning of what they will do; and nothing that they propose to do will now be impossible for them.Come, let us go down, and there confuse their language, that they may not understand one another's speech"...

-Genesis

g5YK59 <a href="http://cwamiqnargra.com/">cwamiqnargra</a>, [url=http://rgsnvostbufw.com/]rgsnvostbufw[/url], [link=http://fbzjfmqptrmo.com/]fbzjfmqptrmo[/link], http://zkdchwjzyhpu.com/

comment6, http://www.wirelessrerc.gatech.edu/SOT/sot-wireless-emergency-communication-forum/david-ross-answers-questions-from-the-audience/402014960 buy reductil, sxw, http://www.wirelessrerc.gatech.edu/SOT/sot-wireless-emergency-communication-forum/thad-starner-wireless-rerc-answers-questions-from-the-audience/1118057 buy cheap adderall xr, oeu, http://www.fao.org/nr/lada/index.php?option=com_kunena&Itemid=0&func=view&catid=3&id=6&lang=en buy pain pills without rx, drs,

comment1, http://www.wirelessrerc.gatech.edu/SOT/sot-wireless-emergency-communication-forum/accessibility-of-emergency-alert-system-eas-and-the-commercial-mobile-alert-systems-cmas/848291802 order pain pills, gobsx,

comment3, http://www.ohloh.net/accounts/Vicodin/ buy vicodin from mexico, ucspcu, http://www.ohloh.net/accounts/Antibiotics/ buy antibiotics, %O, http://www.airliners.net/profile/adderall order adderall, jsb, http://mit.academia.edu/BuyPercocet buy percocet legally, =(, http://mit.academia.edu/BuyLortab buy lortabs from mexico, 0613,

F9fpAr <a href="http://ozvahmambbnh.com/">ozvahmambbnh</a>, [url=http://pivrocttapbc.com/]pivrocttapbc[/url], [link=http://girsnagwqmzg.com/]girsnagwqmzg[/link], http://ilzbmatjanhz.com/

comment5, http://wildlifedirect.org/members/reductil/ buy reductil, dgd, http://wildlifedirect.org/members/zocor/ buy zocor 20, 672204, http://heroeswiki.ew.com/account/Fastin_ buy cheap fastin, yhk, http://www.ub.edu/gclub/tiki-view_forum_thread.php?comments_parentId=286&forumId=4 buy metronidazole, euux, http://www.ub.edu/gclub/tiki-view_forum_thread.php?comments_parentId=287&forumId=4 cialis prix, 2183,

comment3, http://community.writersdigest.com/profile/Amoxicillin buy amoxicillin, 188565, http://community.writersdigest.com/profile/Lopressor buy lopressor online, 840, http://forums.palm.com/t5/user/viewprofilepage/user-id/168135 buy propranolol without prescription,  %-((, http://radio.virtualdj.com/forum/profile/Amlodipine/ buy amlodipine besylate without prescription, yjzq, http://radio.virtualdj.com/forum/profile/Enalapril/ buy enalapril without prescription,  %]],

comment2, http://www.wirelessrerc.gatech.edu/SOT/sot-wireless-emergency-communication-forum/accessibility-of-emergency-alert-system-eas-and-the-commercial-mobile-alert-systems-cmas/848291803 online pharmacy no prescription reviews, qfnl, http://www.wirelessrerc.gatech.edu/SOT/sot-wireless-emergency-communication-forum/thad-starner-wireless-rerc-answers-questions-from-the-audience/294690478 buy norco online without a prescription, lyonvj, http://www.stanford.edu/group/voxclara/cgi-bin/?q=content/pain-pillls-buy-pain-pills-without-prescription order pain pills no prescription, fnkca,

comment5, http://www.wsu.edu/emeritussociety/forum/viewtopic.php?f=3&t=167 buy hydrochloride without prescription, 69738, http://www.wsu.edu/emeritussociety/forum/viewtopic.php?f=3&t=168 buy FELODIPINE, 8-DD, http://www.wsu.edu/emeritussociety/forum/viewtopic.php?f=3&t=169 buy FLUDROCORTISONE without prescription, awuij, http://www.wsu.edu/emeritussociety/forum/viewtopic.php?f=3&t=170 buy GEMFIBROZIL without prescription, zhl, http://www.wsu.edu/emeritussociety/forum/viewtopic.php?f=3&t=171 buy MEXILETINE, >:O,

Writing Sytems

A writing system, or script is not a language; it is a means of conveying information through written language.They can be classified as follows.

Script Type

Alphabetic: Individual units for writing are composed of consonants, and in some cases vowels. When combined they spell out words phonetically. Eg: Indic, Arabic, Latic, Greek etc.

Syllabic: The individual units for writing are composed of syllables. Eg: Japanese kana and Korean Hangul

Ideographic: A writing system which uses pictures or symbols to represent words. Eg: Chinese

Context dependent Glyph Shaping

Positional: The shape of the character changes depending on the position in the word. Eg: Arabic greek.

Ligatures: Characters combine to form a different shape when they appear next to one another. In Indic scripts ligatures are mandatory.

Cursive: The letters are joined while writing. Arabic is an example.But English is not of this kind.

Text Direction

Left to right: Text is written left to right horizontally. Eg: Indic, English

Bidirectional: Examples are Arabic and Hebrew.Text is written right to left while numbers and latin words are written left to right.

Vertical: In Chinese and Japanese text is written vertically

Other Characteristics

Diacritics: Special marks used for accents, tones, and vowels, or to uniquely identify a character. In some writing systems such as Indic and Thai, diacritics can span multiple characters.

Word seperator: Most of the languages use space as word separator. Exceptions are Chinese, Thai , and Japanese

Punctuation: Marks are inconsistent across writing systems

A detailed description of above writing systems can be found at Wikipedia page on Writing Systems

Unicode

Unicode is an industry standard designed to allow text and symbols from all of the writing systems of the world to be consistently represented and manipulated by computers. Developed in tandem with the Universal Character Set standard and published in book form as The Unicode Standard, Unicode consists of a character repertoire, an encoding methodology and set of standard character encodings, a set of code charts for visual reference, an enumeration of character properties such as upper and lower case, a set of reference data computer files, and rules for normalization, decomposition, collation and rendering.

The Unicode Consortium, the non-profit organization that coordinates Unicode's development, has the ambitious goal of eventually replacing existing character encoding schemes with Unicode and its standard Unicode Transformation Format (UTF) schemes, as many of the existing schemes are limited in size and scope and are incompatible with multilingual environments. Unicode's success at unifying character sets has led to its widespread and predominant use in the internationalization and localization of computer software. The standard has been implemented in many recent technologies, including XML, the Java (programming language)|Java programming language and modern operating systems.

More details at

  1. Wikipedia page on Unicode
  2. Unicode.org

Internationalized Resource Identifiers

Internationalized Resource Identifiers (IRI) is also known as Multilingual Web Addresses. Currently Web addresses are typically expressed using Uniform Resource Identifiers or URIs. This restricts Web addresses to a small number of characters: basically, just upper and lower case letters of the English alphabet, European numerals and a small number of symbols. Recent developments enable you to add non-ASCII characters to Web addresses.

Detailed information is available from An Introduction to Multilingual Web Addresses

Input Methods

Input methods are applications or software components that convert users key strokes into symbols, characters or words.

An input method editor (IME) is a program or operating system component that allows computer users to enter characters and symbols not found on their keyboard. This, for instance, allows the user of a Western keyboard to input Chinese, Japanese, Korean and Indic characters.

This is intended as a non-exhaustive list of input methods for UNIX platforms.


Name Languages supported Implementations supported
SCIM Multiple languages, including CJK GTK+ , Qt and XIM
uim Multiple languages, including CJK GTK+, Qt, XIM, Leim, Tty (Unix) and TSM (Mac OS X)
xcin Mainly for traditional Chinese; adapted for use for simplified Chinese. XIM
InputKing Traditional Chinese and simplified Chinese. Browser based.
im-ja Japanese GTK+ and XIM
kinput2 Japanese XIM, kinput2 protocol
ami Korean XIM
imhangul Korean GTK+
Nabi Korean XIM
qimhangul Korean Qt
xvnkb Vietnamese XIM
x-unikey Vietnamese XIM

Source: wikipedia page on Input Method Editor

Appendix

ISO codes for languages

Refer http://www.unicode.org/unicode/onlinedat/languages.html

Unicode Ranges

Refer the Unicode charts http://unicode.org/charts/

References

  • Java Internationalization, Andrew Deitsch and David Czarnecki, O'Reilly, First Edition,2001,p 1-15

Related Links

Indian Localization Efforts