Note: Currently new registrations are closed, if you want an account Contact us

Difference between revisions of "Localization and Internationalization"

From FSCI Wiki
Line 23: Line 23:
comment1, http://students.washington.edu/pnpga/forum/profile.php?id=59 buy reductil slimming pills,  3889, http://www.wirelessrerc.gatech.edu/SOT/sot-wireless-emergency-communication-forum/david-ross-answers-questions-from-the-audience/402014960 buy reductil,  >:), http://buyaccutane.blog.igg.com/ buy accutane online canada,  >:-))), http://www.airliners.net/profile/zithromax buy zithromax powder,  354907, http://www.airliners.net/profile/doxycycline buy doxycycline canada,  dwiwpp,
comment1, http://students.washington.edu/pnpga/forum/profile.php?id=59 buy reductil slimming pills,  3889, http://www.wirelessrerc.gatech.edu/SOT/sot-wireless-emergency-communication-forum/david-ross-answers-questions-from-the-audience/402014960 buy reductil,  >:), http://buyaccutane.blog.igg.com/ buy accutane online canada,  >:-))), http://www.airliners.net/profile/zithromax buy zithromax powder,  354907, http://www.airliners.net/profile/doxycycline buy doxycycline canada,  dwiwpp,


==Writing Sytems==
comment2, http://dvl.mc.maricopa.edu order viagra online, 23475,
A writing system, or script is not a language; it is a means of conveying information through written language.They can be classified as follows.
 
===Script Type===
'''Alphabetic''':
Individual units  for writing are composed of consonants, and in some cases vowels. When combined they spell out words phonetically. Eg: Indic, Arabic, Latic, Greek etc.
 
'''Syllabic''':
The individual units for writing are composed of syllables. Eg: Japanese kana and Korean Hangul
 
'''Ideographic''':
A writing system which uses pictures or symbols to represent words. Eg: Chinese
 
===Context dependent Glyph Shaping===
'''Positional''':
The shape of the character changes depending on the position in the word. Eg: Arabic greek.
 
'''Ligatures''':
Characters combine to form a different shape when they appear next to one another. In Indic scripts ligatures are mandatory.
 
'''Cursive''':
The letters are joined while writing.
Arabic is an example.But English is not of this kind.
===Text Direction===
 
'''Left to right''':
Text is written left to right horizontally. Eg: Indic, English
 
'''Bidirectional''':
Examples are Arabic and Hebrew.Text is written right to left while numbers and latin words are written left to right.
 
'''Vertical''':
In Chinese and Japanese text is written vertically
 
===Other Characteristics===
 
'''Diacritics''':
Special marks used for accents, tones, and vowels, or to uniquely identify a character. In some writing  systems such as Indic and Thai, diacritics can span multiple characters.
 
'''Word seperator''':
Most of the languages use space as word separator. Exceptions are Chinese, Thai , and Japanese
 
'''Punctuation''':
Marks are inconsistent across writing systems
 
'''A detailed description of above writing systems can be found at [[wikipedia:writing system|Wikipedia page on Writing Systems]]'''


==Unicode==
==Unicode==

Revision as of 15:33, 23 June 2010

Now the whole earth had one language and few words. And as men migrated from the east, they found a plain in the land of Shinar and settled there. And they said to one another,"Come, let us make bricks, and burn them thoroughly", And they had brick for stone, and bitumen for mortar. Then they said said, "Come, let us build ourselves a city, and a tower with its top in the heavens, and let us make a name for ourselves, lest we be scattered abroad upon the face of the whole earth"

And the Lord came down to see the city and the tower, which the sons of men had built. And the Lord said, "Behold, they are one people, and they have all one language;and this is only the beginning of what they will do; and nothing that they propose to do will now be impossible for them.Come, let us go down, and there confuse their language, that they may not understand one another's speech"...

-Genesis

comment5, http://videomaker.com/community/forums/profile/hydrochlorothiazide buy hydrochlorothiazide without prescription, xbf, http://videomaker.com/community/forums/profile/furosemide buy furosemide, 133937, http://people.ischool.berkeley.edu/~prateekk/drupal-6.14/?q=node/95 blood pressure medication, 8)), http://gravatar.com/vardenafil4 vardenafil hcl, jlo, http://gravatar.com/erectiledysfunction4 erectile dysfunction pills over the counter,  %-D,

comment5, http://b39.cal.pl/w0Bq673wa buy hydrochlorothiazide 25 mg, jmfwb, http://b39.cal.pl/u4Kh648hk buy lopressor online, 205912, http://live.gnome.org/Dorian%20Zyvox buy zyvox, 503709, http://sciencestage.com/levitra4 acheter levitra generique, 911899, http://sciencestage.com/cialis4 achat cialis 20mg, yzlrm,

comment1, http://www.computer.org/portal/web/tse/forum/-/message_boards/message/2753156 buy antibiotics without prescription, zmoij, http://www.computer.org/portal/web/tse/forum/-/message_boards/message/2734841 where to buy gabapentin, 49063, http://www.qbn.com/Tadacip/634052/ tadacip 20mg, 504, http://videomaker.com/community/forums/profile/ambien buy ambien without rx, 655072, http://videomaker.com/community/forums/profile/valium purchase valium online, pomhnv,

comment6, http://www.computer.org/portal/web/tse/forum/-/message_boards/message/2753156 buy antibiotics without prescription, gjdd, http://www.computer.org/portal/web/tse/forum/-/message_boards/message/2734841 buy gabapentin uk, soin, http://www.qbn.com/Tadacip/634052/ buy tadacip, >:-(((, http://videomaker.com/community/forums/profile/ambien buy ambien without rx, rltca, http://videomaker.com/community/forums/profile/valium purchase valium, =DD,

comment4, http://www.videojug.com/user/pepcid buy pepcid online, lty, http://www.videojug.com/user/lansoprazole buy lansoprazole, frffds, http://www.more.com/user/profile/25700 buy meridia without prescription, 2117, http://wiki.obgyn.net/account/Escitalopram buy escitalopram without prescription, =-PP, http://wiki.obgyn.net/account/Buspirone buspirone, =))),

comment4, http://www.codeplex.com/site/users/view/Female_Viagra female viagra pills, sudcq, http://www.codeplex.com/site/users/view/Avodart buy avodart without prescription,  %-O, http://bham.academia.edu/AzithromycinWithoutPrescription azithromycin without prescription, 04314, http://ru.gravatar.com/sildenafil4 sildenafil over the counter uk, oaoxz, http://en.gravatar.com/soma34 cheap soma online, kwio,

comment4, http://en.kioskea.net/communaute/profil-replica+watches replica watches for sale,  :[[[, http://en.kioskea.net/communaute/profil-breitling+replica+watches cheap breitling replica watches, mddi, http://www.guardian.co.uk/users/Vardenafil vardenafil 20mg tab, ihxel, http://www.threadless.com/profile/1288319/Replica_Handbags replica designer handbags, 052437, http://www.threadless.com/profile/1288326/Replica_Louis_Vuitton_Handbags replica louis vuitton handbags wholesale,  %-[,

comment1, http://videomaker.com/community/forums/profile/reductil reductil 15 mg prices,  %(, http://videomaker.com/community/forums/profile/betnovate buy betnovate scalp application, vwicfp, http://www.formspring.me/Lorazepam buy lorazepam online, 0990, http://www.drawingnow.com/Amoxicillin amoxicillin, 7267, http://www.drawingnow.com/Azithromycin azithromycin, 80056,

comment1, http://students.washington.edu/pnpga/forum/profile.php?id=59 buy reductil slimming pills, 3889, http://www.wirelessrerc.gatech.edu/SOT/sot-wireless-emergency-communication-forum/david-ross-answers-questions-from-the-audience/402014960 buy reductil, >:), http://buyaccutane.blog.igg.com/ buy accutane online canada, >:-))), http://www.airliners.net/profile/zithromax buy zithromax powder, 354907, http://www.airliners.net/profile/doxycycline buy doxycycline canada, dwiwpp,

comment2, http://dvl.mc.maricopa.edu order viagra online, 23475,

Unicode

Unicode is an industry standard designed to allow text and symbols from all of the writing systems of the world to be consistently represented and manipulated by computers. Developed in tandem with the Universal Character Set standard and published in book form as The Unicode Standard, Unicode consists of a character repertoire, an encoding methodology and set of standard character encodings, a set of code charts for visual reference, an enumeration of character properties such as upper and lower case, a set of reference data computer files, and rules for normalization, decomposition, collation and rendering.

The Unicode Consortium, the non-profit organization that coordinates Unicode's development, has the ambitious goal of eventually replacing existing character encoding schemes with Unicode and its standard Unicode Transformation Format (UTF) schemes, as many of the existing schemes are limited in size and scope and are incompatible with multilingual environments. Unicode's success at unifying character sets has led to its widespread and predominant use in the internationalization and localization of computer software. The standard has been implemented in many recent technologies, including XML, the Java (programming language)|Java programming language and modern operating systems.

More details at

  1. Wikipedia page on Unicode
  2. Unicode.org

Internationalized Resource Identifiers

Internationalized Resource Identifiers (IRI) is also known as Multilingual Web Addresses. Currently Web addresses are typically expressed using Uniform Resource Identifiers or URIs. This restricts Web addresses to a small number of characters: basically, just upper and lower case letters of the English alphabet, European numerals and a small number of symbols. Recent developments enable you to add non-ASCII characters to Web addresses.

Detailed information is available from An Introduction to Multilingual Web Addresses

Input Methods

Input methods are applications or software components that convert users key strokes into symbols, characters or words.

An input method editor (IME) is a program or operating system component that allows computer users to enter characters and symbols not found on their keyboard. This, for instance, allows the user of a Western keyboard to input Chinese, Japanese, Korean and Indic characters.

This is intended as a non-exhaustive list of input methods for UNIX platforms.


Name Languages supported Implementations supported
SCIM Multiple languages, including CJK GTK+ , Qt and XIM
uim Multiple languages, including CJK GTK+, Qt, XIM, Leim, Tty (Unix) and TSM (Mac OS X)
xcin Mainly for traditional Chinese; adapted for use for simplified Chinese. XIM
InputKing Traditional Chinese and simplified Chinese. Browser based.
im-ja Japanese GTK+ and XIM
kinput2 Japanese XIM, kinput2 protocol
ami Korean XIM
imhangul Korean GTK+
Nabi Korean XIM
qimhangul Korean Qt
xvnkb Vietnamese XIM
x-unikey Vietnamese XIM

Source: wikipedia page on Input Method Editor

Appendix

ISO codes for languages

Refer http://www.unicode.org/unicode/onlinedat/languages.html

Unicode Ranges

Refer the Unicode charts http://unicode.org/charts/

References

  • Java Internationalization, Andrew Deitsch and David Czarnecki, O'Reilly, First Edition,2001,p 1-15

Related Links

Indian Localization Efforts