20 Sep 2000 cwryu   » (Journeyer)

Why CJKV language processing is an issue:

  • Multibyte encodings are used. They are incompatible with each other, as well as with iso-8859-N or other 8-bit encodings.
  • The number of "alphabets" are relatively big. Even no native Chinese person knows the exact number. For Korean, there are 19+21+27 alphabets but they can be "combined" into 19*21*(27+1) glyphs.
  • And, some American/European hackers' ASSUMPTION. They assumes iso-8859-1 encoding. They assumes linebreaks can occur between any bytes. In some cases CJK support brings inefficiency.

Latest blog entries     Older blog entries

New Advogato Features

FOAF updates: Trust rankings are now exported, making the data available to other users and websites. An external FOAF URI has been added, allowing users to link to an additional FOAF file.

Keep up with the latest Advogato features by reading the Advogato status blog.

If you're a C programmer with some spare time, take a look at the mod_virgule project page and help us with one of the tasks on the ToDo list!