Name: Roozbeh Pournader
Member since: 2001-02-04 18:09:16
Last Login: 2009-12-08 20:42:04
Notes: Just interested in internationalization and localization (for years). Living and working in Silicon Valley, California. You can read my curriculum vitae, if you like, to get some idea of what I have done. I also have a Persian blog called UTF-8. You can contact me at the address roozbeh gmail com.
If you wish to join us, information on events are at http://voices4iran.org/.
29 Jun 2009 (updated 29 Jun 2009 at 05:46 UTC) »
I lost at least three hours today finding about this, and I found about it by accident, because I had Calendrical Tabulations at hand and happened to look at the Chinese calendar column. There are several conflicting pieces of information on the internet here and there, which really confused me to the point that I thought the actual algorithm is not publicly available.
Just wanted to share a bit of my own experience with being overweight, losing a lot of it, and then gaining some of it back:
I highly recommend The Hacker’s Diet, available online for free. It is written by John Walker, of AutoCAD fame.
The very short book helped me lose about 15 kilos easily (and with no exercising) a few years ago. I have started to diet again these days, with a goal of losing about 30 pounds (almost the same amount, but I know live in the US).
Even if you hate diets and diet books, still read it. I would recommend reading it even if you are not overweight!
Footnote: The author of the book has made all the code he used in the book (with several updates) available as public domain code online. He also runs a server with the tools installed for public use, if you are the lazy type, like me. It's all here.
6 Mar 2009 (updated 6 Mar 2009 at 03:19 UTC) »
Font files don’t have that information directly. How would a font designer know that his font supports Arbuan Papiamento just fine, which uses a different orthography than Papiamento as written in Netherlands Antilles, for example? What about African or native American languages? Or Mongolian? Or Kurdish? He just designs and tests glyphs for characters and languages he is interested in. If the resulting font happens to support Filipino too, good for him and his users, if it doesn’t, he may not care. At best, a list of the languages the font designer believes the font is supporting may be found somewhere in the documentation.
In the present freedesktop stack, the language support detection task is done by fontconfig. When an application, like Firefox, wants to display text in some language, a text layout engine, like Pango, will ask fontconfig for a font that supports displaying text in the language (possibly with some other properties, like the font being bold and sans serif). fontconfig then uses its various font suggestion rules and orthography files to give the best font it can find back to the engine. If FontConfig doesn't know anything about the language, or has wrong information, it may give you something totally off, like a Latin or Devanagari font for a language written in the Arabic script.
What font designers may not know (or care about), fontconfig needs to know. The usual way of knowing, especially for not-very-famous fonts or languages, is through orthography files. These files contain a list of Unicode characters that play a letter-like role in the language. For example, for French, it is a list of basic Latin letters plus all the ligatures (like œ) and accented letters (like ï). fontconfig runs the list through each font installed on your machine and sees if it has glyphs for all the characters listed. If it does, the font is assumed to support the language.
Getting back to my own story, I thought of checking orthography files to see which languages my packaged fonts support. But when I looked into a few, I found several bugs and unsupported languages. Behdad encouraged me to fix them early, for a chance for them to get them into fontconfig 2.7.
During the past few weeks, I’ve been trying to hunt things down and fix them during my free time. I achieved my first target of matching glibc locales (those without ‘@’). I’m now on my second target of matching languages with two-letter codes; remaining are: Akan, Avestan, Cree, Ewe, Herero, Sichuan Yi, Javanese, Kanuri, Kongo, Kuanyama, Luba-Katanga, Nauru, Navajo, North Ndebele, Ndonga, Ojibwa, Pali, Quechua, Rundi, Sango, Shona, Sundanese, Tahitian, and Zhuang. After that, there are thousands of languages with three letter codes, which would need an army the size of SIL International.
Everything I did is in my git tree here. If you want to help, file bugs with your findings at http://bugs.freedesktop.org/. You can also check out the existing orthography bugs to avoid duplication.
roozbeh certified others as follows:
Others have certified roozbeh as follows:
[ Certification disabled because you're not logged in. ]
FOAF updates: Trust rankings are now exported, making the data available to other users and websites. An external FOAF URI has been added, allowing users to link to an additional FOAF file.
Keep up with the latest Advogato features by reading the Advogato status blog.
If you're a C programmer with some spare time, take a look at the mod_virgule project page and help us with one of the tasks on the ToDo list!