Freshmeat page: http://freshmeat.net/projects/wv/?topic_id=849
wv is a library to parse microsoft word documents. Supported are word 6, 95, 97 and 2000. It is used by abiword among others as the base of its word import ability
wv comes with a sample program named wvHtml which converts word to html documents. The wvHtml.xml file is configurable to modify the translations of word features to html, so much so that wvLaTeX.xml and wvGroff.xml which are also included can be used to get latex and groff translations of the word documents. These two files need more work and as I know neither groff or much latex help would be appreciated
The project home page has an online conversion utility to test out wv without compiling it for yourself
This project has the following developers:
New HTML Parser: The long-awaited libxml2 based HTML parser code is live. It needs further work but already handles most markup better than the original parser.
Keep up with the latest Advogato features by reading the Advogato status blog.
If you're a C programmer with some spare time, take a look at the mod_virgule project page and help us with one of the tasks on the ToDo list!