Freshmeat page: http://freshmeat.net/projects/xml2
These are simple tools for converting XML and HTML to and from a line-oriented format designed to facilitate processing by classic Unix pipeline processing tools (grep, sed, awk, cut, sh, etc). See the home page for some cute examples. They are best suited for "quick and dirty" transformations and extractions of XML or HTML data.
The xml2 utilities resemble Sean McGrath's Pyxie in spirit, but use a significantly different encoding that (I believe) offers more "context" for line-oriented tools to chew on.
This project has the following developers:
New HTML Parser: The long-awaited libxml2 based HTML parser code is live. It needs further work but already handles most markup better than the original parser.
Keep up with the latest Advogato features by reading the Advogato status blog.
If you're a C programmer with some spare time, take a look at the mod_virgule project page and help us with one of the tasks on the ToDo list!