Created 25 Nov 2006 at 03:17 UTC by StevenRainwater.Homepage: http://www.rainwaterreptileranch.org/steve/sw/odp/
odp2db is a collection of Perl programs that can be used to parse Open Directory Project (ODP) data dumps and insert the data into an SQL database. Both the structure.rdf.u8 and content.rdf.u8 files are parsed. A minimal table structure is included that is suitable for loading the database but probably not useful for any real work. The XML::Parse and DBI Perl modules are required. I developed this for use with PostgreSQL but have tried to stick with standard ANSI SQL as much as possible so it should work with MySQL and anything else supported by DBI with only very minimal changes.
License: GNU GPL
This project has the following developers:
New HTML Parser: The long-awaited libxml2 based HTML parser code is live. It needs further work but already handles most markup better than the original parser.
Keep up with the latest Advogato features by reading the Advogato status blog.
If you're a C programmer with some spare time, take a look at the mod_virgule project page and help us with one of the tasks on the ToDo list!