Freshmeat page: http://freshmeat.net/projects/sitescooper/
Sitescooper is a sophisticated offline reader for the web; it snarfs news from lots of popular news sites, reformats it and trims off superfluous HTML and images, maintains a "last-read" database of articles, and can convert into formats suitable for reading on low memory/small screen hardware such as Palm handhelds. Sitescooper is written in Perl.
This project has the following developers:
New HTML Parser: The long-awaited libxml2 based HTML parser code is live. It needs further work but already handles most markup better than the original parser.
Keep up with the latest Advogato features by reading the Advogato status blog.
If you're a C programmer with some spare time, take a look at the mod_virgule project page and help us with one of the tasks on the ToDo list!