Created 11 Jul 2010 at 09:25 UTC by mindcrime.Homepage: http://code.google.com/p/heceta
Freshmeat page: http://freshmeat.net/projects/heceta
Heceta is part of the ScrewPile suite of tools for building intelligent applications. Heceta is a search engine that leverages all of the various bits of information from Neddick, Quoddy, and "Other" to provide better / deeper / more insightful search results than you can get from simple document content analysis. Intranet search in organizations is usually very poor, largely because page-rank type algorithms don't work well due to the lack of links between documents. But by supplementing the content analysis with scoring based on tags, social graph connections, activity-stream information, etc., and applying machine-learning / artificial-intelligence techniques Heceta can do a superior job of locating the knowledge and information a user needs. This is not, by the way, a totally novel idea. It's sometimes referred to as Social Search.
License: Apache License 2.0
This project has the following developers:
New HTML Parser: The long-awaited libxml2 based HTML parser code is live. It needs further work but already handles most markup better than the original parser.
Keep up with the latest Advogato features by reading the Advogato status blog.
If you're a C programmer with some spare time, take a look at the mod_virgule project page and help us with one of the tasks on the ToDo list!