13 May 2005 connolly   » (Master)

Grokking Triples from Spreadsheets

Sean notes that there are lots of triples in spreadsheets. Yup. After my Aug 2003 trip to Montreal for Extreme, I used gnumeric as an RDF authoring tool to collect all the gas receipts and such; then the Makefile has this stanza to convert it to RDF:

triplog.rdf: triplog.xml grokSheet.xsl
	$(XSLTPROC) --novalid grokSheet.xsl triplog.xml >$@

I haven't scrubbed the data, so this is somewhat incomplete as a demo.

Yes, this is another GRDDL style transformation.

A comment on Sean's blog said "don't forget RDBs". Of course not. See Relational Databases and the Semantic Web; I hope to update my implementation, dbview.py to use SPARQL before too much longer.

Hmm... where are timbl's slides on RDF, trees, tables, and such?

Latest blog entries     Older blog entries

New Advogato Features

New HTML Parser: The long-awaited libxml2 based HTML parser code is live. It needs further work but already handles most markup better than the original parser.

Keep up with the latest Advogato features by reading the Advogato status blog.

If you're a C programmer with some spare time, take a look at the mod_virgule project page and help us with one of the tasks on the ToDo list!