<?xml version="1.0"?>
<rss version="2.0">
  <channel>
    <title>Advogato blog for mhausenblas</title>
    <link>http://www.advogato.org/person/mhausenblas/</link>
    <description>Advogato blog for mhausenblas</description>
    <language>en-us</language>
    <generator>mod_virgule</generator>
    <pubDate>Sun, 26 May 2013 09:13:54 GMT</pubDate>
    <item>
      <pubDate>Sun, 24 Mar 2013 17:12:30 GMT</pubDate>
      <title>Cloud Cipher Capabilities</title>
      <link>http://www.advogato.org/person/mhausenblas/diary.html?start=67</link>
      <guid>http://webofdata.wordpress.com/2013/03/24/cloud-encryption-support/</guid>
      <description>&lt;p&gt;
  &lt;em&gt;&#x2026; or, the lack of it.&lt;/em&gt;
&lt;/p&gt;
&lt;p&gt;A recent discussion at a customer made me having a closer look around support for encryption in the context of XaaS cloud service offerings as well as concerning Hadoop. In general, this can be broken down into over-the-wire (cf. SSL/&lt;a href="http://en.wikipedia.org/wiki/Transport_Layer_Security" &gt;TLS&lt;/a&gt;) and back-end encryption. While the former is widely used, the latter is rather seldom to find.&lt;/p&gt;
&lt;p&gt;Different reasons might exits why one wants to encrypt her data, ranging from preserving a competitive advantage to end-user privacy issues. No matter why someone wants to encrypt the data, the question is do systems support this (transparently) or are developers forced to code this in the application logic.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;IaaS-level&lt;/strong&gt;. Especially in this category, file storage for app development, one would expect wide support for built-in encryption.&lt;/p&gt;
&lt;ul&gt;&lt;li&gt;Amazon&#x2019;s S3 indeed provides &lt;a href="http://aws.amazon.com/about-aws/whats-new/2011/10/04/amazon-s3-announces-server-side-encryption-support/" &gt;server-side support for encryption&lt;/a&gt;&lt;/li&gt;
&lt;li&gt;Google Storage &lt;a href="https://developers.google.com/storage/docs/developer-guide" &gt;does not encrypt files&lt;/a&gt;&lt;/li&gt;
&lt;li&gt;Same for Rackspace&#x2019;s Cloud Files &#x2013; &lt;a href="http://www.rackspace.com/knowledge_center/product-faq/cloud-files" &gt;no encryption&lt;/a&gt;, ATM&lt;/li&gt;
&lt;li&gt;As well as for Microsoft&#x2019;s Azure storage &#x2013; &lt;a href="http://social.msdn.microsoft.com/Forums/en-US/windowsazuresecurity/thread/d8b461bd-87c4-4552-99ed-aab9faa16457" &gt;not encrypting files&lt;/a&gt;&lt;/li&gt;
&lt;li&gt;And last but not least, HP Cloud&#x2019;s Object Storage is in good company by &lt;a href="https://docs.hpcloud.com/api/object-storage" &gt;not supporting encryption&lt;/a&gt;&lt;/li&gt;
&lt;/ul&gt;&lt;p&gt;On the &lt;strong&gt;PaaS level&lt;/strong&gt; things look pretty much the same: for example, &lt;a href="http://aws.amazon.com/elasticbeanstalk/" &gt;AWS Elastic Beanstalk&lt;/a&gt; provides no support for encryption of the data (unless you consider S3) and concerning Google&#x2019;s App Engine, &lt;a href="http://stackoverflow.com/questions/6040673/encrypting-user-data-on-app-engine" &gt;good practices for data encryption&lt;/a&gt; only seem to emerge.&lt;/p&gt;
&lt;p&gt;Offerings on the &lt;strong&gt;SaaS level&lt;/strong&gt; provide an equally poor picture:&lt;/p&gt;
&lt;ul&gt;&lt;li&gt;Dropbox offers encryption &lt;a href="https://www.dropbox.com/help/27/en" &gt;via S3&lt;/a&gt;.&lt;/li&gt;
&lt;li&gt;Google Drive and Microsoft Skydrive seem to not offer any encryption options for storage.&lt;/li&gt;
&lt;li&gt;Apple&#x2019;s iCloud is a notable exception: not only does it provide support but also &lt;a href="http://support.apple.com/kb/ht4865" &gt;nicely explains it&lt;/a&gt;.&lt;/li&gt;
&lt;li&gt;For many if not most of the above SaaS-level offerings there are plug-ins that enable encryption, such as provided by &lt;a href="http://www.syncdocs.com/how-to-set-up-google-drive-encryption/" &gt;Syncdocs&lt;/a&gt; or &lt;a href="http://www.cloudfogger.com/en/" &gt;CloudFlogger&lt;/a&gt;&lt;/li&gt;
&lt;/ul&gt;&lt;p&gt;In &lt;strong&gt;Hadoop-land&lt;/strong&gt; things also look rather &lt;a href="http://stackoverflow.com/questions/7649936/using-encryption-with-hadoop" &gt;sobering&lt;/a&gt;; there are few activities around making HDFS or the likes do encryption such as &lt;a href="https://launchpad.net/ecryptfs" &gt;ecryptfs&lt;/a&gt; or &lt;a href="http://www.gazzang.com/encrypt-hadoop" &gt;Gazzang&#x2019;s&lt;/a&gt; offering. Last but not least: for Hadoop in the cloud, encryption is available via AWS&#x2019;s EMR by using S3.&lt;/p&gt;
&lt;br/&gt;Filed under: &lt;a href="http://webofdata.wordpress.com/category/big-data/" &gt;Big Data&lt;/a&gt;, &lt;a href="http://webofdata.wordpress.com/category/cloud-computing/" &gt;Cloud Computing&lt;/a&gt;, &lt;a href="http://webofdata.wordpress.com/category/fyi/" &gt;FYI&lt;/a&gt;  &lt;a href="http://feeds.wordpress.com/1.0/gocomments/webofdata.wordpress.com/856/" &gt;
  &lt;img alt="" border="0" src="http://feeds.wordpress.com/1.0/comments/webofdata.wordpress.com/856/"/&gt;&lt;/a&gt; &lt;img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=webofdata.wordpress.com&amp;amp;blog=6169642&amp;amp;post=856&amp;amp;subd=webofdata&amp;amp;ref=&amp;amp;feed=1" width="1" height="1"/&gt;</description>
    </item>
    <item>
      <pubDate>Sun, 10 Mar 2013 14:09:04 GMT</pubDate>
      <title>Elephant filet</title>
      <link>http://www.advogato.org/person/mhausenblas/diary.html?start=66</link>
      <guid>http://webofdata.wordpress.com/2013/03/10/elephant-filet/</guid>
      <description>&lt;p&gt;End of January I participated in a panel discussion on &lt;a href="https://www.ciscolive365.com/connect/sessionDetail.ww?SESSION_ID=6169" &gt;Big Data&lt;/a&gt;, held during the &lt;a href="http://www.computerweekly.com/guides/Cisco-Live-London-2013" &gt;CISCO live&lt;/a&gt; event in London. One of my fellow panelists, I believe it was Sean of CISCO, said there something along the line:&lt;/p&gt;
&lt;blockquote&gt;
  &lt;p&gt;&#x2026; ideally the cluster is at 99% utilisation, concerning CPU, I/O, and network &#x2026;&lt;/p&gt;
&lt;/blockquote&gt;
&lt;p&gt;This stuck in my head and I gave it some thoughts. In the following I will elaborate a bit on this in the context of where Hadoop is used in a shared setup, for example in &lt;a href="http://webofdata.wordpress.com/2012/11/08/hosted-mapreduce-hadoop/" &gt;hosted offerings&lt;/a&gt; or, say, within an enterprise that runs different systems such as Storm, Lucene/Solr, and Hadoop on one cluster.&lt;/p&gt;
&lt;p&gt;In essence, we witness two competing forces: from the perspective of a single user who expects performance vs. the view of the cluster owner or operator who wants to optimise throughput and maximise utilisation. If you&#x2019;re not familiar with these terms you might want to read up on Cary Millsap&#x2019;s Thinking Clearly About Performance (&lt;a href="http://cacm.acm.org/magazines/2010/9/98033-thinking-clearly-about-performance-part-1/fulltext" &gt;part 1&lt;/a&gt; | &lt;a href="http://cacm.acm.org/magazines/2010/10/99486-thinking-clearly-about-performance-part-2/fulltext" &gt;part 2&lt;/a&gt;).&lt;/p&gt;
&lt;p&gt;Now, in such as shared setup we may experience a spectrum of loads: from compute intensive over I/O intensive to communication intensive, illustrated in the following, not overly scientific figure:&lt;br/&gt;&lt;a href="http://webofdata.files.wordpress.com/2013/03/shared-hadoop.png" &gt;&lt;img class="aligncenter size-large wp-image-844" alt="Utilisations" src="http://webofdata.files.wordpress.com/2013/03/shared-hadoop.png?w=750&amp;amp;h=498" width="750" height="498"/&gt;&lt;/a&gt;&lt;/p&gt;
&lt;p&gt;Here are a some observations and thoughts for potential starting points of deeper research or experiments.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Multitenancy&lt;/strong&gt;. We see more and more deployments that require &lt;a href="http://www.mapr.com/company/press-releases/mapr-enables-hadoop-as-a-service-with-multi-tenancy-security-and-end-to-end-management" &gt;strong&lt;/a&gt; &lt;a href="http://www.slideshare.net/treasure-data/hadoop-meets-cloud-with-multitenancy-16107610" &gt;support&lt;/a&gt; &lt;a href="http://extremehadoop.wordpress.com/tag/multitenant/" &gt;for&lt;/a&gt; multitenancy; check out the &lt;a href="http://hadoop.apache.org/docs/r1.1.1/capacity_scheduler.html" &gt;CapacityScheduler&lt;/a&gt;, learn from &lt;a href="http://developer.yahoo.com/blogs/hadoop/posts/2010/08/apache_hadoop_best_practices_a/" &gt;best practices&lt;/a&gt; or use a distribution that natively supports the specification of &lt;a href="http://www.mapr.com/doc/display/MapR/Node+Topology" &gt;topologies&lt;/a&gt;. Additionally, you might still want to keep an eye on &lt;a href="http://serengeti.cloudfoundry.com/" &gt;Serengeti&lt;/a&gt; &#x2013; VMware&#x2019;s Hadoop virtualisation project &#x2013; that seems to have gone quiet in the past months, but I still have hope for it.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Software Defined Networks (SDN)&lt;/strong&gt;. See Wikipedia&#x2019;s &lt;a href="http://en.wikipedia.org/wiki/Software-defined_networking" &gt;definition&lt;/a&gt; for it, it&#x2019;s not too bad. CISCO, for example, is &lt;a href="http://blogs.cisco.com/tag/software-defined-networking/" &gt;very active&lt;/a&gt; in this area and only recently there was a special issue in the recent IEEE Communications Magazine (&lt;a href="http://dl.comsoc.org/comsocdl/?publication=TOC2631&amp;amp;label=IEEE%20Communications%20Magazine%2C%202013%20February" &gt;February 2013&lt;/a&gt;) covering SDN research. I can perfectly see &#x2013; and indeed this was also briefly discussed on our CISCO live panel back in January &#x2013; how SDN can enable new ways to optimise throughput and performance. Imagine a SDN that is dynamically workload-aware in the sense of that it knows the difference of a node that runs a &lt;a href="http://wiki.apache.org/hadoop/TaskTracker" &gt;task tracker&lt;/a&gt; vs. a &lt;a href="http://wiki.apache.org/hadoop/DataNode" &gt;data node&lt;/a&gt; vs. a &lt;a href="http://wiki.apache.org/solr/SolrCloud" &gt;Solr shard&lt;/a&gt; &#x2013; it should be possible to transparently better the operational parameters and everyone involved, both the users as well as the cluster owner benefit from it.&lt;/p&gt;
&lt;p&gt;As usual, I&#x2019;m very interested in what you think about the topic and looking forward learning about resources in this space from you.&lt;/p&gt;
&lt;br/&gt;Filed under: &lt;a href="http://webofdata.wordpress.com/category/big-data/" &gt;Big Data&lt;/a&gt;, &lt;a href="http://webofdata.wordpress.com/category/cloud-computing/" &gt;Cloud Computing&lt;/a&gt;, &lt;a href="http://webofdata.wordpress.com/category/fyi/" &gt;FYI&lt;/a&gt;, &lt;a href="http://webofdata.wordpress.com/category/nosql/" &gt;NoSQL&lt;/a&gt;  &lt;a href="http://feeds.wordpress.com/1.0/gocomments/webofdata.wordpress.com/843/" &gt;
  &lt;img alt="" border="0" src="http://feeds.wordpress.com/1.0/comments/webofdata.wordpress.com/843/"/&gt;&lt;/a&gt; &lt;img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=webofdata.wordpress.com&amp;amp;blog=6169642&amp;amp;post=843&amp;amp;subd=webofdata&amp;amp;ref=&amp;amp;feed=1" width="1" height="1"/&gt;</description>
    </item>
    <item>
      <pubDate>Tue, 1 Jan 2013 21:13:43 GMT</pubDate>
      <title>MapR, Europe and me</title>
      <link>http://www.advogato.org/person/mhausenblas/diary.html?start=65</link>
      <guid>http://webofdata.wordpress.com/2013/01/01/mapr-europe-and-me/</guid>
      <description>&lt;p&gt;&lt;a href="http://www.mapr.com" &gt;&lt;img class="alignright  wp-image-841" alt="MapR" src="http://webofdata.files.wordpress.com/2013/01/mapr.jpeg?w=252&amp;amp;h=148" width="252" height="148"/&gt;&lt;/a&gt;You might have already heard that &lt;a href="http://www.mapr.com/" &gt;MapR&lt;/a&gt;, the leading provider of enterprise-grade Hadoop and friends, is launching its &lt;a href="http://www.mapr.com/company/press-releases/mapr-launches-european-operation" &gt;European operations&lt;/a&gt;.&lt;/p&gt;
&lt;p&gt;&lt;em&gt;Guess what?&lt;/em&gt; I&#x2019;m joining MapR Europe as of January 2013 in the role of Chief Data Engineer EMEA and will support our technical and sales teams throughout Europe. Pretty exciting times ahead!&lt;/p&gt;
&lt;p&gt;As an aside: as I recently &lt;a href="http://webofdata.wordpress.com/2012/09/02/large-scale-interactive-analysis/" &gt;pointed out&lt;/a&gt;, I very much believe that Apache Drill and Hadoop offer great synergies and if you want to learn more about this come and join us at the Hadoop Summit where my Drill talk has been accepted for the &lt;a href="http://hadoopsummit.org/amsterdam/sessions/" &gt;Hadoop Futures&lt;/a&gt; session.&lt;/p&gt;
&lt;br/&gt;Filed under: &lt;a href="http://webofdata.wordpress.com/category/announcement/" &gt;Announcement&lt;/a&gt;, &lt;a href="http://webofdata.wordpress.com/category/big-data/" &gt;Big Data&lt;/a&gt;, &lt;a href="http://webofdata.wordpress.com/category/fyi/" &gt;FYI&lt;/a&gt;, &lt;a href="http://webofdata.wordpress.com/category/nosql/" &gt;NoSQL&lt;/a&gt;  &lt;a href="http://feeds.wordpress.com/1.0/gocomments/webofdata.wordpress.com/837/" &gt;
  &lt;img alt="" border="0" src="http://feeds.wordpress.com/1.0/comments/webofdata.wordpress.com/837/"/&gt;&lt;/a&gt; &lt;img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=webofdata.wordpress.com&amp;amp;blog=6169642&amp;amp;post=837&amp;amp;subd=webofdata&amp;amp;ref=&amp;amp;feed=1" width="1" height="1"/&gt;</description>
    </item>
    <item>
      <pubDate>Thu, 8 Nov 2012 10:10:00 GMT</pubDate>
      <title>Hosted MapReduce and Hadoop offerings</title>
      <link>http://www.advogato.org/person/mhausenblas/diary.html?start=64</link>
      <guid>http://webofdata.wordpress.com/2012/11/08/hosted-mapreduce-hadoop/</guid>
      <description>&lt;p&gt;
  &lt;a href="http://webofdata.files.wordpress.com/2012/11/mr-in-the-cloud.png" &gt;
    &lt;img class=" wp-image-822 alignleft" title="Hadoop in the cloud" alt="Hadoop in the cloud" src="http://webofdata.files.wordpress.com/2012/11/mr-in-the-cloud.png?w=224&amp;amp;h=134" height="134" width="224"/&gt;&lt;/a&gt;
&lt;/p&gt;
&lt;p&gt;Today&#x2019;s question is: &lt;em&gt;where are we regarding MapReduce/Hadoop in the cloud?&lt;/em&gt; That is, what are the offerings of Hadoop-as-a-Service or other hosted MapReduce implementations, currently?&lt;/p&gt;
&lt;p&gt;A year ago, InfoQ ran a story &lt;a href="http://www.infoq.com/news/2011/10/Hadoop-as-a-Service" &gt;Hadoop-as-a-Service from Amazon, Cloudera, Microsoft and IBM&lt;/a&gt; which will serve us as a baseline here. This article contains the following statement:&lt;/p&gt;
&lt;blockquote&gt;
  &lt;p&gt;According to a 2011 TDWI survey, 34% of the companies use big data analytics to help them making decisions. Big data and Hadoop seem to be playing an important role in the future.&lt;/p&gt;
&lt;/blockquote&gt;
&lt;p&gt;One year later, we learn from a recent MarketsAndMarkets study, &lt;a href="http://www.marketsandmarkets.com/Market-Reports/hadoop-market-766.html" &gt;Hadoop &amp;amp; Big Data Analytics Market &#x2013; Trends, Geographical Analysis &amp;amp; Worldwide Market Forecasts (2012 &#x2013; 2017)&lt;/a&gt; that &#x2026;&lt;/p&gt;
&lt;blockquote&gt;
  &lt;p&gt;The Hadoop market in 2012 is worth $1.5 billion and is expected to grow to about $13.9 billion by 2017, at a [Compound Annual Growth Rate] of 54.9% from 2012 to 2017.&lt;/p&gt;
&lt;/blockquote&gt;
&lt;p&gt;In the past year there have also been some quite &lt;a href="http://gigaom.com/2012/08/13/considerations-for-hadoop-in-the-cloud/" &gt;vivid&lt;/a&gt; &lt;a href="http://cloudconexpoconference2012.sched.org/event/7a61ae390b77c773c1808e977d93f44c#.UJoBS2morOI" &gt;discussions&lt;/a&gt; around the topic &#x2018;Hadoop in the cloud&#x2019;.&lt;/p&gt;
&lt;p&gt;So, here are some current offerings and announcements I&#x2019;m aware of:&lt;/p&gt;
&lt;ul&gt;&lt;li&gt;Amazon&#x2019;s Elastic MapReduce (&lt;a href="http://aws.amazon.com/elasticmapreduce/" &gt;EMR&lt;/a&gt;), featuring MapR&#x2019;s rock-solid and fast &lt;a href="http://aws.amazon.com/elasticmapreduce/mapr/" &gt;Hadoop distribution&lt;/a&gt;.&lt;/li&gt;
&lt;li&gt;Google&#x2019;s App Engine, a PaaS offering, allows for &lt;a href="https://developers.google.com/appengine/docs/python/dataprocessing/overview" &gt;experimental MapReduce processing in Python&lt;/a&gt;.&lt;/li&gt;
&lt;li&gt;Microsoft&#x2019;s Azure, also a PaaS offering, now has &lt;a href="https://www.hadooponazure.com/" &gt;Hadoop support&lt;/a&gt;.&lt;/li&gt;
&lt;li&gt;VMware has launched &lt;a href="http://serengeti.cloudfoundry.com/" &gt;Project Serengeti&lt;/a&gt; to enable rapid deployment of Hadoop clusters in their Cloud Foundry environment.&lt;/li&gt;
&lt;li&gt;HStreaming&#x2019;s Cloud Beta &lt;a href="http://www.hstreaming.com/products/cloud/" &gt;hooked up with AWS&lt;/a&gt; as well.&lt;/li&gt;
&lt;li&gt;There is a report on a &lt;a href="http://jaigak.blogspot.ie/2012/07/paas-on-hadoop-yarn-idea-and-prototype.html" &gt;PaaS on Hadoop Yarn &#x2013; Idea and Prototype&lt;/a&gt; available.&lt;/li&gt;
&lt;/ul&gt;&lt;p&gt;&#x2026; and now it&#x2019;s up to you dear reader &#x2013; I would appreciate it if you could point me to more offerings and/or announcements you know of, concerning MapReduce and Hadoop in the cloud!&lt;/p&gt;
&lt;br/&gt;Filed under: &lt;a href="http://webofdata.wordpress.com/category/big-data/" &gt;Big Data&lt;/a&gt;, &lt;a href="http://webofdata.wordpress.com/category/cloud-computing/" &gt;Cloud Computing&lt;/a&gt;, &lt;a href="http://webofdata.wordpress.com/category/fyi/" &gt;FYI&lt;/a&gt;  &lt;a href="http://feeds.wordpress.com/1.0/gocomments/webofdata.wordpress.com/820/" &gt;
  &lt;img alt="" border="0" src="http://feeds.wordpress.com/1.0/comments/webofdata.wordpress.com/820/"/&gt;&lt;/a&gt; &lt;img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=webofdata.wordpress.com&amp;amp;blog=6169642&amp;amp;post=820&amp;amp;subd=webofdata&amp;amp;ref=&amp;amp;feed=1" width="1" height="1"/&gt;</description>
    </item>
    <item>
      <pubDate>Mon, 5 Nov 2012 12:11:09 GMT</pubDate>
      <title>MapReduce for and with the kids</title>
      <link>http://www.advogato.org/person/mhausenblas/diary.html?start=63</link>
      <guid>http://webofdata.wordpress.com/2012/11/05/mapreduce-for-kids/</guid>
      <description>&lt;p&gt;Last week was Halloween and of course we went trick-or-treating with our three kids which resulted in piles of sweets in the living room. Powered by the sugar, the kids would stay up late to count their harvest and while I was observing them at it, I was wondering if it possible to explain the &lt;a href="http://research.google.com/archive/mapreduce.html" &gt;MapReduce&lt;/a&gt; paradigm to them, or even better: &lt;strong&gt;doing&lt;/strong&gt; MapReduce with them. &lt;/p&gt;
&lt;p&gt;Now, it turns out that Halloween and counting kinds of sweets are a perfect setup. Have a look at the following:&lt;/p&gt;
&lt;p&gt;
  &lt;a href="http://webofdata.files.wordpress.com/2012/11/mr-halloween.png" &gt;
    &lt;img class="aligncenter size-medium wp-image-810" title="MapReduce for counting kinds of sweet after Halloween harvest." alt="MapReduce for counting kinds of sweet after Halloween harvest." src="http://webofdata.files.wordpress.com/2012/11/mr-halloween.png?w=500" width="500"/&gt;&lt;/a&gt;
&lt;/p&gt;
&lt;p&gt;So, the goal was to figure how many sweets of a certain kind (like, Twix) we now have available overall, for consumption. &lt;/p&gt;
&lt;p&gt;We started off with every child having her or his pile of sweets in front of them. Now, in the first step I&#x2019;d ask the kids to shout how many of the sweet X they have in their own pile. So one kid would go like &lt;em&gt;I&#x2019;ve got 4 fizzers&lt;/em&gt;, etc. &#x2026; and then we&#x2019;d gather all the same sweets and their respective counts together. Second, we&#x2019;d add up the individual counts for each kind of sweet which would give us the desired result: number of X in total.&lt;/p&gt;
&lt;p&gt;&lt;em&gt;Lesson learned: MapReduce is a child&#x2019;s play. Making kids sharing sweets is certainly not &#x2013; believe me, I speak out of experience&lt;/em&gt; &lt;img src="http://s1.wp.com/wp-includes/images/smilies/icon_wink.gif" alt=";)" class="wp-smiley"/&gt;&lt;/p&gt;
&lt;br/&gt;Filed under: &lt;a href="http://webofdata.wordpress.com/category/big-data/" &gt;Big Data&lt;/a&gt;, &lt;a href="http://webofdata.wordpress.com/category/cloud-computing/" &gt;Cloud Computing&lt;/a&gt;, &lt;a href="http://webofdata.wordpress.com/category/experiment/" &gt;Experiment&lt;/a&gt;, &lt;a href="http://webofdata.wordpress.com/category/nosql/" &gt;NoSQL&lt;/a&gt;  &lt;a href="http://feeds.wordpress.com/1.0/gocomments/webofdata.wordpress.com/808/" &gt;
  &lt;img alt="" border="0" src="http://feeds.wordpress.com/1.0/comments/webofdata.wordpress.com/808/"/&gt;&lt;/a&gt; &lt;img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=webofdata.wordpress.com&amp;amp;blog=6169642&amp;amp;post=808&amp;amp;subd=webofdata&amp;amp;ref=&amp;amp;feed=1" width="1" height="1"/&gt;</description>
    </item>
    <item>
      <pubDate>Thu, 20 Sep 2012 11:09:45 GMT</pubDate>
      <title>Denormalizing graph-shaped data</title>
      <link>http://www.advogato.org/person/mhausenblas/diary.html?start=62</link>
      <guid>http://webofdata.wordpress.com/2012/09/20/denormalising-graph-shaped-data/</guid>
      <description>&lt;p&gt;As nicely &lt;a href="http://highlyscalable.wordpress.com/2012/03/01/nosql-data-modeling-techniques/" &gt;pointed out&lt;/a&gt; by&#xA0;Ilya Katsov:&lt;/p&gt;
&lt;blockquote&gt;
  &lt;p&gt;Denormalization can be defined as the copying of the same data into multiple documents or tables in order to simplify/optimize query processing or to fit the user&#x2019;s data into a particular data model.&lt;/p&gt;
&lt;/blockquote&gt;
&lt;p&gt;So, I was wondering, why is &#x2013; in Ilya&#x2019;s write-up &#x2013; &lt;strong&gt;denormalization&lt;/strong&gt; not considered to be applicable for GraphDBs?&lt;/p&gt;
&lt;p&gt;I suppose the main reason is that the relationships (or links as we use to call them in the Linked Data world) are typically not resolved or dereferenced, which means &lt;a href="https://github.com/tinkerpop/gremlin/wiki" &gt;traversing&lt;/a&gt; the graph is fast, but for a number of &lt;a href="http://www.ovaistariq.net/199/databases-normalization-or-denormalization-which-is-the-better-technique/" &gt;operations&lt;/a&gt; such as range queries, denormalized data would be better.&lt;/p&gt;
&lt;p&gt;Now, the question is: can we achieve this in GraphDBs, incl. &lt;a href="http://www.garshol.priv.no/blog/231.html" &gt;RDF stores&lt;/a&gt;?&#xA0;I would hope so. Here are some design ideas:&lt;/p&gt;
&lt;ul&gt;&lt;li&gt;Up-front: when inserting new data items (nodes), immediately dereference the links (embedded links).&lt;/li&gt;
&lt;li&gt;Query-time: apply database &lt;a href="http://www.cwi.nl/2010/1082/databasecracking" &gt;cracking&lt;/a&gt;.&lt;/li&gt;
&lt;/ul&gt;&lt;p&gt;Here is the question for you, dear reader: &lt;em&gt;are you aware of people doing this? My google skills have failed me so far &#x2013; happy to learn about it in greater detail!&lt;/em&gt;&lt;/p&gt;
&lt;br/&gt;Filed under: &lt;a href="http://webofdata.wordpress.com/category/big-data/" &gt;Big Data&lt;/a&gt;, &lt;a href="http://webofdata.wordpress.com/category/cloud-computing/" &gt;Cloud Computing&lt;/a&gt;, &lt;a href="http://webofdata.wordpress.com/category/idea/" &gt;Idea&lt;/a&gt;, &lt;a href="http://webofdata.wordpress.com/category/linked-data/" &gt;Linked Data&lt;/a&gt;, &lt;a href="http://webofdata.wordpress.com/category/nosql/" &gt;NoSQL&lt;/a&gt;  &lt;a href="http://feeds.wordpress.com/1.0/gocomments/webofdata.wordpress.com/803/" &gt;
  &lt;img alt="" border="0" src="http://feeds.wordpress.com/1.0/comments/webofdata.wordpress.com/803/"/&gt;&lt;/a&gt; &lt;img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=webofdata.wordpress.com&amp;amp;blog=6169642&amp;amp;post=803&amp;amp;subd=webofdata&amp;amp;ref=&amp;amp;feed=1" width="1" height="1"/&gt;</description>
    </item>
    <item>
      <pubDate>Sun, 2 Sep 2012 20:08:55 GMT</pubDate>
      <title>Interactive analysis of large-scale datasets</title>
      <link>http://www.advogato.org/person/mhausenblas/diary.html?start=61</link>
      <guid>http://webofdata.wordpress.com/2012/09/02/large-scale-interactive-analysis/</guid>
      <description>&lt;p&gt;The value of large-scale datasets &#x2013; stemming from IoT sensors, end-user and business transactions, social networks, search engine logs, etc. &#x2013; apparently lies in the patterns buried deep inside them. Being able to identify these patterns, analyzing them is vital. Be it for detecting fraud, determining a new customer segment or predicting a trend. As we&#x2019;re moving from the billions to trillions of records (or: from the terabyte to peta- and exabyte scale) the more &#x2018;traditional&#x2019; methods, including MapReduce seem to have reached the end of their capabilities. The question is: what now?&lt;/p&gt;
&lt;p&gt;But a second issue has to be addressed as well: in contrast to what current large-scale data processing solutions provide for in batch-mode (arbitrarily but in line with the state-of-the-art defined as any query that takes longer than 10 sec to execute) the need for &lt;strong&gt;interactive analysis&lt;/strong&gt; increases. Complementary, &lt;a href="http://en.wikipedia.org/wiki/Visual_analytics" &gt;visual analytics&lt;/a&gt; may or may not be helpful but come with their &lt;a href="http://www.computer.org/cms/Computer.org/ComputingNow/homepage/2012/0812/W_CG_TheTop10Challenges.pdf" &gt;own set of challenges&lt;/a&gt;.&lt;/p&gt;
&lt;p&gt;Recently, a proposal for a new Apache Incubator group called &lt;a href="http://wiki.apache.org/incubator/DrillProposal" &gt;Drill&lt;/a&gt; has been made. This group aims at building a:&lt;/p&gt;
&lt;blockquote&gt;
  &lt;p&gt;&#x2026; distributed system for interactive analysis of large-scale datasets [...] It is a design goal to scale to 10,000 servers or more and to be able to process petabyes of data and trillions of records in seconds.&lt;/p&gt;
&lt;/blockquote&gt;
&lt;p&gt;Drill&#x2019;s design is supposed to be informed by Google&#x2019;s &lt;a href="http://research.google.com/pubs/pub36632.html" &gt;Dremel&lt;/a&gt; and wants to efficiently process nested data (think: &lt;a href="http://code.google.com/p/protobuf/" &gt;Protocol Buffers&lt;/a&gt;). You can learn more about requirements and design considerations from Tomer Shiran&#x2019;s &lt;a href="http://wiki.apache.org/incubator/DrillProposal?action=AttachFile&amp;amp;do=get&amp;amp;target=Drill+slides.pdf" &gt;slide set&lt;/a&gt;.&lt;/p&gt;
&lt;p&gt;In order to better understand where Drill fits in in the overall picture, have a look at the following (admittedly na&#xEF;ve) plot that tries to place it in relation to well-known and deployed data processing systems:&lt;/p&gt;
&lt;p&gt;
  &lt;a href="http://webofdata.files.wordpress.com/2012/09/a-comparison-of-large-scale-data-processing-systems.png" &gt;
    &lt;img class="aligncenter size-full wp-image-799" title="A comparison of large-scale data processing systems" src="http://webofdata.files.wordpress.com/2012/09/a-comparison-of-large-scale-data-processing-systems.png?w=750&amp;amp;h=527" alt="" width="750" height="527"/&gt;&lt;/a&gt;
&lt;/p&gt;
&lt;p&gt;BTW, if you want to test-drive Dremel, you can do this already today; it&#x2019;s an &lt;a href="http://en.wikipedia.org/wiki/Cloud_computing#Infrastructure_as_a_service_.28IaaS.29" &gt;IaaS&lt;/a&gt; service offered in Google&#x2019;s cloud computing suite, called &lt;a href="https://developers.google.com/bigquery/" &gt;BigQuery&lt;/a&gt;.&lt;/p&gt;
&lt;br/&gt;Filed under: &lt;a href="http://webofdata.wordpress.com/category/big-data/" &gt;Big Data&lt;/a&gt;, &lt;a href="http://webofdata.wordpress.com/category/cloud-computing/" &gt;Cloud Computing&lt;/a&gt;, &lt;a href="http://webofdata.wordpress.com/category/fyi/" &gt;FYI&lt;/a&gt;, &lt;a href="http://webofdata.wordpress.com/category/nosql/" &gt;NoSQL&lt;/a&gt;  &lt;a href="http://feeds.wordpress.com/1.0/gocomments/webofdata.wordpress.com/790/" &gt;
  &lt;img alt="" border="0" src="http://feeds.wordpress.com/1.0/comments/webofdata.wordpress.com/790/"/&gt;&lt;/a&gt; &lt;img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=webofdata.wordpress.com&amp;amp;blog=6169642&amp;amp;post=790&amp;amp;subd=webofdata&amp;amp;ref=&amp;amp;feed=1" width="1" height="1"/&gt;</description>
    </item>
    <item>
      <pubDate>Sun, 8 Jul 2012 19:07:09 GMT</pubDate>
      <title>Schema.org + WebIntents = Awesomeness</title>
      <link>http://www.advogato.org/person/mhausenblas/diary.html?start=60</link>
      <guid>http://webofdata.wordpress.com/2012/07/08/schema-org-webintents-awesomeness/</guid>
      <description>&lt;p&gt;Imagine you search for a camera, say a Canon EOS 60D, and in addition to the usual search results you&#x2019;re as well offered a choice of actions you can perform on it, for example &lt;em&gt;share&lt;/em&gt; the result with a friend, write a &lt;em&gt;review&lt;/em&gt; for the item or, why not directly &lt;em&gt;buy&lt;/em&gt; it?&lt;/p&gt;
&lt;p&gt;
  &lt;a href="http://webofdata.files.wordpress.com/2012/07/ser.png" &gt;
    &lt;img src="http://webofdata.files.wordpress.com/2012/07/ser.png?w=750&amp;amp;h=363" alt="Enhancing SERP with actions" title="Enhancing SERP with actions" width="750" height="363" class="aligncenter size-full wp-image-781"/&gt;&lt;/a&gt;
&lt;/p&gt;
&lt;p&gt;Sounds far fetched? Not at all. In fact, all the necessary components are available and deployed. With &lt;a href="http://schema.org/" &gt;Schema.org&lt;/a&gt; we have a way to describe the things we publish on our Web pages, such as books or cameras and with &lt;a href="http://webintents.org/" &gt;WebIntents&lt;/a&gt; we have a technology at hand that allows us to interact with these things in a flexible way.&lt;/p&gt;
&lt;p&gt;Here are some starting points in case you want to dive into WebIntents a bit:&lt;/p&gt;
&lt;ul&gt;&lt;li&gt;Check out the WebIntents &lt;a href="http://demos.webintents.org/" &gt;demos&lt;/a&gt; and learn about the &lt;a href="http://usecases.webintents.org/" &gt;use cases&lt;/a&gt;.&lt;/li&gt;
&lt;li&gt;During the recent Google IO 2012 there was a &lt;a href="http://webintents-io12.appspot.com/#1" &gt;session on WebIntents&lt;/a&gt;.&lt;/li&gt;
&lt;li&gt;A SitePoint &lt;a href="http://www.sitepoint.com/podcast-169-web-intents/" &gt;podcast on WebIntents&lt;/a&gt; is available.&lt;/li&gt;
&lt;li&gt;The Chrome Web Store has a number of &lt;a href="https://chrome.google.com/webstore/category/collection/webintent_apps" &gt;apps that support WebIntents&lt;/a&gt;.&lt;/li&gt;
&lt;li&gt;There is a &lt;a href="http://www.w3.org/TR/web-intents/" &gt;W3C Working Draft&lt;/a&gt; for the core specification as well as a W3C Wiki page that discusses &lt;a href="http://www.w3.org/wiki/WebIntents/schema.org_Types" &gt;passing schema.org data through WebIntents&lt;/a&gt;.&lt;/li&gt;
&lt;/ul&gt;&lt;p&gt;PS: I started to develop a &lt;a href="https://github.com/mhausenblas/intent.iom" &gt;proof of concept&lt;/a&gt; for mapping Schema.org terms to WebIntents and will report on the progress, here. Stay tuned!&lt;/p&gt;
&lt;br/&gt;Filed under: &lt;a href="http://webofdata.wordpress.com/category/experiment/" &gt;Experiment&lt;/a&gt;, &lt;a href="http://webofdata.wordpress.com/category/idea/" &gt;Idea&lt;/a&gt;, &lt;a href="http://webofdata.wordpress.com/category/linked-data/" &gt;Linked Data&lt;/a&gt;, &lt;a href="http://webofdata.wordpress.com/category/w3c/" &gt;W3C&lt;/a&gt;  &lt;a href="http://feeds.wordpress.com/1.0/gocomments/webofdata.wordpress.com/778/" &gt;
  &lt;img alt="" border="0" src="http://feeds.wordpress.com/1.0/comments/webofdata.wordpress.com/778/"/&gt;&lt;/a&gt; &lt;a href="http://feeds.wordpress.com/1.0/godelicious/webofdata.wordpress.com/778/" &gt;
  &lt;img alt="" border="0" src="http://feeds.wordpress.com/1.0/delicious/webofdata.wordpress.com/778/"/&gt;&lt;/a&gt; &lt;a href="http://feeds.wordpress.com/1.0/gofacebook/webofdata.wordpress.com/778/" &gt;
  &lt;img alt="" border="0" src="http://feeds.wordpress.com/1.0/facebook/webofdata.wordpress.com/778/"/&gt;&lt;/a&gt; &lt;a href="http://feeds.wordpress.com/1.0/gotwitter/webofdata.wordpress.com/778/" &gt;
  &lt;img alt="" border="0" src="http://feeds.wordpress.com/1.0/twitter/webofdata.wordpress.com/778/"/&gt;&lt;/a&gt; &lt;a href="http://feeds.wordpress.com/1.0/gostumble/webofdata.wordpress.com/778/" &gt;
  &lt;img alt="" border="0" src="http://feeds.wordpress.com/1.0/stumble/webofdata.wordpress.com/778/"/&gt;&lt;/a&gt; &lt;a href="http://feeds.wordpress.com/1.0/godigg/webofdata.wordpress.com/778/" &gt;
  &lt;img alt="" border="0" src="http://feeds.wordpress.com/1.0/digg/webofdata.wordpress.com/778/"/&gt;&lt;/a&gt; &lt;a href="http://feeds.wordpress.com/1.0/goreddit/webofdata.wordpress.com/778/" &gt;
  &lt;img alt="" border="0" src="http://feeds.wordpress.com/1.0/reddit/webofdata.wordpress.com/778/"/&gt;&lt;/a&gt; &lt;img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=webofdata.wordpress.com&amp;amp;blog=6169642&amp;amp;post=778&amp;amp;subd=webofdata&amp;amp;ref=&amp;amp;feed=1" width="1" height="1"/&gt;</description>
    </item>
    <item>
      <pubDate>Thu, 10 May 2012 15:06:44 GMT</pubDate>
      <title>Turning tabular data into entities</title>
      <link>http://www.advogato.org/person/mhausenblas/diary.html?start=59</link>
      <guid>http://webofdata.wordpress.com/2012/05/10/turning-tabular-data-into-entities/</guid>
      <description>&lt;p&gt;Two widely used data formats on the Web are CSV and JSON. In order to enable fine-grained access in an hypermedia-oriented fashion I&#x2019;ve started to work on &lt;a href="https://github.com/mhausenblas/tride" &gt;Tride&lt;/a&gt;, a mapping language that takes one or more CSV files as inputs and produces a set of (connected) JSON documents. &lt;/p&gt;
&lt;span style="text-align:center; display: block;"&gt;
  &lt;a href="http://webofdata.wordpress.com/2012/05/10/turning-tabular-data-into-entities/" &gt;
    &lt;img src="http://img.youtube.com/vi/Z6ueSqwgaYU/2.jpg" alt=""/&gt;&lt;/a&gt;
&lt;/span&gt;
&lt;p&gt;In the 2 min &lt;a href="http://youtu.be/Z6ueSqwgaYU" &gt;demo video&lt;/a&gt; I use two CSV files (&lt;a href="https://github.com/mhausenblas/tride/blob/master/test/people.csv" &gt;people.csv&lt;/a&gt; and &lt;a href="https://github.com/mhausenblas/tride/blob/master/test/group.csv" &gt;group.csv&lt;/a&gt;) as well as a mapping file (&lt;a href="https://github.com/mhausenblas/tride/blob/master/test/group-map.json" &gt;group-map.json&lt;/a&gt;) to produce a set of interconnected JSON documents.&lt;/p&gt;
&lt;p&gt;So, the following mapping file:&lt;br/&gt;&lt;code/&gt;&lt;/p&gt;
&lt;pre&gt;{
 "input" : [
  { "name" : "people", "src" : "people.csv" },
  { "name" : "group", "src" : "group.csv" }
 ],
 "map" : {
  "people" : {
   "base" : "http://localhost:8000/people/",
   "output" : "../out/people/",
   "with" : { 
    "fname" : "people.first-name", 
    "lname" : "people.last-name",
    "member" : "link:people.group-id to:group.ID"
   }
 },
 "group" : {
  "base" : "http://localhost:8000/group/",
  "output" : "../out/group/",
   "with" : {
    "title" : "group.title",
    "homepage" : "group.homepage",
    "members" : "where:people.group-id=group.ID link:group.ID to:people.ID"
   }
  }
 }
}&lt;/pre&gt;
&lt;p/&gt;
&lt;p&gt;&#x2026; produces JSON documents representing groups. One concrete example output is shown below:&lt;br/&gt;&lt;a href="http://webofdata.files.wordpress.com/2012/05/example-group-doc.png" &gt;&lt;img src="http://webofdata.files.wordpress.com/2012/05/example-group-doc.png?w=750&amp;amp;h=240" alt="" title="Example JSON document representing a group" width="750" height="240" class="alignleft size-full wp-image-774"/&gt;&lt;/a&gt;&lt;/p&gt;
&lt;br/&gt;Filed under: &lt;a href="http://webofdata.wordpress.com/category/demo/" &gt;demo&lt;/a&gt;, &lt;a href="http://webofdata.wordpress.com/category/experiment/" &gt;Experiment&lt;/a&gt;, &lt;a href="http://webofdata.wordpress.com/category/fyi/" &gt;FYI&lt;/a&gt;  &lt;a href="http://feeds.wordpress.com/1.0/gocomments/webofdata.wordpress.com/763/" &gt;
  &lt;img alt="" border="0" src="http://feeds.wordpress.com/1.0/comments/webofdata.wordpress.com/763/"/&gt;&lt;/a&gt; &lt;a href="http://feeds.wordpress.com/1.0/godelicious/webofdata.wordpress.com/763/" &gt;
  &lt;img alt="" border="0" src="http://feeds.wordpress.com/1.0/delicious/webofdata.wordpress.com/763/"/&gt;&lt;/a&gt; &lt;a href="http://feeds.wordpress.com/1.0/gofacebook/webofdata.wordpress.com/763/" &gt;
  &lt;img alt="" border="0" src="http://feeds.wordpress.com/1.0/facebook/webofdata.wordpress.com/763/"/&gt;&lt;/a&gt; &lt;a href="http://feeds.wordpress.com/1.0/gotwitter/webofdata.wordpress.com/763/" &gt;
  &lt;img alt="" border="0" src="http://feeds.wordpress.com/1.0/twitter/webofdata.wordpress.com/763/"/&gt;&lt;/a&gt; &lt;a href="http://feeds.wordpress.com/1.0/gostumble/webofdata.wordpress.com/763/" &gt;
  &lt;img alt="" border="0" src="http://feeds.wordpress.com/1.0/stumble/webofdata.wordpress.com/763/"/&gt;&lt;/a&gt; &lt;a href="http://feeds.wordpress.com/1.0/godigg/webofdata.wordpress.com/763/" &gt;
  &lt;img alt="" border="0" src="http://feeds.wordpress.com/1.0/digg/webofdata.wordpress.com/763/"/&gt;&lt;/a&gt; &lt;a href="http://feeds.wordpress.com/1.0/goreddit/webofdata.wordpress.com/763/" &gt;
  &lt;img alt="" border="0" src="http://feeds.wordpress.com/1.0/reddit/webofdata.wordpress.com/763/"/&gt;&lt;/a&gt; &lt;img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=webofdata.wordpress.com&amp;amp;blog=6169642&amp;amp;post=763&amp;amp;subd=webofdata&amp;amp;ref=&amp;amp;feed=1" width="1" height="1"/&gt;</description>
    </item>
    <item>
      <pubDate>Mon, 2 Apr 2012 06:09:01 GMT</pubDate>
      <title>Linked Data &#x2013; the best of two worlds</title>
      <link>http://www.advogato.org/person/mhausenblas/diary.html?start=58</link>
      <guid>http://webofdata.wordpress.com/2012/04/02/linked-data-best-of-two-worlds/</guid>
      <description>&lt;p&gt;On the one hand you have &lt;strong&gt;structured data sources&lt;/strong&gt; such as relational DB, &lt;a href="http://nosql-database.org/" &gt;NoSQL datastores&lt;/a&gt; or &lt;a href="http://www.odbms.org/" &gt;OODBs&lt;/a&gt; and the like that allow you to query and manipulate data in a structured way. This typically involves &lt;strong&gt;schemata&lt;/strong&gt; (either upfront with RDB or sort of dynamically &lt;a href="http://highlyscalable.wordpress.com/2012/03/01/nosql-data-modeling-techniques/" &gt;with NoSQL&lt;/a&gt; that defines the data layout and the types of the fields), a notion of &lt;strong&gt;object identity&lt;/strong&gt; (for example a unique row ID in RDB or a document ID in a document store) and with it to be able to refer to data items in different containers (e.g. a foreign key in a RDB) as well as  the possibility to create and use indices to speed up look-up/query.&lt;/p&gt;
&lt;p&gt;On the other hand you have the Web, a globally distributed hypermedia system, mainly for consumption by humans. There the main primitives are: an &lt;a href="http://googleblog.blogspot.com/2008/07/we-knew-web-was-big.html" &gt;enormous collection&lt;/a&gt; of hyperlinked documents over the Internet with millions of servers and billions of clients (desktop, mobile devices, etc.), in its core based on simple standards: URL, HTTP, HTML.&lt;/p&gt;
&lt;p&gt;Now, the idea with Linked Data is a simple one: take the best of both worlds and combine it, yielding large-scale structured data (incl. schema and object identity to allow straightforward manipulation) based on established Web standards (in order to benefit from the deployed infrastructure).&lt;/p&gt;
&lt;p&gt;Sounds easy? In fact it is. The devil is in the detail. As with any piece of technology, once you start implementing it, questions arise. For example, must Linked Data be solely based on RDF or are other wire formats such as &lt;a href="http://jsonauts.github.com/" &gt;JSON&lt;/a&gt;, &lt;a href="http://www.w3.org/TR/microdata/" &gt;Microdata&lt;/a&gt; or &lt;a href="http://www.odata.org/" &gt;Atom&lt;/a&gt; &#x2018;allowed&#x2019;? Should we use distributed vocabulary management (as mandated by the Semantic Web) or is it OK to use &lt;a href="http://schema.org" &gt;Schema.org&lt;/a&gt;? Depending on whom you ask you currently may get different answers but in this case I lean towards diversity &#x2013; at the end of the day what matters are URIs (object identity), HTTP (data access) and &lt;strong&gt;some&lt;/strong&gt; way to represent the data in a structured format.&lt;/p&gt;
&lt;br/&gt;Filed under: &lt;a href="http://webofdata.wordpress.com/category/big-data/" &gt;Big Data&lt;/a&gt;, &lt;a href="http://webofdata.wordpress.com/category/fyi/" &gt;FYI&lt;/a&gt;, &lt;a href="http://webofdata.wordpress.com/category/linked-data/" &gt;Linked Data&lt;/a&gt;, &lt;a href="http://webofdata.wordpress.com/category/nosql/" &gt;NoSQL&lt;/a&gt;  &lt;a href="http://feeds.wordpress.com/1.0/gocomments/webofdata.wordpress.com/759/" &gt;
  &lt;img alt="" border="0" src="http://feeds.wordpress.com/1.0/comments/webofdata.wordpress.com/759/"/&gt;&lt;/a&gt; &lt;a href="http://feeds.wordpress.com/1.0/godelicious/webofdata.wordpress.com/759/" &gt;
  &lt;img alt="" border="0" src="http://feeds.wordpress.com/1.0/delicious/webofdata.wordpress.com/759/"/&gt;&lt;/a&gt; &lt;a href="http://feeds.wordpress.com/1.0/gofacebook/webofdata.wordpress.com/759/" &gt;
  &lt;img alt="" border="0" src="http://feeds.wordpress.com/1.0/facebook/webofdata.wordpress.com/759/"/&gt;&lt;/a&gt; &lt;a href="http://feeds.wordpress.com/1.0/gotwitter/webofdata.wordpress.com/759/" &gt;
  &lt;img alt="" border="0" src="http://feeds.wordpress.com/1.0/twitter/webofdata.wordpress.com/759/"/&gt;&lt;/a&gt; &lt;a href="http://feeds.wordpress.com/1.0/gostumble/webofdata.wordpress.com/759/" &gt;
  &lt;img alt="" border="0" src="http://feeds.wordpress.com/1.0/stumble/webofdata.wordpress.com/759/"/&gt;&lt;/a&gt; &lt;a href="http://feeds.wordpress.com/1.0/godigg/webofdata.wordpress.com/759/" &gt;
  &lt;img alt="" border="0" src="http://feeds.wordpress.com/1.0/digg/webofdata.wordpress.com/759/"/&gt;&lt;/a&gt; &lt;a href="http://feeds.wordpress.com/1.0/goreddit/webofdata.wordpress.com/759/" &gt;
  &lt;img alt="" border="0" src="http://feeds.wordpress.com/1.0/reddit/webofdata.wordpress.com/759/"/&gt;&lt;/a&gt; &lt;img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=webofdata.wordpress.com&amp;amp;blog=6169642&amp;amp;post=759&amp;amp;subd=webofdata&amp;amp;ref=&amp;amp;feed=1" width="1" height="1"/&gt;</description>
    </item>
  </channel>
</rss>
