<?xml version="1.0"?>
<rss version="2.0">
  <channel>
    <title>Advogato blog for aigeek</title>
    <link>http://www.advogato.org/person/aigeek/</link>
    <description>Advogato blog for aigeek</description>
    <language>en-us</language>
    <generator>mod_virgule</generator>
    <pubDate>Thu, 23 May 2013 02:38:10 GMT</pubDate>
    <item>
      <pubDate>Tue, 12 Sep 2000 18:02:05 GMT</pubDate>
      <title>12 Sep 2000</title>
      <link>http://www.advogato.org/person/aigeek/diary.html?start=14</link>
      <guid>http://www.advogato.org/person/aigeek/diary.html?start=14</guid>
      <description>Turned in &lt;A href="http://www.aigeek.com/aimsc/"&gt;my
dissertation&lt;/a&gt; and put it and my code online.  Anyone
interested in automatic music analysis?  I know a lot more
than when I started and I'd like to throw away all the code
and start fresh, but I'm definitely taking a break first.
</description>
    </item>
    <item>
      <pubDate>Sun, 3 Sep 2000 00:25:16 GMT</pubDate>
      <title>3 Sep 2000</title>
      <link>http://www.advogato.org/person/aigeek/diary.html?start=13</link>
      <guid>http://www.advogato.org/person/aigeek/diary.html?start=13</guid>
      <description>I changed &lt;A
href="http://www.aigeek.com/geek/autojot/"&gt;autojot&lt;/a&gt; so
instead of logging everything you hit with your browser and
getting it later (maybe from your cache), instead it caches
everything itself for indexing.  A little wasteful of space,
but it solves problems of authentication and cache misses. 
As long as you run the indexer once a week or so, it
shouldn't use too much space.</description>
    </item>
    <item>
      <pubDate>Sat, 2 Sep 2000 03:15:02 GMT</pubDate>
      <title>2 Sep 2000</title>
      <link>http://www.advogato.org/person/aigeek/diary.html?start=12</link>
      <guid>http://www.advogato.org/person/aigeek/diary.html?start=12</guid>
      <description>Finally made a Debian package for &lt;A
href="http://www.aigeek.com/waterfall/"&gt;waterfall&lt;/a&gt;.  Yay,
my first package.</description>
    </item>
    <item>
      <pubDate>Mon, 21 Aug 2000 11:35:37 GMT</pubDate>
      <title>21 Aug 2000</title>
      <link>http://www.advogato.org/person/aigeek/diary.html?start=11</link>
      <guid>http://www.advogato.org/person/aigeek/diary.html?start=11</guid>
      <description>&lt;P&gt;
&lt;A href="http://www.advogato.org/person/mhat/"&gt;mhat&lt;/a&gt;: How
do you
specify the subsets?  I want to be able to specify music by
its
contents instead of by its measly metadata.  Examples:
"uptempo jazz
with lots of brass" or "danceable techno 110-130bpm".

</description>
    </item>
    <item>
      <pubDate>Tue, 15 Aug 2000 21:34:34 GMT</pubDate>
      <title>15 Aug 2000</title>
      <link>http://www.advogato.org/person/aigeek/diary.html?start=10</link>
      <guid>http://www.advogato.org/person/aigeek/diary.html?start=10</guid>
      <description>&lt;P&gt;
&lt;A
href="http://www.advogato.org/person/jmelesky/"&gt;&lt;B&gt;jmelesky&lt;/b&gt;&lt;/a&gt;:
I've heard good things about &lt;A
href="http://www1.fatbrain.com/asp/bookinfo/bookinfo.asp?theisbn=0262133601"&gt;Foundations
of Statistical Natural Language Processing&lt;/a&gt; by Manning &amp;amp;
Schutze, and it's used in the &lt;A
href="http://www.cogsci.ed.ac.uk/school/study/msc/course/dil/"&gt;statistical
linguistics class&lt;/a&gt; here.


&lt;p&gt; &lt;P&gt;
&lt;A
href="http://www.advogato.org/person/raph/"&gt;&lt;B&gt;raph&lt;/b&gt;&lt;/a&gt;,
I replied in detail to your waterfall comments in
&lt;A
href="http://www.advogato.org/person/aigeek/diary.html?start=9"&gt;my
previous entry&lt;/a&gt;, but you may not have seen my reply since
I'm
clobbering its recency with this one.  Have you considered
adding a
threaded view of messages in your copious free time?

</description>
    </item>
    <item>
      <pubDate>Tue, 15 Aug 2000 14:12:09 GMT</pubDate>
      <title>15 Aug 2000</title>
      <link>http://www.advogato.org/person/aigeek/diary.html?start=9</link>
      <guid>http://www.advogato.org/person/aigeek/diary.html?start=9</guid>
      <description>&lt;P&gt;
&lt;A
href="http://www.advogato.org/person/jmelesky/"&gt;jmelesky&lt;/a&gt;:
Markov chains are easy, but probabilistic grammars are more
accurate.
You can even learn them from source data, if you have a
bunch of
"grammatical" source in a useful format.


&lt;p&gt; &lt;p&gt; &lt;p&gt; &lt;A
href="http://www.advogato.org/person/raph/"&gt;raph&lt;/a&gt;
wrote about 
&lt;A href="http://www.aigeek.com/waterfall/"&gt;waterfall&lt;/a&gt;:
&lt;BLOCKQUOTE&gt;&lt;I&gt;Very pretty stuff, but the fact that the
spectrum
display is linear, rather than logarithmic, really bothers
me.&lt;/i&gt;
&lt;/blockquote&gt;

&lt;p&gt; &lt;p&gt; &lt;p&gt; &lt;P&gt;
It's not completely linear, but it's close and it bothers me
too (and
I wrote it).  The problem is that XMMS gives crappy
frequency
resolution to vis plugins (43Hz per linear band).  If
waterfall used a
log2 scale like your ears do, there would only be ten
bands.  I don't
think that would look very good.  Instead it uses 75 bands,
but doing
that on a log scale gets pretty screwed up with round off
error.  I
copied the band sizes from the XMMS spectrum analyzer, but
they're not
ideal.  The first 59 bands (up to ~2.5kHz) are 1:1 linear,
and the
rest are on a log scale.

&lt;p&gt; &lt;p&gt; &lt;P&gt;
The best looking way to fix it would be to have have
waterfall do its
own FFT at greater resolution.  This would slow it down a
bit.


&lt;p&gt; &lt;p&gt; &lt;P&gt;The other solution is to use a real log scale, but
with
fewer bands.
The downside to this is that the low freqs will still be
squished
together (due to poor resolution), though the high freqs
will
be squished
even more, as they should be.  I should probably just make
this
run-time configurable, but it's easy enough to change in the
source if
you care enough to recompile.  -DFEWBANDS will drop it from
75 to 20,
or you can roll your own (waterfall.h line 28, waterfall.c
lines
760-768).
</description>
    </item>
    <item>
      <pubDate>Sat, 12 Aug 2000 01:32:21 GMT</pubDate>
      <title>12 Aug 2000</title>
      <link>http://www.advogato.org/person/aigeek/diary.html?start=8</link>
      <guid>http://www.advogato.org/person/aigeek/diary.html?start=8</guid>
      <description>&lt;p&gt;
Found a dumb mistake in my feature extractor a couple weeks
ago.  I
finally bit the bullet last week and fixed it, rescaned all
my data
(100GB!), and retrained my classifiers.  Got a 10% boost in
accuracy,
so it's not only more correct, but useful too, which is a
nice bonus.
My simple linear network can distinguish between techno and
pop music
with 91% accuracy, and it's only using spectral features. 
This is
going to rock once I add tempo, beat strength, and rhythm
signatures.
I'm in a feature freeze until I submit the thesis though. 
Must...
document...!

</description>
    </item>
    <item>
      <pubDate>Thu, 10 Aug 2000 21:03:38 GMT</pubDate>
      <title>10 Aug 2000</title>
      <link>http://www.advogato.org/person/aigeek/diary.html?start=7</link>
      <guid>http://www.advogato.org/person/aigeek/diary.html?start=7</guid>
      <description>I needed a break from thesis writing, so I spent a few
minutes and added something from my project back into &lt;A
href="http://www.aigeek.com/waterfall/"&gt;my XMMS spectrum
analyzer&lt;/a&gt;.  Oooh.  Pretty entropy.  (I released it too,
so everyone can play.)</description>
    </item>
    <item>
      <pubDate>Sat, 5 Aug 2000 14:03:01 GMT</pubDate>
      <title>5 Aug 2000</title>
      <link>http://www.advogato.org/person/aigeek/diary.html?start=6</link>
      <guid>http://www.advogato.org/person/aigeek/diary.html?start=6</guid>
      <description>&lt;p&gt;Wow.  Last night, someone showed me a short New Scientist
blurb about automatic music classification (my thesis
topic), and today it's &lt;A
href="http://slashdot.org/article.pl?sid=00/08/04/1249258&amp;mode=nested"&gt;all
over slashdot&lt;/a&gt;.  If that had happened six months ago, I
could have gotten other people contributing code to my
thesis.  I probably would have had an easier time keeping my
motivation up too.  Working alone gets old after a few
months.

&lt;p&gt; &lt;P&gt;
The weirdest part though is seeing other people say some of
the things
I've been spouting for years, mostly about all the great
applications
for this.  I kept thinking, "Yes! Yes! Yes!  Now help me
write it!"


&lt;p&gt; &lt;p&gt;I'd been planning to start an open project on music
analysis after I finished here.  I have to spend the next
month writing up my results, but I can't waste all that good
publicity, so I created a SourceForge project today, project
name "vole".  Picking a name was hard until I stopped trying
to make it relevant.  I also quickly put up  &lt;A
href="http://www.aigeek.com/geek/music/"&gt;some links&lt;/a&gt; to
music analysis research in case anyone is actually
interested.</description>
    </item>
    <item>
      <pubDate>Mon, 24 Jul 2000 18:19:23 GMT</pubDate>
      <title>24 Jul 2000</title>
      <link>http://www.advogato.org/person/aigeek/diary.html?start=5</link>
      <guid>http://www.advogato.org/person/aigeek/diary.html?start=5</guid>
      <description>&lt;P&gt;&lt;A
href="http://www.advogato.org/person/tladuca/"&gt;tladuca&lt;/a&gt;
wrote:
&lt;BLOCKQUOTE&gt;Few people take that step to get there heads
around some
large and complex project and help contribute to projects
that are
much more valuable to the community, [..] I really wish we
each had a
mechanism where other Advogato users could reply to our
diary entries
and have them show up.&lt;/blockquote&gt;

&lt;p&gt; &lt;P&gt;Your path is clear.  :-)

&lt;p&gt; &lt;P&gt;I agree with that feature wish though.  I don't think
diaries and
articles are fundamentally different.  I think it would be
nice if I
could see everything written by a certain person, regardless
of the
forum, and for each message, see the context and surf the
threads.  
But I'm too busy to do anything about it right now.

&lt;p&gt; &lt;P&gt;
In regards to your larger point, I think there are reasons
to work on
small independent projects that few people will use.

&lt;p&gt; &lt;UL&gt;
  &lt;LI&gt;It's easier to work on tiny projects.  There's no
paradigm to
      inherit, no code to read, and no maintainer to
convince.
  &lt;LI&gt;Bad code doesn't affect anyone else.
  &lt;LI&gt;It's fun to own a project.
&lt;/ul&gt;

&lt;p&gt; &lt;P&gt;
There are points in favor of working on more popular
packages too.


&lt;p&gt; &lt;UL&gt;
  &lt;LI&gt;More people will benefit from your work.
  &lt;LI&gt;More people will discuss your ideas with you.
  &lt;LI&gt;More people will discuss your code with you.
  &lt;LI&gt;More people will contribute to the software you
use.
  &lt;LI&gt;More poeple will have heard of you and might certify
you on Advogato.  ;-)
  &lt;LI&gt;You'll get more teamwork experience.
  &lt;LI&gt;Working in a team is fun.
  &lt;LI&gt;You might learn from the code you have to read.
  &lt;LI&gt;You might learn from the architecture you have to
conform to.
&lt;/ul&gt;

&lt;p&gt; &lt;P&gt;
In the past, I've done lots of my own independent projects,
mostly
because I haven't been aware of existing things that were
close enough
to what I wanted.  I often submit small patches to other
people's
projects, but doing something major on someone else's
project has a
political overhead and I don't usually take the time to deal
with it.
It would probably be good for me though.  Maybe I'll make
time once
I'm working again.  (Being a grad student is far more time
consuming
than being an employee.)

</description>
    </item>
  </channel>
</rss>
