Older blog entries for deven (starting at number 13)

Given that every certification appears to dilute the others, I've removed all certifications I had given to others, except for geoffeg, who is personally known to me. Although people are encouraged to certify others liberally (especially as Apprentice), it doesn't seem to help, so I'll be minimalist for the moment.

Why is it that you can claim to be God, Satan or Jesus and get certified as a Master without an uproar, but being honest and forthright gets you reamed? This sort of animosity is supposed to help grow the free software community?

hacker, I never requested anyone to certify me as a Master, or as anything else for that matter. Yet ncm singled me out as an example of so-called "abuse" of the system, claiming that I "requested" Master certification. I did nothing of the sort!

I've never asked anyone to certify me -- I have complained that the certification system seems to be broken when I used to be certified and now I'm not, despite having more than a few listed certifications. That complaint was because I was worried that the system might be malfunctioning unnoticed -- I wasn't trolling for additional certifications. However, when more people certified me, I thanked them (and you) for it. But did I request those certifications? Never.

I have previously listed sixteen orthogonal concepts that people seem to want this single certification metric to convey, and most of these can be derived or inferred from Raph's description. Since it obviously can't squeeze sixteen dimensions into one, I chose (somewhat arbitrarily) to pick just one dimension (level of programming skill) to use for my certifications. I may not have the same background and experience as another given individual in terms of what code and tools I've worked with, but when it comes to innate programming ability, I'll go up against most people any day.

Give me all sixteen dimensions individually and I'll be glad to pick the best choice on each dimension. Maybe I should have done them all and averaged it, I don't know. I didn't think it would matter, because I didn't think anyone would care what I put for myself. If people really trusted the system to work correctly, nobody would have made an issue of it in the first place.

People talk about Advogato as if it has a "web of trust", but that is extremely misleading. PGP has a web of trust, but using that terminology here misleads people, because that's not what Advogato has. Instead, this truly is a computerized "good ol' boys" network, even if unintentionally. I believe raph has stated (though I can't remember where) that Advogato was arbitrarily focused on free software because he liked free software and wanted to support it, but that the true purpose of creating the site was as a testbed for his research, to see if the certification system would effectively weed out the spammers and trolls while allowing legitimate users to discuss free software without all the noise (from spammers and trolls).

While it's been pretty effective at that goal, it's not clear that it's been as good about letting legitimate participants in, especially recently. I suspect the chosen algorithm may not scale as well in practice was it was intended to, because the more people and more certifications there are, the more those certifications seem to be diluted. Raph states: "The trust metric used to evaluate the certificates is most robust when the certificate graph is dense." In practice, it seems to be excellent at keeping the "good ol' boys" in, and most others out. Since most of Raph's writings here seem to focus on effectiveness in keeping out the bad people, it's not clear whether he ever paid close attention to the flip side of the coin, letting in the good people.

I just downloaded a copy of the people page, and ran a few counts across it. The majority of the accounts are uncertified! Out of 4,760 total accounts, 3316 (69.7%) are uncertified, greyed out as "Observer". Only 322 (6.8%) are certified as "Apprentice". Another 798 (16.8%) are certified as "Journeyer". Finally, 324 (6.8%) are certified as "Master". More people are certified as Master than Apprentice! Last year, that page was mostly green with a little grey, today it's mostly grey with a little green.

Does this make sense? Are there really over 3,000 spammers and trolls here? The barbarians are at the gates? I find this very difficult to believe. It seems much more likely that the certification system is broken, since it's keeping out more than twice as many people as it's letting in. Maybe the implementation is flawless and the algorithm itself isn't scaling well. If the goal was primarily to keep out spammers and trolls (as it seems to be), but it's keeping out the majority of potential participants, isn't that cause for concern? Shouldn't this be investigated, especially if the algorithm isn't scaling as expected?

I complained about the certification system to try to draw attention to this; it doesn't seem to be behaving as intended. Of course, the "good ol' boys" who are solidly entrenched in the system are happy to say that it's working just fine, thank you. On the other hand, Raph says that "the metric is being pickier than usual, almost certainly too picky. Apologies to those who should be cert'ed but aren't."

Raph designed and implemented this certification system, and he isn't convinced it's working right. Isn't it ironic that I get vilified for daring to question it? If everyone removed their certifications from me, and I was left uncertified because of that, I would have been disappointed, but I wouldn't have complained that the system was broken. However, the certifications that used to be quite sufficient were no longer, and that seemed amiss, so (after giving it a month), I tried to draw some attention to it by complaining more vocally. I've been unfairly attacked in return, but at least Raph has noticed that something is amiss with the certifications. Hopefully he'll pay more attention to this issue now, if he can find the time...

hacker, it appears you removed the certification you gave me last week? Why?

Here I am, with sixteen certifications now, and again the system doesn't even consider me to be certified at Apprentice level. Come on! How many does it take?!?

2 Jul 2001 (updated 2 Jul 2001 at 20:57 UTC) »

Thanks, hacker. I do realize that more than one certification might often be necessary, but how much is enough? It seems like a single Apprentice certification from a Master should be sufficient to get certified as an Apprentice; I've seen one Journeyer certification from a Journeyer make the person in question qualify as a Journeyer. Those first 9 certifications seem like they should have been enough, but somehow they weren't.

Now I'm wondering if each additional certification somewhere else in the system that doesn't connect to you dilutes the value of the certifications that you did receive, allowing you to fall back below the threshold...

I want to thank the several people who added "Apprentice" certifications to me today; now the system finally says that I'm certified. I still think there is something amiss with the certification system.

Let me review just the first nine certifications:

  • miniver certified me as a Journeyer, and he used to be certified as a Journeyer -- now he shows no certification at all, just like I did. He has ten listed certifications from people who are themselves certified; why is he uncertified now?
  • nixnut certified me as a Journeyer, and he is currently certified at the Apprentice level.
  • matt certified me as an Apprentice, and he is currently certified at the Apprentice level.
  • ncm certified me as an Apprentice, and he is currently certified at the Master level.
  • edw certified me as an Apprentice, and he is currently certified at the Apprentice level.
  • mettw certified me as an Apprentice, and he is currently certified at the Journeyer level.
  • cbbrowne certified me as an Apprentice, and he is currently certified at the Master level.
  • jLoki certified me as an Apprentice, and he is currently certified at the Journeyer level.
  • mobius certified me as an Apprentice, and he is currently certified at the Master level.
  • khoerling certified me as a Journeyer, and he is currently certified at the Apprentice level.
  • lerdsuwa certified me as an Apprentice, and he is currently certified at the Journeyer level.
Now, with certifications from 3 Masters, 2 Journeyers, 3 Apprentices and one other person, shouldn't I have already been considered an "Apprentice" at the least? With those 9 certifications, I wasn't, and I don't think I was with 11 either. Now I'm up to 14 certifications, and it says I'm certified now.

How many certifications does it take? edw has 4 certifications (not counting himself), and is currently certified at the Apprentice level. Why wasn't I considered certified at all with 8 certifications from other certified users?

Is there some kind of "anti-certification" going around, that miniver and I have been tagged with, or what? Why would I have previously been certified and suddenly become uncertified? This is very confusing...

What is wrong with the certification system?!?

It's been broken for over a month now, with no explanation. I've got NINE listed certifications, but the system won't let me post because I supposedly haven't been certified!

By the way, I finally switched over to Mozilla as my primary web browser in November, 2000. It still has problems, but it's usable enough that I've stuck with it. I'm still a bit disappointed with the longstanding bugs and frequent regressions, but I don't have a better alternative right now. (I'm not a KDE fan, but I wonder if I should try Konquerer anyhow? People keep raving about it, and it can supposedly use embedded Mozilla for HTML rendering...)

Well, I don't know what's going on, but the system isn't acknowledging any certification for me anymore despite the NINE certifications currently listed on my public page. :-(

I was going to reply to nmw's post in this article about LISP and interned strings, but I guess I'll have to "reply" here for now. He said "Try doing that in Python, or Perl, or Tcl. You just can't, [...]" Well, I wondered about that, so I thought about whether something of the sort could be done in Perl, my preferred HLL.

Here's what I came up with:

sub AUTOLOAD {
  my($sub, $space, $name) = $AUTOLOAD =~ /^.*:((?:(.*)_)?(.*))$/;
  $interned_value{$space}{$name} = int($interned_value{$space}{""}++)
    unless exists $interned_value{$space}{$name};
  eval "sub $sub { $interned_value{$space}{$name}; }";
  goto &$sub;
}

The idea is to reference undefined subroutines like &foo_bar in the program, and the AUTOLOAD routine would create a new subroutine returning an integer index to use for a scalar array. The final underscore (if any) in the name would determine the namespace; &foo_name and &bar_name could end up with the same value. (The idea is to avoid wasting entries in a scalar array; think of "foo" and "bar" as structure names.) Once defined, the subroutines would continue to return the same value, which is kind of like interned values in LISP.

I don't know if this is actually faster, though. The subroutine lookup may well be slower than a hash lookup would be. However, this could be VERY useful to conserve memory if many similar data structures are used; using scalar arrays with interned symbolic indexes would be more space efficient than using many hashes with string indexes. (Hopefully a later version of Perl may have a more natural and efficient way to do this sort of thing.)

This isn't a perfect solution, but it does show that Perl is more flexible than it may appear at first. Of course, since Perl is pretty damn fast to begin with, tricks like this usually aren't necessary to get decent performance. Even with its more natural interned symbols, I doubt LISP can approach Perl's performance in general...

Okay, I was a little rushed to catch my bus the other day, so I didn't bother to put a link. The Kuro5hin story, "Release often, sure. But early?" would actually be an interesting topic for discussion on Advogato. Since there's already a discussion underway on Kuro5hin, if anyone has good ideas for me, I urge to you reply to my posting that I copied into my previous diary entry...

The RSA patent expires today! Okay, it's a moot point, since RSA released the algorithm into the public domain 2 weeks early as a P.R. stunt. Regardless, it's a Good Thing (tm).

The rest of this diary entry is a copy of a Kuro5hin.org posting, subject: "I wonder how common this problem is?"

I've been struggling with similar issues. I've got a conferencing system (fairly different from IRC, but similar idea) which I started writing in 1992. I've spent far more of the time since then not working on it than I've spent working on it. Nevertheless, it's about 10,000 lines of C++ code at this point, and it does some clever things.

I originally wrote the system as a simplistic tool to chat in real time with several family members, avoiding long-distance costs. It evolved over time to a halfway capable imitation of an older conferencing system I used years earlier at RPI. It's not quite as sophisticated yet as the experienced users of RPI's conferencing systems expect, and the interface seems arcane to many who are unfamiliar with those systems. Accordingly, there are only a small handful of regular users.

Currently, the source code is unreleased. I've often considered releasing it as free software (i.e. Open Source), but never quite felt ready to do it for a variety of reasons:

  • The project never felt close to "done", and I've been reluctant to release it while I have a mile-long TODO list.
  • I'd like to be able to use the code as an example of the kind of work I'm capable of, but there are some serious defects in the design and coding that I'd want to fix before it can be the caliber of example I want.
  • Between things I want to fix and things I want to add, there are some major architectural changes likely. Involving other developers makes this more difficult.
  • At the same time, releasing the source usually suggests accepting patches from others, which would dilute the code; right now it's 100% my own code; diluted code wouldn't serve as well as an example.
  • I never seem to find enough time to work on the project, and maintaining a full Open Source project with all the ancillary bits (website, CVS server, mailing list, etc.) takes time.
  • I'm concerned about trying to build and maintain a userbase. It's harder to find motivation to add features and fix bugs when few people will ever benefit from it. Releasing the code would enable someone else to "steal" users with my own code, especially if I don't find time to compete with any potential forked codebase.
  • I've wondered if there's any potential to make money from the code, that might not be possible if the source is freely available. I still wonder about this, but I'm not convinced it's worth pursuing. I'd rather find a way to make money on it as an Open Source project, if possible. Meanwhile, I do have a day job.
  • And, of course, I simply haven't gotten a Round Tuit (tm).

After recent reflection, I've concluded that my best course of action would probably be to release the code as Open Source, but not under the GPL at this time. I don't want to encourage a code fork, and I want to be able to maintain a pristine version of the code for now; I don't know that I'm ready yet to incorporate outside code and start the dilution process. I could maintain a patch pool of outside patches and possibly my own code fork, my code vs. the patched version. The QPL seems like the most appropriate license at the moment; I'd consider switching to the GPL at a later time.

Even so, I need to find some time to audit the code for things I really don't want to be seen, and to clean up some of the ugliest kludges, and to get the infrastructure setup, etc. I don't know when I'll find the time, but that's basically my current thinking.

Anyone have any thoughts or suggestions for me?

4 older entries...

New Advogato Features

New HTML Parser: The long-awaited libxml2 based HTML parser code is live. It needs further work but already handles most markup better than the original parser.

Keep up with the latest Advogato features by reading the Advogato status blog.

If you're a C programmer with some spare time, take a look at the mod_virgule project page and help us with one of the tasks on the ToDo list!