Older blog entries for Ankh (starting at number 120)

XML Query as Middleware is something people have been saying for a while now (including me in a talk at Emerging Technologies next week). The idea that everything going over the glue-pipes is XML turns out to be as startling in its way as the idea some 30 years earlier that everything could be newline-terminated ASCII streams of text. It's soemthing I've been looking forward to for over a decade, but it snuck up on me and startlified me by coming out of XML Query.

orkut: I've been surprised at some of the invitations. It's also clear the site isn't scaling all that well, alhough there doesn't seem to be anything that can't be fixed. The lists of thumbnails of friends, sorted by number of friends, seems to encourage competition. But I really wish they'd start making the friend network available in RDF, issuing RSS feeds of recently joining people per community, and so forth. It's not really giving enough back to the broader Web community I think.

Pictures: I had a request to scan the Slavonic entries from the almost three-hundred-year-old (1713) Oratio Dominica that I have online, and an offer to transcribe them. It's not clear how to do the transcription, and I may need to use a TEI Writing Set Definition and the Unicode private use aera in the absence of Old Church Slavonic charactersin Unicode 4. Until then, though, I'm busy with castle siege engines.

XML at the W3C is moving along in its own sweet pace; XML 1.1, Namespaces 1.1, Infoset 2nd Edition and XML 1.0 3rd Edition have all been published as W3C Recommendations. XML Parsers should be updated to accept both 1.1 and 1.0 documents (validating them appropriately). If you generate XML, it's OK to continue to generate XML 1.0, unless of course you need the new features, such as Unicode-compliant line end support (NEL) and extra name characters in element names, attributes, IDs etc.

nymia, thanks for commenting on the pictures scanned from old books. I keep thinking of getting a domain for them like oldengravins.org or something and seeing if others will contribute.

Orkut.com says at me, You are connected to -31062 people through 72 friends. A negative number of people? Could it be that the programmer used a 16-bit integer somewhere and more than 65535 people signed up?? At any rate the number seems to be moving towards zero, strengthening the hypothesis.

I notice that of the images in my deviant art gallery , the ones marked as Wallpaper/Medieval seem to be one to two orders more popular than the others. I am guessing that a lot of people browse the site by category and choose Wallpapers, so I've move all my pictures of the right size that might be useful for screen backgrounds into that category, to see if it made a difference. It did.

What interests me about this is that it presumably indicates that the Deviant Art site is browsed by a lot of people who are not part of the community. This is very different from orkut, where you have to be a member to be signed up. Of course, the popularity of sites ilke Digital Blasphemy indicates a strong demand for screen backgrounds too, so it's hard to interpret the skimy data I have ;-)

Speaking of orkut, if you know me and want to be invited, get in touch and I'll happily invite you. Don't forget to tell me what colour socks you're wearing.

I've been running Knoppix for a few days, because my laptop died and I'm borrowing a system that doesn't boot except from floppy. It'd be nice to have Flash installed - even nicer if fewer Web sites used Flash, especially now there is SVG - and it'd be nice to have more memory, and it'd be nice if Knoppix's hardware detection was as good as Mandrake's seems to be - Knoppix didn't find the sound on this system for example - but it's a lot better than having to walk up the road to use an Internet Cafe.

Speaking of Flash, I went to see Lord of the Rings / Return of the King last night. I tried to book a ticket online, but the Paramount web site is totally useless without Flash. Luckily there was plenty of room, and I saw the film projected onto an IMAX screen. I love the Alan Lee drawings in the closing credits, and definitely felt it was the best of the three films. I forgive them their deviations from the book - not that they care about what I think, but I do :-)

Someone asked me to scan in some samples of Serbian script I have; when my laptop is back I'll do that. Possibly tomorrow.

When I get back from the O'Reilly Emerging Technology conference next week I want to post an article about why I use the particular distribution of linux I do (when my laptop is working!) - not to start a distribution war, but to try and elicit discussion about the maturity of Linux, and a bit about the cultural and social problems we have to solve. If you're willing to help by reading a draft, please let me know.

elanthis, my favourite rogue-like game was always Omega, for what it's worth, by (I think) Lawrence Brothers. Part of that was that the humour was clearly separated from the mechanism. Looking at AweMud's "screenshots", the game mechanics seem very in-your-face; I think I'd find it felt more like wargaming as a result than a fantasy/quest sort of game, but maybe I'm showing my bias (or my age?). I assume that calling a forest clearig a "room" is just poor scripting, and not part of the game engine. The last two face-to-face roleplaying campaigns I ran (in 1989/1990 and in 1992 or so) didn't have any combat, which was just as well as I didn't write a combat system for them. The players seemed to enjoy them, though.

I had to take my laptop to be repaired today, and stopped off at BCE place (between Yonge and Bay, south of King, in Toronto) and ate in the Movenpick there. When I came out, there was a photography exhibition there, in the Canada Trust (or whatever bank itis) part of the building.

You might expect something very corporate. It was called "Toronto, a celebration" or something. There were maybe 100 photos printed maybe 2 feet across, mostly showing events like the Jazz Festival.

There were some pretty neat photos. One (visible from quite some distance aay becuase it was on the end of the display) was of someone wearing very tight white undies or shorts with "gaypride" printed on them. It could have been in a beer ad or something... except I realised after a moment that the wearer had a beard.

Another was of two men wearing suits and ties, kissing, it was called just married. There were pics of the Hindu Festival, the Pride Parade, Caribana, Jazz Festival, and quite a few others I've forgotten now. But it was neat to see the ones I mentioned, in a context where you might expect controversy to be avoided.

Of course, gay marriage isn't all that controversial here :-) The exhibition has a web page, although I can't see it right now (using knoppix while I have no laptop, I should try gnoppix!)

Seaking of controversial, I've been playing with orkut too. It feels somewhat validating to get a message, so-and-so wanted to add you as a friend, but then there's pressure to get as many friends as possible (I'm resisting). I'd like to distinguish friends, co-workers, acquaintances, lust-bodies and so forth, but then, what's the point? You can do almost nothing with it.

I can see interesting possibilities if it's integrated with google's main search. For example, "weight my google search so that pages made by friends, or linked to by them, are higher in the results".

It doesn't have the focus of a community site like deviantart.com (that's a link to my page there if you care; follow the "Gallery" link there to see the pictures I put there) nor the shared interaction of Advogato.

I'm off to San Diego next week to speak at O'Reilly's Emerging Technology conference. I'm really not sure what the audience will be like; I'm guessing it'll be more analysts, consultants, people trying to decide if new technology is important to them, than programmers trying to learn it, compared to (say) OSCON, but I'll see. I'll be talking about XML Query, which is actually pretty cool despite being fairly large and complex.

I've been working on a search engine for my Pictures of Ruined castles and other engravings and woodcuts from old books. This uses XML Query and RDF and SQL and XHTML and XML and CSS and SVG all at the same time, so it's very Acronym-friendly. I'll share the URL after the conference, I need to do a bit more work on it first, to make it more useable. It's pretty neat to discover how much you can do with XML Query in an all-XML world.

By the way, I've noticed that people visiting my picture site from Advogato tend to be most interested in the Oratio Dominica, a complete scan of an 18th Century book giving the Lord's Prayer in over 100 languages and in lots of different scripts. Next most popular is Fry's Pantographia, another old book of scripts. Maybe geeks like languages?

I'm remembering having lunch at the Ras Cafe just off Central Square, when I lived in Cambridge MA. I ordered a sandwich and a coffee and heard Ettore and Anna right next to me ask if I wanted to join them.

All who knew Ettore, you are in my thoughts.

Went (along with my partner and Larry, a friend from Oswego, NY) to a Pow Wow, the first I've ever been to, at the Toronto Skydome. They had over 600 dancers, and I got to see and learn about a lot of aboriginal things, although it's only scratching the surface really. Someone was selling tee-shirts saying, Homeland security: fighting terrorists since 1492. If I felt slightly out of place (unlike Clyde and Larry, I can't plausibly claim any Native blood inside me at all) I did not feel unwelcome, and for once my wandering around dressed in bright orange and bare feet didn't make anyone bat an eyelid.

Larry is a song-writer, so he went to the Music Industry Workshop and said afterwards it was pretty helpful. I had been going to suggest that he put some of his music on mp3.com, but with that being bought by CNet, maybe iTunes is looking better.

An aboriginal event was an interesting contrast from a week of meetings in Japan. Now I think I need to sleep a lot.

Before I do that, though, I've tried to rescue www.valinor.sorcery.net, a web site where I had a list of IRC clients and a glossary that were quite widely linked to. The ISP decided to close port 6667 (used by IRC) without any explanation, and several months later when I finally got an answer, it was that the port (and port 7000) was closed for security resons for all of their customers. I mind that less than that it took months to find out, and the only refunds they are prepared to give are credits for a service I've no intention of using. It's not worth the small claims court, but I wouldn't use Interland again. It's a shame, because HostPro, the original hosting company I used, was very responsive, but Interland bought them.

Just got back from a week in Japan for W3C AC meetings. Very tired. My laptop died on the 2nd day there; back in Toronto it took an hour and a half for an engineer to fix it. A fuse had blown. It's taking more than an hour and a half for me to recover from the 26 hours of travel from Shin Yokahoma to Toronto, though. Well, OK, 26 hours from when I got up to get breakfrast to when I collapsed in a heap at home, attended by two anxious cats!

A couple of people asked me, incredulously, what did you do in meetings all day without a laptop? and part of me thinks there's something wrong with this picture. First, along came powerpoint presentations (or their HTML slidemaker equivalent) and the visual aid, that once added interest and delight, was reduced to an absurd summary to help remove the need to communciate clearly. Then people realised that they didn't need to listen to the speaker. And now it's hard to speak and be heard, because people expect to be able to view porn or play Ages of Corruption or some other game on their laptops, or read email and chat on IRC and buy socks on eBay.

On the other hand, in the audience, I wrote a fairly detailed outline for a novel, and practiced my calligraphy.

wspace, you're missing the point. The point of XML is not that it's more or less ugly than indented text, it's that I don'thave to parse it, it's that every XML-aware tool can already read it. Consider using XML Query to find all conf entries with timeout < 10:

/host[timeout < 10]

That's more succinct than the corresponding perl, and this is for a format with only one level of nesting and no structure in individual values.

Why would you want humans to edit configuration files anyway?

Thomasvs, for getting rid of stuff, one approach is to go without for a while. I admit it doesn't work for me very well, though. Stuff is a form of security, it shows I'm real.

Exercise: obtain a pair of pyjamas or a track suit or a pair of shorts and a plain tee-shirt (and if you live in the US or Canada, a pair of flip-flops); also obtain a toothbrush and toothpaste, a blank pad of sketch or writing paper and a pen. Wear (only) the items you just purchased, and carry the others in the plastic carrier bag they came in, and take public transport to a hotel. Stay there for a week, writing about your feelings. If possible, ask for a hotel room without a television, or unplug it.

When you return, ask yourself which of the items you own you wanted or needed during that week, and why.

As I said, this tends not to work for me, partly because on my return I get too embroiled in eeryday life to do an actual purge.

MichaelCrawford, like you, I emigrated to Canada, although from the UK not from the US. When I became a Canadian citizen it was because I felt (and feel) that I should contribute more to my surroundings. But if you move to Canada, do it because of a love of Canada, not only because of a hatred or fear of what the US has become. If you do decide you like it here, I'm sure you'll be very welcome :-)

My husband is in the process of becoming a landed immigrant so he can join me here in Toronto.

This summer was busier than I have ever been, I think. I wore shoes for more of it than I like, too.

I'm trying to start catching up publishing all the talks I gave, starting at a conference on the topic of the preservation and archiving of digital media in March.

But sometimes I get distracted; I've been playing with Deviant Art (thanks to halcy0n for suggesting it) and also transcribing a 1718 Grammar of Heraldry, a small book I happen to own.

Where do nudists carry their USB memory sticks?

It's Pride Weekend here in Toronto. A friend from Sorcerynet (Ben/riot) is visiting us, and he has been enjoying seeing thousands (literally) of openly Queer people wandering about, holding hands and kissing in the street.

Probably I should post some notes from Guadec but have been too busy, and others have no doubt said all that I would say.

Life here has been insanely busy since April, and is likely to stay that way for another month or so, unles I run out of health and strength!

I have been generatign some SVG with XML Query (using Galax and also the latest Saxon). This is a lot of fun.

I've been experimenting with SVG recently. I posted some notes on my initial experiences. Probably I should get a book on SVG and benefit from someone else going through this phase. The most irritating discovery so far is that Adobe's SVG Zone crashes the Adobe SVG plugin. (I won't give a link here, for obvious reasons)

You can see what I've been up to www.holoweb.net/~liam/download/yggdrasil6.svgz (if it doesn't work for you try the .svg file instead; if you get a 401 not found, it's because that's a temporary area and this is a work in progress; I'll post a proper link when there's something more stable.)

I also updated my short technical booklist of XML and other books, to add the XSLT cookbook and the RSS book from ORA, both pretty good.

Finally put up a binary RPM package for lq-text, my text retrieval package. It's compiled on Mandrake Cooker, feedback wanted - download it here if you want. I don't have a devel or src package yet, sorry. I will put up the tarball with the .spec filein it when I get a chance.

I wrote lq-text in 1989, after discovering that the cheapest commercial text indexing packages of any use cost upwards of US$30,000. Well, they had a lot more features than lq-text, but I posted the source on net.sources or comp.sources.unix or whatever it was called back then, and it was quite popular for a while. Now, since it doesn't have a GUI, it's probably less interesting, but I've used it in a few Web projects, so it's not dead.

Someone contacted me a year or two ago and said they'd been reviewing commercial text retrieval products, then found mine was the only indexer they found that could get through all their test data. Probably this is because I assumed most data structures would not fit in memory, back when 4MBytes was a lot of memory for a process to use. Well, outside of editors written at MIT :-)

 

Tomorow I fly to XML Query face to face meetings, and then to Budapest for www 2003, the Web Conference. The streets of Budapest will soon embrace my bare feet. Lucky Budapest.

111 older entries...

New Advogato Features

New HTML Parser: The long-awaited libxml2 based HTML parser code is live. It needs further work but already handles most markup better than the original parser.

Keep up with the latest Advogato features by reading the Advogato status blog.

If you're a C programmer with some spare time, take a look at the mod_virgule project page and help us with one of the tasks on the ToDo list!