Older blog entries for Ankh (starting at number 147)

Working on a trip to New Delhi, hoping to meet some potential W3C Members who can contribute a lot to our specifications and to the World Wide Web.

The article I posted here on Advogato about the future of XML (go look at the front page) has attracted a lot of interest on the xml-dev mailing list and elsewhere. Some people feel XML should never ever be changed, that the Specification is a Holy and Sacred Text. Some feel that the specification could be changed as long as no line of code had to be changed and no documents were affected. Some are concerned mostly with interoperability, and are OK with some code changes as long as no documents are affected. Some are willing to change documents but not code. And some are willing to change both, but only beneath the SAX or DOM level. There has even been talk of sacrificing children, something not currently mandated by any W3C specification. (nor, as far as I am aware, by any specification from IEEE, IETF, Oasys, ISO, ICE, not any other organisation or consortium in the computer industry. )

It's nice to see that people are listening, though, and expressing opinions. How else can we learn how people feel?


xach, maybe W3C should define Assembly Language for the Web, with no curly braces. Maybe there could be an arbitrary number of registers each defined by a URI. Text hadling in assembly tends to be hard, but hey, real programmers don't use library functions! :-)

pjrm, you might like to look at the XSL-FO specification as a way of formatting arbitrary XML content. If you do come up with new table layouts, the XSL Working Group may be interested to hear about them -- but note that there's been a lot of research on table layouts over the past five hundred or so years, so although there's certainly the possibility of innovation, it helps your case if you study what has gone before. I can give you some references, although for computer algorithms I'm afraid that most of the work has been propietary.

You mention minimising wasted space, so I should note that an experienced graphic designer will often increase spacing in tables to improve readability. One of the guiding principles is to have related items clearly aligned one with the other, of course.

I wouldn't hold out too much hope of getting changes into HTML at this point, but if you (or your organisation) wants to join W3C and participate in the HTML or CSS Working group, I can give you the necessary information, of course.

mirwin, I'm very pleased that you like the pictures from old books. Interestingly, I note that quite a few people visit the page when I mention it in Advogato, but look only at that single Web page, and hence don't get to see the pictures. I should redesign the page. But I digress. For those pictures that are marked on their corresponding HTML pages as being in the public domain, you can do anything you like with them as long as it's in accordance with the usage guidelines on my page. For the others, I have sent you email.

dfenwick, I recently tried running textedit (it comes with the xview-clients package). It starts masssively faster than gedit on my system. It has some features that seem weird at first (e.g. the ability to surround the selection with |> and <|, a feature used by Sun's calendar manager) but it has drag and drop, and from a user perspective the Sun OpenWindows desktop was probably slightly more integrated than Gnome.

So why the huge slowdown? Partly the number of shared libraries I expect, and partly the use of more layers. Is the code easier to maintain? That's not clear to me; having been inside both XView and Gtk+, the glue is more visible in XView, and it's not so clean, but then it's more than ten years older. Gedit has syntax highlighting I think (I've never seen it) but as a notepad replacement it actually seems less useful than textedit, which can easily reformat a paragraph or insert the output of a command.

The difference might be in the focus of the development team. The open source community tends to have a hard time in thinking of people trying to get every-day tasks done, rather than programmers using their code. The Gnome project has made huge and fabulous advances here, and the Gnome human interface guidlines have also helped dramatically, so we are definitely seeing improvements. Bt it's a pain that a 600MHz pentium system should feel even slightly sluggish.

Last week I posted an article about the possible future of XML at W3C that has had some comment also on the xml-dev mailing list.

cdfrey, you might like to investigate XSLT or even XML Query as alternatives that, after an initial culture shock, can sometimes be a lot cleaner.

I finally broke down and added Goodle Adsense Ads to my pictures scanned from old books and my eighteenth-century dictionry of thieving slang. The revenues are adding up to about US$2/day, which isn't a lot but will probably pay for Web hosting if we need to move the server.

In order to make it less likely we need to move server I have made a bandwidth theft image that's used if people use an img element to embed one of my images in their Web page. I've had several mail messages as a result, requesting permission. In most cases people are happy using a lower resolution image -- google's image search tends to find the largest resolution available, and many people assume they can use images they find on the Web however tehy please.

Actually it's quite common for people to use width and height attributes to scale down a one megabyte image to generate a thumbnail, not understanding the performance implications. Sigh.


gicmo, x means you cam make the directory your current working directory (also needed for descending into subdirectories). r means you can read the dierctory file itself, which lets you get a directory listing. Making a Unix directory mode -rwx--x--x (0711) means that other users can fetch files from the directory only if they know the filenames, and is sometimes used for incoming ftp servers or for use with a Web server such as Apache to prevent a directory listing.

I've been mulling over ways to garner public feedback - especially from open source developers - about what we (W3C) should be doing with one of the sets of specifications we publish - the XML family as I sometimes think of it. Is it time to start XML 2.0?

Where would you go (or post) to ask people why they're not using XML? There are lots of good reasons not to use XML, and lots of good reasons to use it, so I'm particularly interested in people who would like to go with XML but who feel they can't.

Performance is the most commonly-cited reason but it's not the only one. If we made some sort of more compact way to transmit XML that was more efficient to process, but still reasonably easy to deal with, would it make a difference?

What about the other specs in the family?

For my part, I'd like to see more work on integrating SVG and XSL-FO, and notes on working with both W3C XML Schema and Relax NG. I don't know how to make those things happen. I'd like to see some hot next-generation linking and multimedia love for the World Wide Web, but the most widely-used browser is quietly stagnating. I'd like to see more use of client-side XML, but even though there is XSLT in all the browsers that seem to me worth the time of day :-), the interaction between CSS, XSLT, DOM and HTML (and especially HTML forms) is too ill-defined. I want to run client-side XSLT on CSS files too, but that's another matter.

It's hard to say to people that we need better typography on screens. Even when typographic research indicates that typeface design, rendering quality, line and word spacing, choice of typeface, kerning, and many other factors, can all affect reading time, comprehension, recall and tiredness, none of that really translates into product decisions for most people. Oops, got into a rant there!

Anyway, maybe I'll post something on the front page of Advogato and see if anyone replies. Maybe an article on xml.com or somewhere too, if the editor will accept one from a man in bare feet.


Added Goodle Adwords to some of my Web pages. The income isn't huge, but it would be enough to pay for Web hosting, I think, if it keeps up at the rate of the first five days.

I also decided to do something about people who link to my images without giving me credit. A lot of people use an img element with height and width attributes to scale an image down, and set the src attribute to point to my Web site. If they do that now, and point to any but the smallest resolution image, they get this image, thanks to some mod_rewrite trickery.

I've had a few mail messages from people worried about their blogs, and served the image about 3,000 times so far, saving maybe as much as a gigabyte in transfer. I don't pay for the transfer right now, but I might have to shortly.


It's too cold in Toronto for long barefoot walks. There's snow everywhere. My husband and I are planning on buying a house shortly, and one of the candidates has over 80 acres of land. It's not very expensive, either, since it's quite some way outside Toronto, with the main concern I have being the difficulty of getting to the airport. Buying a house is very, very different in North America to England, where I last did it some eighteen years ago, mostly because of the multi-listing service (MLS).

Enuogh for now. Still recovering from trip to Australia. This travel thing can be tiring.

dwmw2 oops, I shouldn't post arguments when I'm jet-lagged, sorry! Altghough I have in fact erad your aritcle, I was confusing it with another, and for that I apologise.

I'm not going to respond further right now, simply because I'm clearly too tired, but I'll try to do so tomorrow, or whenever the fog clears, if it ever does.

On IP spoofing, I should try and clarify: the problem is one of traceability and accountability. People have suggest trust networks, but those fail in the face of forged mail.

There are North American ISPs that block port 25 too (other than to the designated mail server, of course), and although (as you rightly say) it doesn't block all forged mail, it does, when combined with disallowing forged IP addresses, leave a much clearer audit trail.

I think I was trying to agree with you about the forwarding, by the way.


I'm back from Brisbane, but (as expected) fairly tired. Kudos to DSTC for hosting us. (Us here is the XML Query, XSL and XML Schema Working Groups of the W3C, maybe a total of under 30 people working on Schema 1.1, XML Query, XSLT 2.0, and a number of shared specs between XSL and XQuery such as XPath and the Formal Semantics document). The meetings went pretty well, although it was a lot of work.

I've been wondering if I have the energy to go to Guadec this year. Probably not, unfortunately, although I'll be at XTech (used tobe called XML Europe) and very possibly at the WWW conference in Japan. I might also go to Doors of Perception, a conference on design that's interested me for overa decade.

A lot of talkback today...

nutella, yes, zhaoway's link was very interesting. I've also been reading Anthony Hall's The American Empire and the Fourth World [referrer link; see also www.goodminds.com for the Native-owned booksellers where I bought my copy], a book that I recommend, although I found I had to ask someone from the US to explain terms with which I was unfamiliar from time to time, such as Manifest Destiny. I don't think a conspiracy theory is needed - I agree with Noam Chomsky (e.g. see his essay reproduced in You are being lied to) - you only need to believe in the idea that wealth and power often (sometimes?) lead to greed and ruthlessness. I've often thought that people elected into power should lose all their possessions and monies, so that gaining personal wealth could never be a reason to stand for election. The main problem with that idea is that if you make politicians go naked then the politicians who would be elected would mostly be porn stars, I suppose. But California seems to be discovering that film stars aren't always entirely bad news, and maybe it'd help the US get over it's inexplicable fascination with and simultaneous disapproval of sex and nudity.

badvogato - I remember a breakfast conversation between Tim Berners-Lee and Alan Kay (and I think connolly and some others, maybe Yuri Rubinsky was there although I'm not sure now) as a result of Alan Kay saying in his keybote that HTML was the MS-DOS of the Web. There was no resolution, but I'll note in passing that the strength of marked-up text is that the reader, the user, the consumer, says how they want to use the text, not the author or producer.

dcoombs, if your socks are frictionless how do they stay on your feet? Maybe it would be safer if you were to work barefoot?

elanthis don't worry, you're still young :-) Arpeggios are closely related to broken chords; I'm not sure if there's really a difference at all in practice except that "broken chord" doesn't sound as pertentious! Morals as preached are, however, clearly different from morals as practiced. Sometimes this can be good (e.g. the Roman Catholic Church may have been responsible for tens of millions of deaths in Africa by telling lies about condoms; if the people had not gone with the morals the priests espoused, they'd perhaps be uninfected today; similarly Augustine's teaching that the christians needed to keep the Jews around as a reminder that "they" killed Christ, but not let them have power, may have been a major factor in the preaching that led to the massacres of Jews by so-called Christians in the Crusades (literally, Cross Wars) and later. Moral teaching must, it seems to me, be limited to general principles: absolutism in details is barely distinguishable from fanaticism. the Puritans whom you mention chose to forget that Jesus is described in the Gospels as turning water innto wine, and actually instructing his followers to drink wine. He also received an erotic massage from a prostitute, and it would be pretty weird to believe that someone who hung out with prostitutes would be a virgin. The "morality" of abstaining from sex and drugs (including alcohol) is mostly about trying to enforce a "work ethic" to keep poor people busy and productive. Harm no-one, love those around you, and try to accept people as they are, and everything else falls into place, I think. But it's easier to say than to accomplish.

deekayen, neat, the BBC survey said I'm a spatial thinker. If that's so, how come I get lost so easily? :-)

cdfrey, I think most people read http://www.advogato.org/recentlog.html once every day or two, or use the RSS feed. I suppose you could then search that page for your advogato nickname if you're too busy to read all the diaries :-)

dwmw2, although that old "why not SPF" article reappears every now and again, it's not at all clear that the arguments in it are sound. Certainly the author does not substantiate them very well. Neither a whitelist nor a blacklist seems to solve all use cases.

I personally continue to believe that the first step in addressing spam is actually for ISPs to be liable for forged src addresses in IP packets that they forward, and for forged email to be treated as fraud. If a cable modem sends a packet from a home computer up to the ISP, the ISP should check that the src field of the packet is correct before accepting the packet. This is a low cost thing to do and would make fake email headers massively easier to deal with.

This, combined with disallowing a direct connection to or from port 25 except by arrangement, which would remove the ability in most cases for a virus or worm to send or receive email, would make the use of armies of trojan'd Windows XP sysetms (or Linux systems or MacOS systems) to send spam pretty much a non-stater.

For now, SPF represents a good step forward, although it's true that there can be problems with people who use forwarding services that don't themselves publish SF records.

We do publish an SPF record at W3C although I don't yet do so on the server that I share with my brother (holoweb.net) because of a (cough) difference of opinion between Network Solutions and my brother. If solutions are liquids that dissolve things, network solutions are presumably things thet take away networking, and our expectations at times have been met :-)

I'm in Brisbane for XML Query Working Group meetings, kindly hosted by DSTC on the Queensland University Campus. I and some colleagues also spoke at a workshop on Friday.

Much more thinking about XML things than open source things of late, although as usual I've been following Mandrake Linux (cooker). We're slowly getting to the point where more and more of users' needs are met, but meeting 80% of everyone's needs is not as good as meeting all of the needs of 80% of people.

Someone wrote to me recently to ask for an office suite and database to help her run her small business. I don't think postgresql or mysql arethe sort of thing she had in mind :-) although OpenOffice will probably meet her needs in practice, perhaps along with gnumeric. We still don't have anything like Quark or InDesign, although scribus is starting to approach an old version of PageMaker, and passportout is also interesting for a more technical user.

The open source world can meet most of the day to day needs of a great number of people, which is a solid and remarkable achievement. It's not clear to me that computers have anywhere reached anything like their full potential, and meeting all of the uses people might ever have for computers is obviously not possible, so the right question is to ask where the growth in computing capability is most aligned with people's needs. That's somewhere that open source does seemto have a clar advantage.


steved, I've been interested in the reciprocality project for a while, even though the Web site isn't perhaps as grokkable as I'd wish ;-)

haruspex, well, Bush is now a Kingit seems.

Clyde and I were sick over christmas, so we didn't do anything. We might have a late Christmas in a day or two.

It's interesting (for a massochist like me at least) to watch discussions on the xml-dev mailing list about restricted XPath patterns. Someone wanted to design a language, and saved development time by using an XML-based syntax and a SAX-based parser. It's a perfectly valid use of XML, and a good reminder of why we don't have a single data model: he takes SAX events and builds the equivalent of a parse tree for his language.

Next year has to be (for me at least) the opening of public discussions on what comes after XML 1.1, XML Schema 1.1, XPath 2, XSLT 2, XQuery 1.0 and 1.1, XSL-FO 1.1, and generally where W3C should be going with XML.

In the mean-time my 2.6.8 kernel has a problem when the system gets too busy: the system clock jumps back an hour, then a few seconds later jumps forward again! I'm not sure why but it really confuses X and also sound players. The whole system (including the console, not just X) freezes while it does this. I'm wondered for a while if it was using free disk space on my cheap external USB disk for swap, and having to wait for the disk to spin up, but a program that accessed the disk every few seconds didn't seem to help. Maybe I'll try a newer kernel.

I spent some time thinking about timezones, dates, XML attributes and elements, and XML Schema. I'm not sure that was very productive or enlightening but it was more interesting than it might sound.

I've also spent some time comparing the Deviant Art and PhotoSIG art community sites. I've been trying to understand what makes online communities succeed or fail. Part of it seems to involve a mixture of self-policing and community ratings combined with a reward or status system. Where the community Web site doesn't have an obvious rating for people, the yusers often invent one (e.g. Pageviews on Deviant Art, an essentially irrelevent statistic). I'd like to see a standardised format for a community trust metric, so people can say, I'm a top-tier Deviant Art contributer (or whatever) and aggregate that with their Advogato Journeyer and Slashdot Gigaflamer status :-) perhaps to give them status on some other site, or to help build federated identities.

raph - One of the difficulties with greyscale fonts in the past has been how to print them on one-bit-per-pixel device such as a laser printer or typesetter. Print has generally been seen as the objective of type design for most people in the past.

Another difficulty is that early windowing systems had 1-bit-per-pixel APIs that made it hard to change.

It's a pain because it also precluded colour outline fonts.

10 Dec 2004 (updated 12 Dec 2004 at 01:24 UTC) »

[update: this didn't make it onto the recentlog page before, I think, for what little it's worth.]

Voted for gnome board, then left for meetings in Boston and Ottawa, and the mail server I share with my brother promptly stopped working. Or more precisely the router where it's hosted imploded. The Internet ate my homework. Must be the wrong sort of snow.

The first meeting in Boston were a W3C Architecture Team face to face meeting in the oddly-shaped Stata Centre at MIT.

The BBC were there filming in preparation for the next meeting, the W3C's 10th Birthday Party. So there was a brief glimpse of me on the BBC news at ten, it seems. I'm sorry if I spoilt your dinner.

The 10th anniversary was interesting, although seeing most of the W3C Team wearing suits was a little freaky. I even wore shoes and socks. Sometimes you have to make compromises :-)

Then we had an Advisory Committee meeting, at which the organizations involved in the W3C, the Working Group chairs and the W3C staff (the "Team") all get together. Well, a lot of them do.

There's a difficult balance in running a consortium for developing open, public, freely available standards. How do you get the stakeholders around the table and also get public participation and buy-in? It varies from technology to technology, and works better in some cases than others. After the AC meetings we had a Team Day, and for part of that we wandered around Boston interviewing people. It's surprising how few people know that W3C does anything more than HTML and CSS, or that working groups have to deal with public comments. Well, sometimes it's surprising, and sometimes it's a reminder that W3C doesn't have a large marketing budget, and the staff don't have enough time to do enough outreach right now.

This week I've been in Ottawa for XML Binary Characterization meetings. We're trying to work out whether W3C should specify a single format for more efficient and compact interchange of XML. A lot of people want this, and a lot of others don't want the inevitable disruption that a new representation of XML might well cause. I'm beginning to get a much clearer picture of our options.

My husband (Clyde) came with me to Ottawa, and tomorrow we'll wander round the city a bit. We ate tonight at the Sweetgrass Aboriginal Bistro, and I highly recommend it. Reservations needed.

A piece of good news while I was in the meetings: The Canadian Supreme Court's decision means that the Canadian government is now likely to make clear that same sex marriage is legal. It's already legal in several Provinces, including here in Ontario. The previous prime minister, Jean Chr├ętien, pointed out that you can't (by definition) have a vote on specific minority rights, since votes are about majority views. Instead, you have to agree to support minority rights, and then the rest follows. Unfortunately, not everyone is happy, and that, too, is inevitable. The unhappy people tend to be in the majority, and therefore tend to think themselves in the right, but being in a majority has nothing to do with whether people are right, wrong, accepting of others or judgemental, loving and compassionate or filled with hate.


DV, try the Adobe SVG viewer. You may need an old build of Mozilla; I have one that works with version 3 of the plugin, mozilla-2003-03-09-svg, if you want. Newer versions don't work reliably for me.

This week we had the W3C 10th Birthday Party in Boston. The BBC filmed us (amongst many others) so I hear there was a brief glimpse of the back of me on the news. Fame at last. Next to top the music charts! Well, OK, maybe not.

Ten years ago seems a long time now, and the first North American Web conference in that crazy old hotel in Chicago with such frenetic excitement is a world away.

During the day itself I often found myself thinking of my former friend and boss at SoftQuad, Yuri Rubinsky - he would have been so delighted in so much that happened with the Web and with XML, and especially with all the progress on accessibility and internationaliszation, although perhaps sad that so many organizations don't quite seem to understand it all.


raph, I suspect xterm (or libXft) is expecting an outline to convolve or do analytical anti-aliasing with, not a greyscale bitmap. The old X server-side font code only ever worked with 1-bit-deep fonts unfortunately. NeWS used to support colour and greyscale Type 3 PostScript fonts, but if you used them with xterm much weirdness ensued! It'd be nice to see them work, though.

nymia, every now and again I try to post something about writing efficient code, to try and remind people that in fact it matters. In some ways my Linux/Gnome desktop feels slower than a Sunview system from over a decade ago (although in other ways it's a huge amount faster of course). Sun worked hard on speeding up their (kernel-based) window system (SunView, predating X or NeWS), and some of the hacks seemed ugly -- not giving a window a mouse event immediately, in case the mouse was just passing over the window; compiling several unrelated applications together into a single program that looked at argv[0] because people usually ran them all, and it increased the amount of shared code, lots of ugliness :-) But sometimes some ugliness gives a big benefit. And sometimes it's the ugliness that makes the desktop sluggish, when people code poorly.

Sometimes a perl program is faster than a C program, either because it uses a better algorithm or because it's using code built-in to Perl and heavily optimised, instead of one-off C code.

badvogato, I have only lived in Canada for 15 years or so, and I don't know coulianou (should I?). I had no idea there was a homoerotic cult around Osama Ben Laden though.

Speaking of us gays reminds me - although more heterosexual people have AIDS than gays these days - this news could be good news for AIDS and HIV sufferers. It doesn't use an expensive US-developed drug, so maybe that's why it wasn't on all the US news media?? Or maybe it's because it's not yet been reported by a reliable and trusted source? Chicken and egg??

138 older entries...

New Advogato Features

New HTML Parser: The long-awaited libxml2 based HTML parser code is live. It needs further work but already handles most markup better than the original parser.

Keep up with the latest Advogato features by reading the Advogato status blog.

If you're a C programmer with some spare time, take a look at the mod_virgule project page and help us with one of the tasks on the ToDo list!